TY - GEN
T1 - Deep Learning Causal Attributions of Breast Cancer
AU - Chen, Daqing
AU - Li, Bo
PY - 2021/7/6
Y1 - 2021/7/6
N2 - In this paper, a deep learning-based approach is applied to high dimen- sional, high-volume, and high-sparsity medical data to identify critical casual attributions that might affect the survival of a breast cancer patient. The Surveil- lance Epidemiology and End Results (SEER) breast cancer data is explored in this study. The SEER data set contains accumulated patient-level and treatment-level information, such as cancer site, cancer stage, treatment received, and cause of death. Restricted Boltzmann machines (RBMs) are proposed for dimensionality reduction in the analysis. RBM is a popular paradigm of deep learning networks and can be used to extract features from a given data set and transform data in a non-linear manner into a lower dimensional space for further modelling. In this study, a group of RBMs has been trained to sequentially transform the original data into a very low dimensional space, and then the k-means clustering is conducted in this space. Furthermore, the results obtained about the cluster membership of the data samples are mapped back to the original sample space for interpretation and insight creation. The analysis has demonstrated that essential features relating to breast cancer survival can be effectively extracted and brought forward into a much lower dimensional space formed by RBMs.
AB - In this paper, a deep learning-based approach is applied to high dimen- sional, high-volume, and high-sparsity medical data to identify critical casual attributions that might affect the survival of a breast cancer patient. The Surveil- lance Epidemiology and End Results (SEER) breast cancer data is explored in this study. The SEER data set contains accumulated patient-level and treatment-level information, such as cancer site, cancer stage, treatment received, and cause of death. Restricted Boltzmann machines (RBMs) are proposed for dimensionality reduction in the analysis. RBM is a popular paradigm of deep learning networks and can be used to extract features from a given data set and transform data in a non-linear manner into a lower dimensional space for further modelling. In this study, a group of RBMs has been trained to sequentially transform the original data into a very low dimensional space, and then the k-means clustering is conducted in this space. Furthermore, the results obtained about the cluster membership of the data samples are mapped back to the original sample space for interpretation and insight creation. The analysis has demonstrated that essential features relating to breast cancer survival can be effectively extracted and brought forward into a much lower dimensional space formed by RBMs.
KW - Survival Analysis
KW - Deep Learning
KW - Restricted Boltzmann Machines
KW - Principal Component Analysis
KW - k-means Clustering Analysis
UR - https://link.springer.com/chapter/10.1007/978-3-030-80129-8_10
U2 - 10.1007/978-3-030-80129-8_10
DO - 10.1007/978-3-030-80129-8_10
M3 - Conference contribution
SN - 978-3-030-80128-1
VL - 3
T3 - Lecture Notes in Networks and Systems
SP - 124
EP - 135
BT - Intelligent Computing - Proceedings of the 2021 Computing Conference
A2 - Arai, Kohei
PB - Springer
T2 - Computing 2021
Y2 - 7 June 2021
ER -