Resumen
Cancer is the leading cause of death in Taiwan. Compared with other types of cancer, the incidence of lung cancer is high. In this study, the National Health In-surance Research Database (NHIRDB) was used to determine the diseases and symptoms associ-ated with lung cancer, and a 10-year probability deep neural network prediction model for lung cancer was developed. The proposed model could allow patients with a high risk of lung cancer to receive an earlier diagnosis and support the physicians? clinical decision-making. As a result, a total of 13 diseases were selected as the predicting factors. The proposed model showed an accuracy of 85.4%, a sensitivity of 72.4% and a specificity of 85%, as well as an 87.4% area under ROC (95%, 0.8604?0.8885) model precision. Based on data analysis and deep learning, our prediction model discovered some features that had not been previously identified by clinical knowledge. This study tracks a decade of clinical diagnostic records to identify possible symptoms and comorbidities of lung cancer, allows early prediction of the disease, and assists more patients with early diagnosis.