Acute Leukemia is classified in terms of two distinct classes: Acute Lymphoblastic Leukemia (ALL) and Acute Myeloid Leukemia (AML). This paper aims at defining a feature selection analysis process mainly based on Deep Learning for classifying the acute leukemia type. The considered dataset consists in data of patients affected by both the leukemia types. Both the leukemia types are characterized by a list of identical genes for all the patients. The analysis exploits feature selection techniques for reducing the consistent number of variables (genes). To this aim, we use linear models for differential expression for microarray data, and an autoencoder based unsupervised deep learning model to simplify and speed up the classification. Then, classification models have been implemented with the use of a deep neural network (DNN), obtaining an accuracy of approximately 92%. Moreover, the results have been compared with the ones provided by an approach based on support vector machines (SVM), giving an accuracy of 87,39%. Another feature selection approach based on genetic algorithms has been experimented, with worse performances. We also conducted a gene enrichment analysis based on the functional annotation of the differentially expressed genes. As a result, a differentially expressed pathway between the two pathologies has been detected.
A deep learning and genetic algorithm based feature selection processes on Leukemia Data / R. Francese, M. Frasca, M. Risi, G. Tortora (IEEE SYMPOSIUM ON INFORMATION VISUALIZATION). - In: IV[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2022 Jul. - ISBN 9781665490078. - pp. 412-417 (( Intervento presentato al 26. convegno International Conference Information Visualisation : 19 through 22 July tenutosi a Wien nel 2022 [10.1109/iv56949.2022.00074].
A deep learning and genetic algorithm based feature selection processes on Leukemia Data
M. FrascaSecondo
;
2022
Abstract
Acute Leukemia is classified in terms of two distinct classes: Acute Lymphoblastic Leukemia (ALL) and Acute Myeloid Leukemia (AML). This paper aims at defining a feature selection analysis process mainly based on Deep Learning for classifying the acute leukemia type. The considered dataset consists in data of patients affected by both the leukemia types. Both the leukemia types are characterized by a list of identical genes for all the patients. The analysis exploits feature selection techniques for reducing the consistent number of variables (genes). To this aim, we use linear models for differential expression for microarray data, and an autoencoder based unsupervised deep learning model to simplify and speed up the classification. Then, classification models have been implemented with the use of a deep neural network (DNN), obtaining an accuracy of approximately 92%. Moreover, the results have been compared with the ones provided by an approach based on support vector machines (SVM), giving an accuracy of 87,39%. Another feature selection approach based on genetic algorithms has been experimented, with worse performances. We also conducted a gene enrichment analysis based on the functional annotation of the differentially expressed genes. As a result, a differentially expressed pathway between the two pathologies has been detected.| File | Dimensione | Formato | |
|---|---|---|---|
|
A_deep_learning_and_genetic_algorithm_based_feature_selection_processes_on_Leukemia_Data.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
334.91 kB
Formato
Adobe PDF
|
334.91 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




