Acute Leukemia is classified in terms of two distinct classes: Acute Lymphoblastic Leukemia (ALL) and Acute Myeloid Leukemia (AML). This paper aims at defining a feature selection analysis process mainly based on Deep Learning for classifying the acute leukemia type. The considered dataset consists in data of patients affected by both the leukemia types. Both the leukemia types are characterized by a list of identical genes for all the patients. The analysis exploits feature selection techniques for reducing the consistent number of variables (genes). To this aim, we use linear models for differential expression for microarray data, and an autoencoder based unsupervised deep learning model to simplify and speed up the classification. Then, classification models have been implemented with the use of a deep neural network (DNN), obtaining an accuracy of approximately 92%. Moreover, the results have been compared with the ones provided by an approach based on support vector machines (SVM), giving an accuracy of 87,39%. Another feature selection approach based on genetic algorithms has been experimented, with worse performances. We also conducted a gene enrichment analysis based on the functional annotation of the differentially expressed genes. As a result, a differentially expressed pathway between the two pathologies has been detected.

A deep learning and genetic algorithm based feature selection processes on Leukemia Data / R. Francese, M. Frasca, M. Risi, G. Tortora (IEEE SYMPOSIUM ON INFORMATION VISUALIZATION). - In: IV[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2022 Jul. - ISBN 9781665490078. - pp. 412-417 (( Intervento presentato al 26. convegno International Conference Information Visualisation : 19 through 22 July tenutosi a Wien nel 2022 [10.1109/iv56949.2022.00074].

A deep learning and genetic algorithm based feature selection processes on Leukemia Data

M. Frasca
Secondo
;
2022

Abstract

Acute Leukemia is classified in terms of two distinct classes: Acute Lymphoblastic Leukemia (ALL) and Acute Myeloid Leukemia (AML). This paper aims at defining a feature selection analysis process mainly based on Deep Learning for classifying the acute leukemia type. The considered dataset consists in data of patients affected by both the leukemia types. Both the leukemia types are characterized by a list of identical genes for all the patients. The analysis exploits feature selection techniques for reducing the consistent number of variables (genes). To this aim, we use linear models for differential expression for microarray data, and an autoencoder based unsupervised deep learning model to simplify and speed up the classification. Then, classification models have been implemented with the use of a deep neural network (DNN), obtaining an accuracy of approximately 92%. Moreover, the results have been compared with the ones provided by an approach based on support vector machines (SVM), giving an accuracy of 87,39%. Another feature selection approach based on genetic algorithms has been experimented, with worse performances. We also conducted a gene enrichment analysis based on the functional annotation of the differentially expressed genes. As a result, a differentially expressed pathway between the two pathologies has been detected.
Feature selection; Genomic data; Leukemia; Neural Network; Pathways;
Settore INFO-01/A - Informatica
lug-2022
Institute of Electrical and Electronics Engineers (IEEE)
https://ieeexplore.ieee.org/abstract/document/10017871
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
A_deep_learning_and_genetic_algorithm_based_feature_selection_processes_on_Leukemia_Data.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 334.91 kB
Formato Adobe PDF
334.91 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1148787
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
  • OpenAlex ND
social impact