Traditional clinical diagnostic approaches may sometimes fail in detecting tumors (Alizadeh et al. 2001). Several results showed that bio-molecular analysis of malignancies may help to better characterize malignancies (e.g. gene expression profiling). Information for supporting both diagnosis and prognosis of malignancies at bio-molecular level may be obtained from high-throughput biotechnologies (e.g. DNA microarray). Recent work on unsupervised analysis of complex bio-molecular data (Bertoni and Valentini, 2006) showed that random projections obeying the Johnson-Lindenstrauss lemma can be used for: – Discovering structures in bio-molecular data – Validating clustering results – Improving clustering results RS ensembles can improve the accuracy of biomolecular diagnosis characterized by very high dimensional data. They could be also easily applied to heterogeneous bio-molecular and clinical data. A new promising approach consists in combining state of the art feature (gene) selection methods and RS ensembles. RS ensembles are computationally intensive but can be easily parallelized using clusters of workstations (e.g. in a MPI framework).

Bio-molecular diagnosis through Random Subspace Ensembles of Learning Machines / A. Bertoni, R. Folgieri, G. Valentini. ((Intervento presentato al convegno CAPI 2006 Convegno Calcolo ad Alte Prestazioni "Biocomputing" tenutosi a Milano nel 2006.

Bio-molecular diagnosis through Random Subspace Ensembles of Learning Machines.

A. Bertoni;R. Folgieri;G. Valentini
2006

Abstract

Traditional clinical diagnostic approaches may sometimes fail in detecting tumors (Alizadeh et al. 2001). Several results showed that bio-molecular analysis of malignancies may help to better characterize malignancies (e.g. gene expression profiling). Information for supporting both diagnosis and prognosis of malignancies at bio-molecular level may be obtained from high-throughput biotechnologies (e.g. DNA microarray). Recent work on unsupervised analysis of complex bio-molecular data (Bertoni and Valentini, 2006) showed that random projections obeying the Johnson-Lindenstrauss lemma can be used for: – Discovering structures in bio-molecular data – Validating clustering results – Improving clustering results RS ensembles can improve the accuracy of biomolecular diagnosis characterized by very high dimensional data. They could be also easily applied to heterogeneous bio-molecular and clinical data. A new promising approach consists in combining state of the art feature (gene) selection methods and RS ensembles. RS ensembles are computationally intensive but can be easily parallelized using clusters of workstations (e.g. in a MPI framework).
2006
clustering, machine learning, random subspace, ensemble
Settore INF/01 - Informatica
Bio-molecular diagnosis through Random Subspace Ensembles of Learning Machines / A. Bertoni, R. Folgieri, G. Valentini. ((Intervento presentato al convegno CAPI 2006 Convegno Calcolo ad Alte Prestazioni "Biocomputing" tenutosi a Milano nel 2006.
Conference Object
File in questo prodotto:
File Dimensione Formato  
CAPI2006.pdf

accesso aperto

Tipologia: Altro
Dimensione 114.82 kB
Formato Adobe PDF
114.82 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/49206
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact