Traditional clinical diagnostic approaches may sometimes fail in detecting tumors (Alizadeh et al. 2001). Several results showed that bio-molecular analysis of malignancies may help to better characterize malignancies (e.g. gene expression profiling). Information for supporting both diagnosis and prognosis of malignancies at bio-molecular level may be obtained from high-throughput biotechnologies (e.g. DNA microarray). Recent work on unsupervised analysis of complex bio-molecular data (Bertoni and Valentini, 2006) showed that random projections obeying the Johnson-Lindenstrauss lemma can be used for: – Discovering structures in bio-molecular data – Validating clustering results – Improving clustering results RS ensembles can improve the accuracy of biomolecular diagnosis characterized by very high dimensional data. They could be also easily applied to heterogeneous bio-molecular and clinical data. A new promising approach consists in combining state of the art feature (gene) selection methods and RS ensembles. RS ensembles are computationally intensive but can be easily parallelized using clusters of workstations (e.g. in a MPI framework).
Bio-molecular diagnosis through Random Subspace Ensembles of Learning Machines / A. Bertoni, R. Folgieri, G. Valentini. ((Intervento presentato al convegno CAPI 2006 Convegno Calcolo ad Alte Prestazioni "Biocomputing" tenutosi a Milano nel 2006.
Bio-molecular diagnosis through Random Subspace Ensembles of Learning Machines.
A. Bertoni;R. Folgieri;G. Valentini
2006
Abstract
Traditional clinical diagnostic approaches may sometimes fail in detecting tumors (Alizadeh et al. 2001). Several results showed that bio-molecular analysis of malignancies may help to better characterize malignancies (e.g. gene expression profiling). Information for supporting both diagnosis and prognosis of malignancies at bio-molecular level may be obtained from high-throughput biotechnologies (e.g. DNA microarray). Recent work on unsupervised analysis of complex bio-molecular data (Bertoni and Valentini, 2006) showed that random projections obeying the Johnson-Lindenstrauss lemma can be used for: – Discovering structures in bio-molecular data – Validating clustering results – Improving clustering results RS ensembles can improve the accuracy of biomolecular diagnosis characterized by very high dimensional data. They could be also easily applied to heterogeneous bio-molecular and clinical data. A new promising approach consists in combining state of the art feature (gene) selection methods and RS ensembles. RS ensembles are computationally intensive but can be easily parallelized using clusters of workstations (e.g. in a MPI framework).File | Dimensione | Formato | |
---|---|---|---|
CAPI2006.pdf
accesso aperto
Tipologia:
Altro
Dimensione
114.82 kB
Formato
Adobe PDF
|
114.82 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.