Functional enrichment analysis is an analytical method to extract biological insights from gene expression data, popularized by the ever‐growing application of high‐throughput techniques. Typically, expression profiles are generated for hundreds to thousands of genes/proteins from samples belonging to two experimental groups, and after ad‐hoc statistical tests, researchers are left with lists of statistically significant entities, possibly lacking any unifying biological theme. Functional enrichment tackles the problem of putting overall gene expression changes into a broader biological context, based on pre‐existing knowledge bases of reference: database collections of known expression regulation, relationships and molecular interactions. STRING is among the most popular tools, providing both protein–protein interaction networks and functional enrichment analysis for any given set of identifiers. For complex experimental designs, manually retrieving, interpreting, analyzing and abridging functional enrichment results is a daunting task, usually performed by hand by the average wet‐biology researcher. We have developed reString, a cross‐platform software that seamlessly retrieves from STRING functional enrichments from multiple user‐supplied gene sets, with just a few clicks, without any need for specific bioinformatics skills. Further, it aggregates all findings into human‐readable table summaries, with built‐in features to easily produce user‐customizable publication‐grade clustermaps and bubble plots. Herein, we outline a complete reString protocol, showcasing its features on a real use‐case.

reString: an open-source Python software to perform automatic functional enrichment retrieval, results aggregation and data visualization / S. Manzini, M. Busnelli, A. Colombo, E. Franchi, P. Grossano, G. Chiesa. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 11:1(2021 Dec 06), pp. 23458.1-23458.15. [10.1038/s41598-021-02528-0]

reString: an open-source Python software to perform automatic functional enrichment retrieval, results aggregation and data visualization

S. Manzini
Primo
;
M. Busnelli
Secondo
;
A. Colombo;E. Franchi;G. Chiesa
Ultimo
2021

Abstract

Functional enrichment analysis is an analytical method to extract biological insights from gene expression data, popularized by the ever‐growing application of high‐throughput techniques. Typically, expression profiles are generated for hundreds to thousands of genes/proteins from samples belonging to two experimental groups, and after ad‐hoc statistical tests, researchers are left with lists of statistically significant entities, possibly lacking any unifying biological theme. Functional enrichment tackles the problem of putting overall gene expression changes into a broader biological context, based on pre‐existing knowledge bases of reference: database collections of known expression regulation, relationships and molecular interactions. STRING is among the most popular tools, providing both protein–protein interaction networks and functional enrichment analysis for any given set of identifiers. For complex experimental designs, manually retrieving, interpreting, analyzing and abridging functional enrichment results is a daunting task, usually performed by hand by the average wet‐biology researcher. We have developed reString, a cross‐platform software that seamlessly retrieves from STRING functional enrichments from multiple user‐supplied gene sets, with just a few clicks, without any need for specific bioinformatics skills. Further, it aggregates all findings into human‐readable table summaries, with built‐in features to easily produce user‐customizable publication‐grade clustermaps and bubble plots. Herein, we outline a complete reString protocol, showcasing its features on a real use‐case.
Settore BIO/14 - Farmacologia
Settore BIO/16 - Anatomia Umana
   Personalized diagnostics and treatment of high risk coronary artery disease patients
   RISKYCAD
   EUROPEAN COMMISSION
   FP7
   305739

   Knowledge Platform Intestinal Microbiomics: KP-896 Area 4 - Qualità, tipicità e sicurezza degli alimenti e stili di vita sani. Valorizzazione della relazione tra alimentazione e salute e della valenza nutraceutica dei prodotti agroalimentari
   MINISTERO DELLE POLITICHE AGRICOLE ALIMENTARI, FORESTALI E DEL TURISMO
   ID 834

   Characterization of microRNA functional role in cholesterol metabolism and in the pathogenesis of atherosclerosis
   FONDAZIONE CARIPLO
   2011-0645
6-dic-2021
Article (author)
File in questo prodotto:
File Dimensione Formato  
s41598-021-02528-0.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 6.15 MB
Formato Adobe PDF
6.15 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/889044
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact