Gene duplication is a key mechanism in evolution for generating new functionality, and it is known to have produced a large proportion of genes. Duplication mechanisms include small-scale, or "local", events such as unequal crossing over and retroposition, together with global events, such as chromosomal or whole genome duplication (WGD). In particular, different studies confirmed that the yeast S. cerevisiae arose from a 100-150 million-year old whole-genome duplication. Detection and study of duplications are usually based on sequence alignment, synteny and phylogenetic techniques, but protein domains are also useful in assessing protein homology. We develop a simple and computationally efficient protein domain architecture comparison method based on the domain assignments available from public databases. We test the accuracy and the reliability of this method in detecting instances of gene duplication in the yeast S. cerevisiae. In particular, we analyze the evolution of WGD and non-WGD paralogs from the domain viewpoint, in comparison with a more standard functional analysis of the genes. A large number of domains is shared by genes that underwent local and global duplications, indicating the existence of a common set of "duplicable" domains. On the other hand, WGD and non-WGD paralogs tend to have different functions. We find evidence that this comes from functional migration within similar domain superfamilies, but also from the existence of small sets of WGD and non-WGD specific domain superfamilies with largely different functions. This observation gives a novel perspective on the finding that WGD paralogs tend to be functionally different from small-scale paralogs. WGD and non-WGD superfamilies carry distinct functions. Finally, the Gene Ontology similarity of paralogs tends to decrease with duplication age, while this tendency is weaker or not observable by the comparison of the domain architectures of paralogs. This suggests that the set of domains composing a protein tends to be maintained, while its function, cellular process or localization diversifies. Overall, the gathered evidence gives a different viewpoint on the biological specificity of the WGD and at the same time points out the validity of domain architecture comparison as a tool for detecting homology.

Identity and divergence of protein domain architectures after the yeast-whole genome duplication event / L. Grassi, D. Fusco, A.L. Sellerio, D. Corà, B.F. Bassetti, M. Caselle, M. Cosentino Lagomarsino. - In: MOLECULAR BIOSYSTEMS. - ISSN 1742-206X. - 6:11(2010), pp. 2305-2315. [10.1039/C003507F]

Identity and divergence of protein domain architectures after the yeast-whole genome duplication event

B.F. Bassetti;M. Cosentino Lagomarsino
Ultimo
2010

Abstract

Gene duplication is a key mechanism in evolution for generating new functionality, and it is known to have produced a large proportion of genes. Duplication mechanisms include small-scale, or "local", events such as unequal crossing over and retroposition, together with global events, such as chromosomal or whole genome duplication (WGD). In particular, different studies confirmed that the yeast S. cerevisiae arose from a 100-150 million-year old whole-genome duplication. Detection and study of duplications are usually based on sequence alignment, synteny and phylogenetic techniques, but protein domains are also useful in assessing protein homology. We develop a simple and computationally efficient protein domain architecture comparison method based on the domain assignments available from public databases. We test the accuracy and the reliability of this method in detecting instances of gene duplication in the yeast S. cerevisiae. In particular, we analyze the evolution of WGD and non-WGD paralogs from the domain viewpoint, in comparison with a more standard functional analysis of the genes. A large number of domains is shared by genes that underwent local and global duplications, indicating the existence of a common set of "duplicable" domains. On the other hand, WGD and non-WGD paralogs tend to have different functions. We find evidence that this comes from functional migration within similar domain superfamilies, but also from the existence of small sets of WGD and non-WGD specific domain superfamilies with largely different functions. This observation gives a novel perspective on the finding that WGD paralogs tend to be functionally different from small-scale paralogs. WGD and non-WGD superfamilies carry distinct functions. Finally, the Gene Ontology similarity of paralogs tends to decrease with duplication age, while this tendency is weaker or not observable by the comparison of the domain architectures of paralogs. This suggests that the set of domains composing a protein tends to be maintained, while its function, cellular process or localization diversifies. Overall, the gathered evidence gives a different viewpoint on the biological specificity of the WGD and at the same time points out the validity of domain architecture comparison as a tool for detecting homology.
English
Settore BIO/04 - Fisiologia Vegetale
Articolo
Sì, ma tipo non specificato
2010
RSC publishing
6
11
2305
2315
Periodico con rilevanza internazionale
info:eu-repo/semantics/article
Identity and divergence of protein domain architectures after the yeast-whole genome duplication event / L. Grassi, D. Fusco, A.L. Sellerio, D. Corà, B.F. Bassetti, M. Caselle, M. Cosentino Lagomarsino. - In: MOLECULAR BIOSYSTEMS. - ISSN 1742-206X. - 6:11(2010), pp. 2305-2315. [10.1039/C003507F]
none
Prodotti della ricerca::01 - Articolo su periodico
7
262
Article (author)
Periodico con Impact Factor
L. Grassi, D. Fusco, A.L. Sellerio, D. Corà, B.F. Bassetti, M. Caselle, M. Cosentino Lagomarsino
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/157084
Citazioni
  • ???jsp.display-item.citation.pmc??? 10
  • Scopus 17
  • ???jsp.display-item.citation.isi??? 16
social impact