Common bean (Phaseolus vulgaris L.) is the primary grain legume consumed directly by humans worldwide and is increasingly valued for its high nutritional quality, making it a prime target for genomics research. Despite its importance, genomic resources for this species remain limited. Within the PhasTILL and AGRITECH projects, an EMS-mutagenized population of 2,345 M1 lines was recently developed from an Italian cultivated variety (Meccearly dwarf Borlotto type, Verisem Italia). This population represents a unique resource for isolating mutants with desirable traits. To date, M3 seeds obtained from approximately 2,300 M2 plants derived from 590 M1 lines are available. Phenotypic screening in M2 and M3 generations has identified major alterations in germination and plant morphology. DNA has been collected from all M2 plants, and whole-genome sequencing (WGS) is ongoing to characterize induced variants. A pilot WGS study was conducted to optimize variant calling pipelines for EMS-induced mutations. The dataset included two wild-type controls and six M2 pools of four individuals each (4X pooling) sequenced at 40X coverage, and four individual plants from two M2 pools sequenced at 10X coverage. Variant calling was performed using five strategies varying in caller, pooling, and merging approaches, including Freebayes (single-run, batch-wise, octaploid pooling) and HaplotypeCaller with a per-sample gVCF workflow. After filtering, an average of 26,713 variants per sample was identified. HaplotypeCaller detected approximately five times more canonical variants than Freebayes (85,657 vs 11,110–23,363), but with lower confirmation rate in 1X samples (53–62% vs 92–99%). Importantly, 53–94% of canonical variants specific to two M2 pools and confirmed in 1X samples were also recovered by HaplotypeCaller. These results demonstrate that EMS-induced variants can be reliably identified using short-read WGS of both pools and individual mutants. The observed differences in variant calling highlight the importance of balancing sensitivity and precision. In conclusion, this population broadens genetic diversity in common bean and represents a valuable platform to validate candidate genes, identify potential targets for susceptibility to diseases and accelerate functional genomics studies aimed at improving performance, productivity, and nutritional quality.
Expanding genetic diversity in common bean through EMS mutagenesis: a novel platform for variant discovery and candidate genes validation / A. Gaiti, C. Russo, E. Cominelli, G. Testone, P. Cozzi, D. Paolo, M. Galbiati, A. Losa, T. Sala, E. Avite, P. Andreatta, C. Pozzi, F. Sparvoli. International Bean Improvement Cooperative (BIC) and North American Pulse Improvement Association (NAPIA) Biennial Meeting Lincoln, Nebraska, USA 2025.
Expanding genetic diversity in common bean through EMS mutagenesis: a novel platform for variant discovery and candidate genes validation
A. Gaiti
;C. Pozzi;
2025
Abstract
Common bean (Phaseolus vulgaris L.) is the primary grain legume consumed directly by humans worldwide and is increasingly valued for its high nutritional quality, making it a prime target for genomics research. Despite its importance, genomic resources for this species remain limited. Within the PhasTILL and AGRITECH projects, an EMS-mutagenized population of 2,345 M1 lines was recently developed from an Italian cultivated variety (Meccearly dwarf Borlotto type, Verisem Italia). This population represents a unique resource for isolating mutants with desirable traits. To date, M3 seeds obtained from approximately 2,300 M2 plants derived from 590 M1 lines are available. Phenotypic screening in M2 and M3 generations has identified major alterations in germination and plant morphology. DNA has been collected from all M2 plants, and whole-genome sequencing (WGS) is ongoing to characterize induced variants. A pilot WGS study was conducted to optimize variant calling pipelines for EMS-induced mutations. The dataset included two wild-type controls and six M2 pools of four individuals each (4X pooling) sequenced at 40X coverage, and four individual plants from two M2 pools sequenced at 10X coverage. Variant calling was performed using five strategies varying in caller, pooling, and merging approaches, including Freebayes (single-run, batch-wise, octaploid pooling) and HaplotypeCaller with a per-sample gVCF workflow. After filtering, an average of 26,713 variants per sample was identified. HaplotypeCaller detected approximately five times more canonical variants than Freebayes (85,657 vs 11,110–23,363), but with lower confirmation rate in 1X samples (53–62% vs 92–99%). Importantly, 53–94% of canonical variants specific to two M2 pools and confirmed in 1X samples were also recovered by HaplotypeCaller. These results demonstrate that EMS-induced variants can be reliably identified using short-read WGS of both pools and individual mutants. The observed differences in variant calling highlight the importance of balancing sensitivity and precision. In conclusion, this population broadens genetic diversity in common bean and represents a valuable platform to validate candidate genes, identify potential targets for susceptibility to diseases and accelerate functional genomics studies aimed at improving performance, productivity, and nutritional quality.| File | Dimensione | Formato | |
|---|---|---|---|
|
Abstract_BIC_poster_PhasTILL_final(1).pdf
accesso aperto
Descrizione: Abstract Poster Gaiti BIC2025
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza:
Creative commons
Dimensione
208.36 kB
Formato
Adobe PDF
|
208.36 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




