Pan-genomes with high quality de novo assemblies are shifting the paradigm of biology research in genome evolution, speciation, and function annotation. Arrays of new bioinformatic tools, ranging from data storage, annotation, to polymorphism identification and visualization, have been developed to capitalize pan-genome resources. Large insertion and deletion (indel) polymorphisms, potentially altering gene structure or expression, are class of structural variants that need to be catalogued from pan-genomes. However, the nature of indels, unknown size and uneven distribution in different genome assemblies, complicates the identification process. This process remains a challenge and often requires painstaking probing and decision-making from users.
Here, we introduce BRIDGE (Blastn Recovered Insertions and Deletions near Gene Explorer) for surveying potential indels for genes of interest with 5 publicly accessible cereal pan-genomes.
BRIDGEcereal currently holds 120 genomes:
11 Wheat genomes.
38 Maize genomes.
18 Sorghum genomes.
33 Rice genomes.
20 Barley genomes.
Acknowledgements: We thank the USDA-ARS SCINet for computing resource and the collaboration of the USDA-ARS-Partnerships for Data Innovations (PDI,
https://pdi.scinet.usda.gov/), which provided data stewardship solutions to enable secure data management, storage and sharing.
Contact: xianran.li@usda.gov