A high-quality genome assembly and annotation of Thielaviopsis punctulata DSM102798

Gouthaman P. Purayil, Esam Eldin Saeed, Archana M. Mathai, Khaled A. El-Tarabily, Synan F. AbuQamar

Research output: Contribution to journalArticlepeer-review

Abstract

Black scorch disease (BSD), caused by the fungal pathogen Thielaviopsis punctulata (Tp) DSM102798, poses a significant threat to date palm cultivation in the United Arab Emirates (UAE). In this study, Chicago and Hi-C libraries were prepared as input for the Dovetail HiRise pipeline to scaffold the genome of Tp DSM102798. We generated an assembly with a total length of 28.23 Mb comprising 1,256 scaffolds, and the assembly had a contig N50 of 18.56 kb, L50 of three, and a BUSCO completeness score of 98.6% for 758 orthologous genes. Annotation of this assembly produced 7,169 genes and 3,501 Gene Ontology (GO) terms. Compared to five other Thielaviopsis genomes, Tp DSM102798 exhibited the highest continuity with a cumulative size of 27.598 Mb for the first seven scaffolds, surpassing the assemblies of all examined strains. These findings offer a foundation for targeted strategies that enhance date palm resistance against BSD, and foster more sustainable and resilient agricultural systems.

Original languageEnglish
Article number745
JournalScientific data
Volume11
Issue number1
DOIs
Publication statusPublished - Dec 2024

ASJC Scopus subject areas

  • Statistics and Probability
  • Information Systems
  • Education
  • Computer Science Applications
  • Statistics, Probability and Uncertainty
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'A high-quality genome assembly and annotation of Thielaviopsis punctulata DSM102798'. Together they form a unique fingerprint.

Cite this