Assembled and annotated 26.5 Gbp coast redwood genome: a resource for estimating evolutionary adaptive potential and investigating hexaploid origin

David B. Neale, Aleksey V. Zimin, Sumaira Zaman, Alison D. Scott, Bikash Shrestha, Rachael E. Workman, Daniela Puiu, Brian J. Allen, Zane J. Moore, Manoj K. Sekhwal, Amanda R. de la Torre, Patrick E. McGuire, Emily Burns, Winston Timp, Jill L. Wegrzyn, Steven L. Salzberg

Research output: Contribution to journalArticlepeer-review

Abstract

Sequencing, assembly, and annotation of the 26.5 Gbp hexaploid genome of coast redwood (Sequoia sempervirens) was completed leading toward discovery of genes related to climate adaptation and investigation of the origin of the hexaploid genome. Deep-coverage short-read Illumina sequencing data from haploid tissue from a single seed were combined with long-read Oxford Nanopore Technologies sequencing data from diploid needle tissue to create an initial assembly, which was then scaffolded using proximity ligation data to produce a highly contiguous final assembly, SESE 2.1, with a scaffold N50 size of 44.9 Mbp. The assembly included several scaffolds that span entire chromosome arms, confirmed by the presence of telomere and centromere sequences on the ends of the scaffolds. The structural annotation produced 118,906 genes with 113 containing introns that exceed 500 Kbp in length and one reaching 2 Mb. Nearly 19 Gbp of the genome represented repetitive content with the vast majority characterized as long terminal repeats, with a 2.9:1 ratio of Copia to Gypsy elements that may aid in gene expression control. Comparison of coast redwood to other conifers revealed species-specific expansions for a plethora of abiotic and biotic stress response genes, including those involved in fungal disease resistance, detoxification, and physical injury/structural remodeling and others supporting flavonoid biosynthesis. Analysis of multiple genes that exist in triplicate in coast redwood but only once in its diploid relative, giant sequoia, supports a previous hypothesis that the hexaploidy is the result of autopolyploidy rather than any hybridizations with separate but closely related conifer species.

Original languageEnglish (US)
Article numberjkab380
JournalG3: Genes, Genomes, Genetics
Volume12
Issue number1
DOIs
StatePublished - Jan 2022

Keywords

  • Coast redwood
  • Conifer
  • Genome assembly and annotation
  • Gymnosperm
  • Hexaploid genome
  • Sequoia sempervirens

ASJC Scopus subject areas

  • Genetics(clinical)
  • Genetics
  • Molecular Biology

Fingerprint

Dive into the research topics of 'Assembled and annotated 26.5 Gbp coast redwood genome: a resource for estimating evolutionary adaptive potential and investigating hexaploid origin'. Together they form a unique fingerprint.

Cite this