Amborella trichopoda Assembly and Gene Annotation
About Amborella trichopoda
Amborella trichopoda is a small, tropical shrub endemic to New Caledonia. It is the only species in the genus Amborella, which is the only member of the family Amborellaceae. As the only living species on the sister lineage to all other flowering plants, it is an important reference for studying plant evolution. Individual Amborella trichopoda are usually sprawling understory shrubs, although occasionally they grow up to eight metres high. They have evergreen leaves and small, white to yellow flowers. The species is dioecious; female plants producing carpals and males producing stamens. Individual plants can change sex between flowerings. Unlike nearly all other flowering plants, they do not posses vessel elements for water conduction.
Amborella's genome is a relatively compact 870Mb, arranged into 13 chromosome pairs.
Amborella's draft genome sequence was published in December, 2013, by the Amborella Genome Project. Initial sequencing and contig assembly was performed using a whole-genome shotgun (WGS) strategy. Contigs and scaffold assignments were refined, and superscaffolds constructed from them using evidence from end-sequenced BACs, single-molecule restriction digest (OpGen) experiments, and fluorescent in situ hybridisation (FISH) experiments.
The Amborella genome project carried out ab initio annotation of genes and repetitive elements using the DAWGPAWS and EVidenceModeler software packages. These annotations were refined through manual comparison with assembled Amborella cDNA transcripts, gene family analyses, and homology studies.
- Assembly and validation of the genome of the nonmodel basal angiosperm Amborella.
Chamala S, Chanderbali AS, Der JP, Lan T, Walts B, Albert VA, dePamphilis CW, Leebens-Mack J, Rounsley S, Schuster SC et al. 2013. Science. 342:1516-1517.
- The Amborella genome and the evolution of flowering plants.
2013. Science. 342:1241089.
- The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes.
Estill JC, Bennetzen JL. 2009. Plant Methods. 5:8.
- Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments.
Haas BJ, Salzberg SL, Zhu W, Pertea M, Allen JE, Orvis J, White O, Buell CR, Wortman JR. 2008. Genome Biol.. 9:R7.
Picture credit: Scott Zona via Wikimedia Commons: http://commons.wikimedia.org/wiki/File:Amborella_trichopoda_%283173820625%29.jpg
General information about this species can be found in Wikipedia.
|Assembly||AMTR1.0, INSDC Assembly GCA_000471905.1, Nov 2013|
|Golden Path Length||706,332,640|
|Genebuild method||Imported from ENA|
|Data source||Amborella Genome Sequencing Project|
|Non coding genes||1,040|
|Small non coding genes||979|
|Long non coding genes||61|