This Cacao Genome Project is a collaboration among MARS, USDA-ARS, IBM, NCGR, Clemson University, HudsonAlpha Institute for Biotechnology, Indiana University and Washington State University with funding from MARS, USDA-ARS, and NSF, and contributions in effort from cacao breeders around the world.

About Theobroma cacao

Theobroma cacao (cacao or chocolate tree) is a neotropical plant native to Amazonian rainforests. It is now cultivated in over 50 countries. A member of Malvaceae family, its beans are harvested from pods for use as the food chocolate, in confections and cosmetics. This is the genome assembly and annotation of the Matina 1-6 cultivar, which belongs to the most cultivated cacao type worldwide.

Taxonomy ID 3641

Data source The Cacao Genome Database

More information and statistics

Gene annotation

What can I find? Protein-coding and non-coding genes, splice variants, cDNA and protein sequences, non-coding RNAs.

More about this genebuild

Download genes, cDNAs, ncRNA, proteins - FASTA - GFF3

Update your old Ensembl IDs

Comparative genomics

What can I find? Homologues, gene trees, and whole genome alignments across multiple species.

More about comparative analyses

Phylogenetic overview of gene families

Download alignments (EMF)

Variation

This species currently has no variation database. However you can process your own variants using the Variant Effect Predictor:

Variant Effect Predictor

About this species