This Cacao Genome Project is a collaboration among MARS, USDA-ARS, IBM, NCGR, Clemson University, HudsonAlpha Institute for Biotechnology, Indiana University and Washington State University with funding from MARS, USDA-ARS, and NSF, and contributions in effort from cacao breeders around the world.

About Theobroma cacao

Theobroma cacao (cacao or chocolate tree) is a neotropical plant native to Amazonian rainforests. It is now cultivated in over 50 countries. A member of Malvaceae family, its beans are harvested from pods for use as the food chocolate, in confections and cosmetics. This is the genome assembly and annotation of the Matina 1-6 cultivar, which belongs to the most cultivated cacao type worldwide.

Taxonomy ID 3641

Data source Cacao Genome Consortium

More information and statistics

Gene annotation

What can I find? Protein-coding and non-coding genes, splice variants, cDNA and protein sequences, non-coding RNAs.

More about this genebuild

Download genes, cDNAs, ncRNA, proteins - FASTA - GFF3

Update your old Ensembl IDs

Variation

This species currently has no variation database. However you can process your own variants using the Variant Effect Predictor:

Variant Effect Predictor

About this species