Trifolium pratense Assembly and Gene Annotation
About Trifolium pratense
Red clover (Trifolium pratense) is one of the most important forage legume crops in temperate agriculture, and a key component of sustainable intensification of livestock farming systems. Red clover is a highly heterozygous diploid (2n = 14) species due to its gametophytic self-incompatibility system. This assembly provides a chromosome-scale reference draft genome for a red clover genotype of the variety Milvus (Milvus B).
Assembly
This assembly from the Earlham Institute provides a chromosome-scale reference draft genome for a red clover genotype of the variety Milvus (Milvus B) by integration of Whole Genome Sequencing (WGS) of short-length reads, Sanger-based bacterial artificial chromosome (BAC) end sequences, a physical and two genetic maps. WGS was assembled from paired-end and mate-pair libraries using the Platanus assembler. Three BAC libraries were created using high molecular weight DNA from a specific genotype of the Milvus variety (Milvus B). The mapping population used in this work consisted of 188 genotypes of F1 progeny from a cross between a genotype of the variety Milvus and a genotype of the variety Britta. 1,031 of the 1,388 markers were aligned from the two maps to place 532 of the longest scaffolds and used the BAC-end sequences as markers to further link unplaced scaffolds with already placed scaffolds from the same physical contig. The physical map contained 29,730 BACs, of which almost 23,000 were in contigs (77.3%).
Annotation
The genome was annotated by De Vega, JJ, et al.. Repeats were masked using RepeatMasker with the RepBase database, LTRharvest38. Transcripts were annotated using RNA-seq data with ab initio predictions and homologous transcripts from soybean, common bean, and M. truncatula.
Repeated sequences were called with the Repeat Detector, which is part of the Ensembl Genomes repeat feature pipelines. Repeats length: 91429549 - Repeats content: 30%
References
- Red clover (Trifolium pratense L.) draft genome provides a platform
for trait improvement.
De Vega JJ, Ayling S, Hegarty M, Kudrna D, Goicoechea JL, Ergon , Rognli OA, Jones C, Swain M, Geurts R et al. 2015. Sci Rep. 5:17394.
Picture credit: By Masaki Ikeda - Own work, CC BY-SA 3.0
​Links
- http://legumeinfo.org/organism/Trifolium/pratense
- http://www.nature.com/articles/srep17394
- http://marker.kazusa.or.jp/Red_clover
More information
General information about this species can be found in Wikipedia.
Statistics
Summary
Assembly | Trpr, INSDC Assembly GCA_900079335.1, May 2016 |
Database version | 113.1 |
Golden Path Length | 304,842,038 |
Genebuild by | Earlham |
Genebuild method | Import |
Data source | Earlham Institute |
Gene counts
Coding genes | 39,948 |
Non coding genes | 1,183 |
Small non coding genes | 1,172 |
Long non coding genes | 11 |
Gene transcripts | 42,485 |