Plains zebra (Equus quagga)
Plains zebras are the most common zebra species. They live in small family groups consisting of a male (stallion), several females, and their young. The units may combine with others to form awe-inspiring herds thousands of head strong, but family members will remain close within the herd! Read more about plains zebras on National Geographic website.
"Mother and foal in Lakeland, Florida" by Seth Eisenberg, [CC BY-SA 4.0], via wikipedia.com
Chromosome-length genome assembly
Download the Equus_quagga_HiC.fasta.gz file containing the chromosome-length (2n=44) assembly of the plains zebra genome. All modifications with respect to the draft (see below) are annotated in the Equus_quagga_HiC.assembly file. Some basic stats associated with the new reference, Equus_quagga_HiC, are listed below. The full data release can be explored here.
Contig length (bp) | Number of contigs | Contig N50 (bp) | Longest contig (bp) |
---|---|---|---|
2,574,522,325 | 722,004 | 89,783 | 1,083,825 |
Scaffold length (bp) | Number of scaffolds | Scaffold N50 (bp) | Longest scaffold (bp) |
---|---|---|---|
2,586,661,125 | 689,504 | 114,504,116 | 176,892,750 |
Draft
The chromosome-length genome assembly is based on the draft assembly Equus_quagga, credited below.
The draft assembly was generated by the DNA Zoo team from short insert-size PCR-free DNA-Seq data using w2rap-contigger (Clavijo et al. 2017), see (Dudchenko et al., 2018) for details.
Method
3D Assembly was performed using 3D-DNA pipeline (Dudchenko et al., Science, 2017). The genome was reviewed using Juicebox Assembly Tools (Dudchenko et al., bioRxiv, 2018). See Methods for more information.
Hi-C sample
The blood sample for in situ Hi-C preparation was donated by a female individual named Zena, and obtained from Nancy Nunke (Hearts & Hands Animal Rescue), Greg Barsh (Stanford, Hudson Alpha), Ren Larison (UCLA). We thank Greg Barsh, Stanford, Hudson Alpha and Houston Zoo for additional plains zebra samples.
Hi-C Contact maps
Hi-C data was aligned to the draft reference using Juicer (Durand, Shamim et al., Cell Systems, 2016), and contact maps visualizing the alignments with respect to the draft and the new reference were built using 3D-DNA (Dudchenko et al., Science, 2017). The contact maps can be explored below via Juicebox.js interactive tool (Robinson et al., Cell Systems, 2018). (Please note that the interactive figures are scaled 1:2.) To explore the assembly in greater detail, please download the .hic and .assembly files from the data release folder and use Juicebox Assembly Tools (Dudchenko et al., bioRxiv, 2018).
References
If you use this genome assembly in your research, please check that the conditions of use associated with the draft permit it, and acknowledge the following work.
The draft assembly was generated by the DNA Zoo team from short insert-size PCR-free DNA-Seq data using w2rap-contigger (Clavijo et al. 2017), see (Dudchenko et al., 2018) for details.
Dudchenko, O., Batra, S.S., Omer, A.D., Nyquist, S.K., Hoeger, M., Durand, N.C., Shamim, M.S., Machol, I., Lander, E.S., Aiden, A.P., Aiden, E.L., 2017. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95. https://doi.org/10.1126/science.aal3327.
Dudchenko, O., Shamim, M.S., Batra, S., Durand, N.C., Musial, N.T., Mostofa, R., Pham, M., Hilaire, B.G.S., Yao, W., Stamenova, E., Hoeger, M., Nyquist, S.K., Korchina, V., Pletch, K., Flanagan, J.P., Tomaszewicz, A., McAloose, D., Estrada, C.P., Novak, B.J., Omer, A.D., Aiden, E.L., 2018. The Juicebox Assembly Tools module facilitates de novo assembly of mammalian genomes with chromosome-length scaffolds for under $1000. bioRxiv 254797. https://doi.org/10.1101/254797.
Disclaimer
This is a work in progress. If you notice any discrepancies in the map or have data that confirms or contradicts the suggested reference, please email us at thednazoo@gmail.com or leave a comment on the Forum.