EMBL-EBI User Survey 2024

Do data resources managed by EMBL-EBI and our collaborators make a difference to your work?

Please take 10 minutes to fill in our annual user survey, and help us make the case for why sustaining open data resources is critical for life sciences research.

Survey link: https://www.surveymonkey.com/r/HJKYKTT?channel=[webpage]

WormBase ParaSite HomeVersion: WBPS19 (WS291)-  Archive: WBPS18

Dibothriocephalus latus

BioProject PRJEB1206 | Data Source Wellcome Sanger Institute | Taxonomy ID 60516

About Dibothriocephalus latus

The cestode Dibothriocephalus latus, also known as Diphyllobothrium latum or broad fish tapeworm, is a parasite of fish and mammals. The parasite causes diphyllobothriasis in humans through consumption of raw or undercooked fish. Symptoms of diphyllobothriasis are generally mild, and can include diarrhoea, abdominal pain, vomiting, weight loss, fatigue, constipation and discomfort.

Genome Assembly & Annotation


The draft genome assembly was produced by the Parasite Genomic group at the Wellcome Trust Sanger Institute, in collaboration with Tomáš Scholz (Academy of Sciences of the Czech Republic) as part of the 50 Helminth Genomes project. The assembly uses Illumina paired-end sequencing followed by an in-house genome assembly pipeline comprising various steps, including contig assembly, scaffolding, gap-filling and error-correction.


The gene predictions were made by the Parasite Genomics group at the Wellcome Trust Sanger Institute and WormBase, as part of the 50 Helminth Genomes project. An in-house pipeline was developed that used MAKER to generate high-quality annotations by integrating evidence from multiple sources: ab initio gene predictions from AUGUSTUS, GeneMark-ES, and SNAP; projected annotation from C. elegans (using GenBlastG) and the taxonomically nearest reference helminth genome (using RATT); and ESTs, mRNAs and proteins from related organisms aligned to the genome using BLAST, with refinement of alignments using Exonerate.

Key Publications

Assembly Statistics

AssemblyD_latum_Geneva_0011_upd, GCA_900617775.1
Database VersionWBPS19
Genome Size531,434,409
Data SourceWellcome Sanger Institute
Annotation Version2014-06-50HGPpatch

Gene counts

Coding genes19,966
Gene transcripts19,966

Learn more about this widget in our help section

This widget has been derived from the assembly-stats code developed by the Lepbase project at the University of Edinburgh