EMBL-EBI User Survey 2024

Do data resources managed by EMBL-EBI and our collaborators make a difference to your work?

Please take 10 minutes to fill in our annual user survey, and help us make the case for why sustaining open data resources is critical for life sciences research.

Survey link: https://www.surveymonkey.com/r/HJKYKTT?channel=[webpage]

WormBase ParaSite HomeVersion: WBPS19 (WS291)-  Archive: WBPS18

Toxocara canis

BioProject PRJEB533 | Data Source Wellcome Sanger Institute | Taxonomy ID 6265

About Toxocara canis

The nematode Toxocara canis, or dog roundworm, is a worldwide-distributed intestinal parasite of dogs and other canids, and is thought to be the most common parasite infection of humans in the United States of America. It has a wide host range. Humans are infected by accidental ingestion of infective eggs from dog faeces, causing toxocariasis, an infection of organs such as the liver, lungs, eyes or brain. T. canis has many characteristics that make it a useful model system.

There is 1 alternative genome project for Toxocara canis available in WormBase ParaSite: PRJNA248777

Genome Assembly & Annotation


The draft genome assembly was produced by the Parasite Genomic group at the Wellcome Trust Sanger Institute, in collaboration with Philip Cooper (Liverpool School of Tropical Medicine) as part of the 50 Helminth Genomes project. The assembly uses Illumina paired-end sequencing followed by an in-house genome assembly pipeline comprising various steps, including contig assembly, scaffolding, gap-filling and error-correction.


The gene predictions were made by the Parasite Genomics group at the Wellcome Trust Sanger Institute and WormBase, as part of the 50 Helminth Genomes project. An in-house pipeline was developed that used MAKER to generate high-quality annotations by integrating evidence from multiple sources: ab initio gene predictions from AUGUSTUS, GeneMark-ES, and SNAP; projected annotation from C. elegans (using GenBlastG) and the taxonomically nearest reference helminth genome (using RATT); and ESTs, mRNAs and proteins from related organisms aligned to the genome using BLAST, with refinement of alignments using Exonerate.

Key Publications

Assembly Statistics

AssemblyT_canis_Equador_0011_upd, GCA_900622545.1
Database VersionWBPS19
Genome Size299,788,016
Data SourceWellcome Sanger Institute
Annotation Version2014-06-50HGPpatch

Gene counts

Coding genes20,264
Gene transcripts20,264

Learn more about this widget in our help section

This widget has been derived from the assembly-stats code developed by the Lepbase project at the University of Edinburgh