List of sequenced protist genomes
This list of sequenced protist genomes contains all the protist species known to have publicly available complete genome sequences that have been assembled, annotated and published; draft genomes are not included, nor are organelle only sequences.
Alveolata
Dinoflagellata
. Members of this group are of particular interest to science as the cause of serious human and livestock diseases.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion | Assembly status | Links |
---|---|---|---|---|---|---|---|---|
Babesia bovis | Apicomplexan | Cattle pathogen | 8.2 Mb | 3,671 | 2007[1] | |||
Breviolum minutim (Symbiodinium minutum; clade B1) | Dinoflagellate | Coral symbiont | 1.5 Gb | 47,014 | Okinawa Institute of Science and Technology | 2013[2] | Draft | OIST Marine Genomics[3] |
Cladocopium goreaui (Symbiodinium goreaui; Clade C1) | Dinoflagellate | Coral symbiont | 1.19 Gb | 35,913 | Reef Future Genomics (ReFuGe) 2020/ University of Queensland | 2018[4] | Draft | ReFuGe 2020[5] |
Cladocopium C92 strain Y103 (Symbiodinium sp. clade C; putative type C92) | Dinoflagellate | Foraminiferan symbiont | Unknown (assembly size 0.70 Gb) | 65,832 | Okinawa Institute of Science and Technology | 2018[6] | Draft | OIST Marine Genomics[3] |
Cryptosporidium hominis Strain:TU502 |
Apicomplexan | Human pathogen | 10.4 Mb | 3,994[7] | Virginia Commonwealth University | 2004[7] | ||
Cryptosporidium parvum C- or genotype 2 isolate |
Apicomplexan | Human pathogen | 16.5 Mb | 3,807[8] | UCSF and University of Minnesota | 2004[8] | ||
Eimeria tenella Houghton strain |
Apicomplexan | Intestinal parasite of domestic fowl | 55-60 Mb[9] | The Wellcome Trust Sanger Institute[10] | Available for download;[10] 2007 for Chr 1[11] | |||
Fugacium kawagutii CS156=CCMP2468 (Symbiodinium kawagutii; clade F1) | Dinoflagellate | Coral symbiont? | 1.07 Gb | 26,609 | Reef Future Genomics (ReFuGe) 2020 / University of Queensland | 2018[4] | Draft | ReFuGe 2020[5] |
Fugacium kawagutii CCMP2468 (Symbiodinium kawagutii; clade F1) | Dinoflagellate | Coral symbiont? | 1.18 Gb | 36,850 | University of Connecticut / Xiamen University | 2015[12] | Draft | S. kawagutii genome project[13] |
Neospora caninum | Apicomplexan | Pathogen for cattle and dogs | 62 Mb[14] | The Wellcome Trust Sanger Institute[15] | Available for download[15] | |||
Paramecium tetraurelia | Ciliate | Model organism | 72 Mb | 39,642[16] | Genoscope | 2006[16] | ||
Polarella glacialis CCMP1383 | Dinoflagellate | Psychrophile, Antarctic | 3.02 Gb (diploid), 1.48 Gbp (haploid) | 58,232 | University of Queensland | 2020[17] | Draft | UQ eSpace[18] |
Polarella glacialis CCMP2088 | Dinoflagellate | Psychrophile, Arctic | 2.65 Gb (diploid), 1.30 Gbp (haploid) | 51,713 | University of Queensland | 2020[17] | Draft | UQ eSpace[18] |
Plasmodium berghei ANKA
|
Apicomplexan | Rabbit malaria | 18.5 Mb[19] | 4,900;[19] 11,654 (UniProt) | ||||
Plasmodium chabaudi | Apicomplexan | Rodent malaria | 19.8 Mb[20] | 5,000[20] | ||||
Plasmodium falciparum Clone:3D7 |
Apicomplexan | Human pathogen (malaria) | 22.9 Mb | 5,268[21] | Malaria Genome Project Consortium | 2002[21] | ||
Plasmodium knowlesi | Apicomplexan | Primate pathogen (malaria) | 23.5 Mb | 5,188[22] | 2008[22] | |||
Plasmodium vivax | Apicomplexan | Human pathogen (malaria) | 26.8 Mb | 5,433[23] | 2008[23] | |||
Plasmodium yoelii yoelii Strain:17XNL |
Apicomplexan | Rodent pathogen (malaria) | 23.1 Mb | 5,878[24] | TIGR and NMRC | 2002[24] | ||
Symbiodinium microadriaticum (clade A) | Dinoflagellate | Coral symbiont | 1.1 Gb | 49,109 | King Abdullah University of Science and Technology | 2016[25] | Draft | Reef Genomics[26] |
Symbiodinium A3 strain Y106 (Symbiodinium sp. clade A3) | Dinoflagellate | symbiont | Unknown (assembly size 0.77 Gb) | 69,018 | Okinawa Institute of Science and Technology | 2018[6] | Draft | OIST Marine Genomics[3] |
Tetrahymena thermophila | Ciliate | Model organism | 104 Mb | 27,000[27] | 2006[27] | |||
Theileria annulata Ankara clone C9 |
Apicomplexan | Cattle pathogen | 8.3 Mb | 3,792 | Sanger | 2005[28] | ||
Theileria parva Strain:Muguga |
Apicomplexan | Cattle pathogen (African east coast fever) | 8.3 Mb | 4,035[29] | TIGR and the International Livestock Research Institute | 2005[29] | ||
Toxoplasma gondii GT1, ME49, VEG strains |
Apicomplexan | Mammal pathogen | 63 Mb (RefSeq) | 8,100 (UniProt) - 9,000 (EuPathDB) | J. Craig Venter Inst., TIGR, UPenn. | 2008[30] |
Amoebozoa
pseudopods. The best known member of this group is the slime mold, which has been studied for centuries; other members include the Archamoebae, Tubulinea and Flabellinia
. Some Amoeboza cause disease.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Dictyostelium discoideum Strain:AX4 |
Slime mold | Model organism | 34 Mb | 12,500[31] | Consortium from University of Cologne, Baylor College of Medicine and the Sanger Centre | 2005[31] |
Entamoeba histolytica HM1:IMSS |
Parasitic protozoan | Human pathogen ( amoebic dysentery )
|
23.8 Mb | 9,938[32] | TIGR, Sanger Institute and the London School of Hygiene and Tropical Medicine | 2005[32] |
Polysphondylium pallidum Strain:PN500 |
Slime mold | Model organism | 12,939,[33] 12,350 (UniProt) | Leibniz Institute for Age Research | 2009[33] |
Chromista
The
Cryptophyta
. Members of this group are mostly studied for evolutionary interest.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Albugo laibachii | Oomycete | Arabidopsis parasite, biotroph | 37 Mb[34] | 13,032[34] | 2011[34] | |
Aureococcus anophagefferens Strain:CCMP1984 |
Pelagophyte
|
DOE Joint Genome Institute | 2011[35] | |||
Bigelowiella natans | Chlorarachniophyte | Model organism | nucleomorph: 0.331 Mb nuclear: 95 Mb |
nucleomorph: 373[36] nuclear: >21,000[37] |
Hall Institute Australia , Univ. Melbourne, Univ. BCnuclear: Dalhousie University, Halifax, Nova Scotia, Canada |
2006,[36] 2012[37] |
Chroomonas mesostigmatica CCMP1168 | Cryptophyta
|
2012[38] | ||||
Cryptomonas paramecium | Cryptophyta
|
2010[39] | ||||
Emiliania huxleyi CCMP1516 |
Coccolithophore (phytoplankton) | 141.7 Mb[40] | 30,569[40] | Joint Genome Institute | 2013[40] | |
Emiliania huxleyi RCC1217 |
Coccolithophore (phytoplankton) | Available for download[41] | ||||
Fragilariopsis cylindrus | Diatom | 61.1 Mb[42] | 21,066[42] | Joint Genome Institute | 2017[42] | |
Guillardia theta
|
Cryptomonad | Model organism | 0.551 Mb (nucleomorph genome only) 87 Mb (nuclear genome) |
nucleomorph: 465[43] 513, 598 (UniProt) nuclear: >21,000[37] |
nucleomorph: Canadian Institute of Advanced Research, Philipps-University Marburg and the University of British Columbia nuclear: Dalhousie University, Halifax, Nova Scotia, Canada |
2001,[43] 2012[37] |
Hemiselmis andersenii CCMP7644 |
Cryptomonad | Model organism | 0.572 Mb (nucleomorph genome only) |
472,[44] 502 (UniProt) | Canadian Institute of Advanced Research | 2007[44] |
Hyaloperonospora arabidopsidis | Oomycete | obligate biotroph, Arabidopsis pathogen
|
WUGSC | 2010[45] | ||
Nannochloropis gaditana Strain: CCMP526 |
Eustigmatophyte | Lipid-producing, biotechnology applications | Virginia Bioinformatics Institute | 2012[46] | ||
Phaeodactylum tricornutum Strain: CCAP1055/1 |
Diatom | 27.4 Mb | 10,402 | Joint Genome Institute | 2008[47] | |
Phytophthora infestans Strain:T30-4 |
Oomycete | Great Famine of Ireland pathogen | Broad Institute | 2009[48] | ||
Phytophthora ramorum | Oomycete | Sudden oak death pathogen
|
65 Mb (7x) | 15,743 | Joint Genome Institute et al. | 2006[49] |
Phytophthora sojae | Oomycete | Soybean pathogen | 95 Mb (9x) | 19,027 | Joint Genome Institute et al. | 2006[49] |
Pseudo-nitzschia multiseries | Diatom | Joint Genome Institute | ||||
Plasmodiophora brassicae | Plasmodiophorid
|
Clubroot disease pathogen | 25.5 Mb | 9,730 | SLU Uppsala et al. | 2015[50] |
Pythium ultimum | Oomycete | ubiquitous plant pathogen | 42.8 Mb | 15,290 | Michigan State University et al. | 2010[51] |
Thalassiosira pseudonana Strain:CCMP 1335 |
Diatom | 34.5 Mb | 11,242[52] | Joint Genome Institute and the University of Washington | 2004[52] |
Excavata
. They are researched for their role in human disease.Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Giardia enterica (G. duodenalis assemblage B)
|
Parasitic protozoan | Human pathogen (Giardiasis) | 11.7 Mb | 4,470[53] | multicenter collaboration | 2009[53] |
Giardia duodenalis ATCC 50803 (Giardia duodenalis assemblage A) |
Parasitic protozoan | Human pathogen (Giardiasis) | 11.7 Mb | 6,470,[54] 7,153 (UniProt) | Karolinska Institutet, Marine Biological Laboratory | 2007[54] |
Leishmania braziliensis MHOM/BR/75M2904 |
Parasitic protozoan | Human pathogen (Leishmaniasis) | 33 Mb | 8,314[55] | Sanger Institute , Universidade de São Paulo, Imperial College
|
2007[55] |
Leishmania infantum JPCM5 |
Parasitic protozoan | Human pathogen (Visceral leishmaniasis) | 33 Mb | 8,195[55] | Sanger Institute, Imperial College and University of Glasgow | 2007[55] |
Leishmania major Strain:Friedlin |
Parasitic protozoan | Human pathogen (Cutaneous leishmaniasis) | 32.8 Mb | 8,272[56] | Seattle Biomedical Research Institute
|
2005[56] |
Naegleria gruberi | amoeboflagellate | Diverged from other eukaryotes over 1 billion years ago | 41 Mb[57] | 15,727[57] | 2010[57] | |
Trichomonas vaginalis | Parasitic protozoan | Human pathogen (Trichomoniasis) | 160 Mb | 59,681[58] | TIGR | 2007[58] |
Trypanosoma brucei Strain:TREU927/4 GUTat10.1 |
Parasitic protozoan | Human pathogen (Sleeping sickness) | 26 Mb | 9,068[59] | Sanger Institute and TIGR | 2005[59] |
Trypanosoma cruzi Strain:CL Brener TC3 |
Parasitic protozoan | Human pathogen (Chagas disease) | 34 Mb | 22,570[60] | TIGR, Seattle Biomedical Research Institute and Uppsala University | 2005[60] |
Opisthokonts, basal
fungi as well as basal groups that are not classified in these groups. These basal opisthokonts are reasonably categorized as protists and include choanoflagellates
, which are the sister or near-sister group of animals.
Organism | Type | Relevance | Genome size | Number of genes predicted | Organization | Year of completion |
---|---|---|---|---|---|---|
Monosiga brevicollis | Choanoflagellate | close relative of metazoans | 41.6 Mb | 9,200[61] | Joint Genome Institute | 2007[61] |
See also
- List of sequenced bacterial genomes
- List of sequenced animal genomes
- List of sequenced eukaryotic genomes
- List of sequenced fungi genomes
- List of sequenced plant genomes
- List of sequenced algae genomes
References
- PMID 17953480.
- PMID 23850284.
- ^ a b c "OIST Marine Genomics". marinegenomics.oist.jp. Retrieved 2018-08-22.
- ^ PMID 30271976.
- ^ a b "ReFuGe 2020 Data Site". refuge2020.reefgenomics.org. Retrieved 2018-09-07.
- ^ PMID 29898658.
- ^ PMID 15510150.
- ^ S2CID 26434820.
- ^ genedb
- ^ a b Sanger
- PMID 17284678.
- PMID 26542574.
- ^ "S. kawagutii data site". web.malab.cn/symka_new. Retrieved 2018-08-22.
- ^ genedb
- ^ a b Sanger
- ^ PMID 17086204.
- ^ PMID 32448240.
- ^ )
- ^ a b Ensembl entry
- ^ a b Ensembl entry
- ^ PMID 12368864.
- ^ PMID 18843368.
- ^ PMID 18843361.
- ^ PMID 12368865.
- PMID 28004835.
- ^ "Reef Genomics Data Site". smic.reefgenomics.org. Retrieved 2018-08-22.
- ^ PMID 16933976.
- PMID 15994557.
- ^ PMID 15994558.
- ^ NCBI Genome T. gondii ME49
- ^ PMID 15875012.
- ^ S2CID 14231289.
- ^ a b NCBI accession
- ^ PMID 21750662.
- PMID 21368207.
- ^ PMID 16760254.
- ^ PMID 23201678.
- PMID 23042551.
- PMID 21147880.
- ^ PMID 23760476.
- ^ Entry
- ^ PMID 28092920.
- ^ PMID 11323671.
- ^ PMID 18077423.
- PMID 21148394.
- PMID 22353717.
- PMID 18923393.
- S2CID 4385549.
- ^ S2CID 21287860.
- PMID 26084520.
- S2CID 44649671.
- ^ S2CID 8593895.
- ^ PMID 19696920.
- ^ S2CID 29299317.
- ^ PMID 17572675.
- ^ PMID 16020728.
- ^ S2CID 13901186.
- ^ PMID 17218520.
- ^ S2CID 18649858.
- ^ S2CID 3830267.
- ^ PMID 18273011.