Genomes

Introduction

Protein sequences originating from complete genomes and that can be assigned to CAZy families are listed in the links below. The only genomes that are consistently surveyed in the CAZy database are those released by the NCBI as regular entries in the daily releases of GenBank. In a very limited number of cases, we have included data from RefSeq genomes.

The collection of carbohydrate-active enzymes encoded by the genome of an organism ("CAZome") provides an insight into the nature and extent of the metabolism of complex carbohydrates of the species. The CAZomes of free living organisms typically correspond to 1-5% of the predicted coding sequences. Extremely reduced CAZomes are characteristic of species with a strict intracellular parasitic lifestyle. Because of the massive chemical, structural and functional variability of carbohydrates, CAZome analyses and comparisons highlight significant differences between species.

Although often useful, the simple assignment of a protein sequence to a CAZy family does not constitute a refined functional prediction for genomic annotation. For the later task, we are developping a CAZy-based annotation methodology, which takes into account protein modularity, family and subfamily assignment, relatedness to experimentally characterized enzymes and expertise in the varying substrate specificity of carbohydrate-active enzymes. This methodology, which results in coherent, expert and comparable sets of annotations, is applied to novel genomes and metagenomes on a collaborative basis.

Tables for Direct Genome Access by Kingdom

Bacteria1830
Archaea124
Eukaryota63
Viruses138

Our published work on CAZymes in genomes and metagenomes

[58] Price et al. (2012) Cyanophora paradoxa genome elucidates origin of photosynthesis in algae and plants. Science 335, 843-847 [PMID: 22344442].

[57] McNulty et al. (2011) The impact of a consortium of fermented milk strains on the human gut microbiome: a study involving monozygotic twins and gnotobiotic mice. Science Transl. Med. 3(106):106ra106 [PMID: 22030749].

[56] Berka et al. (2011) Comparative genomic analysis of the thermophilic biomass-degrading fungi Myceliophthora thermophila and Thielavia terrestris. Nature Biotechnol. 29, 922-927 [PMID: 21964414].

[55] De Luca et al. (2011) The cyst-dividing bacterium Ramlibacter tataouinensis TTB310 genome reveals a well-stocked toolbox for adaptation to a desert environment. PLoS One 6: e23784 [PMID: 21912644].

[54] Manzo et al. (2011) Carbohydrate-active enzymes from pigmented Bacilli: a genomic approach to assess carbohydrate utilization and degradation. BMC Microbiol 11, 198 [PMID: 21892951].

[53] Amselem et al. (2011) Genomic analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea. PLoS Genetics 7, e1002230 [PMID: 21876677].

[52] Klosterman et al. (2011) Comparative genomics yields insights into niche adaptation of plant vascular wilt pathogens. PLoS Pathog. 7, e1002137 [PMID: 21829347].

[51] Eastwood et al. (2011) The plant cell wall decomposing machinery underlies the functional diversity of forest fungi. Science, 333, 762-765 [PMID: 21764756].

[50] Muegge et al. (2011) Diet drives convergence in gut microbiome functions across mammalian phylogeny and within humans. Science 332, 970-974 [PMID: 21596990].

[49] Duplessis et al. (2011) Obligate biotrophy features unraveled by the genomic analysis of rust fungi. Proc. Natl. Acad. Sci. USA 108, 9166-9171 [PMID: 21536894].

[48] Goodwin et al. (2011) Finished genome of the fungal wheat pathogen Mycosphaerella graminicola reveals dispensome structure, chromosome plasticity and stealth pathogenesis. PLoS Genetics 7, e1002070. [PMID: 21695235].

[47] Kubicek et al. (2011) Comparative genome sequence analysis underscores mycoparasitism as the ancestral life style of Trichoderma. Genome Biol. 12, R40 [PMID: 21501500].

[46] Dam et al. (2011) Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725. Nucleic Acids Res. 39, 3240-3254 [PMID: 21227922].

[45] Sucgang et al. (2011) Comparative genomics of the social amoebae Dictyostelium discoideum and Dictyostelium purpureum. Genome Biol. 12, R20 [PMID: 21356102].

[44] Diguistini et al. (2011) Genome and transcriptome analyses of the mountain pine beetle-fungal symbiont Grosmannia clavigera, a lodgepole pine pathogen. Proc. Natl. Acad. Sci. USA 108, 2504-2509 [PMID: 21262841].

[43] Swanson et al. (2011) Phylogenetic and gene-centric metagenomics of the canine gastrointestinal microbiome reveals similarities with human and mouse gut metagenomes. ISME J 5, 639-649 [PMID: 20962874].

[42] Battaglia et al. (2011) Carbohydrate-active enzymes from the zygomycete fungus Rhizopus oryzae: a highly specialized approach to carbohydrate degradation depicted at genome level. BMC Genomics 12, 38 [PMID: 21241472].

[41] Turroni et al. (2010) Genome analysis of Bifidobacterium bifidum PRL2010 reveals metabolic pathways for host-derived glycan foraging. Proc. Natl. Acad. Sci. USA 107, 19514-19519 [PMID: 20974960].

[40] Hemme et al. (2010) Genome announcement: sequencing of multiple clostridia genomes related to biomass conversion and biofuels production. J. Bacteriol. 192, 6494-6496 [PMID: 20889752].

[39] Tasse et al. (2010) Functional metagenomics to mine the human gut microbiome for dietary fiber catabolic enzymes. Genome Res. 20, 1605-1612 [PMID: 20841432].

[38] Ohm et al. (2010) Formation of mushrooms and lignocellulose degradation encoded in the genome sequence of Schizophyllum commune. Nature Biotechnol. 28, 957-963 [PMID: 20622885].

[37] Purushe et al. (2010) Comparative genome analysis of Prevotella ruminicola and Prevotella bryantii; insights into their environmental niche. Microbial Ecology 60, 721-729 [PMID: 20585943].

[36] Rincon et al. (2010) Abundance and diversity of dockerin-containing proteins in the fiber-degrading rumen bacterium, Ruminococcus flavefaciens FD-1. (2010) PLoS One 5, e12476 [PMID: 20814577].

[35] Levesque et al. (2010) Genome sequence of the necrotrophic plant pathogen, Pythium ultimum, reveals original pathogenicity mechanisms and effector repertoire. Genome Biology 11, R73 [PMID: 20626842].

[34] Turnbaugh et al. (2010) Organismal, genetic, and transcriptional variation in the deeply sequenced gut microbiomes of identical twins. Proc. Natl. Acad. Sci USA 107, 7503-7508 [PMID: 20363958].

[33] Martin et al. (2010) Périgord black truffle genome uncovers evolutionary origins and mechanisms of symbiosis. Nature 464, 1033-1038 [PMID: 20348908].

[32] Ma et al. (2010) Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium. Nature 464, 367-373 [PMID: 20237561].

[31] Ventura et al. (2009) The Bifidobacterium dentium Bd1 genome sequence reflects its genetic adaptation to the human oral cavity. PLoS Genet 5(12) e1000785 [PMID: 20041198].

[30] Coleman et al. (2009) The genome of Nectria haematococca: contribution of supernumerary chromosomes to gene expansion. PLoS Genet 5, e1000618 [PMID: 19714214].

[29] Yang et al. (2009) The complete genome of Teredinibacter turnerae T7901: an intracellular endosymbiont of marine wood-boring bivalves (shipworms). PloS One 4, e6085 [PMID: 19568419].

[28] Worden et al. (2009) The genomes of Micromonas: global reporters in marine environments. Science 324, 268-272 [PMID: 19359590].

[27] Turnbaugh et al. (2009) A core gut microbiome in obese and lean twins. Nature 457, 480-484 [PMID: 19043404].

[26] McBride et al. (2009) Novel features of the polysaccharide digesting gliding bacterium Flavobacterium johnsoniae revealed by genome sequence analysis. Appl. Environm. Microbiol. 75, 6864-6875 [PMID: 19717629].

[25] Berg Miller et al. (2009) Diversity and strain specificity of plant cell wall degrading enzymes revealed by the draft genome of Ruminococcus flavefaciens FD-1. PLoS One 4, e6650. [PMID: 19680555].

[24] Mahowald et al. (2009) Characterizing a model human gut microbiota composed of members of its two dominant bacterial phyla. Proc. Natl. Acad. Sci. USA 106, 5859-5864 [PMID: 19321416].

[23] Ward et al. (2009) Three genomes from the phylum Acidobacteria provide insight into their lifestyles in soils. Appl. Environ. Microbiol. 75, 2046-2056 [PMID: 19201974].

[22] Brulc et al. (2009) Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases. Proc. Natl. Acad. Sci. USA 106, 1948-1953 [PMID: 19181843].

[21] Martinez et al. (2009) Genome, transcriptome, and secretome analysis of wood decay fungus Postia placenta supports unique mechanisms of lignocellulose conversion. Proc. Natl. Acad. Sci. USA 106, 1954-1959 [PMID: 19193860].

[20] Coutinho et al. (2009) Post-genomic insights into the plant polysaccharide degradation potential of Aspergillus nidulans and comparison to Aspergillus niger and Aspergillus oryzae. Fungal Genet. Biol. 46, S161-S169 [PMID: 19618505].

[19] Wortman et al. (2009) The 2008 update of the Aspergillus nidulans genome annotation: a community effort. Fungal Genet. Biol. 46, S2-S13 [PMID: 19146970].

[18] Martin et al. (2008) The genome of Laccaria bicolor provides insights into mycorrhizal symbiosis. Nature 452, 88-92 [PMID: 18322534].

[17] Lozupone et al. (2008) The convergence of carbohydrate active gene repertoires in human gut microbes. Proc. Natl. Acad. Sci. USA, 105, 15076-15081 [PMID: 18806222].

[16] Abad et al. (2008) Genome sequence of the metazoan plant-specific nematode Meloidogyne incognita. Nature Biotechnol. 26, 909-915. [PMID: 18660804].

[15] Deboy et al. (2008) Insights into plant cell-wall degradation from the genome sequence of the soil bacterium Cellvibrio japonicus. J. Bacteriol. 190, 5455-5463 [PMID: 18556790].

[14] Weiner et al. (2008) Complete genome sequence of the complex carbohydrate-degrading marine bacterium Saccharophagus degradans strain 2-40T. PLoS Genet., 4(5):e1000087. [PMID: 18516288].

[13] Martinez et al. (2008) Genome sequence analysis of the cellulolytic fungus Trichoderma reesei (syn. Hypocrea jecorina) reveals a surprisingly limited inventory of carbohydrate-active enzymes. Nature Biotechnol. 26, 553-560. [PMID: 18454138].

[12] Espagne et al. (2008) The genome sequence of the model Ascomycete fungus Podospora anserina. Genome Biol. 9:R77 (doi:10.1186/gb-2008-9-5-r77) [PMID: 18460219].

[11] Pel et al. (2007) Genome sequencing and analysis of the versatile cell factory Aspergillus niger CBS 513.88. Nature Biotechnol. 25, 221-231 [PMID: 17259976].

[10] Xu et al. (2007) Evolution of symbiotic bacteria in the distal human intestine. PLoS Biology 5, e156 [PMID: 17579514].

[9] Samuel et al. (2007) Genomic and metabolic adaptations of Methanobrevibacter smithii to the human gut. Proc. Natl. Acad. Sci. USA 104, 10643-10648. [PMID: 17563350].

[8] Xie et al. (2007) Genome sequence of the cellulolytic gliding bacterium Cytophaga hutchinsonii. Appl. Environm. Microbiol. 73, 3536-3546. [PMID: 17400776].

[7] Geisler-Lee et al (2006) Poplar Carbohydrate-Active Enzymes (CAZymes). Gene identification and expression analyses. Plant Physiol 140, 946-962. [PMID: 16415215].

[6] Tuskan et al. (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray ex Brayshaw). Science 313, 1596-1604 [PMID: 16973872].

[5] West CM, van der Wel H, Coutinho P M, Henrissat B (2005) Glycosyltransferase genomics in Dictyostelium discoideum (2005) In Dictyostelium Genomics. W.F. Loomis & A. Kuspa, eds., Horizon Scientific Press 235-264.

[4] Martinez et al. (2004) Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78. Nature Biotechnol 22 695-700 [PMID: 15122302].

[3] Coutinho PM, Stam M, Blanc E & Henrissat B (2003) Why so many carbohydrate-active enzymes related genes in plants ? Trends Plant Sci. 8, 563-565 [PMID: 14659702].

[2] Henrissat B, Deleury E, Coutinho PM (2002) Glycogen metabolism loss : a common marker of parasitic behaviour in bacteria ? Trends Genet 18 437-440 [PMID: 12175798].

[1] Henrissat B, Coutinho P M, Davies GJ (2001) A census of carbohydrate-active enzymes in the genome of Arabidopsis thaliana. Plant Mol Biol 47 55-72 [PMID: 11554480].

Last update: 2012-05-09 © Copyright 1998-2012
AFMB - CNRS - Université d'Aix-Marseille