Carbohydrate-Binding Module family classification


A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity . A few exceptions are CBMs in cellulosomal scaffoldin proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins.

CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [1] [2]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology.

Family classification

Previous classification of cellulose-binding domains were based on amino acid similarity [3]. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII [3]. For a detailed review on the structure and binding modes of CBMs see [4].

The family classification of CBMs is expected to:

1. aid in the identification of CBMs

2. in some cases, predict binding specificity

3. aid in identifying functional residues

4. reveal evolutionary relationships

5. be predictive of polypeptide folds

The CAZy database provides an updated list of the CBM-containing proteins, arranged by CBM family. When a protein contains CBMs from different families, it appears in each appropriate family. Because the fold of proteins is better conserved than their sequences, some of the CBM families can be grouped into superfamilies or clans.

Tables for Direct Access

- CBM Family Number

Non-Classified Sequences

- CBM Classification Statistics

Modules in present families 288306
Non-Classified modules 1651


[1Tomme, P., Van Tilbeurgh, H., Pettersson, G., Van Damme, J., Vandekerckhove, J., Knowles, J., Teeri, T. & Claeyssens, M. (1988) Studies of the cellulolytic system of Trichoderma reesei QM 9414. Analysis of domain function in two cellobiohydrolases by limited proteolysis. Eur. J. Biochem. 170, 575-581 [PMID: 3338453].

[2Gilkes, N.R., Warren, R.A., Miller, R.C.J. & Kilburn, D.G. (1988) Precise excision of the cellulose binding domains from two Cellulomonas fimi cellulases by a homologous protease and the effect on catalysis. J. Biol. Chem. 263, 10401-10407 [PMID: 3134347].

[3Tomme, P., Warren, R.A., Miller, R.C., Jr., Kilburn, D.G. & Gilkes, N.R. (1995) in Enzymatic Degradation of Insoluble Polysaccharides (Saddler, J.N. & Penner, M., eds.), Cellulose-binding domains: classification and properties. pp. 142-163, American Chemical Society, Washington.

[4Boraston, A.B., Bolam, D.N., Gilbert, H.J. & G.J. Davies (2004). Carbohydrate-binding modules: fine tuning polysaccharide recognition. Biochem. J. 382, 769-81 [PMID: 15214846].

Last update: 2021-11-30 © Copyright 1998-2021
AFMB - CNRS - Université d'Aix-Marseille