Polysaccharide-Utilization Loci DataBase (PULDB)

PULDB describes Polysaccharide Utilization Loci (PULs) experimentally characterized in the literature, but also 81,235 PUL predictions in 2,494 Bacteroidetes species.
A PUL is a set of physically-linked genes organized around a susCD gene pair. PULs are prevalent in the Bacteroidetes phylum. They provide an evolutionary advantage to these species by orchestrating the breakdown of complex glycans thanks to the encoded CAZymes. Please note that our PUL prediction tool tends to predict more PULs with multiple tandem susCD pairs than what is seen in experimentally determined cases.

Citing PULDB

A new reference for PULDB ! In the 2018 database issue of Nucleic Acids Research, we summarize the many changes that have occurred in the PULDB during the previous two years.
Terrapon N, Lombard V, Drula É, Lapébie P, Al-Masaudi S, Gilbert HJ, Henrissat B (2017) PULDB: the expanded database of Polysaccharide Utilization Loci; Nucleic Acids Research Link to the publication

To refer the prediction method
Terrapon, N., Lombard, V., Gilbert, H.J. and Henrissat, B. (2015) Automatic prediction of polysaccharide utilization loci in Bacteroidetes species from the human gut microbiota; Bioinformatics 31(5):647-55 - Link to the publication

List by species/taxon or browse PULDB taxonomy


Include CAZyme clusters?
Search for PULs with specific modules
Enter one or multiple modules (list available here) separated by '+'
Module(s)  Force order? Add CAZyme clusters?
Search for a protein

or a substrate
   Note: [Fn], [Fc] and [F] indicate a possibly fragmented gene (e.g. a missing signal peptide or an incomplete catalytic module) at N- or C-terminus or both/undetermined, respectively, and [EC] denotes functional characterization of the gene in the litterature (EC number assigned by CAZy curators).


Organism name/NCBI id Bacteroides thetaiotaomicron VPI-5482 - 226186
Identifier/JBrowse link Predicted PUL 84JBrowse icon 
Modularity
◀ GH2_1
 
◀ GH53
 
◀ unk
 
◀ SusD
 
◀ SusC
 
◀ unk
 
◀ HTCS
 
unk ▶
 
PL13 ▶
Protein details
ProteinModularityCAZy functionGenome Sequencing descriptionPfam domains
BT4667
◀ GH2_1
[EC]
EC3.2.1.23 b-galactosidase Pubmed icon beta-galactosidasePF02837
PF00703
PF02836
PF16355
PF18565
BT4668
◀ GH53
[EC]
EC3.2.1.89 endo-b-1,4-galactanase Pubmed icon arabinogalactan endo-1,4-beta-galactosidasePF07745
BT4669
◀ unk
hypothetical proteinPF14292
PF17138
PF17141
BT4670
◀ SusD
putative outer membrane protein, probably involved in nutrient bindingPF07980
BT4671
◀ SusC
putative outer membrane protein, probably involved in nutrient bindingPF13715
PF07715
PF00593
BT4672
◀ unk
hypothetical protein
BT4673
◀ HTCS
two-component system sensor histidine kinasePF07494
PF07495
PF00512
PF02518
PF12833
BT4674
unk ▶
hypothetical proteinPF15889
BT4675
PL13 ▶
[EC]
EC4.2.2.7 heparin lyase / heparin lyase I Pubmed icon heparin lyase I precursorPF14099
Overlap with PULs
Literature-derived PUL 86 pectic galactan - pectic galactanJBrowse icon
◀ GH2_1
 
◀ GH53
 
◀ unk
 
◀ SusD
 
◀ SusC
 
◀ unk
 
◀ HTCS
Literature-derived PUL 85 heparin/heparan sulfate - heparin/heparan sulfateJBrowse icon
◀ PL15_2
 
◀ MFS
 
◀ ROK
 
◀ unk
 
◀ Sulf1_11
 
◀ PL12_2
 
◀ GH88
 
◀ SusD
 
◀ SusC
 
◀ unk
 
◀ PL12_2
 
◀ HTCS
 
PL13 ▶
Similarity search
  Select the number of most similar PULs (hits) to display    Change parameters