SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00732

Identifier
GLHYDRLASE4  [View Relations]  [View Alignment]  
Accession
PR00732
No. of Motifs
7
Creation Date
26-MAY-1997  (UPDATE 07-JUN-1999)
Title
Glycosyl hydrolase family 4 signature
Database References

PROSITE; PS01324 GLYCOSYL_HYDROL_F4
INTERPRO; IPR001088
Literature References
1. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
 
2. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
 
3. DAVIES, G. AND HENRISSAT, B.
Structures and mechanisms of glycosyl hydrolases.
STRUCTURE 3 853-859 (1995).
 
4. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).

Documentation
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt). Family 4 includes 6-phospho-beta-glucosidases, alpha-
galactosidases, and B.subtilis LPLD protein. 
 
GLHYDRLASE4 is a 7-element fingerprint that provides a signature for 
family 4 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 7 sequences: the motifs were drawn from short conserved regions 
spanning the N-terminal half of the alignment. A single iteration on OWL29.3
was required to reach convergence, no further sequences being identified
beyond the starting set.
 
An update on SPTR37_9f identified a true set of 8 sequences, and 1
partial match.
Summary Information
   8 codes involving  7 elements
1 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
Composite Feature Index
78888888
60111111
50000000
40000000
30000000
20000000
1234567
True Positives
AGAL_BACSU    AGAL_ECOLI    CELF_BACSU    CELF_ECOLI    
GLVG_BACSU GLVG_ECOLI LPLD_BACSU MALH_FUSMR
True Positive Partials
Codes involving 6 elements
O88026
Sequence Titles
AGAL_BACSU  ALPHA-GALACTOSIDASE (EC 3.2.1.22) (MELIBIASE) - BACILLUS SUBTILIS. 
AGAL_ECOLI ALPHA-GALACTOSIDASE (EC 3.2.1.22) (MELIBIASE) - ESCHERICHIA COLI.
CELF_BACSU PROBABLE 6-PHOSPHO-BETA-GLUCOSIDASE (EC 3.2.1.86) - BACILLUS SUBTILIS.
CELF_ECOLI 6-PHOSPHO-BETA-GLUCOSIDASE (EC 3.2.1.86) - ESCHERICHIA COLI.
GLVG_BACSU MALTOSE-6'-PHOSPHATE GLUCOSIDASE (EC 3.2.1.122) (6-PHOSPHO-ALPHA-D- GLUCOSIDASE) - BACILLUS SUBTILIS.
GLVG_ECOLI PROBABLE 6-PHOSPHO-BETA-GLUCOSIDASE (EC 3.2.1.86) - ESCHERICHIA COLI.
LPLD_BACSU LPLD PROTEIN - BACILLUS SUBTILIS.
MALH_FUSMR MALTOSE-6'-PHOSPHATE GLUCOSIDASE (EC 3.2.1.122) (6-PHOSPHO-ALPHA-D- GLUCOSIDASE) - FUSOBACTERIUM MORTIFERUM.

O88026 PUTATIVE GLUCOSIDASE - STREPTOMYCES COELICOLOR.
Scan History
OWL29_3    1  100  NSINGLE    
SPTR37_9f 2 14 NSINGLE
Initial Motifs
Motif 1  width=16
Element Seqn Id St Int Rpt
SIVIAGGGSTFTPGIV GLVG_BACSU 7 7 -
SVVVAGGGSTFTPGIV GLVG_ECOLI 5 5 -
KIVTIGGGSSYTPELV CELF_BACSU 6 6 -
KVVTIGGGSSYTPELL CELF_ECOLI 6 6 -
KITFIGAGSTIFVKNI AGAL_ECOLI 6 6 -
KIAYIGGGSQGWARSL LPLD_BACSU 11 11 -
KVVTIGGGSSYTPELL D908165 6 6 -

Motif 2 width=17
Element Seqn Id St Int Rpt
AFTDVDFVMAHIRVGKY GLVG_BACSU 76 53 -
ALKDADFVTTQFRVGLL CELF_BACSU 77 55 -
ALKDADFVTTQLRVGQL D908165 77 55 -
ALSAADIVIISILPGSL LPLD_BACSU 76 49 -
ALEDADFVVVAFQIGGY AGAL_ECOLI 75 53 -
ALKDADFVTTQLRVGQL CELF_ECOLI 77 55 -
AFSDVDFVMAHIRVGKY GLVG_ECOLI 74 53 -

Motif 3 width=14
Element Seqn Id St Int Rpt
VDVHLPERCGIYQS LPLD_BACSU 97 4 -
LDERIPLSHGYLGQ D908165 98 4 -
LDEQIPLKYGVVGQ GLVG_BACSU 97 4 -
LDEKIPLRHGVVGQ GLVG_ECOLI 95 4 -
KDERIPLKYGVIGQ CELF_BACSU 98 4 -
LDERIPLSHGYLGQ CELF_ECOLI 98 4 -
TDFEVCKRHGLEQT AGAL_ECOLI 97 5 -

Motif 4 width=21
Element Seqn Id St Int Rpt
ETNGAGGLFKGLRTIPVIFDI CELF_ECOLI 112 0 -
DTLGPGGIMRALRTIPHLWQI AGAL_ECOLI 113 2 -
DTVGPGGIIRGLRAVPIFAEI LPLD_BACSU 113 2 -
ETNGAGGLFKGLRTIPVIFDI D908165 112 0 -
ETCGPGGIAYGMRSIGGVLEI GLVG_BACSU 111 0 -
ETCGPGGIAYGMRSIGGVLEL GLVG_ECOLI 109 0 -
ETNGPGGLFKGLRTIPVILEI CELF_BACSU 112 0 -

Motif 5 width=18
Element Seqn Id St Int Rpt
PNAWVINFTNPAGMVTEA CELF_ECOLI 141 8 -
PESWVINYTNPMYVCTRV LPLD_BACSU 142 8 -
PNAWVINFTNPAGMVTEA D908165 141 8 -
PDAWMLNYSNPAAIVAEA GLVG_BACSU 140 8 -
PNAWMLNYSNPAAIVAEA GLVG_ECOLI 138 8 -
PNAWLVNFTNPAGMVTEA CELF_BACSU 141 8 -
PDATMLNYVNPMAMNTWA AGAL_ECOLI 142 8 -

Motif 6 width=12
Element Seqn Id St Int Rpt
KQVGLCHSVQGT AGAL_ECOLI 168 8 -
RFIGVCNIPIGM D908165 167 8 -
KILNICDMPVGI GLVG_BACSU 166 8 -
KILNICDMPIGI GLVG_ECOLI 164 8 -
KVVGLCNVPIGI CELF_BACSU 167 8 -
RFIGVCNIPIGM CELF_ECOLI 167 8 -
KAIGCCHEVFGT LPLD_BACSU 168 8 -

Motif 7 width=13
Element Seqn Id St Int Rpt
VEVQFAGLNHMVF CELF_BACSU 193 14 -
LRYRCAGINHMAF AGAL_ECOLI 194 14 -
IRVNVLGINHFTW LPLD_BACSU 201 21 -
MKVRYYGLNHFGW GLVG_BACSU 193 15 -
LSIDLFGLNHMVF D908165 194 15 -
MRVRYYGLNHWWS GLVG_ECOLI 191 15 -
LSIDLFGLNHMVF CELF_ECOLI 194 15 -
Final Motifs
Motif 1  width=16
Element Seqn Id St Int Rpt
SIVIAGGGSTFTPGIV GLVG_BACSU 7 7 -
SVVVAGGGSTFTPGIV GLVG_ECOLI 5 5 -
SILIAGGGSTFTPGII MALH_FUSMR 5 5 -
KIVTIGGGSSYTPELV CELF_BACSU 6 6 -
KVVTIGGGSSYTPELL CELF_ECOLI 6 6 -
KITFIGAGSTIFAKNV AGAL_BACSU 3 3 -
KITFIGAGSTIFVKNI AGAL_ECOLI 6 6 -
KIAYIGGGSQGWARSL LPLD_BACSU 11 11 -

Motif 2 width=17
Element Seqn Id St Int Rpt
AFTDVDFVMAHIRVGKY GLVG_BACSU 76 53 -
AFSDVDFVMAHIRVGKY GLVG_ECOLI 74 53 -
AFTDIDFVMAHIRVGKY MALH_FUSMR 74 53 -
ALKDADFVTTQFRVGLL CELF_BACSU 77 55 -
ALKDADFVTTQLRVGQL CELF_ECOLI 77 55 -
ALQNAGYVINAIQVGGY AGAL_BACSU 72 53 -
ALEDADFVVVAFQIGGY AGAL_ECOLI 75 53 -
ALSAADIVIISILPGSL LPLD_BACSU 76 49 -

Motif 3 width=14
Element Seqn Id St Int Rpt
LDEQIPLKYGVVGQ GLVG_BACSU 97 4 -
LDEKIPLRHGVVGQ GLVG_ECOLI 95 4 -
LDEKIPLRHGVVGQ MALH_FUSMR 95 4 -
KDERIPLKYGVIGQ CELF_BACSU 98 4 -
LDERIPLSHGYLGQ CELF_ECOLI 98 4 -
IDFEIPKRYGLRQT AGAL_BACSU 94 5 -
TDFEVCKRHGLEQT AGAL_ECOLI 97 5 -
VDVHLPERCGIYQS LPLD_BACSU 97 4 -

Motif 4 width=21
Element Seqn Id St Int Rpt
ETCGPGGIAYGMRSIGGVLEI GLVG_BACSU 111 0 -
ETCGPGGIAYGMRSIGGVLEL GLVG_ECOLI 109 0 -
ETCGPGGIAYGMRSIGGVIGL MALH_FUSMR 109 0 -
ETNGPGGLFKGLRTIPVILEI CELF_BACSU 112 0 -
ETNGAGGLFKGLRTIPVIFDI CELF_ECOLI 112 0 -
DTVGIGGIFRSLRTIPVLFDI AGAL_BACSU 110 2 -
DTLGPGGIMRALRTIPHLWQI AGAL_ECOLI 113 2 -
DTVGPGGIIRGLRAVPIFAEI LPLD_BACSU 113 2 -

Motif 5 width=18
Element Seqn Id St Int Rpt
PDAWMLNYSNPAAIVAEA GLVG_BACSU 140 8 -
PNAWMLNYSNPAAIVAEA GLVG_ECOLI 138 8 -
PNAWMLNYSNPAAIVAEA MALH_FUSMR 138 8 -
PNAWLVNFTNPAGMVTEA CELF_BACSU 141 8 -
PNAWVINFTNPAGMVTEA CELF_ECOLI 141 8 -
PDAWFLNYTNPMATLTGA AGAL_BACSU 139 8 -
PDATMLNYVNPMAMNTWA AGAL_ECOLI 142 8 -
PESWVINYTNPMYVCTRV LPLD_BACSU 142 8 -

Motif 6 width=12
Element Seqn Id St Int Rpt
KILNICDMPVGI GLVG_BACSU 166 8 -
KILNICDMPIGI GLVG_ECOLI 164 8 -
KVLNICDMPIGI MALH_FUSMR 164 8 -
KVVGLCNVPIGI CELF_BACSU 167 8 -
RFIGVCNIPIGM CELF_ECOLI 167 8 -
KTIGLCHSVQVC AGAL_BACSU 164 7 -
KQVGLCHSVQGT AGAL_ECOLI 168 8 -
KAIGCCHEVFGT LPLD_BACSU 168 8 -

Motif 7 width=13
Element Seqn Id St Int Rpt
MKVRYYGLNHFGW GLVG_BACSU 193 15 -
MRVRYYGLNHWWS GLVG_ECOLI 191 15 -
MDIMYYGLNHFGW MALH_FUSMR 191 15 -
VEVQFAGLNHMVF CELF_BACSU 193 14 -
LSIDLFGLNHMVF CELF_ECOLI 194 15 -
IEERIAGINHMAW AGAL_BACSU 190 14 -
LRYRCAGINHMAF AGAL_ECOLI 194 14 -
IRVNVLGINHFTW LPLD_BACSU 201 21 -