WORKLIST ENTRIES (1):
GLHYDRLASE15 View alignment View Structure Glycosyl hydrolase family 15 signature
Type of fingerprint: COMPOUND with 7 elements
Links:
PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
PRINTS; PR00737 GLHYDRLASE16; PR00738 GLHYDRLASE20; PR00739 GLHYDRLASE26
PRINTS; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29; PR00843 GLHYDRLASE30
PRINTS; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36; PR00744 GLHYDRLASE37
PRINTS; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41; PR00747 GLHYDRLASE47
PRINTS; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52; PR00846 GLHYDRLASE56
PRINTS; PR00849 GLHYDRLASE58; PR00850 GLHYDRLASE59; PR00748 MELIBIASE
PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
INTERPRO; IPR000165
PROSITE; PS00820 GLUCOAMYLASE
PFAM; PF00723 glycosyl_hydr10
PDB; 1AGM 3Dinfo
SCOP; 1AGM
CATH; 1AGM
Creation date 05-JUN-1997; UPDATE 10-JUN-1999
1. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
2. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
3. DAVIES, G. AND HENRISSAT, B.
Structures and mechanisms of glycosyl hydrolases.
STRUCTURE 3 853-859 (1995).
4. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
5. SIERKS, M.R., FORD, C., REILLY, P.J. AND SVENSSON, B.
Catalytic mechanism of fungal glucoamylase as defined by mutagenesis
of Asp176, Glu179 and Glu180 in the enzyme from Aspergillus awamori.
PROTEIN ENG. 3 193-198 (1990).
6. OHNISHI, H., KITAMURA, H., MINOWA, T., SAKAI, H. AND OHTA, T.
Molecular-cloning of a glucoamylase gene from a thermophilic Clostridium
and kinetics of the cloned enzyme.
EUR.J.BIOCHEM. 207 413-418 (1992).
7. ALESHIN, A.E., FIRSOV, L.M. AND HONZATKO, R.B.
Refined structure for the complex of acarbose with glucoamylase from
Aspergillus awamori var. x100 to 2.4A resolution.
J.BIOL.CHEM. 269(22) 15631-15639 (1994).
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt).
Family 15 encompasses the glucoamylases (GA). GA catalyses the release of
D-glucose from the non-reducing ends of starch and other oligo- or poly-
saccharides. Studies of fungal GA have indicated 3 closely-clustered acidic
residues that play a role in the catalytic mechanism [5]. This region is
also conserved in a recently sequenced bacterial GA [6].
The 3D structure of the pseudo-tetrasaccharide acarbose complexed with
glucoamylase II(471) from Aspergillus awamori var. X100 has been determined
to 2.4A resolution [7]. The protein belongs to the mainly-alpha class, and
contains 19 helices and 9 strands.
GLHYDRLASE15 is a 7-element fingerprint that provides a signature for
family 15 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 6 sequences: the motifs were drawn from short conserved regions
spanning the N-terminal half of the alignment - motif 2 includes part of
helix 5 and the second beta-strand; motif 3 encompasses most of helix 6;
motif 4 spans strands 3 and 4, and includes the region encoded by PROSITE
pattern GLUCOAMYLASE (PS00820), which contains the catalytic cluster of
acidic residues; motif 5 includes part of helix 13; motif 6 spans the C-
terminus of helix 17 and strand 8; and motif 7 spans helix 18. Two
iterations on OWL29.3 was required to reach convergence, at which point
a true set comprising 18 sequences was identified. Several partial matches
were also found, all of which are related sequences that fail to make
significant matches with one or more motifs.
An update on SPTR37_9f identified a true set of 18 sequences, and 3
partial matches.
SUMMARY INFORMATION
18 codes involving 7 elements
0 codes involving 6 elements
3 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
7| 18 18 18 18 18 18 18
6| 0 0 0 0 0 0 0
5| 3 3 3 3 3 0 0
4| 0 0 0 0 0 0 0
3| 0 0 0 0 0 0 0
2| 0 0 0 0 0 0 0
--+------------------------------------
| 1 2 3 4 5 6 7
True positives..
Q12537 AMYG_ASPAK Q92201 AMYG_ASPNG
AMYG_ASPSH Q02296 AMYG_ASPOR AMYG_NEUCR
Q12596 Q12623 AMYG_HORRE O59846
AMYH_SACFI AMYG_SACFI O60087 AMYG_YEAST
AMYG_RHIOR AMYG_ARXAD
Subfamily: Codes involving 5 elements
Subfamily True positives..
AMYH_SACDI Q92314 AMYI_SACDI
PROTEIN TITLES
Q12537 GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
AMYG_ASPAK GLUCOAMYLASE I PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUC
Q92201 GLUCOAMYLASE G1 AND G2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-AL
AMYG_ASPNG GLUCOAMYLASE G1 AND G2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-AL
AMYG_ASPSH GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
Q02296 GLUCOAMYLASE - ASPERGILLUS NIGER.
AMYG_ASPOR GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
AMYG_NEUCR GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
Q12596 GLUCOAMYLASE G2 (EC 3.2.1.3) - CORTICIUM ROLFSII.
Q12623 GLUCOAMYLASE (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOSIDASE) (1,
AMYG_HORRE GLUCOAMYLASE P PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUC
O59846 GLUCOAMYLASE - ASPERGILLUS ORYZAE.
AMYH_SACFI GLUCOAMYLASE GLA1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-
AMYG_SACFI GLUCOAMYLASE GLU1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-
O60087 GLUCOAMYLASE - SCHIZOSACCHAROMYCES POMBE (FISSION YEAST).
AMYG_YEAST GLUCOAMYLASE, INTRACELLULAR SPORULATION-SPECIFIC (EC 3.2.1.3
AMYG_RHIOR GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
AMYG_ARXAD GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOS
AMYH_SACDI GLUCOAMYLASE S1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLU
Q92314 GLUCOAMYLASE S1 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLU
AMYI_SACDI GLUCOAMYLASE S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLU
SCAN HISTORY
OWL29_3 2 100 NSINGLE
SPTR37_9f 2 21 NSINGLE
INITIAL MOTIF SETS
GLHYDRLASE151 Length of motif = 18 Motif number = 1
Glycosyl hydrolase family 15 motif I - 1
PCODE ST INT
GADSGIVVASPSTDNPDY AMYG_ASPAK 55 55
GADSGIVVASPSTDNPDY AMYG_ASPNG 55 55
GASPGVVIASPSKSDPDY AMYG_ASPOR 58 58
GAGAGFVVASPSKANPDY AMYG_HORRE 59 59
GAASGVVVASPSKSSPDW AMYG_NEUCR 65 65
SISPGVVIASPSQTHPDY AMYG_YEAST 108 108
GLHYDRLASE152 Length of motif = 18 Motif number = 2
Glycosyl hydrolase family 15 motif II - 1
PCODE ST INT
LGEPKFNVDETAYTGSWG AMYG_ASPAK 127 54
LGEPKFNVDETAYTGSWG AMYG_ASPNG 128 55
LGEPKFNVDETAFTGAWG AMYG_ASPOR 130 54
LGEPKFMVDGTRFNGPWG AMYG_HORRE 133 56
LGEPKFMVDLQQFTGAWG AMYG_NEUCR 138 55
LGDPKWNVDNTAFTEDWG AMYG_YEAST 182 56
GLHYDRLASE153 Length of motif = 20 Motif number = 3
Glycosyl hydrolase family 15 motif III - 1
PCODE ST INT
RPQRDGPALRATAMIGFGQV AMYG_ASPAK 145 0
RPQRDGPALRATAMIGFGQW AMYG_ASPNG 146 0
RPQRDGPALRATAMISFGEW AMYG_ASPOR 148 0
RPQRDGPALRAIALMTYSNW AMYG_HORRE 151 0
RPQRDGPPLRAIALIGYGKW AMYG_NEUCR 156 0
RPQNDGPALRSIAILKIIDY AMYG_YEAST 200 0
GLHYDRLASE154 Length of motif = 19 Motif number = 4
Glycosyl hydrolase family 15 motif IV - 1
PCODE ST INT
WNQTGYDLWEEVNGSSFFT AMYG_ASPAK 193 28
WNQTGYDLWEEVNGSSFFT AMYG_ASPNG 194 28
WSQSGFDLWEEVQGTSFFT AMYG_ASPOR 196 28
WNQSGFDLWEETYASSFFT AMYG_HORRE 199 28
WNNTGFDLWEEVNSSSFFT AMYG_NEUCR 204 28
WNSSGFDLWEEVNGMHFFT AMYG_YEAST 255 35
GLHYDRLASE155 Length of motif = 20 Motif number = 5
Glycosyl hydrolase family 15 motif V - 1
PCODE ST INT
NDGLSDSEAVAVGRYPEDSY AMYG_ASPAK 315 103
NDGLSDSEAVAVGRYPEDTY AMYG_ASPNG 316 103
NSGRAENQAVAVGRYPEDSY AMYG_ASPOR 318 103
NAGIPEGQGVAVGRYAEDVY AMYG_HORRE 322 104
NSGRTAGKAAAVGRYAEDVY AMYG_NEUCR 326 103
NDSSKNATGIALGRYPEDVY AMYG_YEAST 388 114
GLHYDRLASE156 Length of motif = 22 Motif number = 6
Glycosyl hydrolase family 15 motif VI - 1
PCODE ST INT
ADGFVSIVETHAASNGSLSEQF AMYG_ASPAK 404 69
ADGFVSIVETHAASNGSMSEQY AMYG_ASPNG 405 69
ADGYVQIVQTYAASTGSMAEQY AMYG_ASPOR 407 69
ADSYVAIAEKYIPSNGSLSEQF AMYG_HORRE 413 71
ADGFVDIVAQYTPSDGSLAEQF AMYG_NEUCR 415 69
ADSFLVKLKAHVGTDGELSEQF AMYG_YEAST 494 86
GLHYDRLASE157 Length of motif = 16 Motif number = 7
Glycosyl hydrolase family 15 motif VII - 1
PCODE ST INT
DLTWSYAALLTANNRR AMYG_ASPAK 437 11
DLTWSYAALLTANNRR AMYG_ASPNG 438 11
DLTWSYAALLTANNRR AMYG_ASPOR 440 11
DLTWSYAAFITMSQRR AMYG_HORRE 446 11
HLTWSYASFLSAAARR AMYG_NEUCR 448 11
HLTWSYTSFWDAYQIR AMYG_YEAST 527 11
FINAL MOTIF SETS
GLHYDRLASE151 Length of motif = 18 Motif number = 1
Glycosyl hydrolase family 15 motif I - 2
PCODE ST INT
GADSGIVVASPSTDNPDY Q12537 55 55
GADSGIVVASPSTDNPDY AMYG_ASPAK 55 55
GADSGIVVASPSTDNPDY AMYG_ASPNG 55 55
GADSGIVVASPSTDNPDY Q92201 55 55
GADSGIVVASPSTDNPDY AMYG_ASPSH 55 55
GADSGIVVASPSTDNPDY Q02296 55 55
GASPGVVIASPSKSDPDY AMYG_ASPOR 58 58
GAASGVVVASPSKSSPDW AMYG_NEUCR 65 65
GAYSGIVIASPSKTSPDY Q12596 51 51
GAAAGVVIASPSRTDPPY Q12623 61 61
GAGAGFVVASPSKANPDY AMYG_HORRE 59 59
GAAAGIVVASPSKSNPDY O59846 58 58
DGVPGTVIASPSTSNPDY AMYH_SACFI 73 73
NGVPGTVIASPSTSNPDY AMYG_SACFI 73 73
DINPGCIIASPSTDSPDY O60087 59 59
SISPGVVIASPSQTHPDY AMYG_YEAST 108 108
GSATGFIAASLSTAGPDY AMYG_RHIOR 193 193
GAAPGTVIAAQSYSEPDY AMYG_ARXAD 187 187
GLHYDRLASE152 Length of motif = 18 Motif number = 2
Glycosyl hydrolase family 15 motif II - 2
PCODE ST INT
LGEPKFNVDETAYTGSWG Q12537 127 54
LGEPKFNVDETAYTGSWG AMYG_ASPAK 127 54
LGEPKFNVDETAYTGSWG AMYG_ASPNG 128 55
LGEPKFNVDETAYTGSWG Q92201 128 55
LGEPKFNVDETAYAGSWG AMYG_ASPSH 127 54
LGEPKFNVDETAYTGSWG Q02296 128 55
LGEPKFNVDETAFTGAWG AMYG_ASPOR 130 54
LGEPKFMVDLQQFTGAWG AMYG_NEUCR 138 55
LGEPKFNIDETAFTGAWG Q12596 124 55
LGEAKFNVDLTAFTGEWG Q12623 121 42
LGEPKFMVDGTRFNGPWG AMYG_HORRE 133 56
LAEPKFYVNISQFTDSWG O59846 131 55
LGEPKFNTDGSAYTGAWG AMYH_SACFI 150 59
LGEPKFNTDGSAYTGAWG AMYG_SACFI 150 59
LGEPKFNVDGTSYDGDWG O60087 131 54
LGDPKWNVDNTAFTEDWG AMYG_YEAST 182 56
LGEPKFNPDASGYTGAWG AMYG_RHIOR 263 52
MGEPKFYLNNTAFTGSWG AMYG_ARXAD 259 54
GLHYDRLASE153 Length of motif = 20 Motif number = 3
Glycosyl hydrolase family 15 motif III - 2
PCODE ST INT
RPQRDGPALRATAMIGFGQW Q12537 145 0
RPQRDGPALRATAMIGFGQV AMYG_ASPAK 145 0
RPQRDGPALRATAMIGFGQW AMYG_ASPNG 146 0
RPQRDGPALRATAMIGFGQW Q92201 146 0
RPQRDGPALRATAMIGFGQW AMYG_ASPSH 145 0
RPQRDGPALRATAMIGFGQW Q02296 146 0
RPQRDGPALRATAMISFGEW AMYG_ASPOR 148 0
RPQRDGPPLRAIALIGYGKW AMYG_NEUCR 156 0
RPQRDGPALRATAIMTYATY Q12596 142 0
RPQRDGPPLRAIALIQYAKW Q12623 139 0
RPQRDGPALRAIALMTYSNW AMYG_HORRE 151 0
RPQRDGPALRASALIAYGNS O59846 149 0
RPQNDGPALRAYAISRYLND AMYH_SACFI 168 0
RPQNDGPALRAYAISRYLND AMYG_SACFI 168 0
RPQNDSPALRAIAFIKYMNY O60087 149 0
RPQNDGPALRSIAILKIIDY AMYG_YEAST 200 0
RPQNDGPAERATTFILFADS AMYG_RHIOR 281 0
RPQNDGPATRAITLIEFANA AMYG_ARXAD 277 0
GLHYDRLASE154 Length of motif = 19 Motif number = 4
Glycosyl hydrolase family 15 motif IV - 2
PCODE ST INT
WNQTGYDLWEEVNGSSFFT Q12537 193 28
WNQTGYDLWEEVNGSSFFT AMYG_ASPAK 193 28
WNQTGYDLWEEVNGSSFFT AMYG_ASPNG 194 28
WNQTGYDLWEEVNGSSFFT Q92201 194 28
WNQTGYDLWEEVNGSSFFT AMYG_ASPSH 193 28
WNQTGYDLWEVNGSSFFTI Q02296 194 28
WSQSGFDLWEEVQGTSFFT AMYG_ASPOR 196 28
WNNTGFDLWEEVNSSSFFT AMYG_NEUCR 204 28
WNQTTFDLWEEVDSSSFFT Q12596 190 28
WNETGFDLWEEVPGSSFFT Q12623 187 28
WNQSGFDLWEETYASSFFT AMYG_HORRE 199 28
WNQTGFDLWEEVQGSSFFT O59846 197 28
WDSTGFDLWEENQGRHFFT AMYH_SACFI 228 40
WDSTGFDLWEENQGRHFFT AMYG_SACFI 228 40
WTEASFDLWEEIKDVHYFT O60087 197 28
WNSSGFDLWEEVNGMHFFT AMYG_YEAST 255 35
WSNGCFDLWEEVNGVHFYT AMYG_RHIOR 330 29
WSSPSFDLWEEEESAHFYT AMYG_ARXAD 334 37
GLHYDRLASE155 Length of motif = 20 Motif number = 5
Glycosyl hydrolase family 15 motif V - 2
PCODE ST INT
NDGLSDSEAVAVGRYPEDSY Q12537 315 103
NDGLSDSEAVAVGRYPEDSY AMYG_ASPAK 315 103
NDGLSDSEAVAVGRYPEDTY AMYG_ASPNG 316 103
NDGLSDSEAVAVGRYPEDTY Q92201 316 103
NDGLSDSEAVAVGRYPEDSY AMYG_ASPSH 315 103
NDGLSDSEAVAVGRYPEDTY Q02296 315 102
NSGRAENQAVAVGRYPEDSY AMYG_ASPOR 318 103
NSGRTAGKAAAVGRYAEDVY AMYG_NEUCR 326 103
NSGISSTSGVATGRYPEDSY Q12596 315 106
NKGIAQGKAVAVGRYSEDVY Q12623 313 107
NAGIPEGQGVAVGRYAEDVY AMYG_HORRE 322 104
NNGRGAGKAAAVGPYAEDTY O59846 319 103
SVNSAYSAGAAIGRYPEDVY AMYH_SACFI 359 112
SVNSAYSAGAAIGRYPEDVY AMYG_SACFI 359 112
DYPVNQGWKQAMGRYPEDVY O60087 308 92
NDSSKNATGIALGRYPEDVY AMYG_YEAST 388 114
NKNLPSYLGNSIGRYPEDTY AMYG_RHIOR 455 106
SDESGKPLGIPVGRYPEDVY AMYG_ARXAD 463 110
GLHYDRLASE156 Length of motif = 22 Motif number = 6
Glycosyl hydrolase family 15 motif VI - 2
PCODE ST INT
ADGFVSIVETHAASNGSLSEQF Q12537 404 69
ADGFVSIVETHAASNGSLSEQF AMYG_ASPAK 404 69
ADGFVSIVETHAASNGSMSEQY AMYG_ASPNG 405 69
ADGFVSIVETHAASNGSMSEQY Q92201 405 69
ADGFVSIVETHAASNGSLSEQF AMYG_ASPSH 404 69
ADGFVSIVETHAASNGSMSEQY Q02296 403 68
ADGYVQIVQTYAASTGSMAEQY AMYG_ASPOR 407 69
ADGFVDIVAQYTPSDGSLAEQF AMYG_NEUCR 415 69
ADEFVDIVAKYTPSSGFLSEQY Q12596 404 69
ADGFIEVAAKYTPSNGALAEQY Q12623 402 69
ADSYVAIAEKYIPSNGSLSEQF AMYG_HORRE 413 71
ADGFISVVQEYTPDGGALAEQY O59846 408 69
GDSFLQVILDHINDDGSLNEQL AMYH_SACFI 464 85
GDSFLQVILDHINDDGSLNEQL AMYG_SACFI 464 85
ADNFLKAVAEFQHPNGSMSEQF O60087 395 67
ADSFLVKLKAHVGTDGELSEQF AMYG_YEAST 494 86
ADRFLSTVQLHAHNNGSLAEEF AMYG_RHIOR 550 75
GDAFMRRAKYHTPSSGHMSEEF AMYG_ARXAD 560 77
GLHYDRLASE157 Length of motif = 16 Motif number = 7
Glycosyl hydrolase family 15 motif VII - 2
PCODE ST INT
DLTWSYAALLTANNRR Q12537 437 11
DLTWSYAALLTANNRR AMYG_ASPAK 437 11
DLTWSYAALLTANNRR AMYG_ASPNG 438 11
DLTWSYAALLTANNRR Q92201 438 11
DLTWSYAALLTANNRR AMYG_ASPSH 437 11
DLTWSYAALLTANNRR Q02296 436 11
DLTWSYAALLTANNRR AMYG_ASPOR 440 11
HLTWSYASFLSAAARR AMYG_NEUCR 448 11
NLTWSYAAAITAYQAR Q12596 437 11
DLTWSYSAFLSAIDRR Q12623 435 11
DLTWSYAAFITMSQRR AMYG_HORRE 446 11
DLTWSYAAFLSAVGRR O59846 441 11
SLTWSSGALLEAIRLR AMYH_SACFI 497 11
SLTWSSGALLEAIRLR AMYG_SACFI 497 11
DLTWSYSSLLNAIYRR O60087 428 11
HLTWSYTSFWDAYQIR AMYG_YEAST 527 11
DLTWSHASLITASYAK AMYG_RHIOR 583 11
DLTWSYASLLSAAFAR AMYG_ARXAD 593 11
User query: Display/Full Code "GLHYDRLASE15"