WORKLIST ENTRIES (1):
GLHYDRLASE37 View alignment Glycosyl hydrolase family 37 signature
Type of fingerprint: COMPOUND with 7 elements
Links:
PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
PRINTS; PR00736 GLHYDRLASE15; PR00737 GLHYDRLASE16; PR00738 GLHYDRLASE20
PRINTS; PR00739 GLHYDRLASE26; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29
PRINTS; PR00843 GLHYDRLASE30; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36
PRINTS; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41; PR00747 GLHYDRLASE47
PRINTS; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52; PR00846 GLHYDRLASE56
PRINTS; PR00849 GLHYDRLASE58; PR00850 GLHYDRLASE59; PR00748 MELIBIASE
PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
INTERPRO; IPR001661
PROSITE; PS00927 TREHALASE_1; PS00928 TREHALASE_2
PFAM; PF01204 Trehalase
Creation date 12-JUN-1997; UPDATE 07-JUN-1999
1. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
2. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
3. DAVIES, G. AND HENRISSAT, B.
Structures and mechanisms of glycosyl hydrolases.
STRUCTURE 3 853-859 (1995).
4. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
5. KOPP, M., MUELLER, H. AND HOLZER, H.
Molecular analysis of the neutral trehalase gene from Saccharomyces
cerevisiae.
J.BIOL.CHEM. 268 4766-4774 (1993).
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt).
Family 37 encompasses trehalases. These enzymes are responsible for
degradation of the disaccharide alpha,alpha-trehalose, yielding two glucose
units [5]:
alpha,alpha-trehalose + H(2)O = 2 D-glucose
GLHYDRLASE37 is a 7-element fingerprint that provides a signature for
family 37 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 8 sequences: the motifs were drawn from conserved regions
spanning the C-terminal half of the alignment - motif 6 includes the
region encoded by PROSITE pattern TREHALASE_2(PS00928). Two iterations
on OWL29.3 were required to reach convergence, at which point a true set
comprising 14 sequences was identified. Three partial matches were also
found, all of which are fragments.
An update on SPTR37_9f identified a true set of 19 sequences, and 1
partial match.
SUMMARY INFORMATION
19 codes involving 7 elements
1 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
7| 19 19 19 19 19 19 19
6| 1 0 1 1 1 1 1
5| 0 0 0 0 0 0 0
4| 0 0 0 0 0 0 0
3| 0 0 0 0 0 0 0
2| 0 0 0 0 0 0 0
--+------------------------------------
| 1 2 3 4 5 6 7
True positives..
TREB_MAGGR TREB_EMENI TREB_NEUCR TREA_CANAL
TREA_KLULA TREA_YEAST TREA_SCHPO TREA_RABIT
TREB_YEAST TREA_BOMMO TREA_HUMAN Q27453
TREA_ECOLI Q22195 O22986 TREF_ECOLI
TREA_TENMO Q23176 O44736
Subfamily: Codes involving 6 elements
Subfamily True positives..
Q19472
PROTEIN TITLES
TREB_MAGGR NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREB_EMENI NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREB_NEUCR NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREA_CANAL NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREA_KLULA NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREA_YEAST NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREA_SCHPO NEUTRAL TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (ALP
TREA_RABIT TREHALASE PRECURSOR (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (A
TREB_YEAST PROBABLE TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (AL
TREA_BOMMO TREHALASE PRECURSOR (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (A
TREA_HUMAN TREHALASE PRECURSOR (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (A
Q27453 TREHALASE (EC 3.2.1.28) - BOMBYX MORI (SILK MOTH).
TREA_ECOLI PERIPLASMIC TREHALASE PRECURSOR (EC 3.2.1.28) (ALPHA,ALPHA-T
Q22195 SIMILAR TO TREHALASE PRECURSOR. NCBI GI: 1086612 - CAENORHAB
O22986 TREHALASE PRECUSOR ISOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR
TREF_ECOLI PROBABLE CYTOPLASMIC TREHALASE (EC 3.2.1.28) (ALPHA,ALPHA-TR
TREA_TENMO TREHALASE PRECURSOR (EC 3.2.1.28) (ALPHA,ALPHA-TREHALASE) (A
Q23176 W05E10.4 PROTEIN - CAENORHABDITIS ELEGANS.
O44736 F57B10.7 PROTEIN - CAENORHABDITIS ELEGANS.
Q19472 F15A2.2 PROTEIN - CAENORHABDITIS ELEGANS.
SCAN HISTORY
OWL29_3 2 100 NSINGLE
SPTR37_9f 2 24 NSINGLE
INITIAL MOTIF SETS
GLHYDRLASE371 Length of motif = 19 Motif number = 1
Glycosyl hydrolase family 37 motif I - 1
PCODE ST INT
YGHIPNGNRSYYLSRSQPP TREA_ECOLI 191 191
YGHIPNGGRVYYLQRSQPP TREA_RABIT 207 207
YGHIPNGNRTYYLSRSQPP TREF_ECOLI 207 207
YGKILNANRSYYLCRSQPP TREA_KLULA 344 344
YGKILNANRSYYLCRSQPP TREA_YEAST 341 341
YSKILNANRSYYLCRSQPP TREB_YEAST 370 370
YGFIPNGARVFYLNRSQPP TREA_TENMO 203 203
FGHIPNGSRWYYQERSQPP TREA_BOMMO 200 200
GLHYDRLASE372 Length of motif = 18 Motif number = 2
Glycosyl hydrolase family 37 motif II - 1
PCODE ST INT
LNRYWDDRDTPRPESWVE TREA_ECOLI 266 56
LNRYHVPYGGPRPESYSK TREA_RABIT 275 49
LNRYWDDRDTPRDESWLE TREF_ECOLI 281 55
LSCYHPDGIGIPPETEPG TREA_KLULA 414 51
LSRYHPNGLGIPPETESD TREA_YEAST 411 51
LSCYHSDGIGIPPETEPD TREB_YEAST 440 51
LAQYNSNSGSPRPESYYE TREA_TENMO 271 49
LLRYYIPSAGPRPESYYE TREA_BOMMO 268 49
GLHYDRLASE373 Length of motif = 18 Motif number = 3
Glycosyl hydrolase family 37 motif III - 1
PCODE ST INT
YRDLRSAAASGWDFSSRW TREA_ECOLI 300 16
WAELKAGAESGWDFSSRW TREA_RABIT 309 16
YRDLRAGAASGWDYSSRW TREF_ECOLI 314 15
FLHDRGVRESGHDTTYRF TREA_KLULA 469 37
FLHDRGVRESGHDTTYRF TREA_YEAST 466 37
FLHDRAVRESGHDTTYRF TREB_YEAST 495 37
YMDLKSAAESGWDFSSRW TREA_TENMO 306 17
YADLKSAAESGWDFSTRW TREA_BOMMO 303 17
GLHYDRLASE374 Length of motif = 18 Motif number = 4
Glycosyl hydrolase family 37 motif IV - 1
PCODE ST INT
IVPVDLNSLMFKMEKILA TREA_ECOLI 332 14
LVPVDLNAFLCQAEELLS TREA_RABIT 344 17
FIPIDLNAFLFKLESAIA TREF_ECOLI 346 14
LATIDLNSLLYKYEVDIA TREA_KLULA 493 6
LATIDLNSLLYKYEIDIA TREA_YEAST 490 6
LATIDLNSLLYKYEKDIA TREB_YEAST 519 6
IIPVDLNAFLCQAFQKLS TREA_TENMO 343 19
VIPVDLNAIFAGALQITA TREA_BOMMO 340 19
GLHYDRLASE375 Length of motif = 17 Motif number = 5
Glycosyl hydrolase family 37 motif V - 1
PCODE ST INT
IEKYLWNDQQGWYADYD TREA_ECOLI 375 25
LTALLWDEDKGAWFDYD TREA_RABIT 387 25
VNRYLWDDENGIYRDYD TREF_ECOLI 389 25
IDKYLWDEETGFYYDYN TREA_KLULA 545 34
ITKYMWDDESGFFFDYN TREA_YEAST 543 35
INKYMWDEDSGFFFYYN TREB_YEAST 572 35
IEMVHYNRDDGIWYDWD TREA_TENMO 386 25
IEQALWDEEDGVWHDYD TREA_BOMMO 383 25
GLHYDRLASE376 Length of motif = 17 Motif number = 6
Glycosyl hydrolase family 37 motif VI - 1
PCODE ST INT
TSVKSGQQWDAPNGWAP TREA_ECOLI 439 47
SLQNTGQQWDFPNAWAP TREA_RABIT 454 50
SEYETGEQWDKPNGWAP TREF_ECOLI 452 46
SLDRPTRQWDYPFGWAP TREA_KLULA 618 56
SISRPIRQWDYPFGWAP TREA_YEAST 616 56
SIDRPIRQWDYPFGWAP TREB_YEAST 645 56
SLSHTGEQWDYPNAWPP TREA_TENMO 456 53
SLVNSGEQWDFPNAWPP TREA_BOMMO 453 53
GLHYDRLASE377 Length of motif = 14 Motif number = 7
Glycosyl hydrolase family 37 motif VII - 1
PCODE ST INT
DGFGWTNGVTLKML TREA_ECOLI 516 60
EGFGWTNGVALMLL TREA_RABIT 533 62
DGFGWTNGVVRRLI TREF_ECOLI 530 61
EGFGWVNSSYLLGM TREA_KLULA 706 71
EGFGWVNASYILGL TREA_YEAST 704 71
EGFGWVNTSYLLGL TREB_YEAST 733 71
SGFGWTNGVVLEFI TREA_TENMO 536 63
DGFGWSNGVVLEFL TREA_BOMMO 533 63
FINAL MOTIF SETS
GLHYDRLASE371 Length of motif = 19 Motif number = 1
Glycosyl hydrolase family 37 motif I - 2
PCODE ST INT
YGKILNATRSYYLCRSQPP TREB_MAGGR 318 318
YGKILNANRSYYLTRSQPP TREB_EMENI 328 328
YGKILNATRSYYLCRSQPP TREB_NEUCR 325 325
YGKILNANRSYYLGRSQPP TREA_CANAL 490 490
YGKILNANRSYYLCRSQPP TREA_KLULA 344 344
YGKILNANRSYYLCRSQPP TREA_YEAST 341 341
YGKILNANRTYYLLRSQPP TREA_SCHPO 322 322
YGHIPNGGRVYYLQRSQPP TREA_RABIT 207 207
YSKILNANRSYYLCRSQPP TREB_YEAST 370 370
FGHIPNGSRWYYQERSQPP TREA_BOMMO 200 200
YGHVPNGGRVYYLQRSQPP TREA_HUMAN 207 207
FGHIPNGSRWYYQERSQPP Q27453 200 200
YGHIPNGNRSYYLSRSQPP TREA_ECOLI 191 191
YGFIPNGTRVYYLNRSQPP Q22195 179 179
YGYALNGARAYYTNRSQPP O22986 202 202
YGHIPNGNRTYYLSRSQPP TREF_ECOLI 207 207
YGFIPNGARVFYLNRSQPP TREA_TENMO 203 203
YGFVPNGGRVYYLQRSQPP Q23176 195 195
HGFVPNGGRVYYLTRSQPP O44736 184 184
GLHYDRLASE372 Length of motif = 18 Motif number = 2
Glycosyl hydrolase family 37 motif II - 2
PCODE ST INT
LSRYRPEGAGVPPETEAD TREB_MAGGR 386 49
LSRYRSPGIGVPPETEAS TREB_EMENI 396 49
LSRYRPEGLGVPPETEAG TREB_NEUCR 393 49
LSCYHPEGKGIPPETEPT TREA_CANAL 563 54
LSCYHPDGIGIPPETEPG TREA_KLULA 414 51
LSRYHPNGLGIPPETESD TREA_YEAST 411 51
LSRYRPGGLGIPPETEAS TREA_SCHPO 390 49
LNRYHVPYGGPRPESYSK TREA_RABIT 275 49
LSCYHSDGIGIPPETEPD TREB_YEAST 440 51
LLRYYIPSAGPRPESYYE TREA_BOMMO 268 49
LNRYYVPYGGPRPESYSK TREA_HUMAN 275 49
LLRYYIPSAGPRPESYYE Q27453 268 49
LNRYWDDRDTPRPESWVE TREA_ECOLI 266 56
LYRFVVETSHPRPESYRE Q22195 245 47
LSRYYAMWNKPRPESSVF O22986 272 51
LNRYWDDRDTPRDESWLE TREF_ECOLI 281 55
LAQYNSNSGSPRPESYYE TREA_TENMO 271 49
VYQYKTASNVPRPESYRV Q23176 263 49
YYQYKAKLKVPRPESYRE O44736 254 51
GLHYDRLASE373 Length of motif = 18 Motif number = 3
Glycosyl hydrolase family 37 motif III - 2
PCODE ST INT
FLHDRAVRESGHDTSYRL TREB_MAGGR 441 37
FMHDRAVRESGHDTSYRL TREB_EMENI 451 37
FMHDRAVRESGHDTTYRF TREB_NEUCR 448 37
FLHDRAVRESGHDTSYRL TREA_CANAL 619 38
FLHDRGVRESGHDTTYRF TREA_KLULA 469 37
FLHDRGVRESGHDTTYRF TREA_YEAST 466 37
FVHDRAVRESGHDTTYRL TREA_SCHPO 445 37
WAELKAGAESGWDFSSRW TREA_RABIT 309 16
FLHDRAVRESGHDTTYRF TREB_YEAST 495 37
YADLKSAAESGWDFSTRW TREA_BOMMO 303 17
WAELKAGAESGWDFSSRW TREA_HUMAN 309 16
YADLKSAAESGWDFSTRW Q27453 303 17
YRDLRSAAASGWDFSSRW TREA_ECOLI 300 16
WGDLAAAAESGRDFSSRF Q22195 280 17
HRDIATAAESGCDFSTRW O22986 307 17
YRDLRAGAASGWDYSSRW TREF_ECOLI 314 15
YMDLKSAAESGWDFSSRW TREA_TENMO 306 17
YQDLASAAESGWDFSTRW Q23176 298 17
WSEIASAAETGWDFSTRW O44736 289 17
GLHYDRLASE374 Length of motif = 18 Motif number = 4
Glycosyl hydrolase family 37 motif IV - 2
PCODE ST INT
LATVDLNTLLFKYETDIA TREB_MAGGR 465 6
LATVDLNSLLYKYEVDIA TREB_EMENI 475 6
LATIDLNSLLFKYETDIA TREB_NEUCR 472 6
LATVDLNSLLYKYENDIA TREA_CANAL 643 6
LATIDLNSLLYKYEVDIA TREA_KLULA 493 6
LATIDLNSLLYKYEIDIA TREA_YEAST 490 6
LATVDLNSLLYKYETDIS TREA_SCHPO 469 6
LVPVDLNAFLCQAEELLS TREA_RABIT 344 17
LATIDLNSLLYKYEKDIA TREB_YEAST 519 6
VIPVDLNAIFAGALQITA TREA_BOMMO 340 19
LVPVDLNAFLCQAEELMS TREA_HUMAN 344 17
VIPVDLNAIFAGALQITA Q27453 340 19
IVPVDLNSLMFKMEKILA TREA_ECOLI 332 14
LIPVDLNSIICGNMKTLS Q22195 316 18
VVPVDLNVFLLKMELDIA O22986 339 14
FIPIDLNAFLFKLESAIA TREF_ECOLI 346 14
IIPVDLNAFLCQAFQKLS TREA_TENMO 343 19
VLPVDLNGLLCWNMDIME Q23176 330 14
IIPADLNAFMCANARILA O44736 328 21
GLHYDRLASE375 Length of motif = 17 Motif number = 5
Glycosyl hydrolase family 37 motif V - 2
PCODE ST INT
IDKYLWNEEAGMYFDYD TREB_MAGGR 526 43
MDTYLWDEEKGMYFDYD TREB_EMENI 536 43
IDKYLWNEEAGMYFDYN TREB_NEUCR 534 44
VDKYLWNEQDGIYYDYN TREA_CANAL 700 39
IDKYLWDEETGFYYDYN TREA_KLULA 545 34
ITKYMWDDESGFFFDYN TREA_YEAST 543 35
MEKYLWSEADSMWYDYN TREA_SCHPO 521 34
LTALLWDEDKGAWFDYD TREA_RABIT 387 25
INKYMWDEDSGFFFYYN TREB_YEAST 572 35
IEQALWDEEDGVWHDYD TREA_BOMMO 383 25
LNTVLWDEQTGAWFDYD TREA_HUMAN 387 25
IEQALWDEEDGVWHDYD Q27453 383 25
IEKYLWNDQQGWYADYD TREA_ECOLI 375 25
IRQVLWNEEHNCWFDFD Q22195 359 25
FQTVFWNEKAGQWLDYW O22986 382 25
VNRYLWDDENGIYRDYD TREF_ECOLI 389 25
IEMVHYNRDDGIWYDWD TREA_TENMO 386 25
VQNVFYNRTDGTWYDYN Q23176 373 25
MRELHWNETDGIWYDYD O44736 371 25
GLHYDRLASE376 Length of motif = 17 Motif number = 6
Glycosyl hydrolase family 37 motif VI - 2
PCODE ST INT
GLDRPNRQWDYPYGWAP TREB_MAGGR 599 56
GLNRPTRQWDYPYGWAP TREB_EMENI 609 56
GLERPNRQWDYPYGWAP TREB_NEUCR 607 56
GLTRPSRQWDYPFGWAP TREA_CANAL 773 56
SLDRPTRQWDYPFGWAP TREA_KLULA 618 56
SISRPIRQWDYPFGWAP TREA_YEAST 616 56
GLDNPSRQWDYPNGWSP TREA_SCHPO 594 56
SLQNTGQQWDFPNAWAP TREA_RABIT 454 50
SIDRPIRQWDYPFGWAP TREB_YEAST 645 56
SLVNSGEQWDFPNAWPP TREA_BOMMO 453 53
SLQKTGQQWDFPNAWAP TREA_HUMAN 454 50
SLINSGEQWDFPNAWPP Q27453 453 53
TSVKSGQQWDAPNGWAP TREA_ECOLI 439 47
SLVNSGEQWDFPNSWPP Q22195 426 50
SLTNSGQQWDSPNGWAP O22986 451 52
SEYETGEQWDKPNGWAP TREF_ECOLI 452 46
SLSHTGEQWDYPNAWPP TREA_TENMO 456 53
MSQESDQQWDFPNGWSP Q23176 443 53
LAMSSTQQWDKENAWPP O44736 440 52
GLHYDRLASE377 Length of motif = 14 Motif number = 7
Glycosyl hydrolase family 37 motif VII - 2
PCODE ST INT
EGFGWVNASYIYGL TREB_MAGGR 687 71
EGFGWVNASYVYGL TREB_EMENI 697 71
EGFGWVNASYVYGL TREB_NEUCR 695 71
EGFGWVNASYVFGL TREA_CANAL 861 71
EGFGWVNSSYLLGM TREA_KLULA 706 71
EGFGWVNASYILGL TREA_YEAST 704 71
EGFGWVNASYEVGL TREA_SCHPO 682 71
EGFGWTNGVALMLL TREA_RABIT 533 62
EGFGWVNTSYLLGL TREB_YEAST 733 71
DGFGWSNGVVLEFL TREA_BOMMO 533 63
EGFGWDEGVVLMLL TREA_HUMAN 534 63
DGFGWSNGVVLEFL Q27453 533 63
DGFGWTNGVTLKML TREA_ECOLI 516 60
EGFGWTNGVILDFL Q22195 505 62
TGFGWSNGVILAFL O22986 531 63
DGFGWTNGVVRRLI TREF_ECOLI 530 61
SGFGWTNGVVLEFI TREA_TENMO 536 63
DGFGWSNGAILDLL Q23176 524 64
TGFGWTNGVILDLL O44736 523 66
User query: Display/Full Code "GLHYDRLASE37"