WORKLIST ENTRIES (1):
GLHYDRLASE56 View alignment View Structure Glycosyl hydrolase family 56 signature
Type of fingerprint: COMPOUND with 6 elements
Links:
PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
PRINTS; PR00736 GLHYDRLASE15; PR00737 GLHYDRLASE16; PR00738 GLHYDRLASE20
PRINTS; PR00739 GLHYDRLASE26; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29
PRINTS; PR00843 GLHYDRLASE30; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36
PRINTS; PR00744 GLHYDRLASE37; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41
PRINTS; PR00747 GLHYDRLASE47; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52
PRINTS; PR00849 GLHYDRLASE58; PR00850 GLHYDRLASE59; PR00748 MELIBIASE
PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
PRINTS; PR00847 HYALURONDASE; PR00848 SPERMPH20
INTERPRO; IPR001968
Creation date 15-FEB-1998; UPDATE 07-JUN-1999
1. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
2. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
3. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
4. EL HASSOUNI, M., HENRISSAT, B., CHIPPAUX, M. AND BARRAS, F.
Nucleotide sequences of the Arb genes, which control beta-glucosidase
utilisation in Erwinia chrysanthemi - Comparison with the Escherichia
coli Bgl operon and evidence for a new beta-glycohydrolase family
including enzymes from eubacteria, archaebacteria and humans.
J.BACTERIOL. 174 765-777 (1992).
5. GMACHL, M. AND KREIL, G.
Bee venom hyaluronidase is homologous to a membrane protein of mammalian
sperm.
PROC.NATL.ACAD.SCI.U.S.A. 90 3569-3573 (1993).
6. LATHROP W.F., CARMICHAEL E.P., MYLES D.G., PRIMAKOFF P.
cDNA cloning reveals the molecular structure of a sperm surface protein,
PH-20, involved in sperm-egg adhesion and the wide distribution of its
gene among mammals.
J.CELL BIOL. 111 2939-2949 (1990).
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt). Family 56 encompasses a group of hyaluronidases (EC 3.2.1.35)
that includes venom hyaluronidases [5] and mammalian sperm surface proteins
(PH-20) [6].
The venom of honeybees contains several biologically-active peptides and
two enzymes, one of which is a hyaluronidase [5]. The amino acid sequence
of bee venom hyaluronidase contains 349 amino acids, and includes four
cysteines and a number of potential glycosylation sites [5]. The sequence
shows a high degree of similarity to PH-20, a membrane protein of mammalian
sperm involved in sperm-egg adhesion, supporting the view that hyaluronidases
play a role in fertilisation [5].
PH-20 is required for sperm adhesion to the egg zona pellucida; it is
located on both the sperm plasma membrane and acrosomal membrane [6]. The
amino acid sequence of the mature protein contains 468 amino acids, and
includes six potential N-linked glycosylation sites and twelve cysteines,
eight of which are tightly clustered near the C-terminus [6].
GLHYDRLASE56 is a 6-element fingerprint that provides a signature for
family 56 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 13 sequences: the motifs were drawn from conserved regions
spanning virtually the full alignment length. Two iterations on OWL30.0
were required to reach convergence, at which point a true set comprising
18 sequences was identified.
An update on SPTR37_9f identified a true set of 20 sequences.
SUMMARY INFORMATION
20 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
6| 20 20 20 20 20 20
5| 0 0 0 0 0 0
4| 0 0 0 0 0 0
3| 0 0 0 0 0 0
2| 0 0 0 0 0 0
--+-------------------------------
| 1 2 3 4 5 6
True positives..
O35632 O35631 HYA1_RABIT HYA1_MACFA
HYA1_HUMAN O15177 Q12891 Q62803
Q93013 Q12794 Q29152 HYA1_MOUSE
HYA1_CAVPO O70229 O60540 O43820
Q22675 HUGA_DOLMA HUGA_VESVU HUGA_APIME
PROTEIN TITLES
O35632 HYALURONIDASE - MUS MUSCULUS (MOUSE).
O35631 HYALURONIDASE - MUS MUSCULUS (MOUSE).
HYA1_RABIT HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFACE PROTEIN
HYA1_MACFA HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFACE PROTEIN
HYA1_HUMAN HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFACE PROTEIN
O15177 LYSOSOMAL HYALURONIDASE - HOMO SAPIENS (HUMAN).
Q12891 LYSOSOMAL HYALURONIDASE (PH-20 HOMOLOG) (LUCA-2) (LUCA2) (PH
Q62803 PROBABLE HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFAC
Q93013 HYALURONOGLUCOSAMINIDASE 1 (LUCA-1) (TUMOR SUPPRESSOR LUCA-1
Q12794 TUMOR SUPPRESSOR (LUCA-1) - HOMO SAPIENS (HUMAN).
Q29152 PROBABLE HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFAC
HYA1_MOUSE HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFACE PROTEIN
HYA1_CAVPO HYALURONIDASE PRECURSOR (EC 3.2.1.35) (SPERM SURFACE PROTEIN
O70229 HYALURONIDASE 1 (HYALURONOGLUCOSAMINIDASE 1) - MUS MUSCULUS
O60540 HYALURONIDASE - HOMO SAPIENS (HUMAN).
O43820 SIMILAR TO HYALURONOGLUCOSAMINIDASE - HOMO SAPIENS (HUMAN).
Q22675 T22C8.2 PROTEIN - CAENORHABDITIS ELEGANS.
HUGA_DOLMA HYALURONOGLUCOSAMINIDASE (EC 3.2.1.35) (HYALURONIDASE) (ALLE
HUGA_VESVU HYALURONOGLUCOSAMINIDASE (EC 3.2.1.35) (HYALURONIDASE) (ALLE
HUGA_APIME HYALURONOGLUCOSAMINIDASE PRECURSOR (EC 3.2.1.35) (HYALURONID
SCAN HISTORY
OWL30_0 2 40 NSINGLE
SPTR37_9f 2 21 NSINGLE
INITIAL MOTIF SETS
GLHYDRLASE561 Length of motif = 13 Motif number = 1
Glycosyl hydrolase family 56 motif I - 1
PCODE ST INT
FLWAWNAPTEFCL HYA1_RABIT 49 49
FVVAWNVPTQECA MMAJ0059 36 36
FLWIWNVPTERCV HYA1_MOUSE 49 49
FVVAWDVPTQDCG AC002455 36 36
FNVYWNVPTFMCH HUGA_APIME 43 43
FLWAWNAPSEFCL HYA1_HUMAN 49 49
FLWAWNAPSEFCL HYA1_MACFA 49 49
FNIYWNVPTFMCH HUGA_DOLMA 8 8
FNIYWNVPTFMCH HUGA_VESVU 8 8
LLWVWNAPTEFCI HYA1_CAVPO 48 48
FTTVWNANTQWCL JC5584 32 32
FTTVWNANTQWCL HSU03056 32 32
TDVVWMVPSWTCK CET22C81 36 36
GLHYDRLASE562 Length of motif = 15 Motif number = 2
Glycosyl hydrolase family 56 motif II - 1
PCODE ST INT
VHGRIPQLGPLQQHL HYA1_RABIT 110 48
VHGGVPQNGSLCAHL MMAJ0059 96 47
HYGGIPQRGDYQAHL HYA1_MOUSE 109 47
VHGGVPQNVSLWAHR AC002455 96 47
RNGGVPQLGNLTKHL HUGA_APIME 106 50
VNGGIPQKISLQDHL HYA1_HUMAN 110 48
VHGGIPQKVSLQDHL HYA1_MACFA 110 48
RNGGVPQEGNITIHL HUGA_DOLMA 70 49
RNGGVPQEGNITIHL HUGA_VESVU 70 49
VHGGLPQLMNLQQHL HYA1_CAVPO 109 48
VFGGLPQNASLIAHL JC5584 92 47
VFGGLPQNASLIAHL HSU03056 92 47
KNGGLPQMGDLEAHL CET22C81 97 48
GLHYDRLASE563 Length of motif = 18 Motif number = 3
Glycosyl hydrolase family 56 motif III - 1
PCODE ST INT
GLAVIDWEEWLPTWLRNW HYA1_RABIT 141 16
GLAVIDWEEWRPVWVRNW MMAJ0059 128 17
GLAIIDWEEWRPTWLRNW HYA1_MOUSE 140 16
GLAVIDWEDWRPVWVRNW AC002455 128 17
GVGVIDFESWRPIFRQNW HUGA_APIME 138 17
GMAVIDWEEWRPTWARNW HYA1_HUMAN 141 16
GMAVIDWEEWRPTWARNW HYA1_MACFA 141 16
GIGVIDFERWRPIFRQNW HUGA_DOLMA 102 17
GIGVIDFERWRPIFRQNW HUGA_VESVU 102 17
GLAVIDWEEWRPTWTRNW HYA1_CAVPO 140 16
GLAVIDWEAWRPRWAFNW JC5584 124 17
GLAVIDWEAWRPRWAFNW HSU03056 124 17
GIAVIDIEEFRPMWELSW CET22C81 129 17
GLHYDRLASE564 Length of motif = 26 Motif number = 4
Glycosyl hydrolase family 56 motif IV - 1
PCODE ST INT
FMEETLKLGRLLRPNHLWGYYLFPDC HYA1_RABIT 199 40
FMLNTLRLRQGSQTQHLWGFYLFPDC MMAJ0059 186 40
FMEGTLHLGKFLRPNQLWGYYLFPDC HYA1_MOUSE 198 40
FMLETLRYVKAVRPRHLWGFYLFPDC AC002455 186 40
FMEETLKAAKRMRPAANWGYYAYPYC HUGA_APIME 196 40
FLVETIKLGKLLRPNHLWGYYLFPDC HYA1_HUMAN 199 40
FMLETIKLGRSLRPNHLWGYYLFPDC HYA1_MACFA 199 40
FMEETLKLAKKTRKQADWGYYGYPYC HUGA_DOLMA 160 40
FMEETLKLAKKTRKQADWGYYGYPYC HUGA_VESVU 160 40
FMLETLKLGKSLRPSSLWGYYLFPDC HYA1_CAVPO 198 40
WMAGTLQLGRALRPRGLWGFYGFPDC JC5584 182 40
WMAGTLQLGGALRPRGLWGFYGFPDC HSU03056 182 40
FFIETLRLGKRLRPNAKWGYYLFPKC CET22C81 187 40
GLHYDRLASE565 Length of motif = 15 Motif number = 5
Glycosyl hydrolase family 56 motif V - 1
PCODE ST INT
NDDLSWLWKESTALF HYA1_RABIT 247 22
NDQLAWLWAESTALF MMAJ0059 235 23
NDNLKWLWKASTGLY HYA1_MOUSE 245 21
NDQLAWLWAESTALF AC002455 235 23
NDKMSWLFESEDVLL HUGA_APIME 241 19
NDDLSWLWNESTALY HYA1_HUMAN 246 21
NDDLSWLWNESTALY HYA1_MACFA 246 21
NDKMSWLFNNQNVLL HUGA_DOLMA 205 19
NDKMSWLFNNQNVLL HUGA_VESVU 205 19
NNDLQWLWNDSTALY HYA1_CAVPO 245 21
NDQLGWLWGQSRALY JC5584 229 21
NDQLGWLWGQSRALY HSU03056 229 21
NDNLHWLWGESTALF CET22C81 232 19
GLHYDRLASE566 Length of motif = 14 Motif number = 6
Glycosyl hydrolase family 56 motif VI - 1
PCODE ST INT
CLHLDNYMKTILNP HYA1_RABIT 355 93
CQYLKNYLTQLLVP MMAJ0059 339 89
CPILHKYMQTTLNP HYA1_MOUSE 351 91
CQYLKDYLTRLLVP AC002455 340 90
CLQFREYLNNELGP HUGA_APIME 345 89
CLLLDNYMETILNP HYA1_HUMAN 351 90
CLLLDTYMETILNP HYA1_MACFA 351 90
CKRLREYLLTVLGP HUGA_DOLMA 308 88
CKRLQDYLLTVLGP HUGA_VESVU 308 88
CIGLENYMKGTLLP HYA1_CAVPO 351 91
CQAIKEYMDTTLGP JC5584 333 89
CQAIKEYMDTTLGP HSU03056 333 89
CGSLQTYVDNTLGP CET22C81 337 90
FINAL MOTIF SETS
GLHYDRLASE561 Length of motif = 13 Motif number = 1
Glycosyl hydrolase family 56 motif I - 2
PCODE ST INT
FVVAWNVPTQECA O35631 36 36
FVVAWNVPTQECA O35632 36 36
FLWAWNAPTEFCL HYA1_RABIT 49 49
FLWAWNAPSEFCL HYA1_MACFA 49 49
FLWAWNAPSEFCL HYA1_HUMAN 49 49
FVVAWDVPTQDCG O15177 36 36
FVVAWDVPTQDCG Q12891 36 36
FVWVWNVPTEACV Q62803 49 49
FTTVWNANTQWCL Q93013 32 32
FTTVWNANTQWCL Q12794 32 32
FLWGWNAPTELCA Q29152 49 49
FLWIWNVPTERCV HYA1_MOUSE 49 49
LLWVWNAPTEFCI HYA1_CAVPO 48 48
FITVWNGDTHWCL O70229 60 60
FSVLWNVPSAHCE O43820 31 31
FSVLWNVPSAHCE O60540 31 31
TDVVWMVPSWTCK Q22675 36 36
FNIYWNVPTFMCH HUGA_DOLMA 8 8
FNIYWNVPTFMCH HUGA_VESVU 8 8
FNVYWNVPTFMCH HUGA_APIME 43 43
GLHYDRLASE562 Length of motif = 15 Motif number = 2
Glycosyl hydrolase family 56 motif II - 2
PCODE ST INT
VHGGVPQNGSLCAHL O35631 96 47
VHGGVPQNGSLCAHL O35632 96 47
VHGRIPQLGPLQQHL HYA1_RABIT 110 48
VHGGIPQKVSLQDHL HYA1_MACFA 110 48
VNGGIPQKISLQDHL HYA1_HUMAN 110 48
VHGGVPQNVSLWAHR O15177 96 47
VHGGVPQNVSLWAHR Q12891 96 47
HHGGIPQKGDLTTHL Q62803 109 47
VFGGLPQNASLIAHL Q93013 92 47
VFGGLPQNASLIAHL Q12794 92 47
VNGGIPQLGSLKKHL Q29152 110 48
HYGGIPQRGDYQAHL HYA1_MOUSE 109 47
VHGGLPQLMNLQQHL HYA1_CAVPO 109 48
VFGGLPQNASLVTHL O70229 120 47
HNGGIPQALPLDRHL O43820 91 47
HNGGIPQALPLDRHL O60540 91 47
KNGGLPQMGDLEAHL Q22675 97 48
RNGGVPQEGNITIHL HUGA_DOLMA 70 49
RNGGVPQEGNITIHL HUGA_VESVU 70 49
RNGGVPQLGNLTKHL HUGA_APIME 106 50
GLHYDRLASE563 Length of motif = 18 Motif number = 3
Glycosyl hydrolase family 56 motif III - 2
PCODE ST INT
GLAVIDWEEWRPVWVRNW O35631 128 17
GLAVIDWEEWRPVWVRNW O35632 128 17
GLAVIDWEEWLPTWLRNW HYA1_RABIT 141 16
GMAVIDWEEWRPTWARNW HYA1_MACFA 141 16
GMAVIDWEEWRPTWARNW HYA1_HUMAN 141 16
GLAVIDWEDWRPVWVRNW O15177 128 17
GLAVIDWEDWRPVWVRNW Q12891 128 17
GLAIIDWEEWRPTWMRNW Q62803 140 16
GLAVIDWEAWRPRWAFNW Q93013 124 17
GLAVIDWEAWRPRWAFNW Q12794 124 17
GLAVIDWDSWRPNWARNW Q29152 141 16
GLAIIDWEEWRPTWLRNW HYA1_MOUSE 140 16
GLAVIDWEEWRPTWTRNW HYA1_CAVPO 140 16
GLAVIDWEAWRPRWAFNW O70229 152 17
GPAVLDWEEWCPLWAGNW O43820 122 16
GPAVLDWEEWCPLWAGNW O60540 122 16
GIAVIDIEEFRPMWELSW Q22675 129 17
GIGVIDFERWRPIFRQNW HUGA_DOLMA 102 17
GIGVIDFERWRPIFRQNW HUGA_VESVU 102 17
GVGVIDFESWRPIFRQNW HUGA_APIME 138 17
GLHYDRLASE564 Length of motif = 26 Motif number = 4
Glycosyl hydrolase family 56 motif IV - 2
PCODE ST INT
FMLNTLRLRQGSQTQHLWGFYLFPDC O35631 186 40
FMLNTLRLRQGSQTQHLWGFYLFPDC O35632 186 40
FMEETLKLGRLLRPNHLWGYYLFPDC HYA1_RABIT 199 40
FMLETIKLGRSLRPNHLWGYYLFPDC HYA1_MACFA 199 40
FLVETIKLGKLLRPNHLWGYYLFPDC HYA1_HUMAN 199 40
FMLETLRYVKAVRPRHLWGFYLFPDC O15177 186 40
FMLETLRYVKAVRPRHLWGFYLFPDC Q12891 186 40
FMEGTLKLGKHIRPKHLWGFYLFPDC Q62803 198 40
WMAGTLQLGRALRPRGLWGFYGFPDC Q93013 182 40
WMAGTLQLGGALRPRGLWGFYGFPDC Q12794 182 40
FMQETLKLGKFLRPNYLWGFYLYPDC Q29152 199 40
FMEGTLHLGKFLRPNQLWGYYLFPDC HYA1_MOUSE 198 40
FMLETLKLGKSLRPSSLWGYYLFPDC HYA1_CAVPO 198 40
WMAGTLQLGQVLRPRGLWGYYGFPDC O70229 210 40
LMEDTLRVAQALRPHGLWGFYHYPAC O43820 180 40
LMEDTLRVAQALRPHGLWGFYHYPAC O60540 180 40
FFIETLRLGKRLRPNAKWGYYLFPKC Q22675 187 40
FMEETLKLAKKTRKQADWGYYGYPYC HUGA_DOLMA 160 40
FMEETLKLAKKTRKQADWGYYGYPYC HUGA_VESVU 160 40
FMEETLKAAKRMRPAANWGYYAYPYC HUGA_APIME 196 40
GLHYDRLASE565 Length of motif = 15 Motif number = 5
Glycosyl hydrolase family 56 motif V - 2
PCODE ST INT
NDQLAWLWAESTALF O35631 235 23
NDQLAWLWAESTALF O35632 235 23
NDDLSWLWKESTALF HYA1_RABIT 247 22
NDDLSWLWNESTALY HYA1_MACFA 246 21
NDDLSWLWNESTALY HYA1_HUMAN 246 21
NDQLAWLWAESTALF O15177 235 23
NDQLAWLWAESTALF Q12891 235 23
NDDLDWLWKESTGLY Q62803 245 21
NDQLGWLWGQSRALY Q93013 229 21
NDQLGWLWGQSRALY Q12794 229 21
NDEIDWLWKESTALF Q29152 246 21
NDNLKWLWKASTGLY HYA1_MOUSE 245 21
NNDLQWLWNDSTALY HYA1_CAVPO 245 21
NDQLGWLWNQSYALY O70229 257 21
NTQLHWLWAASSALF O43820 228 22
NTQLHWLWAASSALF O60540 228 22
NDNLHWLWGESTALF Q22675 232 19
NDKMSWLFNNQNVLL HUGA_DOLMA 205 19
NDKMSWLFNNQNVLL HUGA_VESVU 205 19
NDKMSWLFESEDVLL HUGA_APIME 241 19
GLHYDRLASE566 Length of motif = 14 Motif number = 6
Glycosyl hydrolase family 56 motif VI - 2
PCODE ST INT
CQYLKNYLTQLLVP O35631 339 89
CQYLKNYLTQLLVP O35632 340 90
CLHLDNYMKTILNP HYA1_RABIT 355 93
CLLLDTYMETILNP HYA1_MACFA 351 90
CLLLDNYMETILNP HYA1_HUMAN 351 90
CQYLKDYLTRLLVP O15177 339 89
CQYLKDYLTRLLVP Q12891 340 90
CPILRQYMKTTLNP Q62803 351 91
CQAIKEYMDTTLGP Q93013 333 89
CQAIKEYMDTTLGP Q12794 333 89
CTELDTYIKNKLNP Q29152 352 91
CPILHKYMQTTLNP HYA1_MOUSE 351 91
CIGLENYMKGTLLP HYA1_CAVPO 351 91
CQAIKAYMDSTLGP O70229 361 89
CWHLHDYLVDTLGP O43820 331 88
CWHLHDYLVDTLGP O60540 331 88
CGSLQTYVDNTLGP Q22675 337 90
CKRLREYLLTVLGP HUGA_DOLMA 308 88
CKRLQDYLLTVLGP HUGA_VESVU 308 88
CLQFREYLNNELGP HUGA_APIME 345 89
User query: Display/Full Code "GLHYDRLASE56"