WORKLIST ENTRIES (1):
CAULIMOPTASE View alignment Cauliflower mosaic virus peptidase (A3) signature
Type of fingerprint: COMPOUND with 4 elements
Links:
PRINTS; PR00792 PEPSIN; PR00977 SCYTLDPTASE; PR00863 NODAVIRPTASE
PRINTS; PR00781 LIPOSIGPTASE; PR00920 SPUMVIRPTASE
INTERPRO; IPR000588
Creation date 01-MAY-1997; UPDATE 14-JUN-1999
1. RALINGS, N.D. AND BARRETT, A.J.
Families of aspartic peptidases, and those of unknown catalytic mechanism.
METHODS ENZYMOL. 248 105-120 (1995).
Cauliflower mosaic viruses belong to a group of plant viruses known as
pararetroviruses, which have a double-stranded DNA genome [1]. The genome
includes an open reading frame (ORF V) that shows similarities to the pol
gene of retroviruses [1]. This ORF codes for a polyprotein that includes
a reverse transcriptase, which, on the basis of a DTG triplet near the
N-terminus, was suggested to include an aspartic protease [1].
The presence of an aspartic protease has been confirmed by mutational
studies, implicating Asp-45 in catalysis [1]. The protease releases itself
from the polyprotein and is involved in reactions required to process the
ORF IV polyprotein, which includes the viral coat protein gene [1].
CAULIMOPTASE is a 4-element fingerprint that provides a signature for the
cauliflower mosaic virus aspartate protease. The fingerprint was derived
from an initial alignment of 7 sequences: the motifs were drawn from
conserved regions within the gag/pol protease domain, motif 2 containing
the catalytic aspartate residue. Two iterations on OWL29.2 were required
to reach convergence, at which point a true set comprising 12 sequences
was identified. A single partial match was also found, PCU139881, a
fragment lacking the region of sequence bearing the first motif.
An update on SPTR37_9f identified a true set of 12 sequences.
SUMMARY INFORMATION
12 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
4| 12 12 12 12
3| 0 0 0 0
2| 0 0 0 0
--+---------------------
| 1 2 3 4
True positives..
POL_CAMVC Q83169 Q66162 POL_CAMVN
POL_CAMVE POL_CAMVS POL_CAMVD POL_FMVD
POL_CERV Q88442 POL_SOCMV Q84682
PROTEIN TITLES
POL_CAMVC ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
Q83169 REVERSE TRANSCRIPTASE - CAULIFLOWER MOSAIC VIRUS.
Q66162 ORF V - CAULIFLOWER MOSAIC VIRUS.
POL_CAMVN ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
POL_CAMVE ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
POL_CAMVS ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
POL_CAMVD ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
POL_FMVD ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
POL_CERV ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
Q88442 COMPLETE GENOME - STRAWBERRY VEIN BANDING VIRUS.
POL_SOCMV ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
Q84682 REVERSE TRANSCRIPTASE - PEANUT CHLOROTIC STREAK VIRUS.
SCAN HISTORY
OWL29_2 2 100 NSINGLE
SPTR37_9f 2 200 NSINGLE
INITIAL MOTIF SETS
CAULIMOPTASE1 Length of motif = 14 Motif number = 1
Cauliflower mosaic virus peptidase motif I - 1
PCODE ST INT
NVTNPNSIYIKGRL POL_CAMVC 17 17
NVTNPNSIYIKGRL POL_CAMVN 18 18
NVTNPNSIYIKGRL POL_CAMVS 17 17
NITNPNSIYIKGRL POL_CAMVD 19 19
NVTNPNSIYIEGKL POL_FMVD 26 26
NRTNPNSIYVKGIL POL_CERV 5 5
TKGNPNVTFIKVSI POL_SOCMV 13 13
CAULIMOPTASE2 Length of motif = 15 Motif number = 2
Cauliflower mosaic virus peptidase motif II - 1
PCODE ST INT
FVDTGASLCIASKFV POL_CAMVC 43 12
FVDTGASLCIASKFV POL_CAMVN 44 12
FVDTGASLCIASKFV POL_CAMVS 43 12
FVDTGASLCIASKFV POL_CAMVD 45 12
YVDTGASLCIASRYI POL_FMVD 52 12
YVDTGSSLCMASKYV POL_CERV 32 13
YIDTGATLCFGKRKI POL_SOCMV 34 7
CAULIMOPTASE3 Length of motif = 16 Motif number = 3
Cauliflower mosaic virus peptidase motif III - 1
PCODE ST INT
FKIPTVYQQESGIDFI POL_CAMVC 98 40
FKIPTVYQQESGIDFI POL_CAMVN 99 40
FRIPTVYQQESGIDFI POL_CAMVS 98 40
FHIPTVYQQESGIDFI POL_CAMVD 100 40
FEIPTVYQQETGIDFL POL_FMVD 107 40
FLIPTLFQQESGIDLL POL_CERV 87 40
FLIPIIYLHDSGLDLI POL_SOCMV 87 38
CAULIMOPTASE4 Length of motif = 15 Motif number = 4
Cauliflower mosaic virus peptidase motif IV - 1
PCODE ST INT
IGNNFCQLYEPFIQF POL_CAMVC 114 0
IGNNFCQLYEPFIQF POL_CAMVN 115 0
IGNNFCQLYEPFIQF POL_CAMVS 114 0
IGNNFCQLYEPFIQF POL_CAMVD 116 0
IGNNFCRLYNPFIQW POL_FMVD 123 0
LGNNFCQLYSPFIQY POL_CERV 103 0
IGNNFLKLYQPFIQR POL_SOCMV 103 0
FINAL MOTIF SETS
CAULIMOPTASE1 Length of motif = 14 Motif number = 1
Cauliflower mosaic virus peptidase motif I - 2
PCODE ST INT
NVTNPNSIYIKGRL POL_CAMVC 17 17
NVTNPNSIYIKGRL Q83169 18 18
NVTNPNSIYIKGRL Q66162 18 18
NVTNPNSIYIKGRL POL_CAMVN 18 18
NVTNPNSIYIKGRL POL_CAMVE 17 17
NVTNPNSIYIKGRL POL_CAMVS 17 17
NITNPNSIYIKGRL POL_CAMVD 19 19
NVTNPNSIYIEGKL POL_FMVD 26 26
NRTNPNSIYVKGIL POL_CERV 5 5
TKTNPNSIYIRGNF Q88442 51 51
TKGNPNVTFIKVSI POL_SOCMV 13 13
SSKNSSFIKVKLFN Q84682 2 2
CAULIMOPTASE2 Length of motif = 15 Motif number = 2
Cauliflower mosaic virus peptidase motif II - 2
PCODE ST INT
FVDTGASLCIASKFV POL_CAMVC 43 12
FVDTGASLCIASKFV Q83169 44 12
FVDTGASLCIASKFV Q66162 44 12
FVDTGASLCIASKFV POL_CAMVN 44 12
FVDTGASLCIASKFV POL_CAMVE 43 12
FVDTGASLCIASKFV POL_CAMVS 43 12
FVDTGASLCIASKFV POL_CAMVD 45 12
YVDTGASLCIASRYI POL_FMVD 52 12
YVDTGSSLCMASKYV POL_CERV 32 13
YVDTGASMCTANKHV Q88442 77 12
YIDTGATLCFGKRKI POL_SOCMV 34 7
YIDTGATICLAQAKI Q84682 21 5
CAULIMOPTASE3 Length of motif = 16 Motif number = 3
Cauliflower mosaic virus peptidase motif III - 2
PCODE ST INT
FKIPTVYQQESGIDFI POL_CAMVC 98 40
FKIPTVYQQESGIDFI Q83169 99 40
FKIPTVYQQESGIDFI Q66162 99 40
FKIPTVYQQESGIDFI POL_CAMVN 99 40
FKIPTVYQQESGIDFI POL_CAMVE 98 40
FRIPTVYQQESGIDFI POL_CAMVS 98 40
FHIPTVYQQESGIDFI POL_CAMVD 100 40
FEIPTVYQQETGIDFL POL_FMVD 107 40
FLIPTLFQQESGIDLL POL_CERV 87 40
FIIPTLYQATTKGDIT Q88442 132 40
FLIPIIYLHDSGLDLI POL_SOCMV 87 38
FPLPSVYQQDAGLPLI Q84682 76 40
CAULIMOPTASE4 Length of motif = 15 Motif number = 4
Cauliflower mosaic virus peptidase motif IV - 2
PCODE ST INT
IGNNFCQLYEPFIQF POL_CAMVC 114 0
IGNNFCQLYEPFIQF Q83169 115 0
IGNNFCQLYEPFIQF Q66162 115 0
IGNNFCQLYEPFIQF POL_CAMVN 115 0
IGNNFCQLYEPFIQF POL_CAMVE 114 0
IGNNFCQLYEPFIQF POL_CAMVS 114 0
IGNNFCQLYEPFIQF POL_CAMVD 116 0
IGNNFCRLYNPFIQW POL_FMVD 123 0
LGNNFCQLYSPFIQY POL_CERV 103 0
LGNNFCRLYEPFVQY Q88442 148 0
IGNNFLKLYQPFIQR POL_SOCMV 103 0
LGNNFLKLYNPFIQT Q84682 92 0
User query: Display/Full Code "CAULIMOPTASE"