WORKLIST ENTRIES (1):

CAULIMOPTASE View alignment     Cauliflower mosaic virus peptidase (A3) signature
 Type of fingerprint: COMPOUND with 4  elements
Links:
   PRINTS; PR00792 PEPSIN; PR00977 SCYTLDPTASE; PR00863 NODAVIRPTASE
   PRINTS; PR00781 LIPOSIGPTASE; PR00920 SPUMVIRPTASE
   INTERPRO; IPR000588

 Creation date 01-MAY-1997; UPDATE 14-JUN-1999

   1. RALINGS, N.D. AND BARRETT, A.J.
   Families of aspartic peptidases, and those of unknown catalytic mechanism.
   METHODS ENZYMOL. 248 105-120 (1995).

   Cauliflower mosaic viruses belong to a group of plant viruses known as
   pararetroviruses, which have a double-stranded DNA genome [1]. The genome
   includes an open reading frame (ORF V) that shows similarities to the pol
   gene of retroviruses [1]. This ORF codes for a polyprotein that includes
   a reverse transcriptase, which, on the basis of a DTG triplet near the
   N-terminus, was suggested to include an aspartic protease [1].
   
   The presence of an aspartic protease has been confirmed by mutational
   studies, implicating Asp-45 in catalysis [1]. The protease releases itself
   from the polyprotein and is involved in reactions required to process the
   ORF IV polyprotein, which includes the viral coat protein gene [1]. 
   
   CAULIMOPTASE is a 4-element fingerprint that provides a signature for the
   cauliflower mosaic virus aspartate protease. The fingerprint was derived
   from an initial alignment of 7 sequences: the motifs were drawn from
   conserved regions within the gag/pol protease domain, motif 2 containing 
   the catalytic aspartate residue. Two iterations on OWL29.2 were required
   to reach convergence, at which point a true set comprising 12 sequences
   was identified. A single partial match was also found, PCU139881, a
   fragment lacking the region of sequence bearing the first motif.
  
   An update on SPTR37_9f identified a true set of 12 sequences.

  SUMMARY INFORMATION
     12 codes involving  4 elements
      0 codes involving  3 elements
      0 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    4|  12   12   12   12  
    3|   0    0    0    0  
    2|   0    0    0    0  
   --+---------------------
     |   1    2    3    4  

True positives..
 POL_CAMVC      Q83169         Q66162         POL_CAMVN      
 POL_CAMVE      POL_CAMVS      POL_CAMVD      POL_FMVD       
 POL_CERV       Q88442         POL_SOCMV      Q84682         


  PROTEIN TITLES
   POL_CAMVC        ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   Q83169           REVERSE TRANSCRIPTASE - CAULIFLOWER MOSAIC VIRUS.
   Q66162           ORF V - CAULIFLOWER MOSAIC VIRUS.
   POL_CAMVN        ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   POL_CAMVE        ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   POL_CAMVS        ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   POL_CAMVD        ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   POL_FMVD         ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   POL_CERV         ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   Q88442           COMPLETE GENOME - STRAWBERRY VEIN BANDING VIRUS.
   POL_SOCMV        ENZYMATIC POLYPROTEIN [CONTAINS: ASPARTIC PROTEASE (EC 3.4.2
   Q84682           REVERSE TRANSCRIPTASE - PEANUT CHLOROTIC STREAK VIRUS.

SCAN HISTORY OWL29_2 2 100 NSINGLE SPTR37_9f 2 200 NSINGLE INITIAL MOTIF SETS CAULIMOPTASE1 Length of motif = 14 Motif number = 1 Cauliflower mosaic virus peptidase motif I - 1 PCODE ST INT NVTNPNSIYIKGRL POL_CAMVC 17 17 NVTNPNSIYIKGRL POL_CAMVN 18 18 NVTNPNSIYIKGRL POL_CAMVS 17 17 NITNPNSIYIKGRL POL_CAMVD 19 19 NVTNPNSIYIEGKL POL_FMVD 26 26 NRTNPNSIYVKGIL POL_CERV 5 5 TKGNPNVTFIKVSI POL_SOCMV 13 13 CAULIMOPTASE2 Length of motif = 15 Motif number = 2 Cauliflower mosaic virus peptidase motif II - 1 PCODE ST INT FVDTGASLCIASKFV POL_CAMVC 43 12 FVDTGASLCIASKFV POL_CAMVN 44 12 FVDTGASLCIASKFV POL_CAMVS 43 12 FVDTGASLCIASKFV POL_CAMVD 45 12 YVDTGASLCIASRYI POL_FMVD 52 12 YVDTGSSLCMASKYV POL_CERV 32 13 YIDTGATLCFGKRKI POL_SOCMV 34 7 CAULIMOPTASE3 Length of motif = 16 Motif number = 3 Cauliflower mosaic virus peptidase motif III - 1 PCODE ST INT FKIPTVYQQESGIDFI POL_CAMVC 98 40 FKIPTVYQQESGIDFI POL_CAMVN 99 40 FRIPTVYQQESGIDFI POL_CAMVS 98 40 FHIPTVYQQESGIDFI POL_CAMVD 100 40 FEIPTVYQQETGIDFL POL_FMVD 107 40 FLIPTLFQQESGIDLL POL_CERV 87 40 FLIPIIYLHDSGLDLI POL_SOCMV 87 38 CAULIMOPTASE4 Length of motif = 15 Motif number = 4 Cauliflower mosaic virus peptidase motif IV - 1 PCODE ST INT IGNNFCQLYEPFIQF POL_CAMVC 114 0 IGNNFCQLYEPFIQF POL_CAMVN 115 0 IGNNFCQLYEPFIQF POL_CAMVS 114 0 IGNNFCQLYEPFIQF POL_CAMVD 116 0 IGNNFCRLYNPFIQW POL_FMVD 123 0 LGNNFCQLYSPFIQY POL_CERV 103 0 IGNNFLKLYQPFIQR POL_SOCMV 103 0 FINAL MOTIF SETS CAULIMOPTASE1 Length of motif = 14 Motif number = 1 Cauliflower mosaic virus peptidase motif I - 2 PCODE ST INT NVTNPNSIYIKGRL POL_CAMVC 17 17 NVTNPNSIYIKGRL Q83169 18 18 NVTNPNSIYIKGRL Q66162 18 18 NVTNPNSIYIKGRL POL_CAMVN 18 18 NVTNPNSIYIKGRL POL_CAMVE 17 17 NVTNPNSIYIKGRL POL_CAMVS 17 17 NITNPNSIYIKGRL POL_CAMVD 19 19 NVTNPNSIYIEGKL POL_FMVD 26 26 NRTNPNSIYVKGIL POL_CERV 5 5 TKTNPNSIYIRGNF Q88442 51 51 TKGNPNVTFIKVSI POL_SOCMV 13 13 SSKNSSFIKVKLFN Q84682 2 2 CAULIMOPTASE2 Length of motif = 15 Motif number = 2 Cauliflower mosaic virus peptidase motif II - 2 PCODE ST INT FVDTGASLCIASKFV POL_CAMVC 43 12 FVDTGASLCIASKFV Q83169 44 12 FVDTGASLCIASKFV Q66162 44 12 FVDTGASLCIASKFV POL_CAMVN 44 12 FVDTGASLCIASKFV POL_CAMVE 43 12 FVDTGASLCIASKFV POL_CAMVS 43 12 FVDTGASLCIASKFV POL_CAMVD 45 12 YVDTGASLCIASRYI POL_FMVD 52 12 YVDTGSSLCMASKYV POL_CERV 32 13 YVDTGASMCTANKHV Q88442 77 12 YIDTGATLCFGKRKI POL_SOCMV 34 7 YIDTGATICLAQAKI Q84682 21 5 CAULIMOPTASE3 Length of motif = 16 Motif number = 3 Cauliflower mosaic virus peptidase motif III - 2 PCODE ST INT FKIPTVYQQESGIDFI POL_CAMVC 98 40 FKIPTVYQQESGIDFI Q83169 99 40 FKIPTVYQQESGIDFI Q66162 99 40 FKIPTVYQQESGIDFI POL_CAMVN 99 40 FKIPTVYQQESGIDFI POL_CAMVE 98 40 FRIPTVYQQESGIDFI POL_CAMVS 98 40 FHIPTVYQQESGIDFI POL_CAMVD 100 40 FEIPTVYQQETGIDFL POL_FMVD 107 40 FLIPTLFQQESGIDLL POL_CERV 87 40 FIIPTLYQATTKGDIT Q88442 132 40 FLIPIIYLHDSGLDLI POL_SOCMV 87 38 FPLPSVYQQDAGLPLI Q84682 76 40 CAULIMOPTASE4 Length of motif = 15 Motif number = 4 Cauliflower mosaic virus peptidase motif IV - 2 PCODE ST INT IGNNFCQLYEPFIQF POL_CAMVC 114 0 IGNNFCQLYEPFIQF Q83169 115 0 IGNNFCQLYEPFIQF Q66162 115 0 IGNNFCQLYEPFIQF POL_CAMVN 115 0 IGNNFCQLYEPFIQF POL_CAMVE 114 0 IGNNFCQLYEPFIQF POL_CAMVS 114 0 IGNNFCQLYEPFIQF POL_CAMVD 116 0 IGNNFCRLYNPFIQW POL_FMVD 123 0 LGNNFCQLYSPFIQY POL_CERV 103 0 LGNNFCRLYEPFVQY Q88442 148 0 IGNNFLKLYQPFIQR POL_SOCMV 103 0 LGNNFLKLYNPFIQT Q84682 92 0

User query: Display/Full Code "CAULIMOPTASE"