WORKLIST ENTRIES (1):
LEUZIPPRFOS View alignment View Structure Fos transforming protein signature
Type of fingerprint: COMPOUND with 5 elements
Links:
PRINTS; PR00041 LEUZIPPRCREB; PR00043 LEUZIPPRJUN; PR00044 LEUZIPPRMYC
INTERPRO; IPR000837
PROSITE; PS00036 FOS_JUN_BASIC; PS00029 LEUCINE_ZIPPER
Creation date 17-MAY-1993; UPDATE 10-JUN-1999
1. BOHMANN, D., BOS, T.J., ADMON, A., NISHIMURA, T., VOGT, P.K. AND
TIJAN, R.
Human proto-oncogene c-jun encodes a DNA-binding protein with structural
and functional properties of transcription factor AP-1.
SCIENCE 238 1386-1392 (1987).
2. COHEN, D.R. AND CURRAN, T.
Fra-1 - A serum-inducible, cellular imediate early gene that encodes a
fos-related antigen.
MOL.CELL BIOL. 8(5) 2063-2069 (1988).
3. VAN STRAATEN, F., MULLER, R., CURRAN, T., VAN BEVEREN, C. AND VERMA, I.
Complete nucleotide sequence of a human c-onc gene - deduced amino acid
sequence of the human c-fos protein.
PROC.NATL.ACAD.SCI.U.S.A. 80(11) 3183-3187 (1983).
Implicit in the growth regulatory functions of all proto-oncogenes is the
potential to induce abnormal cell growth [1] and cancer as a result of
alterations in gene expression. This may be a qualitative or quantitative
alteration, the viral oncogenes activating this potential by transducing a
truncated or mutated form of the protein product, or by increasing
transcription of the proto-oncogene by the integration of a viral promoter
and enhancer sequence in its vicinity.
Both the cellular and viral forms of the fos gene encode a phosphoprotein
that is located in the nucleus of cells, and forms a noncovalent complex
with several other proteins, a leucine zipper holding the dimer together.
The dimer is associated with chromatin and demonstrates specific and non-
specific DNA-binding properties [2], the DNA being bound by a highly basic
area in the protein sequence immediately preceding the zipper domain.
Expression of the fos gene is stimulated by mitogens, suggesting that the
gene product is involved in cell growth [3], and may act as a nuclear
signal in a more general sense.
The 'leucine zipper' is a structure that is believed to mediate the
function of several eukaryotic gene regulatory proteins. The zipper
consists of a periodic repetition of leucine residues at every seventh
position, and regions containing them appear to span 8 turns of alpha-
helix. The leucine side chains that extend from one helix interact with
those from a similar helix, hence facilitating dimerisation in the form
of a coiled-coil. Leucine zippers are present in many gene regulatory
proteins, including the CREB proteins, Jun/AP1 transcription factors,
fos oncogene and fos-related proteins, C-myc, L-myc and N-myc oncogenes,
and so on.
LEUZIPPRFOS is a 5-element fingerprint that provides a signature for the
leucine zipper and DNA-binding domains characteristic of the fos oncogenes
and fos-related proteins. The fingerprint was derived from an initial
alignment of 6 sequences: motifs 2 and 3 span the highly basic DNA-
binding domain, while motifs 4 and 5 encode the zipper region (cf.
PROSITE patterns FOS_JUN_BASIC (PS00036) and LEUCINE_ZIPPER (PS00029)).
Two iterations on OWL19.1 were required to reach convergence, at which
point a true set comprising 14 sequences was identified. Several partial
matches were also found: of those matching just 4 motifs, both are CREB
protein fragments that are highly similar to the DNA-binding and zipper
domains of the fos gene products; those matching just 2 or 3 motifs are
myosin heavy chains, which form coiled coils using a system similar to
leucine zippers.
An update on SPTR37_9f identified a true set of 24 sequences, and 6
partial matches.
SUMMARY INFORMATION
24 codes involving 5 elements
1 codes involving 4 elements
3 codes involving 3 elements
2 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
5| 24 24 24 24 24
4| 0 1 1 1 1
3| 0 3 3 3 0
2| 0 1 2 1 0
--+--------------------------
| 1 2 3 4 5
True positives..
FOS_HUMAN O88479 FOS_MOUSE FOS_RAT
FOS_CHICK FOS_AVINK FOSX_MSVFR O56223
Q62592 FOS_MSVFB FOS_FUGRU FRA2_HUMAN
FOS_CYPCA FOS_TETFL FRA2_CHICK Q91639
FRA2_MOUSE FOSB_MOUSE FOSB_HUMAN FRA2_RAT
FRA1_RAT FRA1_HUMAN O35285 FRA1_MOUSE
Subfamily: Codes involving 4 elements
Subfamily True positives..
Q62738
Subfamily: Codes involving 3 elements
Subfamily True positives..
Q62281 ATF3_RAT ATF3_MOUSE
Subfamily: Codes involving 2 elements
Subfamily True positives..
ATF3_HUMAN FRA_DROME
PROTEIN TITLES
FOS_HUMAN P55-C-FOS PROTO-ONCOGENE PROTEIN (G0S7 PROTEIN) - HOMO SAPIE
O88479 C-FOS PROTO-ONCOGENE PROTEIN - MESOCRICETUS AURATUS (GOLDEN
FOS_MOUSE P55-C-FOS PROTO-ONCOGENE PROTEIN - MUS MUSCULUS (MOUSE).
FOS_RAT P55-C-FOS PROTO-ONCOGENE PROTEIN - RATTUS NORVEGICUS (RAT).
FOS_CHICK P55-C-FOS PROTO-ONCOGENE PROTEIN - GALLUS GALLUS (CHICKEN).
FOS_AVINK P55-V-FOS TRANSFORMING PROTEIN - AVIAN RETROVIRUS NK24.
FOSX_MSVFR V-FOS/FOX TRANSFORMING PROTEIN - FBR MURINE OSTEOSARCOMA VIR
O56223 COMPLETE GENOME - MURINE OSTEOSARCOMA VIRUS.
Q62592 FBR-MURINE OSTEOSARCOMA PROVIRUS GENOME - RATTUS NORVEGICUS
FOS_MSVFB P55-V-FOS TRANSFORMING PROTEIN - FBJ MURINE OSTEOSARCOMA VIR
FOS_FUGRU P55-C-FOS PROTO-ONCOGENE PROTEIN - FUGU RUBRIPES (JAPANESE P
FRA2_HUMAN FOS-RELATED ANTIGEN 2 - HOMO SAPIENS (HUMAN).
FOS_CYPCA P55-C-FOS PROTO-ONCOGENE PROTEIN - CYPRINUS CARPIO (COMMON C
FOS_TETFL P55-C-FOS PROTO-ONCOGENE PROTEIN - TETRAODON FLUVIATILIS (PU
FRA2_CHICK FOS-RELATED ANTIGEN 2 - GALLUS GALLUS (CHICKEN).
Q91639 FOS-RELATED ANTIGEN-2 - XENOPUS LAEVIS (AFRICAN CLAWED FROG)
FRA2_MOUSE FOS-RELATED ANTIGEN 2 - MUS MUSCULUS (MOUSE).
FOSB_MOUSE FOSB PROTEIN - MUS MUSCULUS (MOUSE).
FOSB_HUMAN FOSB PROTEIN (G0/G1 SWITCH REGULATORY PROTEIN 3) - HOMO SAPI
FRA2_RAT FOS-RELATED ANTIGEN 2 - RATTUS NORVEGICUS (RAT).
FRA1_RAT FOS-RELATED ANTIGEN 1 - RATTUS NORVEGICUS (RAT).
FRA1_HUMAN FOS-RELATED ANTIGEN 1 - HOMO SAPIENS (HUMAN).
O35285 FOS-LIKE ANTIGEN 1 (FOS-RELATED ANTIGEN 1) - MUS MUSCULUS (M
FRA1_MOUSE FOS-RELATED ANTIGEN-1 - MUS MUSCULUS (MOUSE).
Q62738 FOS-RELATED ANTIGEN 2 - RATTUS NORVEGICUS (RAT).
Q62281 TI-241 - MUS MUSCULUS (MOUSE).
ATF3_RAT CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-3 (ACTIVATING
ATF3_MOUSE CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-3 (ACTIVATING
ATF3_HUMAN CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-3 (ACTIVATING
FRA_DROME TRANSCRIPTION FACTOR DFRA (FOS-RELATED ANTIGEN) (AP-1) (KAYA
SCAN HISTORY
OWL19_1 2 100 NSINGLE
OWL26_0 1 200 NSINGLE
SPTR37_9f 2 67 NSINGLE
INITIAL MOTIF SETS
LEUZIPPRFOS1 Length of motif = 18 Motif number = 1
FOS Transforming protein motif I - 1
PCODE ST INT
PTVTAISTSPDLQWLVQP FOS_AVINK 17 17
PTVTAISTSPDLQWLVQP FOS_HUMAN 62 62
PTETAISTSPDLQWLVQP FOSX_MSVFR 38 38
PTINAITTSQDLQWMVQP FRA2_CHICK 48 48
PSINAVSGSQELQWMVQP FRA1_RAT 41 41
PSINTMSGSQELQWMVQP FRA1_HUMAN 39 39
LEUZIPPRFOS2 Length of motif = 17 Motif number = 2
FOS Transforming protein motif II - 1
PCODE ST INT
EQLSPEEEEKRRIRRER FOS_AVINK 84 49
EQLSPEEEEKRRIRRER FOS_HUMAN 130 50
EQLSPEEEVKRRIRRER FOSX_MSVFR 106 50
EQLSPEEEEKRRIRRER FRA2_CHICK 117 51
EQISPEEEERRRVRRER FRA1_RAT 100 41
EQISPEEEERRRVRRER FRA1_HUMAN 98 41
LEUZIPPRFOS3 Length of motif = 17 Motif number = 3
FOS Transforming protein motif III - 1
PCODE ST INT
NKMAAAKCRNRRRELTD FOS_AVINK 101 0
NKMAAAKCRNRRRELTD FOS_HUMAN 147 0
NKMAAAKCRNRRRELTD FOSX_MSVFR 123 0
NKLAAAKCRNRRRELTE FRA2_CHICK 134 0
NKLAAAKCRNRRKELTD FRA1_RAT 117 0
NKLAAAKCRNRRKELTD FRA1_HUMAN 115 0
LEUZIPPRFOS4 Length of motif = 22 Motif number = 4
FOS Transforming protein motif IV - 1
PCODE ST INT
LQAETDQLEEEKSALQAEIANL FOS_AVINK 119 1
LQAETDQLEDEKSALQTEIANL FOS_HUMAN 165 1
LQAETDQLEDEKSALQTEIANL FOSX_MSVFR 141 1
LQAETEVLEEEKSVLQKEIAEL FRA2_CHICK 152 1
LQAETDKLEDEKSGLQREIEEL FRA1_RAT 135 1
LQAETDKLEDEKSGLQREIEEL FRA1_HUMAN 133 1
LEUZIPPRFOS5 Length of motif = 24 Motif number = 5
FOS Transforming protein motif V - 1
PCODE ST INT
LLKEKEKLEFILAAHRPACKMPEE FOS_AVINK 140 -1
LLKEKEKLEFILAAHRPACKIPDD FOS_HUMAN 186 -1
LLKEKEKLEFILAAHRPACKIPDD FOSX_MSVFR 162 -1
LQKEKEKLEFMLVAHSPVCKISPE FRA2_CHICK 173 -1
LQKQKERLELVLEAHRPICKIPEE FRA1_RAT 156 -1
LQKQKERLELVLEAHRPICKIPEG FRA1_HUMAN 154 -1
FINAL MOTIF SETS
LEUZIPPRFOS1 Length of motif = 18 Motif number = 1
FOS Transforming protein motif I - 2
PCODE ST INT
PTVTAISTSPDLQWLVQP FOS_HUMAN 62 62
PTVTAISTSPDLQWLVQP O88479 62 62
PTVTAISTSPDLQWLVQP FOS_MOUSE 62 62
PTVTAISTSPDLQWLVQP FOS_RAT 62 62
PTVTAISTSPDLQWLVQP FOS_AVINK 17 17
PTVTAISTSPDLQWLVQP FOS_CHICK 62 62
PTETAISTSPDLQWLVQP FOSX_MSVFR 38 38
PTETAISTSPDLQWLVQP O56223 347 347
PTETAISTSPDLQWLVQP Q62592 348 348
PTVTATSTSPDLQWLVQP FOS_MSVFB 62 62
PTVTAISTSPDLQWMVQP FOS_FUGRU 57 57
PTINAITTSQDLQWMVQP FRA2_HUMAN 49 49
PTVTAISSCPDLQWMVQP FOS_CYPCA 49 49
PTVTAISTSPDLQWMVQP FOS_TETFL 56 56
PTINAITTSQDLQWMVQP FRA2_CHICK 48 48
PTVNAITTSQDLQWMVQP Q91639 52 52
PTINAITTSQDLQWMVQP FRA2_MOUSE 49 49
PTVTAITTSQDLQWLVQP FOSB_HUMAN 56 56
PTVTAITTSQDLQWLVQP FOSB_MOUSE 56 56
TINAITTTSQDLQWMVQP FRA2_RAT 50 50
PSINAVSGSQELQWMVQP FRA1_RAT 41 41
PSINTMSGSQELQWMVQP FRA1_HUMAN 39 39
LVPSIDSSSQELHWMVQP O35285 39 39
FVPSIDSSSQELHWMVQP FRA1_MOUSE 39 39
LEUZIPPRFOS2 Length of motif = 17 Motif number = 2
FOS Transforming protein motif II - 2
PCODE ST INT
EQLSPEEEEKRRIRRER FOS_HUMAN 130 50
EQLSPEEEEKRRIRRER O88479 130 50
EQLSPEEEEKRRIRRER FOS_MOUSE 130 50
EQLSPEEEEKRRIRRER FOS_RAT 130 50
EQLSPEEEEKRRIRRER FOS_AVINK 84 49
EQLSPEEEEKRRIRRER FOS_CHICK 129 49
EQLSPEEEVKRRIRRER FOSX_MSVFR 106 50
EQLSPEEEVKRRIRRER O56223 415 50
EQLSPEEEVKRRIRRER Q62592 416 50
EQLSPEEEEKRRIRRER FOS_MSVFB 130 50
EQTTPEEEEKKRIRRER FOS_FUGRU 114 39
EQLSPEEEEKRRIRRER FRA2_HUMAN 117 50
EQLSPEEEEKKRVRRER FOS_CYPCA 106 39
EQTTPEEEEKKRIRRER FOS_TETFL 113 39
EQLSPEEEEKRRIRRER FRA2_CHICK 117 51
EQLSPEEEEKRRVRRER Q91639 121 51
EQLSPEEEEKRRIRRER FRA2_MOUSE 117 50
ETLTPEEEEKRRVRRER FOSB_HUMAN 148 74
ETLTPEEEEKRRVRRER FOSB_MOUSE 148 74
EQLSPEEEEKRRIRRER FRA2_RAT 118 50
EQISPEEEERRRVRRER FRA1_RAT 100 41
EQISPEEEERRRVRRER FRA1_HUMAN 98 41
EQISPEEEERRRVRRER O35285 98 41
EQISPEEEERRRVRRER FRA1_MOUSE 98 41
LEUZIPPRFOS3 Length of motif = 17 Motif number = 3
FOS Transforming protein motif III - 2
PCODE ST INT
NKMAAAKCRNRRRELTD FOS_HUMAN 147 0
NKMAAAKCRNRRRELTD O88479 147 0
NKMAAAKCRNRRRELTD FOS_MOUSE 147 0
NKMAAAKCRNRRRELTD FOS_RAT 147 0
NKMAAAKCRNRRRELTD FOS_AVINK 101 0
NKMAAAKCRNRRRELTD FOS_CHICK 146 0
NKMAAAKCRNRRRELTD FOSX_MSVFR 123 0
NKMAAAKCRNRRRELTD O56223 432 0
NKMAAAKCRNRRRELTD Q62592 433 0
NKMAAAKCRNRRRELTD FOS_MSVFB 147 0
NKQAAAKCRNRRRELTD FOS_FUGRU 131 0
NKLAAAKCRNRRRELTE FRA2_HUMAN 134 0
NKMAAAKCRNRRRELTD FOS_CYPCA 123 0
NKQAAAKCRNRRRELTD FOS_TETFL 130 0
NKLAAAKCRNRRRELTE FRA2_CHICK 134 0
NKLAAAKCRNRRRELTD Q91639 138 0
NKLAAAKCRNRRRELTE FRA2_MOUSE 134 0
NKLAAAKCRNRRRELTD FOSB_HUMAN 165 0
NKLAAAKCRNRRRELTD FOSB_MOUSE 165 0
NKLAAAKCRNRRRELTE FRA2_RAT 135 0
NKLAAAKCRNRRKELTD FRA1_RAT 117 0
NKLAAAKCRNRRKELTD FRA1_HUMAN 115 0
NKLAAAKCRNRRKELTD O35285 115 0
NKLAAAKCRNRRKELTD FRA1_MOUSE 115 0
LEUZIPPRFOS4 Length of motif = 22 Motif number = 4
FOS Transforming protein motif IV - 2
PCODE ST INT
LQAETDQLEDEKSALQTEIANL FOS_HUMAN 165 1
LQAETDQLEDEKSALQTEIANL O88479 165 1
LQAETDQLEDEKSALQTEIANL FOS_MOUSE 165 1
LQAETDQLEDEKSALQTEIANL FOS_RAT 165 1
LQAETDQLEEEKSALQAEIANL FOS_AVINK 119 1
LQAETDQLEEEKSALQAEIANL FOS_CHICK 164 1
LQAETDQLEDEKSALQTEIANL FOSX_MSVFR 141 1
LQAETDQLEDEKSALQTEIANL O56223 450 1
LQAETDQLEDEKSALQTEIANL Q62592 451 1
LQAETDQLEDKKSALQTEIANL FOS_MSVFB 165 1
LQAETDQLEDEKSSLQNDIANL FOS_FUGRU 149 1
LQAETEELEEEKSGLQKEIAEL FRA2_HUMAN 152 1
LQAETDELEDEKSALQNDIANL FOS_CYPCA 141 1
LQAETDQLEAEKSSLQNDIANL FOS_TETFL 148 1
LQAETEVLEEEKSVLQKEIAEL FRA2_CHICK 152 1
LQAETEKLEQEKSGLQKEIADL Q91639 156 1
LQAETEELEEEKSGLQKEIAEL FRA2_MOUSE 152 1
LQAETDQLEEEKAELESEIAEL FOSB_HUMAN 183 1
LQAETDQLEEEKAELESEIAEL FOSB_MOUSE 183 1
LQTETEELEEEKSGLQKEIAEL FRA2_RAT 153 1
LQAETDKLEDEKSGLQREIEEL FRA1_RAT 135 1
LQAETDKLEDEKSGLQREIEEL FRA1_HUMAN 133 1
LQAETDKLEDEKSGLQREIEEL O35285 133 1
LQAETDKLEDEKSGLQREIEEL FRA1_MOUSE 133 1
LEUZIPPRFOS5 Length of motif = 24 Motif number = 5
FOS Transforming protein motif V - 2
PCODE ST INT
LLKEKEKLEFILAAHRPACKIPDD FOS_HUMAN 186 -1
LLKEKEKLEFILAAHRPACKIPDD O88479 186 -1
LLKEKEKLEFILAAHRPACKIPDD FOS_MOUSE 186 -1
LLKEKEKLEFILAAHRPACKIPND FOS_RAT 186 -1
LLKEKEKLEFILAAHRPACKMPEE FOS_AVINK 140 -1
LLKEKEKLEFILAAHRPACKMPEE FOS_CHICK 185 -1
LLKEKEKLEFILAAHRPACKIPDD FOSX_MSVFR 162 -1
LLKEKEKLEFILAAHRPACKIPDD O56223 471 -1
LLKEKEKLEFILAAHRPACKIPDD Q62592 472 -1
LLKEKEKLEFILAAHRPACKIPDD FOS_MSVFB 186 -1
LLKEKERLEFILAAHQPICKIPSQ FOS_FUGRU 170 -1
LQKEKEKLEFMLVAHGPVCKISPE FRA2_HUMAN 173 -1
LLKEKERLEFILAAHKPICKIPSS FOS_CYPCA 162 -1
LLKEKERLEFILAAHQPICKIPSQ FOS_TETFL 169 -1
LQKEKEKLEFMLVAHSPVCKISPE FRA2_CHICK 173 -1
LQKEKDKLEFMLVAHSPVCKISTD Q91639 177 -1
LQKEKEKLEFMKVAHGPVCKISPE FRA2_MOUSE 173 -1
LQKEKERLEFVLVAHKPGCKIPYE FOSB_HUMAN 204 -1
LQKEKERLEFVLVAHKPGCKIPYE FOSB_MOUSE 204 -1
LQKEKEKLEFMLVAHGPVCKISPE FRA2_RAT 174 -1
LQKQKERLELVLEAHRPICKIPEE FRA1_RAT 156 -1
LQKQKERLELVLEAHRPICKIPEG FRA1_HUMAN 154 -1
LQKQKERLELVLEAHRPICKIPEG O35285 154 -1
LQKQKERLELVLEAHRLICKIPEG FRA1_MOUSE 154 -1
User query: Display/Full Code "LEUZIPPRFOS"