SPRINT Home UMBER Home Contents Standard Search Advanced Search Relation Search

==SPRINT==> PRINTS View



  selected as


PR00758

Identifier
ARSENICPUMP  [View Relations]  [View Alignment]  
Accession
PR00758
No. of Motifs
8
Creation Date
23-AUG-1997  (UPDATE 10-JUN-1999)
Title
Arsenical pump membrane protein signature
Database References

INTERPRO; IPR000802
Literature References
1. DIORIO C., CAI J., MARMOR J., SHINDER R., DUBOW M.S.
An Escherichia coli chromosomal ars operon homolog is functional in
arsenic detoxification and is conserved in Gram-negative bacteria.
J.BACTERIOL. 177(8) 2050-2056 (1995).
 
2. TISA, L.S. AND ROSEN, B.P.
Molecular characterization of an anion pump. The ArsB protein is the
membrane anchor for the ArsA protein.
J.BIOL.CHEM. 265(1) 190-194 (1990).

Documentation
Arsenic is a toxic metalloid whose trivalent and pentavalent ions inhibit
a variety of biochemical processes. Operons that encode arsenic resistance
have been found in multicopy plasmids from both Gram-positive and Gram-
negative bacteria [1]. The resistance mechanism is encoded from a single
operon, which houses an anion pump. The pump has two polypeptide components:
a catalytic subunit (the ArsA protein), which functions as an oxyanion-
stimulated ATPase; and an arsenite export component (the ArsB protein), 
which is associated with the inner membrane [2]. The ArsA and ArsB proteins
are thought to form a membrane complex that functions as an anion-
translocating ATPase.
 
The ArsB protein is distinguished by its overall hydrophobic character, 
in keeping with its role as a membrane-associated channel. Sequence
analysis reveals the presence of 13 putative transmembrane (TM) regions.
 
ARSENICPUMP is an 8-element fingerprint that provides a signature for
arsenical pump ArsB membrane-associated proteins. The fingerprint was
derived from an initial alignment of 8 sequences: the motifs were drawn
from conserved regions spanning virtually the full alignment length - motif
1 spans putative TM regions 3 and 4; motif 2 includes part of the fifth TM
region; motif 3 spans the sixth TM region; motifs 4, 5 and 6 include parts
of the seventh, ninth and tenth putative TM regions; and motifs 7 and 8 span
the twelfth and final TM regions. A single iteration on OWL29.4 was required
to reach convergence, no further sequences being identified beyond the
starting set. Several partial matches were also found, all of which are
fragments and family members that fail to make significant matches with
one or more motifs.
 
An update on SPTR37_9f identified a true set of 9 sequences, and 5
partial matches.
Summary Information
   9 codes involving  8 elements
0 codes involving 7 elements
0 codes involving 6 elements
0 codes involving 5 elements
2 codes involving 4 elements
1 codes involving 3 elements
1 codes involving 2 elements
Composite Feature Index
899999999
700000000
600000000
500000000
400012221
301000110
201100000
12345678
True Positives
ARB1_ECOLI    ARB2_ECOLI    ARSB_ECOLI    ARSB_STAAU    
ARSB_STAXY ARSB_YEREN O50594 O68021
P96678
True Positive Partials
Codes involving 4 elements
P76607 Q54091
Codes involving 3 elements
P96860
Codes involving 2 elements
P76608
Sequence Titles
ARB1_ECOLI  ARSENICAL PUMP MEMBRANE PROTEIN - ESCHERICHIA COLI. 
ARB2_ECOLI ARSENICAL PUMP MEMBRANE PROTEIN - ESCHERICHIA COLI.
ARSB_ECOLI ARSENICAL PUMP MEMBRANE PROTEIN - ESCHERICHIA COLI.
ARSB_STAAU ARSENICAL PUMP MEMBRANE PROTEIN - STAPHYLOCOCCUS AUREUS.
ARSB_STAXY ARSENICAL PUMP MEMBRANE PROTEIN - STAPHYLOCOCCUS XYLOSUS.
ARSB_YEREN TRANSMEMBRANE PROTEIN OF ARSENITE PUMP - YERSINIA ENTEROCOLITICA.
O50594 ARSB - ACIDIPHILIUM MULTIVORUM.
O68021 ARSB - PSEUDOMONAS AERUGINOSA.
P96678 YDFA PROTEIN - BACILLUS SUBTILIS.

P76607 FROM BASES 2765294 TO 2775787 (SECTION 239 OF 400) OF THE COMPLETE GENOME (SECTION 239 OF 400) - ESCHERICHIA COLI.
Q54091 SERINE PROTEASE - STAPHYLOCOCCUS EPIDERMIDIS.

P96860 ARSB - MYCOBACTERIUM TUBERCULOSIS.

P76608 FROM BASES 2765294 TO 2775787 (SECTION 239 OF 400) OF THE COMPLETE GENOME (SECTION 239 OF 400) - ESCHERICHIA COLI.
Scan History
OWL29_4    1  100  NSINGLE    
SPTR37_9f 2 89 NSINGLE
Initial Motifs
Motif 1  width=25
Element Seqn Id St Int Rpt
LGAIVAAFFANDGAALILTPIVLAM ARSB_STAAU 102 102 -
LGAAVAALFANDGAALILTPIVIAM ARSB_ECOLI 101 101 -
LGAAVAALFANDGAALILTPIVIAM ARB2_ECOLI 101 101 -
LGAAVDALFANDGAALILTPIVIAM YFJV_ECOLI 10 10 -
LGAIVAALFANDGAALILTPIVLAM AB001488114 104 104 -
LGAAVAALFANDGAALILTPIVIAM YEU583662 101 101 -
LGAIVAAFFANDGAALILTPIVLAM ARSB_STAXY 102 102 -
LGAAVAALFANDGAALILTPIVIAM ARB1_ECOLI 101 101 -

Motif 2 width=25
Element Seqn Id St Int Rpt
GFIADTASLPLIVSNLVNIVSADFF ARB2_ECOLI 144 18 -
GFIADTTSLPLIVSNLVNIVSADYF ARSB_STAXY 145 18 -
GFIADTTSLPLIVSNLVNIVSADYF ARSB_STAAU 145 18 -
GFIADTASLPLIVSNLVNIVSADFF YEU583662 144 18 -
GFIADTASLPLIVSNLVNIVSADFF YFJV_ECOLI 53 18 -
GFIADTTSLPFVVSNLVNIVSADYF AB001488114 147 18 -
GFISDTASLPLIVSNLVNIVSADFF ARB1_ECOLI 144 18 -
GFIADTASLPLIVSNLVNIVSADFF ARSB_ECOLI 144 18 -

Motif 3 width=24
Element Seqn Id St Int Rpt
SRMVVPYLFSLLASIIVLYLFFRK AB001488114 180 8 -
SVMLPVDIAAIAATLGMLHLFFRR YEU583662 177 8 -
SVMISVDAAAIAATLIMLYLFFRR YFJV_ECOLI 86 8 -
SVMVPVDIAAIIATLVMLHLFFRK ARB2_ECOLI 177 8 -
SVMVPVDIAAIVATLVMLHLYFRK ARSB_ECOLI 177 8 -
SVMVPVDIAAIIATLVMLHLFFRK ARB1_ECOLI 177 8 -
SRMIIPNIFSLIASILVLWLYFRK ARSB_STAAU 178 8 -
SRMIIPNIFSLIASILVLWLYFRK ARSB_STAXY 178 8 -

Motif 4 width=26
Element Seqn Id St Int Rpt
AIKDLATFRAGWIVLLLLLVGFFFLE YFJV_ECOLI 126 16 -
AIKDPATFRAGWIVLVLLLVGFFVLE YEU583662 217 16 -
AIKDPATFKTGWVVLLLLLVGFFVLE ARSB_ECOLI 217 16 -
AIKDLATFRTGWIVLILLLVGFFVLE ARB2_ECOLI 217 16 -
VIKDPKLFKLSWIVLAILLVGYLVSE ARSB_STAXY 218 16 -
AIKDSKLFKLSWIVLAVLLVGYLVSE ARSB_STAAU 218 16 -
AIKDPATFRTGWVVLLLLLVGFFVLE ARB1_ECOLI 217 16 -
AIKDQNMFRLSWYILGLLLIGYFASE AB001488114 220 16 -

Motif 5 width=25
Element Seqn Id St Int Rpt
APWNIVVFSIGMYLVVFGLKNVGIT ARSB_STAXY 279 35 -
APWAIVFFSIGMYVVVYGVRNAGLT AB001488114 281 35 -
APWQIVIFSLGMYIVVYGLRNAGFT YFJV_ECOLI 187 35 -
APWQIVVFSLGMYLVVYGLRNAGLT YEU583662 278 35 -
APWQIVIFSLGMYLVVYGLRNAGLT ARB2_ECOLI 278 35 -
APWQIVIFSLGMYLVVYGLRNAGLT ARSB_ECOLI 278 35 -
APWQIVIFSLGMYLVIYGLRNAGLT ARB1_ECOLI 278 35 -
APWNIVVFSIGMYLVVFGLKNVGIT ARSB_STAAU 279 35 -

Motif 6 width=23
Element Seqn Id St Int Rpt
GMGFIAAFLSSIMNNMPTVLIDA ARSB_STAXY 324 20 -
GTGFLTAFLSSVMNNMPTVLVGA YEU583662 323 20 -
GTGFLTAFLSSVMNNMPTVLIGA YFJV_ECOLI 232 20 -
GMGFIAAILSSIMNNLPTVMIDA AB001488114 326 20 -
GTGFLTALLSSIMNNMPTVLIGA ARB1_ECOLI 323 20 -
GTGFLTAFLSSIMNNMPTVLVGA ARSB_ECOLI 323 20 -
GTGFLTAFLSSIMNNMPTVLVGA ARB2_ECOLI 323 20 -
GMGFIAAFLSSIMNNMPTVLIDA ARSB_STAAU 324 20 -

Motif 7 width=26
Element Seqn Id St Int Rpt
ANVIGCDLGPKITPIGSLATLLWLHV YEU583662 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV ARB2_ECOLI 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV ARSB_ECOLI 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV ARB1_ECOLI 364 18 -
ANVIGSDLGPKITPIGSLATLLWLHV ARSB_STAAU 365 18 -
ANVIGSDLGPKITPIGSLATLLWLHV ARSB_STAXY 365 18 -
ANVIGCDLGPKITPIGSLATLLWLHV YFJV_ECOLI 273 18 -
ANVIGSDLGPKITPIGSLATLLWLHV AB001488114 367 18 -

Motif 8 width=25
Element Seqn Id St Int Rpt
ITWGYYFRTGIVMTVPVLFVTLAAL YFJV_ECOLI 306 7 -
ISWGTYFKTGIIITIPVLFVTLLGL ARSB_STAXY 398 7 -
ISWGTYFKTGIIITIPVLFVTLLGL ARSB_STAAU 398 7 -
ITWGYYFRTGIVMTLPVLFVTLAAL ARB1_ECOLI 397 7 -
ISWGYYFRTGIIMTLPVLFVTLAAL ARSB_ECOLI 397 7 -
ITWGYYFRTGIVMTLPVLFVTLAAL ARB2_ECOLI 397 7 -
ISWGTYFKTGIILTIPTLLITLVGL AB001488114 400 7 -
ISWGYYFRTGIIMTLPVLFVTLAAL YEU583662 397 7 -
Final Motifs
Motif 1  width=25
Element Seqn Id St Int Rpt
LGAAVAALFANDGAALILTPIVIAM ARB2_ECOLI 101 101 -
LGAAVAALFANDGAALILTPIVIAM O50594 101 101 -
LGAAVAALFANDGAALILTPIVIAM ARSB_ECOLI 101 101 -
LGAAVAALFANDGAALILTPIVIAM ARB1_ECOLI 101 101 -
LGAAVAALFANDGAALILTPIVIAM ARSB_YEREN 101 101 -
LGAIVAAFFANDGAALILTPIVLAM ARSB_STAAU 102 102 -
LGAIVAAFFANDGAALILTPIVLAM ARSB_STAXY 102 102 -
LGAIVAALFANDGAALILTPIVLAM P96678 104 104 -
LGAAVSALFANDGAALILTPIVMSM O68021 100 100 -

Motif 2 width=25
Element Seqn Id St Int Rpt
GFIADTASLPLIVSNLVNIVSADFF ARB2_ECOLI 144 18 -
GFIADTASLPLIVSNLVNIVSADFF O50594 144 18 -
GFIADTASLPLIVSNLVNIVSADFF ARSB_ECOLI 144 18 -
GFISDTASLPLIVSNLVNIVSADFF ARB1_ECOLI 144 18 -
GFIADTASLPLIVSNLVNIVSADFF ARSB_YEREN 144 18 -
GFIADTTSLPLIVSNLVNIVSADYF ARSB_STAAU 145 18 -
GFIADTTSLPLIVSNLVNIVSADYF ARSB_STAXY 145 18 -
GFIADTTSLPFVVSNLVNIVSADYF P96678 147 18 -
GFIADSASLPLVVSNLVNIVSADYF O68021 143 18 -

Motif 3 width=24
Element Seqn Id St Int Rpt
SVMVPVDIAAIIATLVMLHLFFRK ARB2_ECOLI 177 8 -
SVMVPVDIAAIVATLVMLHLFFRK O50594 177 8 -
SVMVPVDIAAIVATLVMLHLYFRK ARSB_ECOLI 177 8 -
SVMVPVDIAAIIATLVMLHLFFRK ARB1_ECOLI 177 8 -
SVMLPVDIAAIAATLGMLHLFFRR ARSB_YEREN 177 8 -
SRMIIPNIFSLIASILVLWLYFRK ARSB_STAAU 178 8 -
SRMIIPNIFSLIASILVLWLYFRK ARSB_STAXY 178 8 -
SRMVVPYLFSLLASIIVLYLFFRK P96678 180 8 -
AVMLPVNLVSVATSLLVLFLYFRR O68021 176 8 -

Motif 4 width=26
Element Seqn Id St Int Rpt
AIKDLATFRTGWIVLILLLVGFFVLE ARB2_ECOLI 217 16 -
AIKDLATFRTGWIVLILLLVGFFVLE O50594 217 16 -
AIKDPATFKTGWVVLLLLLVGFFVLE ARSB_ECOLI 217 16 -
AIKDPATFRTGWVVLLLLLVGFFVLE ARB1_ECOLI 217 16 -
AIKDPATFRAGWIVLVLLLVGFFVLE ARSB_YEREN 217 16 -
AIKDSKLFKLSWIVLAVLLVGYLVSE ARSB_STAAU 218 16 -
VIKDPKLFKLSWIVLAILLVGYLVSE ARSB_STAXY 218 16 -
AIKDQNMFRLSWYILGLLLIGYFASE P96678 220 16 -
AIRDRATFVVGGWMLLVLLAGLFALE O68021 216 16 -

Motif 5 width=25
Element Seqn Id St Int Rpt
APWQIVIFSLGMYLVVYGLRNAGLT ARB2_ECOLI 278 35 -
APWQIVIFSLGMYLVVYGLRNAGLT O50594 278 35 -
APWQIVIFSLGMYLVVYGLRNAGLT ARSB_ECOLI 278 35 -
APWQIVIFSLGMYLVIYGLRNAGLT ARB1_ECOLI 278 35 -
APWQIVVFSLGMYLVVYGLRNAGLT ARSB_YEREN 278 35 -
APWNIVVFSIGMYLVVFGLKNVGIT ARSB_STAAU 279 35 -
APWNIVVFSIGMYLVVFGLKNVGIT ARSB_STAXY 279 35 -
APWAIVFFSIGMYVVVYGVRNAGLT P96678 281 35 -
APWQIVVFSLGMYLVVYGLKNAGLT O68021 276 34 -

Motif 6 width=23
Element Seqn Id St Int Rpt
GTGFLTAFLSSIMNNMPTVLVGA ARB2_ECOLI 323 20 -
GTGFLTAFLSSIMNNMPTVLVGA O50594 323 20 -
GTGFLTAFLSSIMNNMPTVLVGA ARSB_ECOLI 323 20 -
GTGFLTALLSSIMNNMPTVLIGA ARB1_ECOLI 323 20 -
GTGFLTAFLSSVMNNMPTVLVGA ARSB_YEREN 323 20 -
GMGFIAAFLSSIMNNMPTVLIDA ARSB_STAAU 324 20 -
GMGFIAAFLSSIMNNMPTVLIDA ARSB_STAXY 324 20 -
GMGFIAAILSSIMNNLPTVMIDA P96678 326 20 -
GTGLLSAALSSVMNNMPSMLLGA O68021 321 20 -

Motif 7 width=26
Element Seqn Id St Int Rpt
ANVIGCDLGPKITPIGSLATLLWLHV ARB2_ECOLI 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV O50594 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV ARSB_ECOLI 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV ARB1_ECOLI 364 18 -
ANVIGCDLGPKITPIGSLATLLWLHV ARSB_YEREN 364 18 -
ANVIGSDLGPKITPIGSLATLLWLHV ARSB_STAAU 365 18 -
ANVIGSDLGPKITPIGSLATLLWLHV ARSB_STAXY 365 18 -
ANVIGSDLGPKITPIGSLATLLWLHV P96678 367 18 -
ANVIGCDLGPKITPIGSLATLLWLHV O68021 362 18 -

Motif 8 width=25
Element Seqn Id St Int Rpt
ITWGYYFRTGIVMTLPVLFVTLAAL ARB2_ECOLI 397 7 -
ITWGYYFRTGIVMTLPVLFVTLAAL O50594 397 7 -
ISWGYYFRTGIIMTLPVLFVTLAAL ARSB_ECOLI 397 7 -
ITWGYYFRTGIVMTLPVLFVTLAAL ARB1_ECOLI 397 7 -
ISWGYYFRTGIIMTLPVLFVTLAAL ARSB_YEREN 397 7 -
ISWGTYFKTGIIITIPVLFVTLLGL ARSB_STAAU 398 7 -
ISWGTYFKTGIIITIPVLFVTLLGL ARSB_STAXY 398 7 -
ISWGTYFKTGIILTIPTLLITLVGL P96678 400 7 -
ITWGYYFRVGALLTLPVLLATLSAL O68021 395 7 -