AMYB_SOYBN.abs:
PMID - 7777485
Sweet potato beta-amylase is a tetramer of identical subunits, which are arranged to exhibit 222 molecular symmetry. Its subunit consists of 498 amino acid residues (Mr 55,880). It has been crystallized at room temperature using polyethylene glycol 1500 as precipitant. The crystals, growing to dimensions of 0.4 mm x 0.4 mm x 1.0 mm within 2 weeks, belong to the tetragonal space group P4(2)2(1)2 with unit cell dimensions of a = b = 129.63 A and c = 68.42 A. The asymmetric unit contains 1 subunit of beta-amylase, with a crystal volume per protein mass (VM) of 2.57 A3/Da and a solvent content of 52% by volume. The three-dimensional structure of the tetrameric beta-amylase from sweet potato has been determined by molecular replacement methods using the monomeric structure of soybean enzyme as the starting model. The refined subunit model contains 3,863 nonhydrogen protein atoms (488 amino acid residues) and 319 water oxygen atoms. The current R-value is 20.3% for data in the resolution range of 8-2.3 A (with 2 sigma cut-off) with good stereochemistry. The subunit structure of sweet potato beta-amylase (crystallized in the absence of alpha-cyclodextrin) is very similar to that of soybean beta-amylase (complexed with alpha-cyclodextrin). The root-mean-square (RMS) difference for 487 equivalent C alpha atoms of the two beta-amylases is 0.96 A. Each subunit of sweet potato beta-amylase is composed of a large (alpha/beta)8 core domain, a small one made up of three long loops [L3 (residues 91-150), L4 (residues 183-258), and L5 (residues 300-327)], and a long C-terminal loop formed by residues 445-493. Conserved Glu 187, believed to play an important role in catalysis, is located at the cleft between the (alpha/beta)8 barrel core and a small domain made up of three long loops (L3, L4, and L5). Conserved Cys 96, important in the inactivation of enzyme activity by sulfhydryl reagents, is located at the entrance of the (alpha/beta)8 barrel.
PMID - 2430952
The blocked N-terminus and N-terminal sequence of soybean beta-amylase were determined by analyzing the acidic peptides derived on peptic digestion of the enzyme. The acidic peptides were separated from the digest on a Dowex 50 X 2 column and purified by reversed phase-high performance liquid chromatography (RP-HPLC). The major acidic peptide, Pep-4, was a heptapeptide with a molecular weight of 766. Forty-eight hundredths mol acetyl group and 0.61 mol acetyl-Ala per mol of Pep-4 were detected on RP-HPLC analysis. The N-terminal 9 amino acid sequence of soybean beta-amylase was deduced to be acetyl-Ala-Thr-Ser-Asp-Ser-Asn-Met-(Gly-Leu) from the results of sequence analysis of Pep-4 and amino acid analysis of other acidic peptides.
PMID - 9918723
The three-dimensional structure of the sevenfold mutant of barley beta-amylase (BBA-7s) with increased thermostability was determined by X-ray crystallography. The enzyme was purified as a single component and crystallized by a hanging drop method in the presence of 14 % PEG 6000. The crystals belong to space group P43212 with cell dimensions a=b=72.11 A, c=250.51 A. The diffraction data up to 2.5 A were collected after soaking the crystal in 100 mM maltose with Rsym of 8.6 %. The structure was determined by a molecular replacement method using soybean beta-amylase (SBA) as a search model and refined to an R-factor of 18.7 %. The final model included 500 amino acid residues, 141 water molecules and three glucose residues, which were located at subsites 1-2 and 4 in the active site. The r.m.s. distance of 485 Calpha atoms between BBA-7s and SBA was 0.62 A. Out of the seven mutated amino acids, four (Ser295Ala, Ile297Val, Ser351Pro and Ala376Ser) were substitutions from the common residues with SBA to the thermostable forms. A comparison of the structures of BBA-7s and SBA indicated that the side-chain of Ser376 makes new hydrogen bonds to the main-chain of an adjacent beta-strand, and that the side-chains of Val297 reduce an unfavorable interaction between the side-chains of Ala314. The mutation of Ser295Ala breaks the hydrogen bond between Ser295 OG and Tyr195 OH, which seems to be the reason for the unoccupied glucose residue at subsite 3. The tandem mutations at 350-352 including substitutions to two Pro residues suggested the reduction of main-chain entropy in the unfolded structure of this solvent-exposed protruded loop.
PMID - 8103452
The complete amino acid sequence of a subunit of sweet potato beta-amylase, a homotetramer, was established by sequence analysis of peptides obtained by digestions with Achromobacter protease I and Staphylococcus aureus V8 protease and by cyanogen bromide cleavage of the S-carboxymethylated subunit. The subunit of the enzyme is a single polypeptide consisting of 498 amino acid residues. It showed 50-60% identity in the amino acid sequence with those of beta-amylases from soybean and barley, while it about 25% with those of three bacterial beta-amylases deduced from the cDNA sequences. Sweet potato beta-amylase was completely inactivated with 2,3-epoxypropyl alpha-D-[U-14C]glucopyranoside. Sequence analysis of the inactivated enzyme revealed that Glu187 was specifically esterified by the affinity labeling with the above reagent, proposing that Glu187 is a potent candidate involved directly in the catalysis with this plant beta-amylase.
PMID - 10617198
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
PMID - 9847126
Alfalfa (Medicago sativa L.) roots contain large quantities of beta-amylase, but little is known about its role in vivo. We studied this by isolating a beta-amylase cDNA and by examining signals that affect its expression. The beta-amylase cDNA encoded a 55.95-kD polypeptide with a deduced amino acid sequence showing high similarity to other plant beta-amylases. Starch concentrations, beta-amylase activities, and beta-amylase mRNA levels were measured in roots of alfalfa after defoliation, in suspension-cultured cells incubated in sucrose-rich or -deprived media, and in roots of cold-acclimated germ plasms. Starch levels, beta-amylase activities, and beta-amylase transcripts were reduced significantly in roots of defoliated plants and in sucrose-deprived cell cultures. beta-Amylase transcript was high in roots of intact plants but could not be detected 2 to 8 d after defoliation. beta-Amylase transcript levels increased in roots between September and October and then declined 10-fold in November and December after shoots were killed by frost. Alfalfa roots contain greater beta-amylase transcript levels compared with roots of sweetclover (Melilotus officinalis L.), red clover (Trifolium pratense L.), and birdsfoot trefoil (Lotus corniculatus L.). Southern analysis indicated that beta-amylase is present as a multigene family in alfalfa. Our results show no clear association between beta-amylase activity or transcript abundance and starch hydrolysis in alfalfa roots. The great abundance of beta-amylase and its unexpected patterns of gene expression and protein accumulation support our current belief that this protein serves a storage function in roots of this perennial species.
PMID - 1837016
Tuberous roots of the sweet potato are unusually rich in beta-amylase, and the beta-amylase polypeptides account for about 5% of the total soluble protein of the organ. Unlike beta-amylases from other origins, the sweet potato beta-amylase is a tetramer of identical subunits, and it also bears starch phosphorylase-inhibitor activity. A cDNA for the subunit of sweet potato beta-amylase was obtained by immunological screening of an expression cDNA library constructed by the vector-primer and linker method using a plasmid vector containing tac-SP6 promoters. The SP6 transcript of a 2,000 base-pair-long cDNA insert directed the synthesis in vitro of a precursor to the subunit of beta-amylase which was identical in size with the mature subunit, and the beta-amylase mRNA detected by Northern blot hybridization was identical in size with the SP6 transcript of the cDNA insert. The cDNA insert contained 1,494 base pairs of an open reading frame which codes for the 499-amino-acid-long precursor to the subunit of beta-amylase. An amino acid sequence identical to the N-terminal amino acid sequence of the mature subunit appeared immediately after the initiator methionine of the precursor, indicating that the subunit of beta-amylase is synthesized as a mature form. Comparison of the amino acid sequences of subunits of sweet potato beta-amylase and seed beta-amylases from barley and soybean indicated that these enzymes share about 68% amino acid identities among each other. Escherichia coli cells harboring the cDNA clone produced the mature-sized subunit of the beta-amylase, and the soluble extract exhibited amylolytic activity which migrated to the same position as the beta-amylase purified from the sweet potato in non-denaturing polyacrylamide gel containing soluble starch indicating that oligomerization of the subunit occurred properly in E. coli cells.
PMID - 2446870
The primary structure of barley endosperm beta-amylase, an enzyme which catalyses the liberation of maltose from 1,4-alpha-D-glucans, has been deduced from the nucleotide sequence of a cloned full-length cDNA. The mRNA is 1754 nucleotides long [excluding the poly(A) tail] and codes for a polypeptide of 535 amino acids with a relative molecular mass of 59,663. The deduced amino acid sequence was compared with the sequences of ten peptides obtained from the purified enzyme and unambiguous identification was obtained. The N-terminal region of the deduced sequence was identical to a 12-residue cyanogen-bromide-peptide sequence, indicating that beta-amylase is synthesized as the mature protein. A graphic matrix homology plot shows four glycine-rich repeats, each of 11 residues, preceding the C-terminus. Southern blotting of genomic DNA demonstrates that beta-amylase is encoded by a small gene family, while cDNA sequence analysis indicates the presence of at least two types of mRNA in the endosperm. Dot and northern blot analysis show that Hiproly barley contains greatly increased levels of beta-amylase mRNA compared to the normal cultivar Sundance, whereas Risø mutant 1508 contains only trace amounts. These results correlate well with the deposition of beta-amylase during endosperm development in these lines. Low but similar amounts of beta-amylase mRNAs sequences were detected in leaves and shoots from normal and mutant barleys, demonstrating that the mutant lys3a (1508) and lysl (Hiproly) genes do not affect the expression of beta-amylase in these tissues.
PMID - 1383095
A nuclear AmyB gene from sweet potato encoding beta-amylase (beta Amy) that is abundant in tuberous roots and inducible in other organs by an exogenous supply of sucrose or polygalacturonic acid, was isolated and characterized. Genomic Southern blot hybridization, restriction maps of independently isolated phage lambda genomic clones, and the nucleotide sequence of AmyB compared with that of the cDNA, all suggested that beta Amy of sweet potato is encoded by a gene that is present in a single copy per haploid genome. In the sequence of AmyB, the sequence that is identical to that of the cDNA was split into seven exons by six introns, and the transcription of this gene starts from multiple sites 26 to 30 bp downstream from a potential TATA-box sequence, 5'-TATATAA. In the 5'-upstream region of AmyB, there are sequences homologous to those conserved in the 5'-upstream regions of genes encoding sporamin, which are regulated similarly to AmyB. The 5'-upstream region of AmyB also contains sequences to which several previously known plant nuclear factors bind.
PMID - 8334116
New crystallographic findings are presented which offer a deeper understanding of the structure and functioning of beta-amylase, the first known exo-type starch-hydrolyzing enzyme. A refined three-dimensional structure of soybean beta-amylase, complexed with the inhibitor alpha-cyclodextrin, has been determined at 2.0-A resolution with a conventional R-value of 17.5%. The model contains 491 amino acid residues, 319 water molecules, 1 sulfate ion, and 1 alpha-cyclodextrin molecule. The protein consists of a core with an (alpha/beta)8 supersecondary structure, plus a smaller globular region formed by long loops (L3, L4, and L5) extending from beta-strands beta 3, beta 4, and beta 5. Between the two regions is a cleft that opens into a pocket whose floor contains the postulated catalytic center near the carboxyl group of Glu 186. The annular alpha-cyclodextrin binds in (and partly projects from) the cleft with its glucosyl O-2/O-3 face abutting the (alpha/beta)8 side and with its alpha-D(1 --> 4) glucosidic linkage progression running clockwise as viewed from that side. The ligand does not bind deeply enough to interact with the carboxyl group of Glu 186. Rather, it occupies most of the cleft entrance, strongly suggesting that alpha-cyclodextrin inhibits catalysis by blocking substrate access to the more deeply located reaction center. Of the various alpha-cyclodextrin interactions with protein residues in loops L4, L5, L6, and L7, most notable is the shallow inclusion complex formed with Leu 383 (in L7, on the core side of the cleft) through contacts of its methyl groups with the C-3 atoms of four of the ligand's D-glucopyranosyl residues. All six residues of the bound alpha-cyclodextrin are of 4C1 conformation and are joined by alpha-1,4 linkages with similar torsional angles to form a nearly symmetrical torus as reported for crystalline inclusion complexes with alpha-cyclodextrin. We envision a significant role for the methyl groups of Leu 383 at the cleft entrance with respect to the productive binding of the outer chains of starch.
PMID - 8174545
To determine which amino acid residues are essential for the catalytic activity of soybean beta-amylase, deoxyoligonucleotide site-directed mutagenesis was employed against aspartyl, glutamyl, and cysteinyl residues located in highly conserved regions found in beta-amylase family to date. Both substitution of aspartic acid at position 101 and that of glutamic acid at position 186 of the enzyme by neutral and acidic amino acids, respectively, led to the complete elimination of activity, but did not induce any significant changes in circular dichroic spectra or the binding affinity for cyclomaltohexaose, a substrate analogue. Taking account of the results obtained here, the above two amino acid residues are involved in the catalytic site of soybean beta-amylase. The replacement of glutamic acid at position 345 decreased activity to below 6% of the non-mutant level, implying that this residue may also play a crucial role in beta-amylase activity, although it may not be involved at the catalytic site itself. In contrast, substitution of cysteinyl residue at position 95 by a serinyl residue led to a drastic reducing of the optimal temperature (from 50 degrees C to 30 degrees C), suggesting that this cysteinyl residue is responsible for the thermal stability of the enzyme.
PMID - 9677422
In order to study the interaction of soybean beta-amylase with substrate, we solved the crystal structure of beta-cyclodextrin-enzyme complex and compared it with that of alpha-cyclodextrin-enzyme complex. The enzyme was expressed in Escherichia coli at a high level as a soluble and catalytically active protein. The purified recombinant enzyme had properties nearly identical to those of native soybean beta-amylase and formed the same crystals as the native enzyme. The crystal structure of recombinant enzyme complexed with beta-cyclodextrin was refined at 2. 07-A resolution with a final crystallographic R value of 15.8% (Rfree = 21.1%). The root mean square deviation in the position of C-alpha atoms between this recombinant enzyme and the native enzyme was 0.22 A. These results indicate that the expression system established here is suitable for studying structure-function relationships of beta-amylase. The conformation of the bound beta-cyclodextrin takes an ellipsoid shape in contrast to the circular shape of the bound alpha-cyclodextrin. The cyclodextrins shared mainly two glucose binding sites, 3 and 4. The glucose residue 4 was slightly shifted from the maltose binding site. This suggests that the binding site of the cyclodextrins is important for its holding of a cleaved substrate, which enables the multiple attack mechanism of beta-amylase.
PMID - 7716246
The levels of beta-amylase activity and of the mRNA for beta-amylase in rosette leaves of Arabidopsis thaliana (L.) Heynh. increased significantly, with the concomitant accumulation of starch, when whole plants or excised mature leaves were supplied with sucrose. A supply of glucose or fructose, but not of mannitol or sorbitol, to plants also induced the expression of the gene for beta-amylase, and the induction occurred not only in rosette leaves but also in roots, stems, and bracts. These results suggest that the gene for beta-amylase of Arabidopsis is subject to regulation by a carbohydrate metabolic signal, and expression of the gene in various tissues may be regulated by the carbon partitioning and sink-source interactions in the whole plant. The sugar-inducible expression of the gene in Arabidopsis was severely repressed in the absence of light. The sugar-inducible expression in the light was not inhibited by 3(3,4-dichlorophenyl)-1,1-dimethylurea or by chloramphenicol, but it was inhibited by cycloheximide. These results suggest that a light-induced signal and de novo synthesis of proteins in the cytoplasm are involved in the regulation. A fusion gene composed of the 5' upstream region of the gene for beta-amylase from Arabidopsis and the coding sequence of beta-glucuronidase showed the sugar-inducible expression in a light-dependent manner in rosette leaves of transgenic Arabidopsis.
PMID - 8011643
The crystal structures of catalytically competent soybean beta-amylase, unliganded and bathed with small substrates (beta-maltose, maltal), were determined at 1.9-2.2-A resolution. Two molecules of beta-maltose substrate bind to the protein in tandem, with some maltotetraose enzymic condensation product sharing the same binding sites. The beta-amylase soaked with maltal shows a similar arrangement of two bound molecules of 2-deoxymaltose, the enzymic hydration product. In each case the nonreducing ends of the saccharide ligands are oriented toward the base of the protein's active site pocket. The catalytic center, located between the bound disaccharides and found deeper in the pocket than where the inhibitor alpha-cyclodextrin binds, is characterized by the presence of oppositely disposed carboxyl groups of two conserved glutamic acid residues. The OE2 carboxyl of Glu 186 is below the plane of the penultimate glucose residue (Glc 2) of bound maltotetraose, 2.6 A from the oxygen atom of that ligand's penultimate alpha-1,4-glucosidic linkage. The OE2 carboxyl of Glu 380 lies above the plane of Glc 2, 2.8 A from the O-1 atom of the more deeply bound beta-maltose. Saccharide binding does not alter the spatial coordinates of these two carboxyl groups or the overall conformation of the 57-kDa protein. However, the saccharide complexes of the active enzyme are associated with a significant (10 A) local conformational change in a peptide segment of a loop (L3) that borders the active site pocket.(ABSTRACT TRUNCATED AT 250 WORDS)
PMID - 14593172
Functional analysis of a genome requires accurate gene structure information and a complete gene inventory. A dual experimental strategy was used to verify and correct the initial genome sequence annotation of the reference plant Arabidopsis. Sequencing full-length cDNAs and hybridizations using RNA populations from various tissues to a set of high-density oligonucleotide arrays spanning the entire genome allowed the accurate annotation of thousands of gene structures. We identified 5817 novel transcription units, including a substantial amount of antisense gene transcription, and 40 genes within the genetically defined centromeres. This approach resulted in completion of approximately 30% of the Arabidopsis ORFeome as a resource for global functional experimentation of the plant proteome.
PMID - 9046591
A maize (Zea mays L.) cDNA clone (pZMB2) encoding beta-amylase was isolated from a cDNA library prepared from the aleurone RNA of germinating kernels. The cDNA encodes a predicted product of 488 amino acids with significant similarity to known beta-amylases from barley (Hordeum vulgare), rye (Secale cereale), and rice (Oryza sativa). Glycine-rich repeats found in the carboxyl terminus of the endosperm-specific beta-amylase of barley and rye are absent from the maize gene product. The N-terminal sequence of the first 20 amino acids of a beta-amylase peptide derived from purified protein is identical to the 5th through 24th amino acids of the predicted cDNA product, indicating the absence of a conventional signal peptide in the maize protein. Recombinant inbred mapping data indicate that the cDNA clone is single-copy gene that maps to chromosome 7L at position 83 centimorgans. Northern blot analysis and in vitro translation-immunoprecipitation data indicate that the maize beta-amylase is synthesized de novo in the aleurone cells but not in the scutellum during seed germination.
PMID - 8188635
Polymerase chain reaction (PCR) amplification of mRNA from developing barley (cultivar Haruna two-rows) endosperm was used to clone and sequence full-length cDNA encoding beta-amylase. The beta-amylase cDNA was 1,775 bp in length. The beta-amylase was deduced to be composed of 535 amino acid residues and its molecular weight was calculated to be 59,610. Kreis et al. reported that the beta-amylase cDNA from barley (cultivar Hiproly) was 1,754 bp in length and coded for a polypeptide of 535 amino acids [Eur. J. Biochem. (1987) 169, 517-525]. A comparison of the beta-amylase sequences from Haruna two-rows and Hiproly barleys revealed nine differences in the nucleotide sequence which resulted in three changes in the amino acid residues and 21 additional nucleotides at its 3'-end in the cultivar Haruna two-rows. The three changes were as follows: Ala-233, Ser-347, Met-527 (Haruna two-rows) and Val-233, Met-347, Ile-527 (Hiproly). Lundgard and Svensson pointed out that 23 amino acid residues of the peptide fragment derived from the COOH-terminal region of barley (cultivar Gula) beta-amylase were in agreement with the deduced amino acid sequence reported by Kreis et al., with the exception of a single position (Met-527 compared to Ile) [Carlsberg Res. Commun. (1986) 51, 487-491]. Our findings described above showed Met-527 is reasonable. In the cases of beta-amylases from soybean and sweet potato, the positions that corresponded to those at 233 and 347 in the amino acid sequence of beta-amylase from barley were Ala and Ser, respectively.(ABSTRACT TRUNCATED AT 250 WORDS)
PMID - 9461215
The plant Arabidopsis thaliana (Arabidopsis) has become an important model species for the study of many aspects of plant biology. The relatively small size of the nuclear genome and the availability of extensive physical maps of the five chromosomes provide a feasible basis for initiating sequencing of the five chromosomes. The YAC (yeast artificial chromosome)-based physical map of chromosome 4 was used to construct a sequence-ready map of cosmid and BAC (bacterial artificial chromosome) clones covering a 1.9-megabase (Mb) contiguous region, and the sequence of this region is reported here. Analysis of the sequence revealed an average gene density of one gene every 4.8 kilobases (kb), and 54% of the predicted genes had significant similarity to known genes. Other interesting features were found, such as the sequence of a disease-resistance gene locus, the distribution of retroelements, the frequent occurrence of clustered gene families, and the sequence of several classes of genes not previously encountered in plants.
PMID - 8319688
The cDNA clones corresponding to soybean beta-amylase mRNA were isolated and sequenced. The cDNA contained an open-reading frame composed of 496 amino acids. The comparison of the amino acid sequence deduced from the cDNA with the N-terminal peptide sequence from mature enzyme proved that beta-amylase had no leader sequence. Employing the cDNA, the beta-amylase was directly synthesized in Escherichia coli by the expression vector pKK233-2 controlled by the tac promoter. The enzyme activity detected in E. coli lysate drastically increased with a lower cultivation temperature, and the total activity and specific activity of the enzyme in E. coli lysate cultured at 13 degrees C was 130-fold and 280-fold, respectively, the value at 37 degrees C. The enzyme produced in E. coli was purified by the affinity column chromatography of cyclomaltohexaose-immobilized Sepharose 6B. Employing the established expression and purification system of the enzyme, the functional ionizable groups in the active site were searched. His93, involving an imidazole, and Asp348, involving a carboxylate, in the highly conserved regions within the beta-amylases were replaced by Arg (H93R) and Ash (D348N) by site-directed mutagenesis, respectively. All beta-amylases, including the non-mutant and mutant beta-amylases, produced in E. coli exhibited lower Vmax values than that of beta-amylase isolated conventionally from soybean seeds. Especially the Vmax value of [H93R]beta-amylase was reduced drastically compared to that of the non-mutant; however, none of them lost their enzyme activities completely. Therefore, neither His93 nor Asp348 may participate in the catalytic reaction directly.
PMID - 8534999
A structural gene encoding the beta-amylase that is abundant in the starchy endosperm of ungerminated barley seeds was isolated and characterized. It was 3825 bp in length. In the sequence of the structural gene, a sequence identical to that of the cDNA was found to contain seven exons and six introns.
PMID - 2474529
Soybean beta-amylase was modified with 2,3-epoxypropyl alpha-D-[U-14C]glucopyranoside ([14C]alpha-EPG), a radioactive affinity-labeling reagent for beta-amylase, until it lost 95% of its enzyme activity. After S-carboxymethylation at pH 8.0 of SH groups, the modified enzyme was digested at pH 7.0 with Achromobacter protease I and the digest was fractionated by reverse-phase HPLC. A radioactive peptide was finally isolated and its amino acid sequence was determined to be 181Leu-Gly-Pro-Ala-Gly-Glu186. Radioactivity derived from [14C]-alpha-EPG was found exclusively at Glu-186, the gamma-carboxyl group of which is esterified with the affinity label. It was concluded that the carboxylate of Glu-186 is a functional group at the catalytic site of soybean beta-amylase.
PMID - 1491009
The three-dimensional structure of a complex of soybean beta-amylase [EC 3.2.1.2] with an inhibitor, alpha-cyclodextrin, has been determined at 3.0 A resolution by X-ray diffraction analysis. Preliminary chain tracing showed that the enzyme folded into large and small domains. The large domain has a (beta alpha)8 super-secondary structure, while the smaller one is formed from two long loops extending from the beta 3 and beta 4 strands of the (beta alpha)8 structure. The interface of the two domains together with shorter loops from the (beta alpha)8 structure form a deep cleft, in which alpha-cyclodextrin binds slightly away from the center. Two maltose molecules also bind in the cleft. One shares a binding site with alpha-cyclodextrin and the other is situated more deeply in the cleft.

ANXB_HUMAN.abs:
PMID - 7508441
Anti-56K autoantibodies are present in sera from patients with various autoimmune diseases, predominantly in sera from patients with rheumatoid arthritis, systemic lupus erythematosus, or Sjögren's syndrome. To clarify the molecular structure of this autoantigen, we isolated a 2.0-kilobase pair cDNA clone considered to encode the full-length 56K autoantigen. The longest open reading frame encodes a 505-amino acid polypeptide, with a predicted molecular mass of 54.4 kDa. The in vitro translated protein is recognized by all anti-56K positive patient sera tested. Antibodies affinity-purified using the bacterially expressed recombinant protein recognized the 56K autoantigen in a HeLa cell extract. cDNA sequencing revealed that the 56K cDNA shares a high degree of homology in both nucleotide (87%) and amino acid sequence (92.5%) with bovine annexin XI, indicating that the 56K cDNA encodes the human homologue of annexin XI, a member of the Ca(2+)-dependent phospholipid binding protein family. Anti-56K autoantibody exhibits both a cytoplasmic and a nuclear staining in immunofluorescence experiments. Patients' sera recognize preferentially the N-terminal region of the protein, which is specific for 56K/annexin XI and not shared by other annexins, indicating that the autoimmune response to 56K/annexin XI in these patients is specific for this annexin family member.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 11013079
The genomic organization of the annexin A11 gene was determined in mouse and human to assess its congruity with other family members and to examine the species variation in alternative splicing patterns. Mouse annexin A11 genomic clones were characterized by restriction analysis, Southern blotting, and DNA sequencing, and the homologous human gene (HGMW-approved gene symbol ANXA11) was deciphered from high-throughput genomic sequence with coanalysis of expressed sequence tags. Exons 6-15 of the tetrad core repeat region differ from annexins A7 and A13 but are spliced identically to other phylogenetic descendents, making annexin A11 the putative primary progenitor of up to nine paralogous human annexins. The 5' regions consist of untranslated exon 1, followed by an extensive intron 1 comprising almost half the total gene length of >40 kb, and additional GC-rich exons 2-5 encoding the proline- and glycine-rich amino-terminus. Distinct cDNA isoforms in cow and human were determined to be unique to each species and hence of dubious general significance for this gene's function. Multiple transcription start sites were revealed by primer extension analysis of the mouse gene, and transfection constructs containing the prospective promoter generated transcriptional activity comparable to that of the SV40 promoter. Internal repetitive elements and vicinal gene markers were mapped for the complete human annexin A11 gene sequence to characterize the surrounding genomic environment.
PMID - 1535225
Annexins are a family of structurally related calcium-dependent phospholipid binding proteins. We recently described a new member of this family, bovine annexin XI [1]. Two kinds of cDNAs were identified corresponding to annexin XI mRNA variants A and B, which are generated by alternative splicing of identical primary transcripts. Annexin XI isoforms are predicted to differ at the amino-terminus, suggesting that they may have distinct biological roles.
PMID - 8938449
Mouse annexin XI (anx11)2 was cloned from a macrophage cDNA library and characterized by genetic linkage mapping, DNA sequencing, and structural comparison with other annexins. The Anx11 gene localized to mouse chromosome 14 in close linkage with the Rarb, Plau, and Wnt5a genes near the centromere and 1.8 cM distal from the Anx7 gene. The open reading frame was flanked by long, untranslated regions and encoded a 503-amino-acid protein with 93.1% identity to its human orthologue. Its 189-aa amino terminus corresponded to the widely expressed variant 1 of two possible, alternatively spliced forms. A previously described peptide from Aplysia brasiliana was identified as a closely related invertebrate homologue. Since annexin XI is known to be localized in the nucleus at certain stages of development, the identification of a region in tetrad repeats 3 and 4 resembling the "chromo box" domain may be relevant to a nuclear regulatory function of annexin XI. Knowledge of the mouse cDNA sequence and genetic map location will assist in the analysis of genomic organization and expression and provide a useful animal model to investigate gene function and hereditary phenotype for annexin XI.
PMID - 1380798
CAP-50 is a member of annexin family proteins which binds specifically to calcyclin in a Ca2+ dependent manner (Tokumitsu. H., Mizutani. A., Minami. H., Kobayashi. R., and Hidaka. H. (1992) J. Biol. Chem. 267,8919-8924). The cDNA representing the rabbit form of this protein has been cloned from rabbit lung cDNA library. Sequence analysis of two overlapping clones revealed a 81-nucleotides 5'-nontranslated region, 1512-nucleotides of open reading frame, a 672-nucleotides 3'-nontranslated region, and a poly(A) tail. Authenticity of the clones was confirmed by comparison of portions of the deduced amino acid sequence with eight sequences of proteolytic peptides obtained from rabbit lung protein. CAP-50 cDNA encodes a 503 residue protein with a calculated M(r) of 54,043 and shows that the protein is composed of four imperfect repeats and hydrophobic N-terminal region. C-terminal region including four imperfect repeats shows 58.1% identity with human synexin (annexin VII), 48.0% identity with annexin I, 47.4% identity with annexin II, 60.1% identity with annexin IV, 54.5% identity with annexin V. Hydrophobic N-terminal region composed of 202 amino acid residues is not homologous with other annexin proteins suggesting that CAP-50 is a novel member of annexin family proteins.
PMID - 1372001
Annexins (or lipocortins) are a family of at least 10 structurally related calcium- and phospholipid-binding proteins. Each protein consists of a conserved core domain having four (or eight) repeats of a segment approximately 70 amino acids in length and a nonconserved, usually short, amino-terminal domain. To date, amino acid sequences for eight distinct mammalian annexins have been predicted from cDNAs. This report describes an additional member of this family, bovine annexin XI, identified by cDNA cloning and sequence analysis. The 503-amino acid deduced protein consists of a core domain of four annexin repeats and a long amino-terminal domain rich in glycine, proline, and tyrosine. This novel annexin gene is expressed in a wide variety of tissues and isolated cells in culture.

BRS3_HUMAN.abs:
PMID - 8131855
The gene encoding the human homologue of the guinea pig uterine bombesin receptor [(1992) Eur. J. Biochem. 208, 405] was isolated from a genomic lambda library by the PCR/homology screening approach. The gene spans more than 4 kb and consists of 3 exons and 2 introns. The deduced amino acid sequence shows about 86% identity to that of guinea pig bombesin receptor. This subtype of bombesin receptor is expressed in the pregnant uterus and in two human tumour cell lines, T47D (ductal breast carcinoma) and A431 (epidermal carcinoma). PCR analysis of genomic DNA from human-mouse cell hybrids allows the cloned gene to be localized to the region q26-q28 on chromosome X.
PMID - 9573346
Bombesin (BN)-like peptides/neurotransmitters mediate a broad range of physiological funtions in the gastrointestinal tract and the central nervous system through binding to their specific, high-affinity mammalian bombesin receptors. This family of heptahelical, G-protein coupled receptors includes the gastrin-releasing peptide receptor (GRP-R, or bb2), neuromedin B receptor (NMB-R, or bb1), and the bombesin receptor subtype 3 (BRS-3, or bb3). The tissue distribution of BRS-3 is quite dissimilar compared to the other two BN receptors, GRP-R and NMB-R, and a natural ligand for BRS-3 is currently unknown. Nothing is known about mechanisms regulating BRS-3 gene expression and possible association with disease. To gain insight into the underlying structure and chromosomal localization of the BRS-3 genes, bacteriophage P1 genomic clones, harboring the genes for the human and mouse BRS-3, respectively, were isolated and their structure and chromosomal localizations determined. The protein-coding region of both genes is divided into three exons and spans approximately 5kb. The loci of the BRS-3 genes were mapped to a syntenic region of the human (Xq25) and mouse (XA7.1-7.2) X-chromosome, respectively. The structural data of the BRS-3 genes derived from this study will permit future investigations of the mechanisms regulating their expression.
PMID - 8383682
The bombesin (BN)-like peptides mediate a diverse spectrum of biological activities and have been implicated as autocrine growth factors in the pathogenesis and progression of some human small cell lung carcinoma tumors. Previously, two mammalian BN-like peptide receptor subtypes, gastrin-releasing peptide receptor and neuromedin-B receptor, have been cloned and characterized. In this study, we have isolated and characterized human genomic and complementary DNA (cDNA) clones encoding a new BN-like peptide receptor subtype, BN receptor subtype 3 (BRS-3). Expression of BRS-3 cDNA in Xenopus oocytes encodes a functional receptor that is specifically activated by BN-like peptides. Chromosome mapping studies indicate that the BRS-3 gene is located on human chromosome X. BRS-3 mRNA expression in rat tissues is limited to secondary spermatocytes in testis. In contrast, BRS-3 mRNA is widely expressed in a panel of human cell lines from all histological types of lung carcinoma. These results suggest a role for BN-like peptides and their receptors in mammalian reproductive physiology and also indicate that BRS-3 could serve as a potential therapeutic target for human lung carcinoma.
PMID - 1325907
The homology screening approach has been used to clone a new member of the guanine-nucleotide-binding-protein-coupled receptor superfamily from guinea pig uterus. The cloned cDNA encodes a 399-amino-acid protein and shows the highest amino acid similarity to members of the bombesin receptor family; 52% and 47% similarity to the gastrin-releasing-peptide (GRP) receptor and the neuromedin-B receptor, respectively. Binding experiments with the stably transfected LLC-PK1 cell line expressing the new receptor protein confirmed the bombesin-like nature of the cloned receptor. The relative order of ligand affinity, GRP = neuromedin C much greater than neuromedin B, suggests that the cloned cDNA represents the GRP subtype rather than the neuromedin-B subtype of bombesin receptors. Northern-blot analysis of mRNA species from several guinea-pig tissues showed that the mRNA for the new bombesin receptor subtype is expressed mainly in uteri of pregnant animals.
PMID - 10425452
The bombesin receptor subtype 3 (BRS-3) is considered an orphan receptor as it has a low affinity for bombesin-like peptides and no identified natural ligand. We have reported a novel form of gastrin-releasing peptide (GRP) present in high abundance in the pregnant uterus of women and sheep. As BRS-3 was originally cloned from guinea pig uterus, we postulated that the uterine GRP-like peptide may be its natural ligand. We have therefore cloned the gene for the sheep homologue of BRS-3 and determined its distribution. The sheep BRS-3 gene spans 4 kbp and comprises three exons with intron-exon borders at positions similar to those observed for the human and mouse BRS-3 genes. The predicted amino acid sequence of ovine BRS-3 has approximately 85% identity with the human, mouse and guinea pig receptors. Highly conserved amino acids important in mediating receptor G-protein coupling to second messengers and important in ligand binding were found to be conserved in ovine BRS-3. One potentially important deviation was noted: ovine BRS-3 possesses an arginine residue at position 294 instead of a histidine residue as found in all other BRS-3. His(294) was previously identified as important in ligand-receptor interactions while Arg(294) was implicated in high ligand affinity. Thus ovine BRS-3 may have binding characteristics different from those of the human, mouse and guinea pig BRS-3 receptors. In the ewe, BRS-3 mRNA expression was detected in pituitary and hypothalamus but not in tissues of the pregnant uterus (endometrium, myometrium, chorioallantois or amnion). Nor was BRS-3 expression detected in the non-pregnant uterus or in testis. This pattern of BRS-3 expression is similar to that observed in the mouse but different from that observed in the human, rat and guinea pig. We conclude that there is no local interaction between uterine GRP-like peptide and BRS-3. However, the high expression of BRS-3 in the pituitary coupled with elevated circulating levels of this GRP-like peptide during pregnancy suggests an alternate pathway. Cloning of the ovine BRS-3 gene will permit a detailed functional analysis of this receptor in the sheep and its role in the mediation of action of uterine GRP.
PMID - 9262170
We cloned the genes for the mouse homologue of the neuromedin B receptor (NMB-R) and the bombesin receptor subtype 3 (BRS-3). Both receptor genes consist of three exons with well-conserved intron-exon borders. Although the NMB-R gene spans more than 10 kb, the BRS-3 gene spans only about 4 kb. Comparison of the mouse and human receptor sequences indicates 90% (NMB-R) and 85% (BRS-3) sequence homology at the amino-acid level. In the adult mouse, the NMB-R mRNA is expressed in the brain, testis, esophagus, intestine and uterus, whereas the BRS-3 mRNA is expressed predominantly in the brain. In the brain, the NMB-R gene expression is prominent in the thalamic and olfactory regions, and the BRS-3 gene is expressed particularly in the hypothalamic region. In mouse testis, the NMB-R gene expression is prominent, and the expression of BRS-3 mRNA is barely detected. In contrast, BRS-3 has been shown to be expressed in rat testis and guinea-pig uterus, therefore it is possible that a different subtype of the bombesin receptor mediates the same response in different species. Together with the mouse GRP-R gene cloned previously, cloning of the mouse NMB-R and BRS-3 genes permits comparison of function and structure of the three bombesin receptor subtypes in the mouse.

CB2R_HUMAN.abs:
PMID - 7689702
The major active ingredient of marijuana, delta 9-tetrahydrocannabinol (delta 9-THC), has been used as a psychoactive agent for thousands of years. Marijuana, and delta 9-THC, also exert a wide range of other effects including analgesia, anti-inflammation, immunosuppression, anticonvulsion, alleviation of intraocular pressure in glaucoma, and attenuation of vomiting. The clinical application of cannabinoids has, however, been limited by their psychoactive effects, and this has led to interest in the biochemical bases of their action. Progress stemmed initially from the synthesis of potent derivatives of delta 9-THC, and more recently from the cloning of a gene encoding a G-protein-coupled receptor for cannabinoids. This receptor is expressed in the brain but not in the periphery, except for a low level in testes. It has been proposed that the nonpsychoactive effects of cannabinoids are either mediated centrally or through direct interaction with other, non-receptor proteins. Here we report the cloning of a receptor for cannabinoids that is not expressed in the brain but rather in macrophages in the marginal zone of spleen.
PMID - 10688601
Many of the pharmacological effects of Delta(9)-tetrahydrocannabinol are mediated through CB(1) and CB(2) cannabinoid receptors. However, with the discovery of endogenous cannabinoids, some discrepancies have arisen. Furthermore, unlike the CB(1) receptor, the sequences of the mouse and human CB(2) receptor are divergent, raising the possibility of species specificity. The gene for the rat CB(2) receptor was cloned, expressed, and its properties compared with those of mouse and human CB(2) receptors. Sequence analysis of the coding region of the rat CB(2) genomic clone indicates 90% nucleic acid identity (93% amino acid identity) between rat and mouse and 81% nucleic acid identity (81% amino acid identity) between rat and human. The rat CB(2) receptor was stably expressed in human embryonic kidney-293 cells to examine its pharmacology. The rat CB(2) showed low affinity for anandamide, an endogenous ligand shown to act at the CB(1) receptor. In contrast, high-affinity binding for SR144528 (CB(2)-selective antagonist) as well as several cannabinoid receptor agonists was observed. Coupling to adenylate cyclase was observed. Aspects of the pharmacology of palmitoylethanolamide were also examined. It bound to CB(1) and CB(2) receptors with low affinity and stimulated GTPgammaS binding in the cerebellum and CB(2)-expressing cell lines with low potency. The data in this study suggest that the discrepancies in affinities between rat and human may represent species differences. The rat CB(2) receptor genomic clone will be a useful tool for studying the function and regulation of CB(2) in rats.
PMID - 8679694
We have cloned the peripheral cannabinoid receptor, mCB2, from a mouse splenocyte cDNA library. The 3.7 kb sequence contains an open reading frame encoding a protein of 347 residues sharing 82% overall identity with the only other known peripheral receptor, human CB2 (hCB2) and shorter than hCB2 by 13 amino acids at the carboxyl terminus. Binding experiments with membranes from COS-3 cells transiently expressing mCB2 showed that the synthetic cannabinoid WIN 55212-2 had a 6-fold lower affinity for mCB2 than for hCB2, whereas both receptors showed similar affinities for the agonists CP 55,940, delta(9)-THC and anandamide and almost no affinity for the central receptor- (CB1) specific antagonist SR 141716A. Both hCB2 and mCB2 mediate agonist-stimulated inhibition of forskolin-induced cAMP production in CHO cell lines permanently expressing the receptors. SR 141716A failed to antagonize this activity in either cell line, confirming its specificity for CB1.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 9261404
A new common region of virus integration, Evi11, has been identified in two retrovirally induced murine myeloid leukemia cell lines, NFS107 and NFS78. By interspecific backcross analysis, it was shown that Evi11 is located at the distal end of mouse chromosome 4, in a region that shows homology with human 1p36. The genes encoding the peripheral cannabinoid receptor (Cnr2) and alpha-L-fucosidase (Fuca1) were identified near the integration site by using a novel exon trapping system. Cnr2 is suggested to be the target gene for viral interference in Evi11, since proviruses are integrated in the first intron of Cnr2 and retroviral integrations alter mRNA expression of Cnr2 in NFS107 and NFS78. In addition, proviral integrations were demonstrated within the 3' untranslated region of Cnr2 in five independent newly derived CasBrM-MuLV (mouse murine leukemia virus) tumors, CSL13, CSL14, CSL16, CSL27, and CSL97. The Cnr2 gene encodes a seven-transmembrane G-protein-coupled receptor which is normally expressed in hematopoietic tissues. Our data suggest that the peripheral cannabinoid receptor gene might be involved in leukemogenesis as a result of aberrant expression of Cnr2 due to retroviral integration in Evi11.
PMID - 12084572
The rat peripheral cannabinoid receptor (rCB2) was cloned from a Sprague-Dawley rat spleen cDNA library and when translated, encodes a protein of 410 amino acids. Alignment of rCB2 with mouse (mCB2) and human (hCB2) peripheral cannabinoid receptors reveals a high degree of homology except in the carboxy terminus where rCB2 is 50 and 63 residues longer than hCB2 and mCB2, respectively. PCR screening and sequencing of rat genomic DNA showed that rCB2 is encoded by three exons interrupted by two introns, one of which is polymorphic and contains a 209 base pair B2 (SINE) element. By Northern hybridization and ribonuclease protection assay (RPA), rCB2 mRNA was detected in rat spleen, testis, thymus and lung but not in rat brain, heart, kidney or liver. Like hCB2 and mCB2 receptors, rCB2 activates mitogen-activated protein kinase when it is stably expressed in Chinese Hamster Ovary (CHO) cells. The importance of the carboxy terminus in regulating CB2 receptor desensitization and internalization is well-established. Thus, the profound differences identified in this region of the CB2 receptor between species mandates caution when extrapolating experimental results from non-human models to the effects of chronic CB2 receptor stimulation in humans.

DVL3_MOUSE.abs:
PMID - 8922524
The Drosophila dishevelled (dsh) segment polarity gene is required to establish cell fates specified by wingless/Wnt signal transduction during development. We have previously reported the cloning and characterization of a mouse homolog of dishevelled, Dvl1. Utilizing RT-PCR with degenerate primers, we isolated another member of the mouse Dishevelled (Dvl) gene family, Dvl3. The Dvl3 gene maps to mouse chromosome 16. The predicted amino acid sequence shares 64 and 62% identity to Dvl1 and Dvl2, respectively. The region of highest conservation between all three Dvl coding regions, at 97% identity, is noted at the PDZ domain (also termed the DHR domain or GLGF motif), a motif of 60 amino acids present in all dishevelled encoded proteins and first described in the Drosophila discs large (dlg) tumor suppressor gene. In adult mice, Dvl3 expression is widespread with highest levels exhibited in brain, ovary, and heart. In embryos, Dvl3 is expressed in every tissue between 7.5 and 9.5 days postcoitum, and by 10.5 days postcoitum highest expression was seen in the dorsal root ganglia, somites, limb buds, branchial arches, heart, gut and throughout the developing central nervous system.
PMID - 8817329
The Drosophila dishevelled gene (dsh) encodes a secreted glycoprotein, which regulates cell proliferation, acting as a transducer molecule for developmental processes, including segmentation and neuroblast specification. We have isolated and characterized cDNA clones from two different human dsh-homologous genes, designated as DVL-1 and DVL-3. DVL-1 and DVL-3 putative protein products show 64% amino acid identity. The DVL-1 product is 50% identical to dsh and 92% to a murine dsh homologue (Dvl-1). Both human DVL genes are widely expressed in fetal and adult tissues, including brain, lung, kidney, skeletal muscle and heart. DVL-1 locus maps to chromosome 1p36 and DVL-3 to chromosome 3q27. DVL-1 locus on chromosome 1 corresponds to the murine syntenic region where Dvl-1 is located. DVL-1 and DVL-3 are members of a human dsh-like gene family, which is probably involved in human development. Although the precise role of these genes in embryogenesis is only conjectural at present, the structural and evolutionary characteristics suggest that mutations at their loci may be involved in neural and heart developmental defects.
PMID - 11354832
A quantitative trait locus on chromosome 5 in the rat is linked to sensitivity to brain ischemia in the stroke-prone spontaneously hypertensive rat (SHRSP). The genes encoding atrial natriuretic peptide (ANP) and brain natriuretic peptide (BNP) that map to this location have been excluded as candidate genes. We examined dishevelled-1 (DVL-1) as a further candidate gene. DVL-1 had not yet been identified in the rat, but Anp, Bnp, and DVL-1 map to the homologous regions of the rat chromosome 5 quantitative trait locus in both mice and man. Furthermore, DVL-1 is involved in the Notch signalling system, which plays a role in the disorder cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy, the symptoms of, which include ischaemic stroke. We show with radiation hybrid mapping that rat DVL-1 indeed maps to chromosome 5, where it is positioned immediately next to microsatellite marker D5Rat49. We sequenced the complete coding sequence and a large part of the intronic genomic sequence for the SHRSP strain and its reference Wistar-Kyoto strain. The DVL-1 sequence in the two strains was identical. Our results essentially exclude the DVL-1 gene as the cause for sensitivity to cerebral ischaemic insult in this rat model of stroke.
PMID - 8644734
DiGeorge syndrome (DGS) is a developmental defect of some of the neural crest derivatives. Most DGS patients show haploinsufficiency due to interstitial deletions of the proximal long arm of chromosome 22. Deletions of 22q11 have also been reported with patients with the velocardio-facial syndrome and familial conotruncal heart defects. It has been suggested that the wide phenotype spectrum associated with 22q11 monosomy is a consequence of contiguous-gene deletions. We report the isolation of human cDNAs homologous to the Drosophila dishevelled (dsh) segment-polarity gene. Sequences homologous to the 3' UTR of these transcripts (DVL-22) were positioned within the DGS critical region and were found to be deleted in DGS patients. Human DVL mRNAs are expressed in several fetal and adult tissues, including the thymus and, at high levels, the heart. Two transcripts, 3.2 and 5kb, were detected, in northern blot analysis, with different expression patterns in the surveyed tissues when different cDNAs were used. The isolated cDNAs exhibit high amino acid homology with the mouse and Xenopus Dvl-1 gene, the only other vertebrate dsh homologues so far isolated. The pivotal role of dsh in fly development suggests an analogous key function in vertebrate embryogenesis of its homologue genes. Since DGS may be due to perturbation of differentiation mechanisms at decisive embryological stages, a Dsh-like gene in the small-region overlap (SRO) might be a candidate for the pathogenesis of this disorder.
PMID - 9132266
We have characterized the genomic organization of a mouse homolog (Dvl-1) of Drosophila dishevelled, a segment polarity gene required for wingless signal transduction. The Dvl-1 gene is organized into 15 exons ranging in size from 68 to 1315 bp spanning a region of 12,409 bp, with the largest and smallest intron being 5545 and 71 bp, respectively. Sequence analysis of the 5'-flanking region of the gene revealed a high GC content, six CCGCCC Sp-1-binding motifs, CREB, LBP-1 (leader-binding protein 1), and TGGCA-binding consensus sites. However, neither TATA or CAAT boxes are present, a characteristic shared by other GC-rich promoters. The 5'-flanking region has strong promoter activity when placed upstream of the luciferase gene. Promoter-luciferase constructs have demonstrated that the promoter is functional in transfection assays and that its activity is orientation dependent. Promoter deletions were used to define the 5' and 3' boundaries for promoter activity and revealed the presence of both positive and negative regulatory elements. Multiple transcription initiation sites were mapped by primer extension analysis and confirmed by reporter gene assay.
PMID - 7744250
Wingless (Wg) is an important signaling molecule in the development of Drosophila, but little is known about its signal transduction pathway. Genetic evidence indicates that another segment polarity gene, dishevelled (dsh) is required for Wg signaling. We have recently developed a cell culture system for Wg protein activity, and using this in vitro system as well as intact Drosophila embryos, we have analyzed biochemical changes in the Dsh protein as a consequence of Wg signaling. We find that Dsh is a phosphoprotein, normally present in the cytoplasm. Wg signaling generates a hyperphosphorylated form of Dsh, which is associated with a membrane fraction. Overexpressed Dsh becomes hyperphosphorylated in the absence of extracellular Wg and increases levels of the Armadillo protein, thereby mimicking the Wg signal. A deletional analysis of Dsh identifies several conserved domains essential for activity, among which is a so-called GLGF/DHR motif. We conclude that dsh, a highly conserved gene, is not merely a permissive factor in Wg signaling but encodes a novel signal transduction molecule, which may function between the Wg receptor and more downstream signaling molecules.
PMID - 8887313
The segment polarity gene dishevelled (dsh) of Drosophila is required for pattern formation of the embryonic segments and the adult imaginal discs. dsh encodes the earliest-acting and most specific known component of the signal transduction pathway of Wingless, an extracellular signal homologous to Wnt1 in mice. We have previously described the isolation and characterization of the Dvl1 mouse dsh homolog. We report here the isolation of a second mouse dsh homolog, Dvl2, which maps to chromosome 11. The Dvl2 amino acid sequence is equally related to the dsh sequence as is that of Dvl1, but Dvl2 is most similar to the Xenopus homolog Xdsh. However, unlike the other vertebrate dsh homologs. Like the other genes, Dvl2 is ubiquitously expressed throughout most of embryogenesis and is expressed in many adult organs. We have developed an assay for dsh function in fly embryos, and show that Dvl2 can partially rescue the segmentation defects of embryos devoid of dsh. Thus, Dvl2 encodes a mammalian homolog of dsh which can transduce the Wingless signal.
PMID - 8149913
The dishevelled gene of Drosophila is required to establish coherent arrays of polarized cells and is also required to establish segments in the embryo. Here, we show that loss of dishevelled function in clones, in double heterozygotes with wingless mutants and in flies bearing a weak dishevelled transgene leads to patterning defects which phenocopy defects observed in wingless mutants alone. Further, polarized cells in all body segments require dishevelled function to establish planar cell polarity, and some wingless alleles and dishevelled; wingless double heterozygotes exhibit bristle polarity defects identical to those seen in dishevelled alone. The requirement for dishevelled in establishing polarity in cell autonomous. The dishevelled gene encodes a novel intracellular protein that shares an amino acid motif with several other proteins that are found associated with cell junctions. Clonal analysis of dishevelled in leg discs provides a unique opportunity to test the hypothesis that the wingless dishevelled interaction species at least one of the circumferential positional values predicted by the polar coordinate model. We propose that dishevelled encodes an intracellular protein required to respond to a wingless signal and that this interaction is essential for establishing both cell polarity and cell identity.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 9192851
Three human genes encoding proteins homologous to Drosophila Dishevelled protein were cloned and characterized. Amino acid similarity between the different Dishevelled proteins is concentrated in three highly conserved regions. Two of these regions do not exhibit significant sequence similarity with other known proteins; the third is similar to the discs-large homology region, which was first found in a Drosophila Discs-large tumor suppressor protein (also known as GLGF or PDZ domain). We produced antibodies against human Dishevelled-2 and demonstrated that it is a phosphoprotein and can be detected in all cell lines and human embryonic tissues examined. Indirect immunofluorescence indicates that it is found throughout the cytoplasm. Our results indicate that the human dishevelled genes constitute a multigene family and that Dishevelled proteins are highly conserved among metazoans.
PMID - 9298901
Mice completely deficient for Dvl1, one of three mouse homologs of the Drosophila segment polarity gene Dishevelled, were created by gene targeting. Dvl1-deficient mice are viable, fertile, and structurally normal. Surprisingly, these mice exhibited reduced social interaction, including differences in whisker trimming, deficits in nest-building, less huddling contact during home cage sleeping, and subordinate responses in a social dominance test. Sensorimotor gating was abnormal, as measured by deficits in prepulse inhibition of acoustic and tactile startle. Thus, Dvl1 mutants may provide a model for aspects of several human psychiatric disorders. These results are consistent with an interpretation that common genetic mechanisms underlie abnormal social behavior and sensorimotor gating deficits and implicate Dvl1 in processes underlying complex behaviors.
PMID - 8288125
The Drosophila Wnt-1 homolog, wingless (wg), is involved in the signaling of patterning information in several contexts. In the embryonic epidermis, Wg protein is secreted and taken up by neighboring cells, in which it is required for maintenance of engrailed transcription and accumulation of Armadillo protein. The dishevelled (dsh) gene mediates these signaling events as well as wg-dependent induction across tissue layers in the embryonic midgut. dsh is also required for the development processes in which wg functions in adult development. Overall, cells lacking dsh are unable to adopt fates specified by Wg. dsh functions cell autonomously, indicating that it is involved in the response of target cells to the Wg signal. dsh is expressed uniformly in the embryo and encodes a novel protein with no known catalytic motifs, although it shares a domain of homology with several junction-associated proteins. Our results demonstrate that dsh encodes a specific component of Wg signaling and illustrate that Wnt proteins may utilize a novel mechanism of extracellular signal transduction.
PMID - 9039502
In this series of projects of sequencing human cDNA clones which correspond to relatively long and nearly full-length transcripts, we newly determined the sequences of 80 clones, and predicted the coding sequences of the corresponding genes, named KIAA0201 to KIAA0280. Among the sequenced clones, 68 were obtained from human immature myeloid cell line KG-1 and 12 from human brain. The average size of the clones was 5.3 kb, and that of distinct ORFs in clones was 2.8 kb, corresponding to a protein of approximately 100 kDa. Computer search against the public databases indicated that the sequences of 22 genes were unrelated to any reported genes, while the remaining 58 genes carried sequences which show some similarities to known genes. Protein motifs that matched those in the PROSITE motif database were found in 25 genes and significant transmembrane domains were identified in 30 genes. Among the known genes to which significant similarity was shown, the genes that play key roles in regulation of developmental stages, apoptosis and cell-to-cell interaction were included. Taking into account of both the search data on sequence similarity and protein motifs, at least seven genes were considered to be related to transcriptional regulation and six genes to signal transduction. When the expression profiles of the cDNA clones were examined with different human tissues, about half of the clones from brain (5 of 11) showed significant tissue-specificity, while approximately 80% of the genes from KG-1 were expressed ubiquitously.
PMID - 9344861
dishevelled (Dsh) is a member of the segment polarity gene family in Drosophila which plays an important role in the early developmental patterning processes. A human homologue of Dsh (DVL-1) has recently been described. Here, we report the cloning of a second human homologue of Dsh (called DVL-3) by cDNA library screening. The human DVL-3 gene encodes a predicted 716 amino acid protein which exhibits 98% amino acid identity with mouse Dvl-3 and 49% with Drosophila Dsh. DVL-3 was mapped to 3q27. The expression of DVL-3 mRNA was detected in 30 human cell lines and 2 primary cell cultures. DVL-3 mRNA was detected in normal human breast tissues (n = 4) and tumours (n = 25). Statistically, there was no difference in DVL-3 mRNA level between normal breast tissues and tumours. In human colorectal samples, DVL-3 was expressed equally in matched normal tissues, polyps and tumours. The data indicates that DVL-3 is widely expressed in human cells and supports the notion of a new developmental gene family for dishevelled which may have a widespread role in signal transduction.
PMID - 7958461
In the Drosophila embryo dishevelled (dsh) function is required by target cells in order to respond to wingless (wg, the homolog of Wnt-1), demonstrating a role for dsh in Wnt signal transduction. We have isolated a mouse homolog of the Drosophila dsh segment polarity gene. The 695-amino-acid protein encoded by the mouse dishevelled gene (Dvl-1) shares 50% identity (65% similarity) with dsh. Similarity searches of protein and DNA data bases revealed that Dvl-1 encodes an otherwise novel polypeptide. While no functional motifs were identified, one region of Dvl-1 was found to be similar to a domain of discs large-1 (dlg), a Drosophila tumor suppressor gene. In the embryo, Dvl-1 is expressed in most tissues, with uniformly high levels in the central nervous system. From 7.5 days postcoitum Dvl-1 is expressed throughout the developing brain and spinal cord, including those regions expressing Wnt-1 and En. Expression of Dvl-1 in adult mice was found to be widespread, with brain and testis exhibiting the highest levels. The majority of Dvl-1 expression in the adult cerebellum is in the granular cell layer, similar to the pattern seen for engrailed-2 (En-2). Throughout postnatal development of the brain Dvl-1 is highly expressed in areas of high neuronal cell density.
PMID - 7600981
Signaling factors of the Wnt proto-oncogene family are implicated in dorsal axis formation during vertebrate development, but the molecular mechanism of this process is not known. Studies in Drosophila have indicated that the dishevelled gene product is required for wingless (Wnt1 homolog) signal transduction. We demonstrate that injection of mRNA encoding a Xenopus homolog of dishevelled (Xdsh) into prospective ventral mesodermal cells triggers a complete dorsal axis formation in Xenopus embryos. Lineage tracing experiments show that cells derived from the injected blastomere contribute to anterior and dorsal structures of the induced axis. In contrast to its effect on mesoderm, overexpression of Xdsh mRNA in prospective ectodermal cells triggers anterior neural tissue differentiation. These studies suggest that Wnt signal transduction pathway is conserved between Drosophila and vertebrates and point to a role for maternal Xdsh product in dorsal axis formation and in neural induction.

ELIA_PHYCP.abs:
PMID - 7763784
Elicitins are toxic proteins secreted by Phytophthora spp. responsible for the incompatible reaction and systemic leaf necroses on tobacco. We found that Phytophthora megasperma megasperma produces two elicitin isoforms, each belonging to a different physico-chemical class, although the beta elicitin was less toxic than other beta ones. These 98 residue proteins were purified, sequenced and compared with other known elicitins. In addition to the point mutation already known to correlate with the differences in necrotic activities between alpha and beta isoforms, we found another region of the molecule likely to be involved in the regulation of the toxicity.
PMID - 2776750
The phytopathogenic fungi Phytophthora cryptogea and Phytophthora capsici cause systemic leaf necrosis on their non-host tobacco; in culture they release proteins, called cryptogein and capsicein, which elicit similar necrosis. In addition, both proteins protect tobacco against invasion by the pathogen Phytophthora nicotianac, the agent of the tobacco black shank, that is unable to produce such an elicitor. Cryptogein causes visible leaf necrosis starting at about 1 microgram/plant, whereas 50-fold as much capsicein is required for the same reaction. Capsicein induces protection even in near absence of leaf necrosis. The activities of both elicitors are eliminated upon pronase digestion. They are proteins of similar Mr (respectively 10,323 and 10,155) and their complete amino acid sequences were determined. They consist of 98 residues, with some internal repetitions of hexapeptides and heptapeptides. 85% identity was observed between both sequences: only two short terminal regions are heterologous, while the central core is entirely conserved. Secondary structure predictions, hydropathy and flexibility profiles differ only around position 15 and at the C-terminus; these modifications could play a role in the modulation of their biological activities. After a search of the sequence data bases, they appear to be novel proteins.
PMID - 9385630
Cryptogein belongs to a new family of 10-kDa proteins called elicitins. Elicitins are necrotic and signaling proteins secreted by Phytophthora spp. responsible for the incompatible reaction and systemic hypersensitive-like necroses of diverse plant species leading to resistance against fungal or bacterial plant pathogens. The solution structure of beta cryptogein from Phytophthora cryptogea fungus was determined by using multidimensional heteronuclear nuclear magnetic resonance spectroscopy. A set of 18 structures was calculated using 1360 NOE-derived distance restraints and 40 dihedral angle restraints obtained from 3JHNH alpha couplings. The RMS deviation from the mean structure is 0.87 +/- 0.14 A for backbone atoms and 1.34 +/- 0.14 A for all the non-hydrogen atoms of residues 2 to 98. The structure of beta cryptogein reveals a novel protein fold, with five helices and a double-stranded beta-sheet facing an omega-loop. One edge of the beta-sheet and the adjacent face of the omega-loop form a hydrophobic cavity. This cavity made of highly conserved residues represents a plausible binding site. Residue 13, which has been identified from directed mutagenesis and natural sequence comparison studies as a key amino acid involved in the differential control of necrosis, is surface exposed and could contribute to the binding to a ligand or a receptor. The solution structure is close to the X-ray structure, with slight differences lightly due to the crystal packing.
PMID - 1368359
Most of the phytopathogenic fungi Phytophthora secrete holoproteins (elicitins) responsible for the incompatible reaction and systemic leaf necroses on tobacco. We found that Phytophthora drechsleri produces several elicitin isoforms of various toxicity on tobacco. The CD spectra showed that their secondary structure was largely conserved, exhibiting ca 50% alpha-helix and little or no beta-structure. These 98 residue proteins were sequenced and compared with other known elicitins. Only one point mutation correlated with the differences in necrotic activities. This residue could be either an active or a regulatory site, involved in the interaction with a receptor responsible for necrosis induction.
PMID - 8664508
Elicitins, proteinaceous elicitors secreted by Phytophthora spp., act as inducers of a hypersensitive-like response in tobacco during incompatible interactions. We have isolated and cloned sequences encoding cryptogein and related isoforms from P. cryptogea that belong to the elicitin family. The isolation of a genomic clone led to the characterization of four clustered genes. Two of these genes encode distinct elicitins, and two genes would encode, if expressed, a class of highly acidic elicitins which had not been observed so far. Northern blots indicate that elicitin genes are expressed in the fungus grown in vitro, though at different levels. Southern hybridization revealed that elicitins are encoded by a multigene family in several other species of Phytophthora. Moreover, isolates of Phytophthora parasitica var. nicotianae, pathogenic to tobacco, which do not produce elicitins, possess several elicitin-encoding genes. Involvement of elicitins in plant-pathogen interactions is discussed.
PMID - 8125100
Difficulties encountered in the interpretation of two-dimensional NMR spectra of proteins exceeding roughly 100 amino acids, including resonance overlap and line broadening due to longer correlation times and/or aggregation phenomena, can be overcome by using three-dimensional 1H-NMR experiments. The improvement of spectral resolution using these experiments allows the size of molecules amenable to structure determination by NMR spectroscopy to be extended. A three-dimensional non-selective homonuclear Hartmann-Hahn/nuclear Overhauser effect spectroscopy experiment was performed on capsicein, a 10161-Da elicitin secreted by the Phytophthora capsici fungus. Sequential assignment and secondary structure determination is illustrated for beta-sheet, alpha-helix and loop structures by analysis of planar cross sections perpendicular to the omega 2 or omega 3 axis at the amide proton resonance frequencies. Cysteine pairing was established in the course of the investigation. The secondary structure topology of the molecule is composed of five helices and an antiparallel beta-sheet. Four of the helices compose two pairs running antiparallel while the last one is parallel to the beta-sheet.
PMID - 8274771
Extracellular elicitor proteins (elicitins) from Phytophthora species induce local and distal defense responses specifically in plants of the Solanaceae and Cruciferae. Based on elicitin amino acid sequences, elicitin-coding sequences from P. parasitica were amplified by the polymerase chain reaction. A genomic clone containing a complete elicitin gene, parA1, was isolated and sequenced. Elicitin was confirmed to be encoded as a precursor protein containing a 20-amino acid signal peptide that is processed before secretion. Bacterial expression of the cloned elicitin gene as a translational fusion protein containing glutathione S-transferase yielded a biologically active protein capable of inducing a hypersensitive response in tobacco, suggesting that fungus-specific postranslational modifications of elicitin are not required for its activity. Southern blot analysis indicated that elicitin genes occur as a multigene family (at least two to 10 copies) in P. parasitica, P. capsici, P. citricola, P. citrophthora, P. cryptogea, P. drechsleri, P. megasperma, and P. palmivora. Some isolates of P. parasitica that did not produce elicitins still contained elicitin-coding sequences but did not accumulate elicitin mRNA.
PMID - 8031752
The backbone 1H and 15N resonance assignments and solution secondary structure determination of capsicein, a protein of 98 residues with a molecular mass of 10161 Da, are presented. Capsicein belongs to the elicitin family, elicitor molecules having toxic and signaling properties that are secreted by Phytophthora fungi, responsible for the incompatible hypersensitive reaction of diverse plant species leading to resistance against fungal or bacterial plant pathogens. The protein was uniformly labeled with 15N to overcome spectral overlap of the proton resonances. A combination of 3D HOHAHA-HMQC and 3D NOESY-HMOC experiments allowed the identification of spin systems with through-bond correlations, which were in turn correlated by through-space connections. The sequential assignment was obtained for main- and side-chain resonances and led to the identification of all secondary structures. A 3D HMQC-NOESY-HMQC experiment was performed which characterized the NH(i)-NH(i+1) connections specific to alpha-helical structures. This proved particularly useful for the assignment of degenerate amide protons of successive residues in alpha-helical structures. The data show five alpha-helical regions comprising residues 5-18, 26-33, 44-58, 59-67, and 86-98 and a two-stranded antiparallel beta-sheet involving residues 70-75 and 80-85, packed around a hydrophobic core grouping all of the aromatic residues. The C-terminal secondary structure motifs of capsicein evoke phospholipase structural features, which suggests that elicitins might interact with the lipidic molecules of the plasma membrane.
PMID - 8994969
BACKGROUND: Elicitins form a novel class of plant necrotic proteins which are secreted by Phytophthora and Pythium fungi, parasites of many economically important crops. These proteins induce leaf necrosis in infected plants and elicit an incompatible hypersensitive-like reaction, leading to the development of a systemic acquired resistance against a range of fungal and bacterial plant pathogens. No crystal structures of this class of protein are available. The crystal structure determination of beta-cryptogein (CRY), secreted by Phytophthora cryptogea, was undertaken to identify structural features important for the necrotic activity of elicitins. RESULTS: The structure of CRY was determined using the multiwavelength anomalous diffraction technique and refined to 2.2 A resolution. The overall structure has a novel fold consisting of six alpha helices and a beak-like motif, whose sequence is highly conserved within the family, composed of an antiparallel two-stranded beta sheet and an omega loop. This motif is assumed to be a major recognition site for a putative receptor and/or ligand. Two other distinct binding sites seem to be correlated to the level of necrotic activity of elicitins. CONCLUSIONS: The determination of the crystal structure of a member of the elicitin family may make it possible to separate the activity that causes leaf necrosis from that inducing systemic acquired resistance to pathogens, making it feasible to engineer a non-toxic elicitin that only elicits plant defences. Such studies should aid the development of non-toxic agricultural pest control.
PMID - 2583277
The phytopathogenic fungi Phytophthora cinnamomi cause systemic leaf necrosis on its non-host tobacco; in culture, it secretes a protein, called cinnamomin, which elicits leaf necrosis and protects tobacco against the pathogen Phytophthora nicotianoe, in a way similar to cryptogein and different from capsicein, elicitins of known amino acid sequences. The cinnamomin sequence has been determined and compared to other elicitins. The differences in the 3 elicitin sequences were correlated to the biological activities: 2 lysines were ascribed as the key amino acids involved in the differential control of protection with respect to necrosis.

FLGK_ECOLI.abs:
PMID - 11677609
Salmonella enterica subspecies I, serovar Typhimurium (S. typhimurium), is a leading cause of human gastroenteritis, and is used as a mouse model of human typhoid fever. The incidence of non-typhoid salmonellosis is increasing worldwide, causing millions of infections and many deaths in the human population each year. Here we sequenced the 4,857-kilobase (kb) chromosome and 94-kb virulence plasmid of S. typhimurium strain LT2. The distribution of close homologues of S. typhimurium LT2 genes in eight related enterobacteria was determined using previously completed genomes of three related bacteria, sample sequencing of both S. enterica serovar Paratyphi A (S. paratyphi A) and Klebsiella pneumoniae, and hybridization of three unsequenced genomes to a microarray of S. typhimurium LT2 genes. Lateral transfer of genes is frequent, with 11% of the S. typhimurium LT2 genes missing from S. enterica serovar Typhi (S. typhi), and 29% missing from Escherichia coli K12. The 352 gene homologues of S. typhimurium LT2 confined to subspecies I of S. enterica-containing most mammalian and bird pathogens-are useful for studies of epidemiology, host specificity and pathogenesis. Most of these homologues were previously unknown, and 50 may be exported to the periplasm or outer membrane, rendering them accessible as therapeutic or vaccine targets.
PMID - 2544561
The flgH, flgI, and fliF genes of Salmonella typhimurium encode the major proteins for the L, P, and M rings of the flagellar basal body. We have determined the sequences of these genes and the flgJ gene and examined the deduced amino acid sequences of their products. FlgH and FlgI, which are exported across the cell membrane to their destinations in the outer membrane and periplasmic space, respectively, both had typical N-terminal cleaved signal-peptide sequences. FlgH is predicted to have a considerable amount of beta-sheet structure, as has been noted for other outer membrane proteins. FlgI is predicted to have an even greater amount of beta-structure. FliF, as is usual for a cytoplasmic membrane protein of a procaryote, lacked a signal peptide; it is predicted to have considerable alpha-helical structure, including an N-terminal sequence that is likely to be membrane-spanning. However, it had overall a quite hydrophilic sequence with a high charge density, especially towards its C terminus. The flgJ gene, immediately adjacent to flgI and the last gene of the flgB operon, encodes a flagellar protein of unknown function whose deduced sequence was hydrophilic and may correspond to a cytoplasmic protein. Several aspects of the DNA sequence of these genes and their surrounds suggest complex regulation of the flagellar gene system. A notable example occurs within the flgB operon, where between the end of flgG (encoding the distal rod protein of the basal body) and the start of flgH (encoding the L-ring protein) there was an unusually long noncoding region containing a potential stem-loop sequence, which could attenuate termination of transcription or stabilize part of the transcript against degradation. Another example is the interface between the flgB and flgK operons, where transcription termination of the former may occur within the coding region of the latter.
PMID - 8158647
Two mutants with defects in hook-associated protein 3 (HAP3) were isolated that exhibit impaired swimming only when they interact with a solid surface or a semisolid matrix. Motility and chemotaxis were normal in liquid media, even in media containing viscous agents, but cells failed to swarm in 0.28% agar. Mutants appeared to carry a full complement of flagella of normal configuration and length. However, filaments rotating counterclockwise close to a glass surface transformed from normal to straight, while filaments rotating clockwise transformed from curly to straight. Both transformations propagated from base to tip, as expected if torsionally induced. The mutations mapped to the middle of flgL, to structural gene for HAP3, and sequence analysis revealed the same coding change in both mutants: a substitution of cysteine for arginine 168. Our results show that the ability of a filament composed of normal flagellin subunits to resist mechanical stress depends on the structure of the protein (HAP3) to which it is attached at its base. The N-terminal sequence of HAP3 was found to be similar to the N-terminal sequence of flagellin, and the possibility that it provides a nucleation site for the C-terminal region of flagellin is discussed.
PMID - 9278503
The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.
PMID - 12644504
We present the 4.8-Mb complete genome sequence of Salmonella enterica serovar Typhi strain Ty2, a human-specific pathogen causing typhoid fever. A comparison with the genome sequence of recently isolated S. enterica serovar Typhi strain CT18 showed that 29 of the 4,646 predicted genes in Ty2 are unique to this strain, while 84 genes are unique to CT18. Both genomes contain more than 200 pseudogenes; 9 of these genes in CT18 are intact in Ty2, while 11 intact CT18 genes are pseudogenes in Ty2. A half-genome interreplichore inversion in Ty2 relative to CT18 was confirmed. The two strains exhibit differences in prophages, insertion sequences, and island structures. While CT18 carries two plasmids, one conferring multiple drug resistance, Ty2 has no plasmids and is sensitive to antibiotics.
PMID - 8905232
The 718,122 base pair sequence of the Escherichia coli K-12 genome corresponding to the region from 12.7 to 28.0 minutes on the genetic map is described. This region contains at least 681 potential open reading frames, of which 277 (41%) have been previously identified, 147 (22%) are homologous to other known genes, 139 (20%) are identical or similar to the hypothetical genes registered in databases, and the remaining 118 (17%) do not show a significant similarity to any other gene. In this region, we assigned a cluster of cit genes encoding multienzyme citrate lyase, two clusters of fimbrial genes and a set of lysogenic phage genes encoding integrase, excisionase and repressor in the e14 genetic element. In addition, a new valine tRNA gene, designated valZ, and a family of long directly repeated sequences, LDR-A, -B and -C, were found.
PMID - 2193164
Within the bacterial flagellum the basal-body rod, the hook, the hook-associated proteins (HAPs), and the helical filament constitute an axial substructure whose elements share structural features and a common export pathway. We present here the amino acid sequences of the hook protein and the three HAPs of Salmonella typhimurium, as deduced from the DNA sequences of their structural genes (flgE, flgK, flgL and fliD, respectively). We compared these sequences with each other and with those for the filament protein (flagellin) and four rod proteins, which have been described previously (Joys, 1985; Homma et al., 1990; Smith & Selander, 1990). Hook protein most strongly resembled the distal rod protein (FlgG) and the proximal HAP (HAP1), which are thought to be attached to the proximal and distal ends of the hook, respectively; the similarities were most pronounced near the N and C termini. Hook protein and flagellin, which occupy virtually identical helical lattices, did not resemble each other strongly but showed some limited similarities near their termini. HAP3 and HAP2, which form the proximal and distal boundaries of the filament, showed few similarities to flagellin, each other, or the other axial proteins. With the exceptions of the N-terminal region of HAP2, and the C-terminal region of flagellin, proline residues were absent from the terminal regions of the axial proteins. Moreover, with the exception of the N-terminal region of HAP2, the terminal regions contained hydrophobic residues at intervals of seven residues. Together, these observations suggest that the axial proteins may have amphipathic alpha-helical structure at their N and C termini. In the case of the filament and the hook, the terminal regions are believed to be responsible for the quaternary interactions between subunits. We suggest that this is likely to be true of the other axial structures as well, and specifically that interaction between N-terminal and C-terminal alpha-helices may be important in the formation of the axial structures of the flagellum. Although consensus sequences were noted among some of the proteins, such as the rod, hook and HAP1, no consensus extended to the entire set of axial proteins. Thus the basis for recognition of a protein for export by the flagellum-specific pathway remains to be identified.
PMID - 2181149
The stoichiometries of components within the flagellar hook-(basal-body) complex of Salmonella typhimurium have been determined. The hook protein (FlgE), the most abundant protein in the complex, is present at approximately 130 subunits. Hook-associated protein 1 (FlgK) is present at approximately 12 subunits. The distal rod protein (FlgG) is present at approximately 26 subunits, while the proximal rod proteins (FlgB, FlgC and FlgF) are present at only approximately six subunits each. The stoichiometries of the proximal rod proteins and hook-associated protein 1 are, within experimental error, consistent with values of 5 or 6, and 11, respectively. Such values would correspond to either one or two turns of a helical structure with a basic helix of approximately 5.5 subunits per turn, which is the geometry of both the hook and the filament and, one supposes, the rod and hook-associated proteins. These stoichiometries may derive from rules for the heterologous interactions that occur when a helical structure consists of successive segments constructed from different proteins; the stoichiometries within the hook and the distal portion of the rod must, however, be set by different mechanisms. The stoichiometries for the ring proteins are approximately 26 subunits each for the M-ring protein (FliF), the P-ring protein (FlgI), and the L-ring protein (FlgH); the protein responsible for the S-ring feature is not known. The rings presumably have rotational rather than helical symmetry, in which case the stoichiometries would be directly constrained by the intersubunit bonding angle. The ring stoichiometries are discussed in light of other information concerning flagellar structure and function.
PMID - 11677608
Salmonella enterica serovar Typhi (S. typhi) is the aetiological agent of typhoid fever, a serious invasive bacterial disease of humans with an annual global burden of approximately 16 million cases, leading to 600,000 fatalities. Many S. enterica serovars actively invade the mucosal surface of the intestine but are normally contained in healthy individuals by the local immune defence mechanisms. However, S. typhi has evolved the ability to spread to the deeper tissues of humans, including liver, spleen and bone marrow. Here we have sequenced the 4,809,037-base pair (bp) genome of a S. typhi (CT18) that is resistant to multiple drugs, revealing the presence of hundreds of insertions and deletions compared with the Escherichia coli genome, ranging in size from single genes to large islands. Notably, the genome sequence identifies over two hundred pseudogenes, several corresponding to genes that are known to contribute to virulence in Salmonella typhimurium. This genetic degradation may contribute to the human-restricted host range for S. typhi. CT18 harbours a 218,150-bp multiple-drug-resistance incH1 plasmid (pHCM1), and a 106,516-bp cryptic plasmid (pHCM2), which shows recent common ancestry with a virulence plasmid of Yersinia pestis.

GAA6_CHICK.abs:
PMID - 11992121
Although many genes that predispose for epilepsy in humans have been determined, those that underlie the classical syndromes of idiopathic generalized epilepsy (IGE) have yet to be identified. We report that an Ala322Asp mutation in GABRA1, encoding the alpha1 subunit of the gamma-aminobutyric acid receptor subtype A (GABA(A)), is found in affected individuals of a large French Canadian family with juvenile myoclonic epilepsy. Compared with wildtype receptors, GABA(A) receptors that contain the mutant subunit show a lesser amplitude of GABA-activated currents in vitro, indicating that seizures may result from loss of function of this inhibitory ligand-gated channel. Our results confirm that mutation of GABRA1 predisposes towards a common idiopathic generalized epilepsy syndrome in humans.
PMID - 1710013
We report the sequence of a complementary DNA (cDNA) that encodes the chicken GABAA receptor alpha 1 subunit, which is extremely homologous to mammalian alpha 1 subunits. The distribution of alpha 1 subunit transcripts is shown to correlate mainly, but not completely, with the previously-reported pattern of benzodiazepine type I (BZI) binding sites in the avian brain. These results suggest that the alpha 1 subunit may not necessarily be restricted to receptors having BZI pharmacology.
PMID - 10602120
Rett syndrome (RTT) is an X-linked dominant neurodevelopmental disorder that affects females. Exclusion mapping studies using a new family with maternal inheritance of RTT defined Xq28 as the candidate region for the RTT gene. Six candidate genes were selected for mutation analysis based on their established expression patterns and known functions in the CNS. These are: Glutamate receptor subunit 3 (GLUR3), GABA receptor subunit alpha 3 (GABRA3), GABA receptor subunit e1 (GABRE1), Vacuolar ATPase subunit 1 (VATPS1, XAP3), the human homologue of plexin 3-SEX (XAP6) and the Synaptobrevin-like protein (SYBL1). Major rearrangements involving these genes were excluded by Southern analysis. No disease-causing mutations were found, but several single-nucleotide polymorphisms (SNPs) were detected. These SNPs will be useful in future linkage analysis and whole-genome association studies for other diseases. The genomic characterization of GLUR3 and GABRA3 will allow mutational analysis of these genes as candidates for other X-linked neurological disorders mapping to Xq25-Xq26 and Xq28.
PMID - 1846404
gamma-Aminobutyric acidA (GABAA) receptors are multisubunit ligand-gated ion channels which mediate neuronal inhibition by GABA and are composed of at least four subunit types (alpha, beta, gamma, and delta). The gamma 2-subunit appears to be essential for benzodiazepine modulation of GABAA receptor function. In cloning murine gamma 2-subunits, we isolated cDNAs encoding forms of the subunit that differ by the insertion of eight amino acids. LLRMFSFK, in the major intracellular loop between proposed transmembrane domains M3 and M4. The two forms of the gamma 2-subunit are generated by alternative splicing, as demonstrated by cloning and partial sequencing of the corresponding gene. The eight-amino-acid insertion encodes a potential consensus serine phosphorylation site for protein kinase C. These results suggest a novel mechanism for the regulation of the GABAA receptor by protein phosphorylation.
PMID - 2556293
Cloned cDNA encoding the bovine alpha 4 subunit of the GABAA receptor has been isolated. The predicted 51 amino acid long mature protein contains an exceptionally long intracellular domain and shares 53-56% sequence similarity to the previously characterized alpha 1, alpha 2 and alpha 3 subunits. Co-expression of alpha 4 and beta 1 in Xenopus oocytes resulted in the formation of GABA-gated chloride channels with expected pharmacology, although no benzodiazepine potentiation was observed. Northern analysis indicates that a 4 kb alpha 4 mRNA is expressed in the calf cerebellum, cortex and hippocampus but is barely detectable in the rat brain.
PMID - 3037384
Amino-acid sequences derived from complementary DNAs encoding the alpha- and beta-subunits of the GABA/benzodiazepine receptor from bovine brain show homology with other ligand-gated receptor subunits, suggesting that there is a super-family of ion-channel-containing receptors. Co-expression of the in vitro-generated alpha-subunit and beta-subunit RNAs in Xenopus oocytes produces a functional receptor and ion channel with the pharmacological properties characteristic of the GABAA receptor.
PMID - 9339354
We have isolated and sequenced a novel human gene (GABRE) of the GABAA neurotransmitter receptor family. A cDNA sequence of the gene coding for a 506 amino acid protein was identified, representing a member of a putative new class (epsilon) of the GABAA receptor. The gene is transcribed at least at low level in several different tissues, with the highest levels being detected in adult heart and placenta. Alternative splicing of GABRE transcripts isolated from different tissues was observed at multiple positions of the gene, yielding an unusually complex variety of cDNA variants. The structure of the 5' region of most cDNAs is compatible with expression of protein sequence epsilon only in adult brain, whereas in other tissues, the majority of transcripts code for truncated protein sequences. The GABRE gene extends over 14 kb and is clustered together with the alpha 3 and the putative beta 4 GABAA receptor subunit genes in an approximately 0.8-Mb interval in chromosome band Xq28, located in the candidate regions of two different neurologic diseases. Based on features of conservation of protein sequences, gene structure, and genomic organization of GABAA receptor gene clusters, we propose that the epsilon and gamma subunit genes have a common ancestor and that GABAA receptor gene clusters in the human genome have diverged by multiple duplication events of an ancestral gene cluster containing one each alpha, beta, and gamma/epsilon precursor gene.
PMID - 2561977
Two cDNAs (alpha 1 and alpha 4) from rat brain cDNA libraries encode isoforms of the alpha subunit of the GABA/benzodiazepine receptor, which differ at 30% of their amino acid residues. Northern blot analysis and in situ hybridization histochemistry show that alpha 1 and alpha 4 mRNAs have distinct sizes and distinct regional and cellular distributions in rat brain: both mRNAs are found in the cortex and hippocampus; however, only the alpha 1 mRNA is detected in the cerebellum. We injected RNA transcribed from alpha 1 and alpha 4 cDNAs into Xenopus oocytes, together with an RNA for a rat beta subunit. We obtained GABA-dependent inward currents that were reversibly blocked by picrotoxin. Picrotoxin alone, applied to oocytes producing the alpha and beta polypeptides, elicited an outward current. We suggest that these polypeptides together produce GABA-gated ion channels that can also open spontaneously.
PMID - 2847710
A cDNA clone of an alpha subunit of the human GABA-A receptor has been isolated. The human clone (pCLL800) contains 1055 nucleotides in an open reading frame and 260 nucleotides in the 5' non-coding region. The 351 amino acid sequence of this human alpha subunit shows 97% homology with its bovine counterpart. Hybridization of pCLL800 to Northern blots shows a 3.9/4.3 Kb RNA doublet in human cortex, rat whole brain, cortex, hippocampus, midbrain, olfactory bulb and cerebellum. Developmental studies show that the levels of the rat alpha mRNA increase between one and three weeks of age in a manner similar to the development of the benzodiazepine binding sites.
PMID - 1647983
We have utilized a polymerase chain reaction (PCR) strategy to identify a novel subunit, gamma 3, of the GABAA receptor. The gamma 3 cDNA encodes a mature protein of 450 amino acids that contains structural features typically conserved among subunits of the GABAA receptor family. The gamma 3 subunit shares approximately 66% sequence identity with the gamma 2 subunit but only 38% and 29% with alpha 1 and beta 1 subunits, respectively. Localization of the gamma 3 mRNA indicates that it is widely distributed throughout the mouse brain in a pattern similar to that observed for mRNAs encoding the gamma 2 subunits.
PMID - 1660002
The gamma 3-subunit of the GABAA-receptor in rat brain has been identified by molecular cloning. When co-expressed with the alpha 5- and beta 2-subunits in transfected cells a high potency for GABA (Ka = 4.9 +/- 1.2 microM) and a strong cooperativity in gating the channel (H = 1.9 +/- 0.2) was observed. The GABA response was potentiated in the presence of flunitrazepam and reduced by beta CCM. An analogous bi-directional modulation of the GABA response was observed with diazepam and DMCM as tested with the subunit combinations alpha 1 beta 2 gamma 3 and alpha 3 beta 2 gamma 3 expressed in Xenopus oocytes. Since the benzodiazepine receptor ligands were virtually inactive in the absence of the gamma 3-subunit, as tested with the alpha 3 beta 2- and alpha 5 beta 2-subunit combinations, the gamma 3-subunit is a prerequisite for the benzodiazepine receptor sensitivity of the expressed GABAA-receptors. The gamma 3-subunit could functionally replace the gamma 2-subunit with regard to the bi-directional allosteric drug modulation.
PMID - 8780005
The gamma-aminobutyric acid type A receptor cDNAs encoding the alpha 6 subunit homologues from chicken and goldfish have been cloned and sequenced. These proteins exhibit 83 and 75% identity, respectively, to the rat alpha 6 polypeptide. In situ hybridization has demonstrated that, as in mammals, the avian and teleost fish alpha 6 subunit genes are predominantly expressed in cerebellar granule cells. Correspondingly, flunitrazepam-non-displaceable binding of [3H]Ro 15-4513 (a benzodiazepine partial inverse agonist), which is a major characteristic of gamma-aminobutyric acid type-A receptors that contain the alpha 6 polypeptide, is also mainly found for cerebellar granule cells of fish and chick. The conservation of this expression pattern suggests that gamma-aminobutyric acid type A receptors possessing the alpha 6 subunit are of fundamental importance for cerebellar function and that the corresponding gene regulatory elements, e.g., granule cell-specific enhancers, have also been conserved.
PMID - 2842688
When gamma-aminobutyric acid (GABA), the major inhibitory neurotransmitter in vertebrate brain, binds to its receptor it activates a chloride channel. Neurotransmitter action at the GABAA receptor is potentiated by both benzodiazepines and barbiturates which are therapeutically useful drugs (reviewed in ref. 1). There is strong evidence that this receptor is heterogeneous. We have previously isolated complementary DNAs encoding an alpha- and a beta-subunit and shown that both are needed for expression of a functional GABAA receptor. We have now isolated cDNAs encoding two additional GABAA receptor alpha-subunits, confirming the heterogeneous nature of the receptor/chloride channel complex and demonstrating a molecular basis for it. These alpha-subunits are differentially expressed within the CNS and produce, when expressed with the beta-subunit in Xenopus oocytes, receptor subtypes which can be distinguished by their apparent sensitivity to GABA. Highly homologous receptor subtypes which differ functionally seem to be a common feature of brain receptors.
PMID - 8632757
A cDNA encoding the human gamma-aminobutyric acidA (GABAA) receptor alpha 6 subunit has been cloned and sequenced. The deduced amino acid sequence of this cDNA shows 91.4% identity with the published rat alpha 6 subunit. In situ hybridization histochemistry reveals the alpha 6 mRNA to be located within the granule cell layer of the human cerebellar cortex. Recombinant human alpha 6 beta gamma 2S GABAA receptors have been expressed in both stably transfected cells and Xenopus oocytes, and the pharmacology of the benzodiazepine binding site has been determined. The recombinant receptor has a diazepam-insensitive pharmacology, with negligible affinity for a number of classic benzodiazepines. A number of compounds that bind to the benzodiazepine site potentiated the GABA response of alpha 6 beta 2 gamma 2 receptors. Most importantly, the classic benzodiazepine antagonist ethyl-8-fluoro-5,6-dihydro-5-methyl-6-oxo-4H-imidazo[1,5-a] [1,4]benzodiazepine-3-carboxylate (Ro 15-1788) and the partial inverse agonist ethyl-8-azido-5,6-dihydro-5-methyl-6-oxo-4H-imidazo[1,5-a] [1,4]benzodiazepine-3-carboxylate (Ro 15-4513) both acted as agonists at the alpha 6 containing receptor. This observation demonstrates definitively that efficacy of benzodiazepine compounds cannot be generalized across receptor subtypes and may also help explain some of the behavioral effects that have been reported for these compounds.
PMID - 1356407
Gamma-aminobutyric acid (GABA), the major inhibitory neurotransmitter in brain, opens chloride channels through actions on GABAA receptors. We now report base and amino acid sequences of the alpha 1, alpha 2, and alpha 3 subunits from GABAA receptors of audiogenic seizure-prone (DBA/2J) and -resistant (C57BL/6J) inbred strains of mice. Inbreeding had fixed different alleles of the alpha 1 subunit in the two strains, giving five base differences in the cDNAs. None of these affected amino acid sequence, but one did create a NsiI restriction site potentially useful in mapping genomic DNA. No base or amino acid sequence differences between the strains were detected for the other two subunits. Northern blots revealed no apparent strain differences in message levels for these three subunits in whole brains of the mice at 3 weeks of age, the peak of seizure susceptibility in DBA/2J, but did reveal distinct regional and developmental patterns of expression among the subunits in mouse brain.
PMID - 2166916
Benzodiazepines are widely prescribed anxiolytics and anticonvulsants which bind with high affinity to sites on the GABAA receptor/Cl- channel complex and potentiate the effect of the neurotransmitter GABA (gamma-aminobutyric acid). The heterogeneity of benzodiazepine recognition sites in the central nervous system was revealed by studies showing different classes of GABAA receptor subunits (classes alpha, beta and gamma) and variant subunits in these classes, particularly in the alpha-class. Expression of recombinant subunits produces functional receptors; when certain alpha-variants are coexpressed with beta- and gamma-subunits the resulting receptors have pharmacological properties characteristic of GABAA-benzodiazepine type I or type II receptors. The alpha-variants are differentially expressed in the central nervous system and can be photoaffinity-labelled with benzodiazepines. Here we report a novel alpha-subunit (alpha 6) of cerebellar granule cells. We show that recombinant receptors composed of alpha 6, beta 2 and gamma 2 subunits bind with high affinity to the GABA agonist [3H]muscimol and the benzodiazepine [3H]Ro15-4513 but not the other benzodiazepines or beta-carboniles. The same distinctive pharmacology is observed with GABAA receptors from rat cerebellum immunoprecipitated by an antiserum specific for the alpha 6 subunit. We conclude that this alpha-subunit is part of a cerebellar receptor subtype, selective for Ro15-4513, an antagonist of alcohol-induced motor incoordination and ataxia.
PMID - 7607683
We demonstrated previously that an alpha 1-beta 2-gamma 2 gene cluster of the gamma-aminobutyric acid (GABAA) receptor is located on human chromosome 5q34-q35 and that an ancestral alpha-beta-gamma gene cluster probably spawned clusters on chromosomes 4, 5, and 15. Here, we report that the alpha 4 gene (GABRA4) maps to human chromosome 4p14-q12, defining a cluster comprising the alpha 2, alpha 4, beta 1, and gamma 1 genes. The existence of an alpha 2-alpha 4-beta 1-gamma 1 cluster on chromosome 4 and an alpha 1-alpha 6-beta 2-gamma 2 cluster on chromosome 5 provides further evidence that the number of ancestral GABAA receptor subunit genes has been expanded by duplication within an ancestral gene cluster. Moreover, if duplication of the alpha gene occurred before duplication of the ancestral gene cluster, then a heretofore undiscovered subtype of alpha subunit should be located on human chromosome 15q11-q13 within an alpha 5-alpha x-beta 3-gamma 3 gene cluster at the locus for Angelman and Prader-Willi syndromes.
PMID - 8719414
A cDNA encoding the gamma 3 subunit of the human GABAA receptor has been obtained by molecular cloning. Its deduced amino acid sequence shows a high level of sequence identity with the published mouse and rat sequences (96%). The ligand binding pharmacology of the benzodiazepine site formed by stably-expressed human alpha 5 beta 3 gamma 2S and alpha 5 beta 3 gamma 3 GABAA receptor subtypes have been compared for a number of ligands, Benzodiazepine site ligands were found to be either non-selective or gamma 2-selective, with the exception of CL218,872, which was found to be 10-fold selective for the alpha 5 beta 3 gamma 3-containing subtype Two benzodiazepine site ligands. Ro15-4513 and FG8205 were more efficacious at alpha 5 beta 3 gamma 3 receptors than alpha 5 beta 3 gamma 2 receptors expressed in Xenopus oocytes, CL218,872, which is a partial agonist at alpha 1 containing receptors, had no intrinsic activity at either alpha 5 beta 3 gamma 2 or alpha 5 beta 3 gamma 3, alpha 1 beta 2 gamma 2S and alpha 1 beta 2 gamma 3 human GABAA receptors were also expressed in Xenopus oocytes and their benzodiazepine pharmacology investigated. Both the EC50 and efficacy of benzodiazepine site ligands were influenced by the type of gamma subunit coexpressed with alpha 1 and beta 2.
PMID - 9039914
A common feature of general anaesthetic agents is their ability to potentiate neuronal inhibition through GABA(A) (gamma-aminobutyric acid) receptors. At concentrations relevant to clinical anaesthesia, these agents cause a dramatic stimulation of the chloride currents that are evoked by the binding of the natural ligand, GABA. Although there is widespread evidence that the sensitivity of GABA(A) receptors to anaesthetic agents is heterogeneous, the structural basis of these differences is largely unknown. Variations in subunit composition can have profound effects on the sensitivity of GABA(A) receptors to modulatory agents such as benzodiazepines. However, strict subunit specificity has not been demonstrated for the potentiating effects of anaesthetic agents. Here we describe a new class of human GABA(A) receptor subunit (epsilon) that can assemble with alpha- and beta-subunits and confer an insensitivity to the potentiating effects of intravenous anaesthetic agents. The epsilon-subunit also abolishes the normal outward rectification of recombinant receptors in which it assembles. The expression pattern of this subunit in the brain suggests a new target for manipulation of neuronal pathways within the basal ganglia.
PMID - 1849552
A cDNA from a rat hippocampal cDNA library encodes an isoform of the alpha polypeptide of the gamma-aminobutyric acid (GABA)/benzodiazepine (BZ) receptor. Its deduced amino acid sequence is 96% identical to that of the alpha 2 polypeptide of the bovine GABAA receptor. The polypeptide has features shared by all previously reported GABAA receptor alpha polypeptides and shares 71-76% identity with previously described rat alpha polypeptides. Most of the differences lie in the presumed extracellular and intracellular domains. On Northern blots, the alpha 2 cDNA detects two mRNAs, which are found in cortex, hippocampus, and striatum, brain regions enriched in pharmacologically defined "BZ type II" receptors. Other workers have previously shown that the alpha polypeptides of the GABAA receptor largely determine the BZ binding properties of reconstituted receptors. The distribution of alpha 2 mRNAs in rat brain suggests that the alpha 2 subunit may indeed be involved in the BZ type II receptors.
PMID - 9527017
Gamma-aminobutyric acid (GABA)A receptors are the sites of action for many antiepileptic drugs such as benzodiazepines and barbiturates. We report the results of molecular cloning of the gamma1-subunit from seizure prone DBA/2J and resistant C57BL/6J inbred mice, and analyses of nucleotide sequences and expression of the gamma1-subunit messenger RNA (mRNA) in DBA/2 and C57BL/6 inbred mice. The mouse gamma1-subunit complementary DNA (cDNA) shares 98% similarity with that of the rat at the level of amino acid sequence. Northern blot hybridization indicates that the gamma1-subunit mRNA is expressed predominantly in areas other than the cerebral cortex and cerebellum and shows little change with postnatal development. No differences have been found for the subunit between DBA/2 and C57BL/6 mice either for nucleotide sequence or for level of expression of the subunit's mRNA in whole brain by Northern blots at 3 weeks of age.
PMID - 9205108
The gamma-aminobutyric acid-type A receptor alpha5 subunit gene (GABRA5) is widely expressed in brain and localized to the imprinted human chromosome 15q11-q13. A combination of cDNA library screening and 5' RACE analysis led to identification of three distinct mRNA isoforms of GABRA5 in human adult and fetal brain tissues, each of which differs only in the noncoding 5' UTR sequence. Alignment of the genomic and cDNA sequences of GABRA5 revealed that the mRNA isoforms resulted from three alternative first exons 1A, 1B, and 1C. Northern blot analysis showed that the expression of GABRA5 was not only tissue specific but region specific in brain. CAT reporter assays revealed promoter elements in the 5' proximity of each first exon. The GABRA5 promoter regions lacked TATA and CCAAT boxes but contained several other consensus transcriptional factor recognition sequences. These findings suggest that the differential exon 1 usage of GABRA5 arises as a consequence of alternative promoter activation.
PMID - 9084408
GABA-gated chloride channels are the main inhibitory neurotransmitter receptors in the CNS. Conserved domains among members of previously described GABAA receptor subunits were used to design degenerate sense and antisense oligonucleotides. A PCR product from this amplification was used to isolate a full-length cDNA. The predicted protein has many of the features shared by other members of the ligand-gated ion channel family. This channel subunit has significant amino acid identity (25-40%) with members of GABAA and GABAC receptor subunits and thus may represent a new subfamily of the GABA receptor channel. Although we cannot rule out that this clone encodes a receptor for an unidentified ligand, it was termed GABA chi. This gene is mainly expressed in placenta and in heart; however, placenta appears to express only an unspliced mRNA. In situ hybridization reveals that the GABA chi subunit mRNA is present in the electrical conduction system of the human heart. Our results suggest that novel GABA receptors expressed outside of the CNS may regulate cardiac function.
PMID - 8388764
We have established a probe order within the Angelman/Prader-Willi chromosomal regions by multicolor fluorescence in situ hybridization (FISH). The probe [locus] order extending distally from the centromere is 34[D15S9]-IR4-3R[D15S11]-189-1[D15S13]-PW71++ + [D15S63]-3-21[D15S10]-28 beta 3-H3[GABRB3]-IR10-1 [D15S12]. This order agrees with that recently reported (1) with the exception of PW71 [D15S63]. In addition, a second gamma-aminobutyric acid (GABAA) receptor, the alpha 5 subunit, has been localized within the reference map to between GABRB3 and D15S12. The locus order was further confirmed by DNA hybridization analysis of two patients, one with Angelman syndrome and one with Prader-Willi syndrome, with different unbalanced translocations and molecular extents of deletion. Our results provide a framework map of chromosome 15q11-q13 into which additional markers can be oriented and allow a further differentiation of the critical genetic regions of the two syndromes.
PMID - 1655526
A cDNA of rat brain encoding the GABAA receptor alpha 4 subunit has been cloned. Recombinant receptors composed of alpha 4, beta 2 and gamma 2 subunit bind with high affinity the GABA agonist [3H]muscimol and the benzodiazepine 'alcohol antagonist' [3H]Ro 15-4513, but fail to bind benzodiazepine agonists. The alpha 4 subunit is expressed mainly in the thalamus, as assessed by in situ hybridization histochemistry, and may participate in a major population of thalamic GABAA receptors. The alpha 4 mRNA is found at lower levels in cortex and caudate putamen, and is rare in cerebellum.
PMID - 1311098
Cloned cDNAs encoding a member of the gamma-aminobutyric acid type A receptor gamma-subunit class were isolated from rat-brain-mRNA-derived libraries. The gamma 3 mRNA is present in cortex, claustrum, caudate putamen, and some thalamic nuclei, particularly the medial geniculate nucleus, where it is the predominant gamma-subunit transcript. The gamma 3 gene is expressed at very low levels in cerebellum and hippocampus. In coexpression experiments with the alpha 1 and beta 2 subunits, gamma 3 imparts benzodiazepine binding to gamma-aminobutyric acid type A receptors and forms gamma-aminobutyric acid-gated benzodiazepine-modulated chloride channels that exhibit a larger conductance than alpha 1 beta 2 receptor channels. Furthermore, the presence of gamma 3 in place of gamma 2 in alpha 1 beta 2 gamma x receptors generates a marked decrease in the affinity of agonists while leaving the affinity of antagonists or negative modulators largely unaffected.
PMID - 2538761
Neurotransmission effected by GABA (gamma-aminobutyric acid) is predominantly mediated by a gated chloride channel intrinsic to the GABAA receptor. This heterooligomeric receptor exists in most inhibitory synapses in the vertebrate central nervous system (CNS) and can be regulated by clinically important compounds such as benzodiazepines and barbiturates. The primary structures of GABAA receptor alpha- and beta-subunits have been deduced from cloned complementary DNAs. Co-expression of these subunits in heterologous systems generates receptors which display much of the pharmacology of their neural counterparts, including potentiation by barbiturates. Conspicuously, however, they lack binding sites for, and consistent electrophysiological responses to, benzodiazepines. We now report the isolation of a cloned cDNA encoding a new GABAA receptor subunit, termed gamma 2, which shares approximately 40% sequence identity with alpha- and beta-subunits and whose messenger RNA is prominently localized in neuronal subpopulations throughout the CNS. Importantly, coexpression of the gamma 2 subunit with alpha 1 and beta 1 subunits produces GABAA receptors displaying high-affinity binding for central benzodiazepine receptor ligands.
PMID - 2167378
GABA (gamma-aminobutyric acid), the major inhibitory neurotransmitter in the vertebrate brain, mediates neuronal inhibition by opening a chloride channel integral to the GABAA receptor. This action is potentiated by both benzodiazepine and barbiturate drugs. Since the isolation of cDNAs encoding GABAA receptor alpha 1 and beta 1 subunits, a further eight subunits have been identified. These subunits show GABAA receptor heterogeneity, unpredicted from classical pharmacological studies. I now report the isolation of a mouse cDNA clone encoding a novel GABAA receptor alpha subunit. The striking feature of this subunit is its regional distribution in the mouse brain. Northern hybridization and in situ hybridization experiments demonstrate that the subunit mRNA is expressed only in cerebellar granule cells. This is the first demonstration of the exclusive presence of a neuroreceptor subtype in a single neuronal cell type.
PMID - 11003197
BACKGROUND: Behavioral genomics has made dramatic progress toward mapping quantitative trait loci (QTLs) that contain genes responsible for phenotypic differences in a variety of behavioral responses to alcohol (ethanol). We previously identified a QTL on mouse Chromosome 11 that affects genetic predisposition to acute alcohol withdrawal. Among mice derived from the C57BL/6J (B6) and DBA/2J (D2) inbred strains, this QTL (Alcw3) accounts for 12% of the genetic variability in withdrawal liability. Candidate genes within this QTL encode the gamma-aminobutyric acid type A (GABA A) receptor gamma2, alpha1, alpha6, and beta2 subunits. We recently identified a coding sequence polymorphism between the B6 and D2 strains for the GABA A receptor gamma2 subunit gene (Gabrg2). In this study, we expand our analysis to a panel of BXD strains derived from the B6 and D2 progenitor strains. These BXD strains provide 26 fixed recombinant genotypes that can be used to examine genetic correlations, for example, between a phenotype of interest and allelic variation in a candidate gene. METHODS: Gabrg2 was cloned and sequenced from the 26 BXD recombinant inbred strains. We analyzed genetic correlations between allelic variation in Gabrg2 and alcohol phenotypes previously measured in the BXD strain means. RESULTS: Allelic variation in Gabrg2 is correlated genetically with predisposition to acute alcohol withdrawal and may underlie the Alcw3 locus. In addition, Gabrg2 is associated with ethanol-conditioned taste aversion, ethanol-induced motor incoordination, and ethanol-induced hypothermia. A trend is observed for chronic ethanol withdrawal, ethanol-induced loss of righting reflex, and tolerance to ethanol-induced hypothermia and ataxia. CONCLUSIONS: Functionally relevant variation in Gabrg2, or a closely linked gene, is correlated genetically with some, but not all, behavioral responses to alcohol. The alcohol-related phenotypes associated with Gabrg2 generally may be characterized as debilitating or motivationally negative.
PMID - 11326274
Major advances in the identification of genes implicated in idiopathic epilepsy have been made. Generalized epilepsy with febrile seizures plus (GEFS+), benign familial neonatal convulsions and nocturnal frontal lobe epilepsy, three autosomal dominant idiopathic epilepsies, result from mutations affecting voltage-gated sodium and potassium channels, and nicotinic acetylcholine receptors, respectively. Disruption of GABAergic neurotransmission mediated by gamma-aminobutyric acid (GABA) has been implicated in epilepsy for many decades. We now report a K289M mutation in the GABA(A) receptor gamma2-subunit gene (GABRG2) that segregates in a family with a phenotype closely related to GEFS+ (ref. 8), an autosomal dominant disorder associating febrile seizures and generalized epilepsy previously linked to mutations in sodium channel genes. The K289M mutation affects a highly conserved residue located in the extracellular loop between transmembrane segments M2 and M3. Analysis of the mutated and wild-type alleles in Xenopus laevis oocytes confirmed the predicted effect of the mutation, a decrease in the amplitude of GABA-activated currents. We thus provide the first genetic evidence that a GABA(A) receptor is directly involved in human idiopathic epilepsy.
PMID - 2157817
A cDNA encoding a protein with 70% amino acid identity to the previously characterized gamma-aminobutyric acidA (GABAA) receptor alpha-subunits was isolated from a rat brain cDNA library by homology screening. As observed for alpha 1-, alpha 2-, and alpha 3-subunits, coexpression of this new alpha-subunit (alpha 5) with a beta- and gamma 2-subunit in cultured cells produces receptors displaying high-affinity binding sites for both muscimol, a GABA agonist, and benzodiazepines. Characteristic of GABAA/benzodiazepine type II sites, receptors containing alpha 2-, alpha 3- or alpha 5-subunits have low affinities for several type I-selective compounds. However, alpha 5-subunit-containing receptors have lower affinities for zolpidem (30-fold) and Cl 218 872 (three-fold) than measured previously using recombinantly expressed type II receptors containing either alpha 2- or alpha 3-subunits. Based on these findings, a reclassification of the GABAA/benzodiazepine receptors is warranted.
PMID - 2165521
GABAA receptors, the major synaptic targets for the neurotransmitter GABA, constitute gated chloride channels. By their allosteric, drug-induced modulation, they serve as control elements for the regulation of anxiety, vigilance, and epileptiform activity. The structural requirements of fully functional GABAA receptors in the mammalian brain have remained elusive so far. We report here on the cloning of the gamma 2-subunit cDNA of rat brain and its functional analysis by coexpression with the alpha 1- and beta 1-subunits in Xenopus oocytes, and on the sites of gene expression of the 3 subunits in the rat brain. The recombinant receptor displayed GABA-inducible currents (Imax = 6 microA; Ka = 75 microM) which were allosterically modulated by benzodiazepine receptor ligands (enhancement and inhibition by diazepam and methyl-6,7-dimethoxy-4-ethyl-beta-carboline-3-carboxylate, respectively). In the absence of GABA, pentobarbital elicited a maximal current amplitude similar to that of GABA. A minor population of channels is expressed which is open in the absence of GABA or pentobarbital. Mapping subunit gene expression by in situ hybridization histochemistry suggests that the alpha 1-, beta 1-, and gamma 2-subunits are likely receptor constituents in some neuronal populations, e.g., mitral cells of the olfactory bulb, pyramidal cells of the hippocampus, and granule cells of the dentate gyrus and cerebellum.
PMID - 1702226
Diversity of gamma-aminobutyrate type A (GABAA) receptors has recently been proposed to be achieved by assembly of receptor subtypes from a multitude of subunits (alpha 1-6, beta 1-3, gamma 1-2, and delta) encoded by different genes. Here we report a further mechanism for creating GABAA receptor diversity: alternative RNA splicing. Two forms of bovine gamma 2 subunit cDNA were isolated (gamma 2S and gamma 2L) that differed by the presence or absence of a 24-base-pair (8-amino acid) insertion in the cytoplasmic domain between the third and fourth putative membrane-spanning regions. Polymerase chain reaction from RNA demonstrated that the two forms of gamma 2 subunit are expressed in bovine, human, and rat brain. Sequencing of genomic DNA clones encoding the gamma 2 subunit demonstrated that the 24-base-pair insert is organized as a separate exon. Analysis of the sequence of the 8-amino acid insert revealed that it contains a protein kinase C consensus phosphorylation site. Expression of the large cytoplasmic loop domains of gamma 2S and gamma 2L in Escherichia coli, followed by phosphorylation of the recombinant proteins by protein kinase C, demonstrated that gamma 2L, but not gamma 2S, could be phosphorylated. Thus the two forms of gamma 2 subunit differ by the presence or absence of a protein kinase C phosphorylation site. This mechanism for creating GABAA receptor diversity may allow differential regulation of the function of receptor subtypes.
PMID - 9892355
Type-A receptors for the neurotransmitter GABA (gamma-aminobutyric acid) are ligand-gated chloride channels that mediate inhibitory neurotransmission. Each subunit of the pentameric receptor protein has ligand-binding sites in the amino-terminal extracellular domain and four membrane-spanning regions, one of which forms a wall of the ion channel. Each subunit also has a large intracellular loop that may be a target for protein kinases and be required for subcellular targeting and membrane clustering of the receptor, perhaps by anchoring the receptor to the cytoskeleton. Neurotransmitter receptors need to be positioned in high density in the cell membrane at sites postsynaptic to nerve terminals releasing that neurotransmitter. Other members of the superfamily of ligand-gated ion-channel receptors associate in postsynaptic-membrane clusters by binding to the proteins rapsyn or gephyrin. Here we identify a new cellular protein, GABA(A)-receptor-associated protein (GABARAP), which can interact with the gamma2 subunit of GABA(A) receptors. GABARAP binds to GABA(A) receptors both in vitro and in vivo, and co-localizes with the punctate staining of GABA(A) receptors on cultured cortical neurons. Sequence analysis shows similarity between GABARAP and light chain-3 of microtubule-associated proteins 1A and 1B. Moreover, the N terminus of GABARAP is highly positively charged and features a putative tubulin-binding motif. The interactions among GABA(A) receptors, GABARAP and tubulin suggest a mechanism for the targeting and clustering of GABA(A) receptors.
PMID - 2561970
Two cDNAs encoding novel GABAA receptor subunits were isolated from a rat brain library. These subunits, gamma 2 and delta, share approximately 35% sequence identity with alpha and beta subunits and form functional GABA-gated chloride channels when expressed alone in vitro. The gamma 2 subunit is the rat homolog of the human gamma 2 subunit recently shown to be important for benzodiazepine pharmacology. Cellular localization of the mRNAs encoding the gamma 2 and delta subunits in rat brain revealed that largely distinct neuronal subpopulations express the two subunits. The delta subunit distribution resembles that of the high affinity GABAA receptor labeled with [3H]muscimol; the gamma 2 subunit distribution resembles that of GABAA/benzodiazepine receptors labeled with [3H]flunitrazepam. These findings have implications for the composition of two different GABAA receptor subtypes and for information processing in networks using GABA for signaling.
PMID - 11326275
Epilepsies affect at least 2% of the population at some time in life, and many forms have genetic determinants. We have found a mutation in a gene encoding a GABA(A) receptor subunit in a large family with epilepsy. The two main phenotypes were childhood absence epilepsy (CAE) and febrile seizures (FS). There is a recognized genetic relationship between FS and CAE, yet the two syndromes have different ages of onset, and the physiology of absences and convulsions is distinct. This suggests the mutation has age-dependent effects on different neuronal networks that influence the expression of these clinically distinct, but genetically related, epilepsy phenotypes. We found that the mutation in GABRG2 (encoding the gamma2-subunit) abolished in vitro sensitivity to diazepam, raising the possibility that endozepines do in fact exist and have a physiological role in preventing seizures.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 8719416
Benzodiazepines modulate gamma-aminobutyric acid (GABA)-evoked chloride currents through a specific binding site at the GABAA receptor-chloride channel complex. The heterogeneity of diazepam-sensitive benzodiazepine binding sites (type I and type II) has been identified by pharmacological approaches both with native receptors and recombinant receptors coexpressing alpha, beta and gamma subunits. In addition, two distinguishable diazepam-insensitive benzodiazepine sites are found, spatially distributed between cerebral cortical and cerebellar regions. Coexpression of alpha 6 with beta 2 and gamma 2L subunits creates a pharmacologically similar benzodiazepine receptor to the diazepam-insensitive site observed in cerebellum, however, there is no evidence regarding the possible subunit combination forming the DI site in cerebral tissues. Here we report the cloning of the human alpha 4 cDNA and its pharmacology by coexpression of this alpha 4 subunit with beta 2 and gamma 2L subunits. This recombinant receptor complex showed a high affinity for the previously described benzodiazepine partial agonist bretazenill, the pyrazoloquinoline compounds CGS-9895 and CGS-9896, as well as the inverse agonists DMCM (methyl 6,7-dimethoxy 4-ethyl-beta-carboline-3-carboxylate) and Ro15-4513 as determined by [3H]Ro15-4513 binding. However, it is insensitive to the benzodiazepine type I selective compounds CL218.872 (3-methyl-6-[3-(trifluoromethyl)[phenyl]-1,2,4-triazolo[4.3-b]pyridazine ) and zolpidem as well as the benzodiazepine full agonists diazepam, halazolam and midazolam. In addition, the benzodiazepine receptor ligands DMCM, beta-CCE (beta-carboline-3-carboxylate ethyl ester), Beta-CCM (beta-carboline-3-carboxylate methyl ester), FG-7142, CGS-9895 and CGS-9896 showed 7 to 10 times higher affinity for alpha 4 beta 2 gamma 2L. The pharmacology of the alpha 4 beta 2 gamma 2L receptor complex appears to resemble those of the diazepam-insensitive site found in the cerebral cortex. Our study thus suggests that this subpopulation of diazepam-insensitive GABAA receptors may be composed of alpha 4 beta 2 gamma 2L subunits.
PMID - 8391122
cDNAs encoding human alpha 2 and alpha 3 gamma-aminobutyric acidA receptor subunits have been cloned. Their deduced amino acid sequences show much sequence identity with the published bovine sequences (98.2% and 97.0% for alpha 2 and alpha 3, respectively). Human alpha 1 beta 1 gamma 2, alpha 2 beta 1 gamma 2, alpha 3 beta 1 gamma 2, and alpha 5 beta 1 gamma 2 subunit combinations were expressed in transiently transfected cells and their pharmacologies were characterized using a series of benzodiazepine (BZ) binding site ligands. Human alpha 1-containing receptors exhibited a BZ1-type pharmacology, and alpha 2-, alpha 3-, and alpha 5-containing receptors exhibited a broadly BZ2-type pharmacology. The partial inverse agonist Ro15-4513 showed an approximately 10-15-fold higher affinity for alpha 5-containing than for alpha 1-, alpha 2-, or alpha 3-containing receptors and is thus the first compound shown to have a significantly higher affinity for another receptor subtype than for alpha 1 beta 1 gamma 2.
PMID - 1848528
The murine GABAA/benzodiazepine (GABAA/BZ) receptor alpha 1 subunit cDNA has been isolated from a BALB/c mouse brain library and sequenced. The cDNA is 2665 nucleotides long with an open reading frame of 455 amino acids. It shows significant homology to the GABAA receptor alpha 1 subunit cDNA sequences of other species. Excluding deletions, the murine GABAA alpha 1 receptor exhibits 96% nucleotide and 100% amino acid sequence homology to the rat alpha 1 receptor cDNA and over 91% nucleotide and 98% amino acid sequence homology to the bovine and human alpha 1 receptor cDNAs in the protein coding region. This murine cDNA was used to locate the alpha 1 receptor subunit gene, Gabra-1, to murine Chromosome 11 between Il-3 and Rel. This assignment extends proximally the segment of mouse Chromosome 11 with known homology to human chromosome 5.
PMID - 2170110
The GABAA receptor gamma 1 subunit of human, rat and bovine origin was molecularly cloned and compared with the gamma 2 subunit in structure and function. Both gamma subunit variants share 74% sequence similarity and are prominently synthesized in often distinct areas of the central nervous system as documented by in situ hybridization. When co-expressed with alpha and beta subunits in Xenopus oocytes and mammalian cells, the gamma variants mediate the potentiation of GABA evoked currents by benzodiazepines and help generate high-affinity binding sites for these drugs. However, these sites show disparate pharmacological properties which, for receptors assembled from alpha 1, beta 1 and gamma 1 subunits, are characterized by the conspicuous loss in affinity for neutral antagonists (e.g. flumazenil) and negative modulators (e.g. DMCM). These findings reveal a pronounced effect of gamma subunit variants on GABAA/benzodiazepine receptor pharmacology.
PMID - 2153588
Two alpha subunits of the GABAA receptor in rat brain have been identified by molecular cloning. The deduced polypeptide sequences share major characteristics with other chemically gated ion channel proteins. One polypeptide represents the rat homologue of the alpha 3 subunit previously cloned from bovine brain, while the other polypeptide is a yet known subunit, termed alpha 5. When coexpressed with the beta 1 subunit in Xenopus oocytes the receptors containing the alpha 5 subunit revealed a higher sensitivity to GABA than receptors expressed from alpha 1 + beta 1 subunits or alpha 3 + beta 1 subunits (Ka = 1 microM, 13 microM and 14 microM, respectively). The alpha 5 subunit was expressed only in a few brain areas such as cerebral cortex, hippocampal formation and olfactory bulb granular layer as shown by in situ hybridization histochemistry. Since the mRNA of the alpha 5 subunit was colocalized with the alpha 1 and alpha 3 subunits only in cerebral cortex and in the hippocampal formation the alpha 5 subunit may be part of distinct GABAA receptors in neuronal populations within the olfactory bulb.
PMID - 8397108
We have isolated a cDNA, from the chicken, that encodes a fourth type of gamma subunit of the vertebrate brain GABAA receptor. The mature polypeptide (which we name gamma 4) displays 67%, 69% and 70% identity, respectively, to the rat gamma 1, gamma 2 and gamma 3 subunits. In the developing chicken brain, the gamma 4-subunit mRNA is first detected at embryonic day 13; the transcript level then increases progressively during embryogenesis. In situ hybridization reveals that the gamma 4-subunit mRNA is abundant in several brain regions, including the ectostriatum, nucleus rotundus and hyperstriatum ventrale, which are involved in visual processing and learning.
PMID - 2465923
The deduced amino acid sequences of cDNA clones encoding human GABAA receptor alpha 1 and beta 1 subunits are presented. The human subunits display very high levels of sequence identity with the corresponding bovine receptor subunits. The cloned human GABAA receptor subunits induce the formation of GABA-gated chloride channels when expressed in mammalian cells.
PMID - 12466851
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.

HH1R_BOVIN.abs:
PMID - 8917588
From pharmacological studies using histamine antagonists and agonists, it has been demonstrated that histamine modulates many physiological functions of the hypothalamus, such as arousal state, locomotor activity, feeding, and drinking. Three kinds of receptors (H1, H2, and H3) mediate these actions. To define the contribution of the histamine H1 receptors (H1R) to behavior, mutant mice lacking the H1R were generated by homologous recombination. In brains of homozygous mutant mice, no specific binding of [3H]pyrilamine was seen. [3H]Doxepin has two saturable binding sites with higher and lower affinities in brains of wild-type mice, but H1R-deficient mice showed only the weak labeling of [3H]doxepin that corresponds to lower-affinity binding sites. Mutant mice develop normally, but absence of H1R significantly increased the ratio of ambulation during the light period to the total ambulation for 24 hr in an accustomed environment. In addition, mutant mice significantly reduced exploratory behavior of ambulation and rearings in a new environment. These results indicate that through H1R, histamine is involved in circadian rhythm of locomotor activity and exploratory behavior as a neurotransmitter.
PMID - 12142541
Bphs controls Bordetella pertussis toxin (PTX)-induced vasoactive amine sensitization elicited by histamine (VAASH) and has an established role in autoimmunity. We report that congenic mapping links Bphs to the histamine H1 receptor gene (Hrh1/H1R) and that H1R differs at three amino acid residues in VAASH-susceptible and -resistant mice. Hrh1-/- mice are protected from VAASH, which can be restored by genetic complementation with a susceptible Bphs/Hrh1 allele, and experimental allergic encephalomyelitis and autoimmune orchitis due to immune deviation. Thus, natural alleles of Hrh1 control both the autoimmune T cell and vascular responses regulated by histamine after PTX sensitization.
PMID - 8294913
An intronless DNA encoding the guinea pig H1 receptor was cloned from a genomic library using probes derived from the bovine H1 receptor. It encodes a protein of 488 amino acids with a calculated molecular mass of 55,619 daltons compared with a size of 56-68 kDa for the photoaffinity-labeled receptor as determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis. The protein displays a 66% homology with the bovine receptor. Stable expression of the H1 receptor, characterized by the appearance of [3H]mepyramine binding sites with a pharmacology similar to that of the native H1 receptor, was obtained following transfection of Chinese hamster ovary cells. Southern blot analysis, using a variety of restriction enzymes, did not provide any evidence of multiple H1 isoreceptors. Northern blot analysis of a variety of guinea pig peripheral or cerebral tissues identified, in most cases, a single transcript of 3.3 kb, but also, in some tissues, a second transcript of 3.7 kb, possibly generated by the use of different promoter or polyadenylation sites or corresponding to a transcript from a distinct gene. In situ hybridization studies showed the highly contrasted cerebral expression of H1-receptor gene transcripts, which was compared with autoradiographic receptor localization. This allowed the identification of some major cell populations expressing the H1 receptor, e.g., Purkinje cells in cerebellum or pyramidal cells in the hippocampal complex.
PMID - 8280179
A human histamine H1 receptor gene lacking introns was isolated by screening a human genomic library with a bovine histamine H1 receptor probe. The deduced protein of 487 amino acids showed characteristic properties of G-protein-coupled receptors. The coding region was subcloned into the expression vector pSVL (Pharmacia), and the resulting construct transfected into COS-7 cells. Binding studies with [3H]pyrilamine on membranes from transfected cells revealed saturable specific binding with a KD of 1.2 nM and a Bmax of 3400 fmol/mg protein. Binding affinities of histamine and known histamine antagonists were similar to those for histamine H1 receptors in guinea-pig cerebellum. In transfected COS-7 cells, histamine induced inositol phosphate formation, that was inhibitable by pyrilamine.
PMID - 12595443
In vivo intoxication with Bordetella pertussis toxin (PTX) elicits a variety of physiological responses including a marked leukocytosis, disruption of glucose regulation, adjuvant activity, alterations in vascular function, hypersensitivity to vasoactive agents, and death. We recently identified Bphs, the locus controlling PTX-induced hypersensitivity to the vasoactive amine histamine, as the histamine H(1) receptor (Hrh1). In this study Bphs congenic mice and mice with a disrupted Hrh1 gene were used to examine the role of Bphs/Hrh1 in the genetic control of susceptibility to a number of phenotypes elicited following in vivo intoxication. We report that the contribution of Bphs/Hrh1 to the overall genetic control of responsiveness to PTX is restricted to susceptibility to histamine hypersensitivity and enhancement of antigen-specific delayed-type hypersensitivity responses. Furthermore, the genetic contribution of Bphs/Hrh1 to vasoactive amine sensitization is specific for histamine, since hypersensitivity to serotonin was unaffected by Bphs/Hrh1. Bphs/Hrh1 also did not significantly influence susceptibility to the lethal effects, the leukocytosis response, disruption of glucose regulation, and histamine-independent increases in vascular permeability associated with in vivo intoxication. Nevertheless, significant interstrain differences in susceptibility to the lethal effects of PTX and leukocytosis response were observed. These results indicate that the phenotypic variation in responsiveness to PTX reflects the genetic control of distinct intermediate phenotypes rather than allelic variation in genes controlling overall susceptibility to intoxication.
PMID - 7678492
A rat histamine H1 receptor gene which lacked introns was isolated from a rat genomic library using recently cloned bovine histamine H1 receptor cDNA [Yamashita et al., Proc. Natl. Acad. Sci. USA, 88, 11515-11519 (1991)]. The receptor protein deduced from this isolated gene was composed of 486 amino acids and showed characteristic properties of G protein-coupled receptors. At the 5'-flanking region of the receptor gene, we have located potential TATA box sequences and consensus sequences for the glucocorticoid response element and AP-2 element. After being subcloned into a mammalian expression vector, the isolated gene was transfected to C6 glioma cells. These cells showed significant binding toward [3H]mepyramine. The binding was inhibited by H1 antagonists or histamine. The mode of this binding was comparable to the binding of membranes derived from rat tissues toward [3H]mepyramine. Northern blot analysis detected a 3.0 kb nucleotide band for histamine H1 receptor mRNAs from rat brain and small intestine when these mRNAs were hybridized with the isolated rat H1 gene. The present results demonstrate the isolation of the rat histamine H1 receptor gene.
PMID - 1722337
A functional cDNA clone for the histamine H1 receptor was isolated from a cDNA library of bovine adrenal medulla by a combination of molecular cloning in an expression vector and electrophysiological assay in Xenopus oocytes. The H1 receptor cDNA encodes a protein of 491 amino acids (Mr 55,954) with seven putative transmembrane domains, illustrating the similarity to other receptors that couple with guanine nucleotide-binding regulatory proteins (G protein-coupled receptors). The sequence homology between the H1 and H2 receptors is not higher than that between the histamine H1 and m1-muscarinic receptors. The cloned receptor protein expressed in COS-7 cells bound specifically to [3H]mepyramine, an H1 receptor antagonist, and this binding was displaced by H1 receptor antagonists and histamine with affinities comparable with those in membranes of bovine adrenal medulla. H1 receptor mRNA was shown to be expressed in brain and in peripheral tissues, including lung, small intestine, and adrenal medulla. This investigation discloses the molecular nature of the H1 receptor--a receptor that mediates diverse neuronal and peripheral actions of histamine and that may be of therapeutic importance in allergy.
PMID - 8812432
We report here the molecular cloning of a mouse histamine H1 receptor gene. The protein deduced from the nucleotide sequence is composed of 488 amino acid residues with characteristic properties of GTP binding protein-coupled receptors. Our results suggest that the mouse histamine H1 receptor gene is a single locus, and no related sequences were detected. Interspecific backcross analysis indicated that the mouse histamine H1 receptor gene (Hrh1) is located in the central region of mouse Chromosome 6 linked to microphthalmia (Mitfmi), ras-related fibrosarcoma oncogene 1 (Raf1), and ret proto-oncogene (Ret) in a region of homology with human chromosome 3p.
PMID - 8282735
The histamine H1 receptor gene was isolated from a guinea-pig gene library. The gene contains no introns and encodes a protein of 488 amino acid residues. The structure of the guinea-pig histamine H1 receptor is predicted to contain seven putative transmembrane regions, which are similar to those of receptors coupling with GTP binding proteins. Although the third intracellular domain, the predicted binding site for the GTP binding protein, showed only 50% identity with those of the bovine and rat H1 receptors, the expressed guinea-pig H1 receptor was fully able to bind with [3H]mepyramine. Northern blot analysis indicated that the cerebrum, cerebellum, lung, adrenal, intestine, and heart expressed 3.3 kb guinea-pig H1 receptor mRNA. Expression of histamine H1 mRNA of guinea-pig peripheral organs was greater than that of rat organs, suggesting the high sensitivity of guinea-pig organs as to histamine is due to the contents of histamine H1 receptor mRNA. In addition, the lung, adrenal, intestine, and heart expressed 3.9 kb mRNA. In situ hybridization showed that the hippocampus, cerebral cortex, thalamus, and granular layer of the cerebellum each contained a large amount of histamine H1 receptors. Southern blot analysis showed that there was another gene quite similar to the cloned histamine H1 receptor gene.
PMID - 8003029
The human histamine H1 receptor gene, an intron-lacking gene, was isolated with bovine H1 receptor cDNA [Yamashita, M., Fukui, H., Sugama, K., Horio, Y., Ito, S., Mizuguchi, H., and Wada, H. (1991) Proc. Natl. Acad. Sci. USA 88, 11515-11519] used as a probe. The receptor protein deduced from the nucleotide sequence of this gene was composed of 487 amino acid residues with a calculated Mr of 55,781 and possessed characteristic properties of GTP binding protein-coupled receptors. At the 5' flanking region of the human H1 receptor gene, we located potential TATA box and CACCC sequences, AP1 binding site-like sequences, glucocorticoid responsive element-like sequences, and other binding sequences for inducers. Northern blot analysis showed that H1 receptor mRNAs visualized as two bands with 3.0-kilobase and 3.5-kilobase nucleotides were expressed in peripheral tissues such as placenta, lung, skeletal muscle, and kidney and that only one species with 3.5-kilobase nucleotides was present in the brain. H1 receptor mRNA was most abundant in the placenta. The human H1 receptor gene was mapped to the chromosome 3p25 by the fluorescence in situ hybridization method.
PMID - 12466851
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.

HUGA_VESVU.abs:
PMID - 8828537
Three known allergens of yellow jacket (Vespula vulgaris) venom are antigen 5, hyaluronidase, and phospholipase. Yellow jacket antigen 5 has been previously cloned and expressed in bacteria; it contains 204 amino acid residues, and it has 69% and 60% sequence identities with the homologous proteins of white-faced hornet (Dolichovespula maculata) and wasp (Polistes annularis), respectively. These studies are now extended to yellow jacket hyaluronidase and phospholipase; they contain 331 and 300 amino acid residues, respectively, and they show 92% and 67% sequence identity with their homologs of white-faced hornet. Tests with the natural and the recombinant vespid allergens in mice indicate partial antigenic cross-reactivity of their homologous proteins at both B- and T-cell levels. There is greater cross-reactivity among hornet and yellow jacket allergens than that among hornet or yellow jacket and wasp allergens. The order of cross-reaction of the three vespid allergens is hyaluronidase > antigen 5 > phospholipase. The continuous (linear) B-cell epitopes of vespid allergens show greater cross-reactivity than their discontinuous epitopes do. The discontinuous B-cell epitopes are immunodominant for all vespid allergens. The low degree of cross-reactivity of the immunodominant discontinuous B-cell epitopes of vespid allergens should be taken into consideration in selection of venoms for immunotherapy of patients with sensitivity to multiple vespids.
PMID - 7876212
White face hornet (Dolichovespula maculata) venom has three known protein allergens which induce IgE response in susceptible people. They are antigen 5, phospholipase A1, and hyaluronidase, also known as Dol m 5, 1, and 2, respectively. We have cloned Dol m 2, a protein of 331 residues. When expressed in bacteria, a mixture of recombinant Dol m 2 and its fragments was obtained. The fragments were apparently generated by proteolysis of a Met-Met bond at residue 122, as they were not observed for a Dol m 2 mutant with a Leu-Met bond. Dol m 2 has 56% sequence identity with the honey bee venom allergen hyaluronidase and 27% identity with PH-20, a human sperm protein with hyaluronidase activity. A common feature of hornet venom allergens is their sequence identity with other proteins in our environment. We showed previously the sequence identity of Dol m 5 with a plant protein and a mammalian testis protein and of Dol m 1 with mammalian lipases. In BALB/c mice, Dol m 2 and bee hyaluronidase showed cross-reactivity at both antibody and T cell levels. These findings are relevant to some patients' multiple sensitivity to hornet and bee stings.
PMID - 7682712
The venom of honeybees, Apis mellifera, contains several biologically active peptides and two enzymes, one of which is a hyaluronidase. By using degenerate oligonucleotides derived from the amino-terminal sequence of this hyaluronidase reported by others, clones encoding the precursor for this enzyme could be isolated from a cDNA library prepared from venom glands of worker bees. The deduced amino acid sequence showed that bee venom hyaluronidase is a polypeptide composed of 349 amino acids containing four cysteines and three potential sites for N-glycosylation. The sequence of the precursor also indicated that the conversion of the pro-enzyme to the end product must involve cleavage of a Thr-Pro bond, a most unusual processing reaction. The mRNA encoding hyaluronidase could also be detected in testes from drones. Expression of the cloned cDNA in Escherichia coli yielded a 41-kDa polypeptide that had hyaluronidase activity. Interestingly, the hyaluronidase from bee venom glands exhibited significant homology to PH-20, a membrane protein of guinea pig sperm involved in sperm-egg adhesion. These structural data support the long-held view that hyaluronidases play a role in fertilization.
PMID - 10998264
A method has been developed which allows the analysis of glycoproteins separated by SDS-PAGE. The procedure, though applicable to N-glycosylated glycoproteins of any origin, is particularly devised for glycoproteins potentially containing fucose in alpha1,3-linkage to the reducing GlcNAc as may be found in plants and invertebrates, e.g., insects and parasitic helminths. Starting with an established procedure for mass spectrometric peptide mapping, the analysis of N-glycans by matrix-assisted laser desorption/ionization mass spectrometry involved the use of peptide:N-glycosidase A, a triphasic microcolumn for sample cleanup, and a new matrix mixture consisting of 2,5-dihyhydroxybenzoic acid, 1-hydroxyisoquinoline, and arabinosazone. The method was tested on proteins with N-glycans of known structure, i.e., as horseradish peroxidase, zucchini ascorbate oxidase, soybean agglutinin, honeybee venom hyaluronidase, bovine ribonuclease B, and bovine fetuin. An electrophoretic band corresponding to 4 microg of glycoprotein was generally sufficient to allow detection of the major N-glycan species. As an additional benefit, a peptide mass map is generated which serves to identify the analyzed protein. The method was applied to glycoprotein allergens whose glycan structures were unknown. Ara h 1 and Ole e 1, major allergens from peanut and olive pollen, respectively, contained mainly xylosylated N-glycans with the composition Man(3(-4))XylGlcNAc(2) in the case of Ara h 1 and GlcNAc(1-2)Man(3)XylGlcNAc(2) in the case of Ole e 1 where also some GlcNAc(0-2)Man(3)XylFucGlcNAc(2) was found.

KBF2_HUMAN.abs:
PMID - 11239468
Processing of the nf(kappa)b2 gene product p100 to generate p52 is an important step in NF-kappaB regulation. We show that this step is negatively regulated by a processing-inhibitory domain (PID) within p100 and positively regulated by the NF-kappaB-inducing kinase (NIK). While the PID suppresses the constitutive processing of p100, NIK induces p100 processing by stimulating site-specific phosphorylation and ubiquitination of this precursor protein. Further, a natural mutation of the gene encoding NIK in alymphoplasia (aly) mice cripples the function of NIK in p100 processing, causing a severe defect in p52 production. These data suggest that NIK is a specific kinase regulating p100 processing and explain why the aly and nf(kappa)b2 knockout mice exhibit similar immune deficiencies.
PMID - 9450761
The NF-kappaB p50/p65 heterodimer is the classical member of the Rel family of transcription factors which regulate diverse cellular functions such as immune response, cell growth, and development. Other mammalian Rel family members, including the proteins p52, proto-oncoprotein c-Rel, and RelB, all have amino-terminal Rel-homology regions (RHRs). The RHR is responsible for the dimerization, DNA binding and cytosolic localization of these proteins by virtue of complex formation with inhibitor kappaB proteins. Signal-induced removal of kappaB inhibitors allows translocation of dimers to the cell nucleus and transcriptional regulation of kappaB DNA-containing genes. NF-kappaB specifically recognizes kappaB DNA elements with a consensus sequence of 5'-GGGRNYYYCC-3' (R is an unspecified purine; Y is an unspecified pyrimidine; and N is any nucleotide). Here we report the crystal structure at 2.9 A resolution of the p50/p65 heterodimer bound to the kappaB DNA of the intronic enhancer of the immunoglobulin light-chain gene. Our structure reveals a 5-base-pair 5' subsite for p50, and a 4-base-pair 3' subsite for p65. This structure indicates why the p50/p65 heterodimer interface is stronger than that of either homodimer. A comparison of this structure with those of other Rel dimers reveals that both subunits adopt variable conformations in a DNA-sequence-dependent manner. Our results explain the different behaviour of the p50/p65 heterodimer with heterologous promoters.
PMID - 9865694
IkappaBalpha regulates the transcription factor NF-kappaB through the formation of stable IkappaBalpha/NF-kappaB complexes. Prior to induction, IkappaBalpha retains NF-kappaB in the cytoplasm until the NF-kappaB activation signal is received. After activation, NF-kappaB is removed from gene promoters through association with nuclear IkappaBalpha, restoring the preinduction state. The 2.3 A crystal structure of IkappaBalpha in complex with the NF-kappaB p50/p65 heterodimer reveals mechanisms of these inhibitory activities. The presence of IkappaBalpha allows large en bloc movement of the NF-kappaB p65 subunit amino-terminal domain. This conformational change induces allosteric inhibition of NF-kappaB DNA binding. Amino acid residues immediately preceding the nuclear localization signals of both NF-kappaB p50 and p65 subunits are tethered to the IkappaBalpha amino-terminal ankyrin repeats, impeding NF-kappaB from nuclear import machinery recognition.
PMID - 1339305
A cDNA corresponding to the 2.6 kb NF-kappa B mRNA species present in a variety of lymphoid cell lines has been molecularly cloned. The deduced 607 amino acid sequence is identical to the sequence of the C-terminal region of 110 kd NF-kappa B protein. A 70 kd protein can be identified in lymphoid cells using antibodies raised against the C-terminal region of p110 NF-kappa B. Comparison of the two-dimensional tryptic peptide maps of the 70 kd protein expressed in cells and the in vitro translated product encoded by the cDNA display extensive homology. The 70 kd protein expressed in bacteria prevents sequence-specific DNA binding of p50-p65 NF-kappa B heterodimer, p50 homodimer, and c-rel. p70 also interferes with transactivation by c-rel and prevents its nuclear translocation. The 70 kd protein, predominantly found in lymphoid cells, is a new member of the I kappa B family of proteins and is referred to as I kappa B gamma.
PMID - 9529257
The NFkappaB1 gene encodes two functionally distinct proteins termed p50 and p105. p50 corresponds to the N terminus of p105 and with p65 (RelA) forms the prototypical NF-kappaB transcription factor complex. In contrast, p105 functions as a Rel-specific inhibitor (IKB) and has been proposed to be the precursor of p50. Our studies now demonstrate that p50 is generated by a unique cotranslational processing event involving the 26S proteasome, whereas cotranslational folding of sequences near the C terminus of p50 abrogates proteasome processing and leads to p105 production. These results indicate that p105 is not the precursor of p50 and reveal a novel mechanism of gene regulation that ensures the balanced production and independent function of the p50 and p105 proteins.
PMID - 9950430
The transcription factor NF-kappaB is composed of homodimeric and heterodimeric complexes of Rel/NF-kappaB-family polypeptides, which include Rel-A, c-Rel, Rel-B, NF-kappaB/p50 and NF-kappaB2/p52 . The NF-kappaB1 gene encodes a larger precursor protein, p105, from which p50 is produced constitutively by proteasome-mediated removal of the p105 carboxy terminus. The p105 precursor also acts as an NFkappaB-inhibitory protein, retaining associated p50, c-Rel and Rel-A proteins in the cytoplasm through its carboxy terminus. Following cell stimulation by agonists, p105 is proteolysed more rapidly and released Rel subunits translocate into the nucleus. Here we show that TPL-2 , which is homologous to MAP-kinase-kinase kinases in its catalytic domain, forms a complex with the carboxy terminus of p105. TPL-2 was originally identified, in a carboxy-terminal-deleted form, as an oncoprotein in rats and is more than 90% identical to the human oncoprotein COT. Expression of TPL-2 results in phosphorylation and increased degradation of p105 while maintaining p50 production. This releases associated Rel subunits or p50-Rel heterodimers to generate active nuclear NF-kappaB. Furthermore, kinase-inactive TPL-2 blocks the degradation of p105 induced by tumour-necrosis factor-alpha. TPL-2 is therefore a component of a new signalling pathway that controls proteolysis of NF-kappaB1 p105.
PMID - 9865693
The inhibitory protein, IkappaBalpha, sequesters the transcription factor, NF-kappaB, as an inactive complex in the cytoplasm. The structure of the IkappaBalpha ankyrin repeat domain, bound to a partially truncated NF-kappaB heterodimer (p50/ p65), has been determined by X-ray crystallography at 2.7 A resolution. It shows a stack of six IkappaBalpha ankyrin repeats facing the C-terminal domains of the NF-kappaB Rel homology regions. Contacts occur in discontinuous patches, suggesting a combinatorial quality for ankyrin repeat specificity. The first two repeats cover an alpha helically ordered segment containing the p65 nuclear localization signal. The position of the sixth ankyrin repeat shows that full-length IkappaBalpha will occlude the NF-kappaB DNA-binding cleft. The orientation of IkappaBalpha in the complex places its N- and C-terminal regions in appropriate locations for their known regulatory functions.
PMID - 1531086
A Rel-related, mitogen-inducible, kappa B-binding protein has been cloned as an immediate-early activation gene of human peripheral blood T cells. The cDNA has an open reading frame of 900 amino acids capable of encoding a 97-kDa protein. This protein is most similar to the 105-kDa precursor polypeptide of p50-NF-kappa B. Like the 105-kDa precursor, it contains an amino-terminal Rel-related domain of about 300 amino acids and a carboxy-terminal domain containing six full cell cycle or ankyrin repeats. In vitro-translated proteins, truncated downstream of the Rel domain and excluding the repeats, bind kappa B sites. We refer to the kappa B-binding, truncated protein as p50B by analogy with p50-NF-kappa B and to the full-length protein as p97. p50B is able to form heteromeric kappa B-binding complexes with RelB, as well as with p65 and p50, the two subunits of NF-kappa B. Transient-transfection experiments in embryonal carcinoma cells demonstrate a functional cooperation between p50B and RelB or p65 in transactivation of a reporter plasmid dependent on a kappa B site. The data imply the existence of a complex family of NF-kappa B-like transcription factors.
PMID - 7510259
NF-kappa B is a transcription factor composed of the p50 and p65 subunits. Recent works identified another human gene which encodes a molecule related to the p50 subunit, termed p50B, p49 or lyt-10. Here, we isolated the cDNA clones encoding chicken p50B/p97 (Lyt-10). The deduced amino acid sequence of the precursor protein, p97, shows conservation of the overall structure, and 86% identity in the Rel homology domain (RHD) and 56% identity in the ankyrin repeat domain (ARD) to human p50B/p97. Expression of this gene is highest in the chicken spleen.
PMID - 1533881
Active NF-kappa B-like transcription complexes are multimers consisting of one or two members of a family of proteins related to the c-Rel proto-oncoprotein. We have isolated a chicken cDNA encoding p105, the precursor protein for the p50 subunit of NF-kappa B. Sequence analysis shows that chicken p105 is approximately 70% identical to the mouse and human p105 proteins, containing the Rel homology domain in its N-terminal 370 amino acids and several ankyrinlike repeats in the C-terminal portion of the protein. The Rel homology domain is particularly highly conserved between chicken and mammalian p50, and an in vitro-synthesized, truncated chicken p105 protein, containing sequences that correspond to the predicted p50 protein, bound to a consensus kappa B site in an electrophoretic mobility shift assay. In v-Rel-transformed chicken spleen cells, v-Rel is found in high-molecular-weight complexes which include cellular proteins of approximately 124 kDa (p124) and 115 kDa (p115). Here we report that in vitro-produced p105 comigrates with p124 from v-Rel-transformed spleen cells and that p105 and p124 appear to be identical by partial proteolytic mapping with V8 protease. Furthermore, both p105 and p50 can complex directly with v-Rel and chicken c-Rel in vitro. However, in vitro association with p105 by v-Rel does not necessarily correlate with transformation, since one nontransforming v-Rel mutant can associate with p105 in vitro.
PMID - 1992489
The DNA binding subunit of nuclear factor kappa B (NF-kappa B), a B-cell protein that interacts with the immunoglobulin kappa light-chain gene enhancer, has been purified from nuclei of human HL-60 cells stimulated with tumor necrosis factor alpha (TNF alpha), and internal peptide sequences were obtained. Overlapping cDNA clones were isolated and sequenced. The encoded open reading frame of about 105 kDa contained at its N-terminal half all six tryptic peptide sequences, suggesting that the 51-kDa NF-kappa B protein is processed from a 105-kDa precursor. An in vitro synthesized protein containing most of the N-terminal half of the open reading frame bound specifically to an NF-kappa B binding site. This region also showed high homology to a domain shared by the Drosophila dorsal gene and the avian and mammalian rel (proto)oncogene products. The level of the 3.8-kilobase mRNA was strongly increased after stimulation with TNF alpha or phorbol ester. Thus, both factors not only activate NF-kappa B protein, as described previously, but also induce expression of the gene encoding the DNA-binding subunit of NF-kappa B.
PMID - 7969179
The NF-kappa B1 subunit of the transcription factor NF-kappa B is derived by proteolytic cleavage from the N terminus of a 105-kDa precursor protein. The C terminus of p105NF-kappa B1, like those of I kappa B proteins, contains ankyrin-related repeats that inhibit DNA binding and nuclear localization of the precursor and confer I kappa B-like properties upon p105NF-kappa B1. Here we report the characterization of two novel NF-kappa B1 precursor isoforms, p84NF-kappa B1 and p98NF-kappa B1, that arise by alternate splicing within the C-terminal coding region of murine nfkb1. p98NF-kappa B1, which lacks the 111 C-terminal amino acids (aa) of p105NF-kappa B1, has a novel 35-aa C terminus encoded by an alternate reading frame of the gene. p84NF-kappa B1 lacks the C-terminal 190 aa of p105NF-kappa B1, including part of ankyrin repeat 7. RNA and protein analyses indicated that the expression of p84NF-kappa B1 and p98NF-kappa B1 is restricted to certain tissues and that the phorbol myristate acetate-mediated induction of p84NF-kappa B1 and p105NF-kappa B1 differs in a cell-type-specific manner. Both p84NF-kappa B1 and p98NF-kappa B1 are found in the nuclei of transfected cells. Transient transfection analysis revealed that p98NF-kappa B1, but not p105NF-kappa B1 or p84NF-kappa B1, acts as a transactivator of NF-kappa B-regulated gene expression and that this is dependent on sequences in the Rel homology domain required for DNA binding and on the novel 35 C-terminal aa of this isoform. In contrast to previous findings, which indicated that p105NF-kappa B1 does not bind DNA, all of the NF-kappa B1 precursors were found to specifically bind with low affinity to a highly restricted set of NF-kappa B sites in vitro, thereby raising the possibility that certain of the NF-kappa B1 precursor isoforms may directly modulate gene expression.
PMID - 9384558
BACKGROUND: Members of the rel/NFkappaB family of transcription factors play a vital role in the regulation of rapid cellular responses, such as those required to fight infection or react to cellular stress. Members of this family of proteins form homo- and heterodimers with differing affinities for dimerization. They share a structural motif known as the rel homology region (RHR), the C-terminal one third of which mediates protein dimerization. Crystal structures of the rel/NFkappaB family members p50 and p65 in their DNA-bound homodimeric form have been solved. These structures showed that the residues from the dimerization domains of both p50 and p65 participate in DNA binding and that the DNA-protein and protein dimerization surfaces form one continuous overlapping interface. We desired to investigate the contribution of DNA to NFkappaB dimerization and to identify the mechanism for the selective association of rel/NFkappaB family peptides into transcriptionally active dimers. RESULTS: We report here the crystal structures of the dimerization domains of murine p50 and p65 at 2.2 A and 2.0 A resolution, respectively. A comparison of these two structures suggests that conservative amino acid changes at three positions are responsible for the differences in their dimer interfaces. The presence of the target DNA does not change the dimer interface of either protein in any significant manner. CONCLUSIONS: These two structures suggest that the rel/NFkappaB family of transcription factors use only a few conservative changes in their amino acid sequences to form a host of dimers with varying affinities for dimerization. Amino acids at positions corresponding to 254, 267, and 307 of murine p50, function as primary determinants for the observed differences in dimerization affinity. The DNA-contacting charged amino acid sidechains from the dimerization domains are held in a similar conformation in both the DNA-bound and free states, therefore, no major structural rearrangement is required to bring these residues into contact with the DNA.
PMID - 2203531
The major determinant in the transcriptional control of class I genes of the major histocompatibility complex is an enhancer sequence located around -170 from the transcription start site, which binds a factor named KBF1. We have isolated a complementary cDNA coding for KBF1 and identified the DNA binding and dimerization domain of the protein. Because KBF1 and the transcription factor NF-kappa B bind to similar sequences, we investigated the relationship between these two molecules. It appeared that KBF1 is, by all criteria used, identical to the 50 kd DNA binding subunit of NF-kappa B. KBF1 (and therefore p50) also displays extensive amino acid sequence homology with the v-rel oncogene and the Drosophila maternal morphogen dorsal. In vitro experiments suggest functional homologies between KBF1 and v-rel.
PMID - 11094166
It has been shown that the molecular mechanism by which cytokines and glucocorticoids mutually antagonize their functions involves a mutual glucocorticoid receptor (GR)/nuclear factor-kappa B (NF-kappa B) transrepression. Here we report a role for the nuclear receptor coactivator RAC3, in modulating NF-kappa B transactivation. We found that RAC3 functions as a coactivator by binding to the active form of NF-kappa B and that overexpression of RAC3 restores GR-dependent transcription neglecting GR/NF-kappa B transrepression. The competition between GR and NF-kappa B for binding to RAC3 may represent a general mechanism by which both transcription factors mutually antagonize their activity.
PMID - 8036016
The NF-kappa b/Rel and I kappa B proteins are important regulators of lymphocyte activation and gene expression. We have identified a rearrangement of the NFKB-2 gene in the HUT 78 human cutaneous T-cell leukemia (CTCL) line, cDNA and genomic DNA sequence predicted the presence of a truncated 80 kD NFKB-2 precursor protein (p80HT), instead of the normal p100 protein. No wild-type allele was identified. Elevated levels of two aberrantly sized RNAs were detected, and high levels of p80HT and processed p52 protein were present in HUT 78 cell nuclei. The p52 protein bound to a palindromic kappa B DNA motif, however p80HT did not. Rearrangement of the NFKB-2 gene was also detected in DNA from two patients with CTCL. Rearrangement and overexpression of the NFKB-2 gene may contribute to the genesis of a subset of T-cell malignancies.
PMID - 7830764
The structure of a large fragment of the p50 subunit of the human transcription factor NF-kappa B, bound as a homodimer to DNA, reveals that the Rel-homology region has two beta-barrel domains that grip DNA in the major groove. Both domains contact the DNA backbone. The amino-terminal specificity domain contains a recognition loop that interacts with DNA bases; the carboxy-terminal dimerization domain bears the site of I-kappa B interaction. The folds of these domains are related to immunoglobulin-like modules. The amino-terminal domain also resembles the core domain of p53.
PMID - 8825636
The NFKB1 gene encodes three proteins of the NF-kappa B/Rel and I kappa B families: p105, p50, and (in mouse) I kappa B-gamma. We determined the complete genomic structure of human NFKB1. NFKB1 spans 156 kb and has 24 exons with introns varying between 40,000 and 323 bp in length. Although NFKB2, which encodes p100 and p52, also has 24 exons and has a comparable exon-intron structure, it is 20 times shorter (8 kb; Fracchiola et al. (1993) Oncogene 8, 2839-2845) than NFKB1. We propose that the long size of NFKB1 is important for transient activation of NF-kappa B complexes containing p50. I kappa B-gamma corresponds to the carboxyl-terminal half of p105. DNA sequence analysis showed that the 3'-end of human intron 11 and the 5'-end of exon 12 of NFKB1 are colinear with the 5'-untranslated region of mouse I kappa B-gamma cDNA. I kappa B-gamma is thus likely to be generated by transcription starting within intron 11 and not by alternative splicing of the mouse mRNA encoding p105 and p50.
PMID - 7969113
NF-kappa B is an inducible transcription factor complex which regulates the expression of a variety of genes which are involved in the immune, inflammatory, and acute-phase responses. The maintenance of NF-kappa B activity in stimulated cells requires ongoing protein synthesis, suggesting several modes of regulation. In this report, we have characterized the transcriptional regulation of one family member, NF-kappa B2. The genomic structure and sequence of NF-kappa B2 revealed the presence of two promoters and at least four kappa B regulatory elements, which mediate responsiveness to phorbol myristate acetate and tumor necrosis factor alpha. Similar to other NF-kappa B family members, NF-kappa B2 is positively autoregulated. In contrast to other family members, we find that kappa B elements in the NFKB2 promoter can also mediate transcriptional repression in the absence of NF-kappa B. We identified a nuclear complex which binds specifically to a subset of kappa B-related sites but not to the canonical kappa B element. Because of its putative inhibitory or repressive effect, this binding activity has been termed Rep-kappa B. This mechanism of repressing basal NF-kappa B2 transcription in an inactivated state enables the cell to tightly control NF-kappa B2 activity. These data demonstrate that a novel mode of kappa B-dependent regulation is mediated by specific kappa B sites in the NFKB2 promoter.
PMID - 8230480
The Rel/NF-kappa B family of proteins includes several interacting cellular transcription factors and the v-Rel oncoprotein of the avian Rev-T retrovirus. We report the isolation of a chicken cDNA for the NF-kappa B p52 precursor protein p100. Full-length p100 only weakly binds DNA in vitro; removal of the ankyrin-like repeats generates C-terminally truncated p100 proteins (like p52) that have an increased ability to bind an oligonucleotide containing a kappa B site. In addition, we show that chicken p100 is identical to a protein previously designated p115, which is found in a complex with v-Rel in transformed chicken spleen cells. Furthermore, p100 and v-Rel can form a complex when synthesized in vitro. Using cDNAs for chicken NF-kappa B p105, NF-kappa B p100, c-Rel, and v-Rel, we show that one of the complexes in v-Rel-transformed spleen cells can be reconstituted in vitro.
PMID - 10469655
The NF-kappaB precursor p105 has dual functions: cytoplasmic retention of attached NF-kappaB proteins and generation of p50 by processing. It is poorly understood whether these activities of p105 are responsive to signalling processes that are known to activate NF-kappaB p50-p65. We propose a model that p105 is inducibly degraded, and that its degradation liberates sequestered NF-kappaB subunits, including its processing product p50. p50 homodimers are specifically bound by the transcription activator Bcl-3. We show that TNFalpha, IL-1beta or phorbolester (PMA) trigger rapid formation of Bcl-3-p50 complexes with the same kinetics as activation of p50-p65 complexes. TNF-alpha-induced Bcl-3-p50 formation requires proteasome activity, but is independent of p50-p65 released from IkappaBalpha, indicating a pathway that involves p105 proteolysis. The IkappaB kinases IKKalpha and IKKbeta physically interact with p105 and inducibly phosphorylate three C-terminal serines. p105 is degraded upon TNF-alpha stimulation, but only when the IKK phospho-acceptor sites are intact. Furthermore, a p105 mutant, lacking the IKK phosphorylation sites, acts as a super-repressor of IKK-induced NF-kappaB transcriptional activity. Thus, the known NF-kappaB stimuli not only cause nuclear accumulation of p50-p65 heterodimers but also of Bcl-3-p50 and perhaps further transcription activator complexes which are formed upon IKK-mediated p105 degradation.
PMID - 9384586
The crystal structure of human NF-kappaB p52 in its specific complex with the natural kappaB DNA binding site MHC H-2 has been solved at 2.1 A resolution. Whereas the overall structure resembles that of the NF-kappaB p50-DNA complex, pronounced differences are observed within the 'insert region'. This sequence segment differs in length between different Rel proteins. Compared with NF-kappaB p50, the compact alpha-helical insert region element is rotated away from the core of the N-terminal domain, opening up a mainly polar cleft. The insert region presents potential interaction surfaces to other proteins. The high resolution of the structure reveals many water molecules which mediate interactions in the protein-DNA interface. Additional complexity in Rel protein-DNA interaction comes from an extended interfacial water cavity that connects residues at the edge of the dimer interface to the central DNA bases. The observed water network might acount for differences in binding specificity between NF-kappaB p52 and NF-kappaB p50 homodimers.
PMID - 8087845
We demonstrate an essential role for the proteasome complex in two proteolytic processes required for activation of the transcription factor NF-kappa B. The p105 precursor of the p50 subunit of NF-kappa B is processed in vitro by an ATP-dependent process that requires proteasomes and ubiquitin conjugation. The C-terminal region of p105 is rapidly degraded, leaving the N-terminal p50 domain. p105 processing can be blocked in intact cells with inhibitors of the proteasome or in yeast with proteasome mutants. These inhibitors also block the activation of NF-kappa B and the rapid degradation of I kappa B alpha induced by tumor necrosis factor alpha. Thus, the ubiquitin-proteasome pathway functions not only in the complete degradation of polypeptides, but also in the regulated processing of precursors into active proteins.
PMID - 7530332
The 2.3-A crystal structure of the transcription factor NK-kappa B p50 homodimer bound to a palindromic kappa B site reveals that the Rel homology region folds into two distinct domains, similar to those in the immunoglobulin superfamily. The p50 dimer envelopes an undistorted B-DNA helix, making specific contacts along the 10-base-pair kappa B recognition site mainly through loops connecting secondary structure elements in both domains. The carboxy-terminal domains form a dimerization interface between beta-sheets using residues that are strongly conserved in the Rel family.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 8398903
The p50 subunit of NF-kappa B is derived from the amino terminus of a 105 kilodalton precursor. The p105 carboxyl terminus, which contains ankyrin-like repeats, a feature of I kappa B molecules, regulates the cytoplasmic retention of p105 and inhibits DNA binding by the precursor. Here, we describe an I kappa B protein identical to the carboxyl-terminal region of p105. Probes spanning the COOH terminus but not the rel homology domain of p105 hybridize to a distinct 2.6-kilobase mRNA expressed in a wide range of murine tissues. The nucleotide sequence of complementary DNA clones for this transcript, in vitro translation, and immune precipitation of metabolically labeled cell lysates establish that it encodes a 70 kilodalton protein that corresponds to the COOH-terminal 607 amino acids of p105. p70 suppresses p65 and p75c-rel mediated transactivation of reporter genes under the control of NF-kappa B elements and in vitro can prevent DNA binding of p50 and p75c-rel homodimers to NF-kappa B sites. The ability of p70 to stably associate with p49 and p65 in vitro, but not inhibit DNA binding by these proteins, suggests that the specific inhibitory properties of this I kappa B may reflect its relative affinity for different rel targets. p70 phosphorylated by protein kinase A fails to inhibit DNA binding by p50 or the c-rel protein, and sequencing of radiolabeled p70 tryptic phosphopeptides establishes that protein kinase A phosphorylates serine residue 576 of p70. This finding suggests that the inhibitory activity of p70 can be regulated by signaling via the adenylate cyclase pathway.
PMID - 1876189
The transcription factor NF-kappa B is a protein complex which comprises a DNA-binding subunit and an associated transactivation protein (of relative molecular masses 50,000 (50K) and 65K, respectively). Both the 50K and 65K subunits have similarity with the rel oncogene and the Drosophila maternal effect gene dorsal. The 50K DNA-binding subunit was previously thought to be a unique protein, derived from the 105K gene product (p105). We now report the isolation of a complementary DNA that encodes an alternative DNA-binding subunit of NF-kappa B. It is more similar to p105 NF-kappa B than other family members and defines a new subset of rel-related genes. It is synthesized as approximately 100K protein (p100) that is expressed in different cell types, contains cell cycle motifs and, like p105, must be processed to generate a 50K form. A 49K product (p49) can be generated independently from an alternatively spliced transcript; it has specific kappa B DNA-binding activity and can form heterodimers with other rel proteins. In contrast to the approximately 50K protein derived from p105, p49 acts in synergy with p65 to stimulate the human immunodeficiency virus (HIV) enhancer in transiently transfected Jurkat cells. p49/p100 NF-kappa B could therefore be important in the regulation of HIV and other kappa B-containing genes.
PMID - 2203532
The DNA binding subunit of the transcription factor NF-kappa B, p50, has been cloned. p50 appears to be synthesized as a larger protein that is then processed to its functional size. Sequence analysis reveals remarkable homology for over 300 amino acids at the amino-terminal end to the oncogene v-rel, its cellular homolog c-rel, and the Drosophila maternal effect gene dorsal. This establishes NF-kappa B as a member of the rel family of proteins, all of which display nuclear-cytosolic translocation. Protein sequence from the p65 polypeptide has established that it is not encoded in the same mRNA as p50. However, p65 appears homologous to c-rel, suggesting that c-rel may form heterodimers with p50 and rel may include a homodimerization motif.
PMID - 11297557
The p105 precursor protein of NF-kappaB1 acts as an NF-kappaB inhibitory protein, retaining associated Rel subunits in the cytoplasm of unstimulated cells. Tumor necrosis factor alpha (TNFalpha) and interleukin-1alpha (IL-1alpha) stimulate p105 degradation, releasing associated Rel subunits to translocate into the nucleus. By using knockout embryonic fibroblasts, it was first established that the IkappaB kinase (IKK) complex is essential for these pro-inflammatory cytokines to trigger efficiently p105 degradation. The p105 PEST domain contains a motif (Asp-Ser(927)-Gly-Val-Glu-Thr), related to the IKK target sequence in IkappaBalpha, which is conserved between human, mouse, rat, and chicken p105. Analysis of a panel of human p105 mutants in which serine/threonine residues within and adjacent to this motif were individually changed to alanine established that only serine 927 is essential for p105 proteolysis triggered by IKK2 overexpression. This residue is also required for TNFalpha and IL-1alpha to stimulate p105 degradation. By using a specific anti-phosphopeptide antibody, it was confirmed that IKK2 overexpression induces serine 927 phosphorylation of co-transfected p105 and that endogenous p105 is also rapidly phosphorylated on this residue after TNFalpha or IL-1alpha stimulation. In vitro kinase assays with purified proteins demonstrated that both IKK1 and IKK2 can directly phosphorylate p105 on serine 927. Together these experiments indicate that the IKK complex regulates the signal-induced proteolysis of NF-kappaB1 p105 by direct phosphorylation of serine 927 in its PEST domain.

OTX2_BRARE.abs:
PMID - 7720578
In this paper we study Xotx2, a Xenopus homeobox gene related to orthodenticle, a gene expressed in the developing head of Drosophila. The murine cognate, Otx2, is first expressed in the entire epiblast of prestreak embryos and later in very anterior regions of late-gastrulae, including the neuroectoderm of presumptive fore- and mid-brain. In Xenopus, RNase protection experiments reveal that Xotx2 is expressed at low levels throughout early development from unfertilized egg to late blastula, when its expression level significantly increases. Whole-mount in situ hybridization shows a localized expression in the dorsal region of the marginal zone at stage 9.5. At stage 10.25 Xotx2 is expressed in dorsal bottle cells and in cells of the dorsal deep zone fated to give rise to prechordal mesendoderm, suggesting a role in the specification of very anterior structures. In stage 10.5 gastrulae, Xotx2 transcripts start to be detectable also in presumptive anterior neuroectoderm, where they persist in subsequent stages. Various treatments of early embryos cause a general reorganization of Xotx2 expression. In particular, retinoic acid treatment essentially abolishes Xotx2 expression in neuroectoderm. Microinjection of Xotx2 mRNA in 1-, 2- and 4-cell stage embryos causes the appearance of secondary cement glands and partial secondary axes in embryos with reduced trunk and tail structures. The presence of the Xotx2 homeodomain is required to produce these effects. In particular, this homeodomain contains a specific lysine residue at position 9 of the recognition helix. Microinjected transcripts of Xotx2 constructs containing a homeodomain where this lysine is substituted by a glutamine or a glutamic acid residue fail to cause these effects.
PMID - 7898305
Three zebrafish otx homeoproteins containing a homeodomain homologous to that of the Drosophila orthodenticle head gap gene product have been identified by cloning and sequencing of cDNAs. The zebrafish otx2 homeoprotein shares high amino-acid sequence identity with the mouse Otx2 homeoprotein, whereas the zebrafish otx1 and otx3 homeoproteins exhibit moderate homology with the mouse Otx1 and Otx2 homeoproteins. Three otx homeoprotein mRNAs show different spatio-temporal expression patterns during zebrafish embryogenesis as revealed by Northern blot and whole mount in situ hybridization analyses. Large amounts of the otx1 homeoprotein mRNA are found in fertilized uncleaving eggs. The otx3 homeoprotein mRNA appears in the embryonic shield, the site of the organizer. In the developing brain, three zebrafish otx mRNAs are distributed in the diencephalon and the midbrain, but their fine expression patterns are different. These results suggest that three zebrafish otx homeoproteins, alone or in combination, may play roles in very early embryogenesis, gastrulation, and the development and subdivision of the diencephalon and the midbrain.
PMID - 8613727
Dissociated primary cultures from rat telencephalon at different developmental stages were used to study the effect of basic fibroblast growth factor (FGF2) on Otx2, Dlx1, and Emx1, three homeobox genes expressed in different regions of the developing mammalian forebrain. At embryonic day (E)13.5. the regional pattern of expression of Otx1, Otx2, Dlx1, Dlx2, Dlx5, and Emx1 is maintained in primary culture, suggesting that cells are already committed to a regional identity at this stage. In these cultures, Otx2 is expressed by precursor cells, whereas Dlx1 and Emx1 are predominantly expressed by postmitotic cells. We found that FGF2 increased Otx2 expression within precursor cells and the total number of Otx2-expressing cells. This effect was gene-specific, dose-dependent, and temporally regulated, with larger effects at earlier stages of development (E11.5). At E13.5, the effect of FGF2 on Otx2 expression was restricted to the basal telencephalon. Our results suggest that a restricted population of neuroblasts respond to FGF2 in a temporally regulated fashion by proliferating and increasing Otx2 expression. This interaction between FGF2 and Otx2 may be important for the regulation of neurogenesis in the forebrain.
PMID - 8235591
The Spermann organizer induces neural tissue from dorsal ectoderm and dorsalizes lateral and ventral mesoderm in Xenopus. The secreted factor noggin, which is expressed in the organizer, can mimic the dorsalizing signal of the organizer. Data are presented showing that noggin directly induces neural tissue, that it induces neural tissue in the absence of dorsal mesoderm, and that it acts at the appropriate stage to be an endogenous neural inducing signal. Noggin induces cement glands and anterior brain markers, but not hindbrain or spinal cord markers. Thus, noggin has the expression pattern and activity expected of an endogenous neural inducer.
PMID - 7893604
To analyze the molecular mechanism of pattern formation in the anteriormost regions of the zebrafish embryo, we isolated two zebrafish sequences, zOtx1 and zOtx2, related to the Drosophila orthodenticle (otd) and two murine Otx genes. zOtx1 and zOtx2 encode predicted gene products which are 82% and 94% identical to the corresponding mouse proteins. Transcripts of both zebrafish genes appear abruptly at high levels in a triangular patch at the animal pole of the mid-gastrula, a region which contains cells fated to become midbrain and forebrain. Between 9 and 14 h of development, zOtx transcripts disappear from forebrain regions in a manner characteristic for each gene, and from 14 to 24 h, particular regions of the forebrain and midbrain express one or both genes. The posterior limit of expression of both genes in 10-30-h embryos forms a sharp boundary at the posterior border of the midbrain. As in the mouse, the early expression patterns of the zOtx genes are consistent with a role in defining midbrain and forebrain territories. However, there are a number of interesting differences between the forebrain and midbrain regions which express the genes in the two species.
PMID - 7931541
Within the cerebral and cerebellar cortices, neurons are organized in layers that segregate neurons with distinctive morphologies and axonal connections, and areas or regions that correspond to distinct functional domains. To explore the molecular underpinnings of pattern formation in layered regions of the CNS, we have characterized the patterns of expression of two homeodomain genes, Otx1 and Otx2, by in situ hybridization during embryonic and postnatal development in the rat. Otx1 and Otx2 are vertebrate homologs of the Drosophila gap gene orthodenticle, and are expressed during the development of the murine CNS (Simeone et al., 1992). Here we report that Otx1 mRNA is expressed in a subpopulation of neurons within cortical layers 5 and 6 during postnatal and adult life. This gene is also expressed by the precursors of deep-layer neurons within the developing cerebral ventricular zone, but is apparently downregulated by the progenitors of upper-layer neurons; Otx1 is never expressed by the neurons of layers 1-4. The spatial and temporal patterns suggest that Otx1 may play a role in the specification or differentiation of neurons in the deep layers of the cerebral cortex. Within the cerebellum, mRNAs for Otx1 and Otx2 are found within the external granular layer (EGL), but in three spatially distinct domains. During postnatal development, Otx1 is expressed within anterior cerebellar lobules; Otx2 mRNA is localized posteriorly, and a region of overlap in mid-cerebellum defines a third domain in which both genes are expressed. The boundaries of Otx1 and Otx2 expression correspond to the major functional boundaries of the cerebellum, and define the vestibulocerebellum, spinocerebellum, and pontocerebellum, respectively. Spatially restricted patterns of hybridization are observed early in postnatal life, at times that correspond roughly to the invasion of spinal and pontine afferents into the cerebellum (Arsénio-Nunes and Sotelo, 1985; Mason, 1987). These results raise the possibility that Otx1 and Otx2 play a role in cerebellar regionalization during early development.
PMID - 1353865
Insight into the genetic control of the identity of specific regions along the body axis of vertebrates has resulted primarily from the study of vertebrate homologues of regulatory genes operating in the Drosophila trunk, but little is known about the development of most anterior regions of the body either in flies or vertebrates. Three Drosophila genes have been identified that are important in controlling the development of the head, two of which, empty spiracles and orthodenticle, have been cloned and shown to contain a homeobox. We previously cloned and characterized Emx1 and Emx2, two mouse genes related to empty spiracles that are expressed in restricted regions of the developing forebrain, including the presumptive cerebral cortex and olfactory bulbs. Here we report the identification of Otx1 and Otx2, which are related to orthodenticle. We have compared the expression domains of the four genes in the developing rostral brain of mouse embryos at a developmental stage, day 10 post coitum, when they are all expressed. Otx2 is expressed in every dorsal and most ventral regions of telencephalon, diencephalon and mesencephalon. The Otx1 expression domain is similar to that of Otx2, but contained within it. The Emx2 expression domain is comprised of dorsal telencephalon and small diencephalic regions, both dorsally and ventrally. Finally, Emx1 expression is exclusively confined to the dorsal telencephalon. Thus at the time when regional specification of major brain regions takes place, the expression domains of the four genes seem to be continuous regions contained within each other in the sequence Emx1 less than Emx2 less than Otx1 less than Otx2.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 8101484
We studied the expression of two vertebrate homeobox genes, Otx1 and Otx2, related to orthodenticle, a gene expressed in the developing head of Drosophila. Both genes are expressed in restricted regions of the developing rostral brain including the presumptive cerebral cortex and olfactory bulbs. The expression patterns of the two genes in diencephalon suggest that they both have a role in establishing the boundary between presumptive dorsal and ventral thalamus. They are also expressed in regions of the developing olfactory, auricolar and ocular system, including the covering of the optic nerve. Otx1 expression is detectable from day 8 of gestation in telencephalic, diencephalic and mesencephalic regions. From day 10.5 of gestation its expression extends to some metencephalic areas. Otx2 appears to be already expressed in the epiblast of prestreak embryos. It persists in the entire embryonic ectoderm for some time after the onset of gastrulation. In midstreak embryos its expression appears progressively restricted to the anterior embryonic ectoderm corresponding to presumptive fore- and mid-brain. In early midgestation embryos it is expressed in telencephalic, diencephalic and mesencephalic regions but from day 11.75 of gestation its expression disappears from dorsal telencephalon and is confined to diencephalic and mesencephalic regions. Otx2 is one of the earliest genes expressed in the epiblast and immediately afterwards is expressed in anterior neuroectoderm, demarcating rostral brain regions even before headfold formation. Its gene product contains a homeodomain of the bicoid class and is able to recognize and transactivate a bicoid target sequence.

PENK_XENLA.abs:
PMID - 1721992
Enkephalins are opiate peptides found in a variety of tissues including brain and pituitary. In brain, they function as neurotransmitters, neuromodulators and neurohormones. Recent studies show that proenkephalin mRNA is expressed early in development both in mammals and the amphibian, suggesting that enkephalins may play a unique role in embryogenesis. In order to characterize factors which regulate the onset and patterning of expression of this gene in adult and developing frog embryos, the proenkephalin A gene was cloned from Xenopus laevis. The clones have been characterized by DNA sequencing and restriction endonuclease mapping. The gene is made up of three exons which span approximately 12 kb. Exon I encodes the 5' untranslated region of the mRNA. Exon II contains the signal peptide and the N terminus of the mature protein. Biologically active opioid peptides are generated from exon III. Comparison to mammalian proenkephalin genomic sequence indicated that nucleotide sequences of the 5' flanking region, noncoding exon I and exon II were not well conserved but exon III was highly conserved. Primer extension and RNase protection assay analyses of the RNA transcripts revealed two major 5' ends. The putative TATA box, CAAT box, CRE and Pit 1 elements have been identified on this gene by sequence homology to published consensus sequences. To assay for sequences that could potentially regulate Xenopus proenkephalin expression, we transfected constructs that contained upstream genomic sequences linked to the CAT reporter gene into various eukaryotic cell lines. The expression of the fusion gene constructs were detected and could be induced 10- to 30-fold upon treatment with forskolin.(ABSTRACT TRUNCATED AT 250 WORDS)
PMID - 7057924
The nucleotide sequence of a complete cDNA copy of enkephalin precursor mRNA from human phaeochromocytoma is reported. The corresponding amino acid sequence shows that the precursor is 267 amino acids long and contains six interspersed Met-enkephalin sequences and one Leu-enkephalin sequence. Five of the seven enkephalins are flanked on both sides by pairs of basic amino acid residues. The precursors does not contain the sequences of the opioid peptides, dynorphin, alpha-neo-endorphin or beta-endorphin.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 6547769
The structures of the genes coding for the opioid peptide precursors proopiomelanocortin, proenkephalin (proenkephalin A) and prodynorphin (proenkephalin B), are known for some mammalian species. To gain insight into the evolutionary history of these precursors, we have examined the proenkephalin gene in the South African clawed toad, Xenopus laevis, which diverged from the principal line of vertebrate evolution some 350 Myr ago. The human proenkephalin gene consists of four exons, of which the main exon (exon IV) contains all known biologically active peptides--six Met-enkephalin sequences and one Leu-enkephalin sequence. We report here the primary structures of the putative main exons of two proenkephalin genes in X. laevis, each of which codes for seven Met-enkephalin sequences but no Leu-enkephalin, indicating that Met-enkephalin preceded Leu-enkephalin in the evolution of the proenkephalin gene. The organization of the main exons of the toad genes is remarkably similar to that of the human gene and conserved regions provide evidence for functionally significant structures. We also detect a polymorphism in one of the toad proenkephalin genes, mapping 1.5 kilobases (kb) 5' of the main exon; it is caused by an insertion/deletion of a 1-kb repetitive sequence which has the characteristics of a transposable element.
PMID - 9126357
Mass spectrometry of fragments produced by limited proteolytic digestion of pro-enkephalin was used to locate the disulfide bridges in synenkephalin (pro-enkephalin 1-73), a domain which contains sorting information for targeting the pro-neuropeptide to the granules of the regulated secretory pathway in neuroendocrine cells. Mass spectrometric analysis was optimized by using chemicals that gave low interference with the ionization and desorption processes, and computer software which simplified the identification of all possible disulfide-linked peptide fragments. Three disulfide bridges between Cys2-Cys24, Cys6-Cys28, and Cys9-Cys41 were identified. Protein conformational prediction of synenkephalin1-42 shows beta-turns which facilitate the formation of these disulfide bonds.

PRO2_HUMAN.abs:
PMID - 7758455
The biochemical characteristics of a new human profilin isoform are described. We refer to this recently described isoform as profilin II (isoelectric point 5.9) in comparison to profilin I (pI 8.4). We expressed both isoforms in bacteria and compared their actin-binding properties, binding to poly(L-proline), affinities for phosphatidylinositol 4,5-bisphosphate [PtdIns(4,5)P2], and their effects on nucleotide exchange on actin. Profilin I and profilin II have similar affinities for PtdIns(4,5)P2 and poly(L-proline), and both accelerate nucleotide exchange on monomeric actin to the same extent. However, the affinity of profilin I for monomeric actin is about five times higher than the affinity of profilin II for actin. Potential structural differences of profilin I and profilin II that might explain the difference in actin binding are discussed.
PMID - 7601111
We purified profilin from bovine brain and were able to separate the two isoforms present in this tissue. Since functional characteristics for profilin II are lacking, we assayed the actin, the phosphatidylinositol 4,5-bisphosphate and the poly(L-proline) binding properties of this isoform. Profilin II binds actin with a similar affinity to that of profilin I, although it inhibits actin polymerization more strongly than profilin I under non-equilibrium conditions. Profilin II also binds the anionic phospholipid phosphatidylinositol 4,5-bisphosphate. Profilin II binds to poly(L-proline) more strongly than does profilin I; this is especially evident at more acidic pH values. This difference is explained by an amino acid exchange in the carboxy-terminal part of the protein which has been implicated in poly(L-proline) binding [Björkegren, C., Rozycki, M., Schutt, C., Lindberg, U. & Karlsson, R. (1993) FEBS Lett. 333, 123-126; Metzler, W., Bell, A., Ernst, E., Lavoie, T. & Mueller, L. (1994) J. Biol. Chem. 369, 4620-4625].
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 8365484
We have isolated a 1.7 kbp cDNA encoding a 140 amino acid protein (15.1 kDa, pI 5.91) with a high sequence similarity (62%) to human profilin (profilin I). We have termed this variant profilin II. Northern blot analysis showed that profilin II is highly expressed in brain, skeletal muscle and kidney and less strongly in heart, placenta, lung and liver. In addition, three different transcript lengths were detected. Only one transcript of profilin I was found. The expression level of this was low in brain and skeletal muscle, medium in heart and high in placenta, lung, liver and kidney.
PMID - 11027290
We deduced the structure of the mouse profilin II gene. It contains five exons that can generate four different transcripts by alternative splicing. Two transcripts encode different profilin II isoforms (designated IIa and IIb) that have similar affinities for actin but different affinities for polyphosphoinositides and proline-rich sequences. Profilins IIa and IIb are also present in humans, suggesting that all mammals have three profilin isoforms. Profilin I is the major form in all tissues, except in the brain, where profilin IIa is most abundant. Profilin IIb appears to be a minor form, and its expression is restricted to a limited number of tissues, indicating that the alternative splicing is tightly regulated. Western blotting and whole-mount in situ hybridization show that, in contrast to the expression of profilin I, the expression level of profilin IIa is developmentally regulated. In situ hybridization of adult brain sections reveals overlapping expression patterns of profilins I and IIa.
PMID - 10600384
Human profilins are multifunctional, single-domain proteins which directly link the actin microfilament system to a variety of signalling pathways via two spatially distinct binding sites. Profilin binds to monomeric actin in a 1:1 complex, catalyzes the exchange of the actin-bound nucleotide and regulates actin filament barbed end assembly. Like SH3 domains, profilin has a surface-exposed aromatic patch which binds to proline-rich peptides. Various multidomain proteins including members of the Ena/VASP and formin families localize profilin:actin complexes through profilin:poly-L-proline interactions to particular cytoskeletal locations (e.g. focal adhesions, cleavage furrows). Humans express a basic (I) and an acidic (II) isoform of profilin which exhibit different affinities for peptides and proteins rich in proline residues. Here, we report the crystallization and X-ray structure determination of human profilin II to 2.2 A. This structure reveals an aromatic extension of the previously defined poly-L-proline binding site for profilin I. In contrast to serine 29 of profilin I, tyrosine 29 in profilin II is capable of forming an additional stacking interaction and a hydrogen bond with poly-L-proline which may account for the increased affinity of the second isoform for proline-rich peptides. Differential isoform specificity for proline-rich proteins may be attributed to the differences in charged and hydrophobic residues in and proximal to the poly-L-proline binding site. The actin-binding face remains nearly identical with the exception of five amino acid differences. These observations are important for the understanding of the functional and structural differences between these two classes of profilin isoforms.
PMID - 11034907
Profilins are a conserved family of proteins participating in actin dynamics and cell motility. In the mouse, two profilin genes are known. Profilin I is expressed universally at high levels, while profilin II is expressed mainly in the brain. Here we describe the occurrence of two mouse profilin II isoforms, A and B, which are derived by alternative splicing. They are identical through residue 107 of the protein, but then have distinct C-terminal sequences. Profilin IIA binds to poly-L-proline and actin with high affinity similar to profilin I. Profilin IIB on the other hand does not bind to actin and the affinity for poly-L-proline is greatly diminished. However, tubulin was found to bind to GST-profilin IIB, and in vivo GFP-profilin IIB was recruited to spindles and asters during mitosis in HeLa cells. Our results indicate unexpected diversity in the functions of the profilin family of proteins, and suggest that in mouse profilin IIA is intimately involved in actin dynamics, while profilin IIB associates with other cytoskeletal components.

RBS_TOBAC.abs:
PMID - 3478552
We have cloned and sequenced all five members of the gene family for the small subunit (rbcS) of ribulose-1,5-bisphosphate carboxylase/oxygenase from tomato, Lycopersicon esculentum cv. VFNT LA 1221 cherry line. Two of the five genes, designated Rbcs-1 and Rbcs-2, are present as single genes at individual loci. Three genes, designated Rbcs-3A, Rbcs-3B and Rbcs-3C, are organized in a tandem array within 10 kb at a third independent locus. The Rbcs-2 gene contains three introns; all the other members of the tomato gene family contain two introns. The coding sequence of Rbcs-1 differs by 14.0% from that of Rbcs-2 and by 13.3% from that of Rbcs-3 genes. Rbcs-2 shows 10.4% divergence from Rbcs-3. The exon and intron sequences of Rbcs-3A are identical to those of Rbcs-3C, and differ by 1.9% from those of Rbcs-3B. Nucleotide sequence analysis suggests that the five rbcS genes encode four different precursors, and three different mature polypeptides. S1 nuclease mapping of the 5' end of rbcS mRNAs revealed that the mRNA leader sequences vary in length from 8 to 75 nucleotides. Northern analysis using gene-specific oligonucleotide probes from the 3' non-coding region of each gene reveals a four to five-fold difference among the five genes in maximal steady-state mRNA levels in leaves.
PMID - 3557127
We have isolated and sequenced two cDNA clones (LESS5 and LESS17) encoding the small subunit of ribulose-1,5-bisphosphate carboxylase of tomato (Lycopersicon esculentum). At the nucleotide level, the protein-coding regions of these genes are 85% conserved, while the untranslated 3' regions are only 55% conserved. Comparison with rbcS genes from other species of Solanaceae suggests that the tomato LESS5 gene, the Nicotiana tabacum NTSS23 gene and the Petunia hybrida SSU8 gene are orthologous members of the rbcS gene family. In addition, the tomato gene LESS17, and the Petunia hybrida gene SSU611, may also be orthologous, since their untranslated 3' regions are related. There is a large difference between the two tomato rbcS genes in the frequency of the CG dinucleotide. This difference may reflect different levels of methylation, and therefore expression, of the tomato genes. Many of the differences involving the CG dinucleotide can be represented as transitions between C and T on the noncoding strand. Such changes are consistent with observations that methylated cytosines are hot-spots for transitions.
PMID - 8299958
We determined the nucleotide sequences of five members of the rbcS gene family encoding the small subunit (SSU) of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) of potato. The genomic organization, structure and expression of the genes is compared to the features of the rbcS genes in tomato. Within the two species, Lycopersicon esculentum and Solanum tuberosum, both members of the Solanaceae, the rbcS genes share more interspecific sequence identity, especially in the 5'- and 3'-untranslated regions and the intron sequences, than within one species. However, the expression data of orthologous rbcS genes containing highly identical regulatory cis-acting elements were found to be different, suggesting that the simple finding of these motifs does not automatically imply similar transcriptional and/or post-transcriptional gene regulation.
PMID - 8425051
DNase I footprinting assays were used to map sites of DNA-protein interaction in the promoter regions of three of the five genes encoding the small subunit of ribulose-1,5-bisphosphate carboxylase (rbcS) in tomato. Organ-specific differences in DNase I protection patterns were observed using nuclear extracts derived from cotyledons, leaves, young fruit, mature fruit, and roots of tomato, implying that organ-specific transcription of these genes is controlled at the level of DNA-protein interaction. The three genes, designated rbcS1, -2 and -3A are similarly expressed in cotyledons of dark-grown seedlings, in immature tomato fruit, and in leaves under conditions of water stress. These three genes share at least three DNA sequence motifs, including the G-box sequence, which are apparently not present in the other two tomato rbcS genes. We find protection of one or more of these sequences in the aforementioned organs, indicating that the corresponding DNA-binding proteins could function in directing differential expression of the genes, although functional studies would be required to establish this point. While most of the DNase I protections encompass previously identified conserved sequence motifs and their flanking sequence, we also observe protection of additional sequences, many of which occur in the region of the transcription start site.
PMID - 4000958
We have cloned and sequenced a gene for the small subunit (SS) of ribulose bis-phosphate carboxylase-oxygenase from Nicotiana tabacum. The tobacco gene is most closely related to the SS genes from the dicots soybean and pea, and less so to the monocots wheat and Lemna; the deduced amino acid sequence of the mature protein is in all cases more closely conserved than is its chloroplast transit sequence. Unlike the genomic sequences of the two monocots, which have one intron, and the two other dicots, which have two introns, the tobacco gene has three introns. The third tobacco intron lies within a highly conserved region of the protein. Its position coincides with the boundary of a 12 amino acid insertion in the SS genes of higher plants, relative to those of blue green algae. The 5' flanking end of the gene carries 67 bp inverted repeats, which flank a series of eight direct repeats; the direct repeats themselves each carry inverted repeats. The 3' untranslated end of this gene differs by only 2 bp from that of an N. sylvestris SS gene.
PMID - 1512238
The structure of the unactivated form of ribulose-1,5-bisphosphate carboxylase/oxygenase was refined at a resolution of 2.0 A to an R-factor of 17.1%. The previous model (Chapman et al., 1988) was extensively rebuilt, and the small subunit was retraced. The refined model consists of residues 22-63 and 69-467 of the large subunit and the complete small subunit. A striking feature of the model is that several loops have very high B-factors, probably representing mobile regions of the molecule. An examination of the intersubunit contacts shows that the L8S8 hexadecamer is composed of four L2 dimers. The dominant contacts between these L2 dimers are formed by the small subunits. This suggests that the small subunits may be essential for maintaining the integrity of the L8S8 structure. The active site shows differences between the unactivated form and the quaternary complex. In particular, Lys334 has moved out of the active site by about 10A. This residue lies on loop 6 of the alpha beta barrel, which is a particularly mobile loop. The site of ribulose-1,5-bisphosphate carboxylase/oxygenase activation is well ordered in the absence of the carbamylation of Lys201 and Mg2+ binding. The residues are held poised by a network of hydrogen bonds. In the unactivated state, the active site is accessible to substrate binding.
PMID - 3012537
The nuclear gene sequences encoding RBCS, the small subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (EC 4.1.1.39) from several plants show extensive interspecific divergence but little intraspecific divergence, suggesting that these genes are evolving in concert within a genome. In this study, the nucleotide sequences of two tomato (Lycopersicon esculentum) RBCS genes and a cDNA clone containing the entire coding region of a third tomato RBCS gene were determined. The three genes, designated Rbcs-1, Rbcs-2A, and Rbcs-3A, each belong to a different one of the three RBCS loci in the tomato genome. The nucleotide sequence of Rbcs-1 differs from that of Rbcs-2A and Rbcs-3A by 13.9% and 13.1%, respectively. Rbcs-2A and Rbcs-3A differ from each other by 10.7%. A recently published RBCS gene sequence from tobacco (Nicotiana tabacum) [Mazur, B. J. & Chui, C.-F. (1985) Nucleic Acids Res. 13, 2373-2386] differs by 10.6% and 11.3% from Rbcs-2A and Rbcs-3A, respectively, and by 15.0% from Rbcs-1. Thus the tobacco gene seems to be phylogenetically as closely related to the tomato genes Rbcs-2A and Rbcs-3A as the latter two are to each other, and more closely related to them than Rbcs-1 is. However, the mature part of the polypeptide encoded by the tobacco RBCS gene differs by five and six amino acids from the corresponding region in the polypeptides encoded by Rbcs-2A and Rbcs-3A, respectively, while these two tomato RBCS polypeptides differ from each other in the mature part by a single amino acid. Rbcs-1, whose nucleotide sequence shows higher divergence from both the tobacco RBCS gene and Rbcs-2A and Rbcs-3A, encodes a polypeptide whose mature part differs by eight amino acids from the corresponding region in the tobacco polypeptide but only by three and four amino acids from the corresponding regions of Rbcs-2A- and Rbcs-3A-encoded polypeptides, respectively. Thus, it appears that in the tomato selection has maintained near uniformity of the coding information in the portion of the RBCS genes encoding the mature polypeptides.
PMID - 3684569
Genomic clones containing three genes for the small subunit (SSU) of ribulose bisphosphate carboxylase were isolated from tobacco. Detailed analysis was performed on two of these clones to give a clearer picture of this multigene family in tobacco. This analysis demonstrated that one of the clones contained a pseudogene that was unusual in that it was transcriptionally active. This is the first transcriptionally active pseudogene that has been reported in plants. In addition, another clone was found to contain coding sequences which are 100% homologous to a previously-cloned tobacco SSU gene (Mazur, B.J. and Chiu, C-F. [1985] Nuc. Acids Res. 13, 2372-2386), indicating that gene duplication and/or gene conversion may have played a role in the evolution of the tobacco SSU family.
PMID - 6549380
The combination of cDNA and RNA sequencing techniques has enabled determination of the complete sequence of one of the mRNAs coding for the precursor of the small subunit of ribulose bisphosphate carboxylase of Nicotiana sylvestris. In this 898-nucleotide-long mRNA, 540 nucleotides code for the entire 180-amino-acid-long precursor polypeptide consisting of the 57-amino acid-long transit peptide and the 123-amino-acid-long mature protein, while 60 and 195 nucleotides belong to the 5' and 3' noncoding flanking regions, respectively. The 5' end, which is very rich in AG residues, contains several direct and indirect repeated sequences, and a possible hairpin structure. The 3' end, terminated by a 103-nucleotide-long poly-A tail, is very rich in AU residues but does not contain the classical polyadenylation signal sequence.
PMID - 3422467
Five genes of the rbcS gene family of Solanum tuberosum (potato) were studied. One of these is a cDNA clone; the other four are located on two genomic clones representing two different chromosomal loci containing one (locus 1) and three genes (locus 2), respectively. The intron/exon structure of the three genes in locus 2 is highly conserved with respect to size and position. These genes contain two introns, whereas the gene from locus 1 contains three introns. Although in most cases the amino acid sequences in the transit peptide part of different rbcS genes from the same species varied considerably more than the corresponding mature amino acid sequences, one exception found in tomato and potato indicates that the transit peptide of rbcS could have a special function. A comparison of the rbcS genes of higher plants with those of prokaryotes offers suggestive evidence that introns first served as spacer material in the process of exon shuffling and then were removed stepwise during the evolution of higher plants.

SEP7_HUMAN.abs:
PMID - 10731132
The fly Drosophila melanogaster is one of the most intensively studied organisms in biology and serves as a model system for the investigation of many developmental and cellular processes common to higher eukaryotes, including humans. We have determined the nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map. Efforts are under way to close the remaining gaps; however, the sequence is of sufficient accuracy and contiguity to be declared substantially complete and to support an initial analysis of genome structure and preliminary gene annotation and interpretation. The genome encodes approximately 13,600 genes, somewhat fewer than the smaller Caenorhabditis elegans genome, but with comparable functional diversity.
PMID - 9022087
Glycoprotein (GP) Ib is a major component of the platelet membrane receptor for von Willebrand factor, designated the GP Ib-IX-V complex. GP Ib is composed of two subunits (GP Ib(alpha) and GP Ib(beta)) each synthesized from separate genes. The 206 amino acid precursor of GP Ib(beta) is synthesized from a 1.0-kb mRNA expressed by megakaryocytes and was originally characterized from cDNA clones of human erythroleukemic (HEL) cell mRNA, a cell line exhibiting megakaryocytic-like properties. The cell line CHRF-288-11 also exhibits megakaryocytic-like properties, but synthesizes two related GP Ib(beta) mRNA species of 3.5 and 1.0 kb. We performed cDNA cloning experiments to identify the origin of the 3.5-kb transcript and determine its relationship to the 1.0-kb GP Ib(beta) mRNA found in megakaryocytes, platelets, and HEL cells. Our cloning experiments demonstrate that the longer transcript results from a nonconsensus polyadenylation recognition sequence, 5'AACAAT3', within a separate gene located upstream to the platelet GP Ib(beta) gene. In the absence of normal polyadenylation the more 5' gene uses the polyadenylation site within its 3' neighbor, the platelet GP Ib(beta) gene. This newly identified 5' gene contains an open reading frame encoding 369 amino acids with a high degree of sequence similarity to an expanding family of GTP-binding proteins.
PMID - 8697812
We have isolated a novel human cDNA that encodes a protein homologous to murine H5 and Diff6, and to yeast CDC10, and mapped it to chromosome region 2q37 by fluorescent in situ hybridization. Its transcript has an open reading frame of 1,218 nucleotides encoding 406 amino acids. The deduced peptide sequence contained conserved domains rich in basic residues, GXXGXGKS--DXXG--TKXD, a motif of the GTPase superfamily. Different polyA sites accounted for generation of two transcripts. The major type, 3.5 kb long, was expressed ubiquitously in all human tissues examined, but a 2.0-kb alternative transcript lacking any long AU-rich element in the 3' non-coding region was expressed abundantly only in testis, heart and skeletal muscle.
PMID - 9520435
At what biological levels are data from single-celled organisms akin to a Rosetta stone for multicellular ones? To examine this question, we characterized a saturation-mutagenized 67-kb region of the Drosophila genome by gene deletions, transgenic rescues, phenotypic dissections, genomic and cDNA sequencing, bio-informatic analysis, reverse transcription-PCR studies, and evolutionary comparisons. Data analysis using cDNA/genomic DNA alignments and bio-informatic algorithms revealed 12 different predicted proteins, most of which are absent from bacterial databases, half of which are absent from Saccharomyces cerevisiae, and nearly all of which have relatives in Caenorhabditis elegans and Homo sapiens. Gene order is not evolutionarily conserved; the closest relatives of these genes are scattered throughout the yeast, nematode, and human genomes. Most gene expression is pleiotropic, and deletion studies reveal that a morphological phenotype is seldom observed when these genes are removed from the genome. These data pinpoint some general bottlenecks in functional genomics, and they reveal the acute emerging difficulties with data transferability above the levels of genes and proteins, especially with complex human phenotypes. At these higher levels the Rosetta stone analogy has almost no applicability. However, newer transgenic technologies in Drosophila and Mus, combined with coherency pattern analyses of gene networks, and synthetic neural modeling, offer insights into organismal function. We conclude that industrially scaled robogenomics in model organisms will have great impact if it can be realistically linked to epigenetic analyses of human variation and to phenotypic analyses of human diseases in different genetic backgrounds.
PMID - 8152419
Morphogenesis in the yeast Saccharomyces cerevisiae consists primarily of bud formation. Certain cell division cycle (CDC) genes, CDC3, CDC10, CDC11, CDC12, are known to be involved in events critical to the pattern of bud growth and the completion of cytokinesis. Their products are associated with the formation of a ring of neck filaments that forms at the region of the mother cell-bud junction during mitosis. Morphogenesis in Candida albicans, a major fungal pathogen of humans, consists of both budding and the formation of hyphae. The latter is thought to be related to the pathogenesis and invasiveness of C. albicans. We have isolated and characterized C. albicans homologs of the S. cerevisiae CDC3 and CDC10 genes. Both C. albicans genes are capable of complementing defects in the respective S. cerevisiae genes. RNA analysis of one of the genes suggests that it is a regulated gene, with higher overall expression levels during the hyphal phase than in the yeast phase. Not surprisingly, DNA sequence analysis reveals that the proteins share extensive homology at the amino acid level with their respective S. cerevisiae counterparts. Related genes are also found in other species of Candida and, more importantly, in filamentous fungi such as Aspergillus nidulans and Neurospora crassa. A database search revealed significant sequence similarity with two peptides, one from Drosophila and one from mouse, suggesting strong evolutionary conservation of function.
PMID - 10591208
Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome.
PMID - 9169871
The yeast Saccharomyces cerevisiae is the pre-eminent organism for the study of basic functions of eukaryotic cells. All of the genes of this simple eukaryotic cell have recently been revealed by an international collaborative effort to determine the complete DNA sequence of its nuclear genome. Here we describe some of the features of chromosome XII.
PMID - 11322766
An expression sequence tag identified in a screen for genes upregulated by retinoic acid induced neuronal differentiation of the human teratocarcinoma cell line Ntera2/D1 was found in close genomic proximity to a region of high sequence homology to the septin subfamily of GTPase genes. We could show that the tag corresponds to the 3' untranslated region of this novel gene named septin 3 and cloned three isoforms A (2191 bp), B (4378 bp), and C (1896 bp) from human Ntera2/D1 cDNA. We present the genomic localization and organization on chromosome 22q13.2, a chromosomal hot spot for translocations implicated in leukemia. Interestingly, MSF the closest paralog of septin 3 is a fusion partner in a therapy-related acute myeloid leukemia. Quantitative PCR confirmed the upregulation of the putative septin by neuronal differentiation and northern blotting showed only one band corresponding to sep3B with a neurospecific expression pattern in adult human tissues.
PMID - 11859360
We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization.
PMID - 8181057
We have identified a Drosophila gene, peanut (pnut), that is related in sequence to the CDC3, CDC10, CDC11, and CDC12 genes of S. cerevisiae. These genes are required for cytokinesis, and their products are present at the bud neck during cell division. We find that pnut is also required for cytokinesis: in pnut mutants, imaginal tissues fail to proliferate and instead develop clusters of large, multinucleate cells. Pnut protein is localized to the cleavage furrow of dividing cells during cytokinesis and to the intercellular bridge connecting postmitotic daughter cells. In addition to its role in cytokinesis, pnut displays genetic interactions with seven in absentia, a gene required for neuronal fate determination in the compound eye, suggesting that pnut may have pleiotropic functions. Our results suggest that this class of proteins is involved in aspects of cytokinesis that have been conserved between flies and yeast.
PMID - 12747765
The ability of the immune system to recognize structurally altered, amplified or aberrantly expressed proteins can be used to identify molecules of etiologic relevance to cancer and to define targets for cancer immunotherapy. In the current study, ninety-four distinct antigens reactive with serum IgG from breast cancer patients were identified by immunoscreening breast cancer-derived cDNA expression libraries (SEREX). A serological profile was generated for each antigen on the basis of reactivity with allogeneic sera from normal individuals and cancer patients, and mRNA expression profiles for coding sequences were assembled based upon the tissue distribution of expressed sequence tags, Northern blots and real-time RT-PCR. Forty antigens reacted exclusively with sera from cancer patients. These included well-characterized tumor antigens, e.g. MAGE-3, MAGE-6, NY-ESO-1, Her2neu and p53, as well as newly-defined breast cancer antigens, e.g. kinesin 2, TATA element modulatory factor 1, tumor protein D52 and MAGE D, and novel gene products, e.g. NY-BR-62, NY-BR-75, NY-BR-85, and NY-BR-96. With regard to expression profiles, two of the novel gene products, NY-BR-62 and NY-BR-85, were characterized by a high level of testicular mRNA expression, and were overexpressed in 60% and 90% of breast cancers, respectively. In addition, mRNA encoding tumor protein D52 was overexpressed in 60% of breast cancer specimens, while transcripts encoding SNT-1 signal adaptor protein were downregulated in 70% of these cases. This study adds to the growing list of breast cancer antigens defined by SEREX and to the ultimate objective of identifying the complete repertoire of immunogenic gene products in human cancer (the cancer immunome).
PMID - 12537569
BACKGROUND: A collection of sequenced full-length cDNAs is an important resource both for functional genomics studies and for the determination of the intron-exon structure of genes. Providing this resource to the Drosophila melanogaster research community has been a long-term goal of the Berkeley Drosophila Genome Project. We have previously described the Drosophila Gene Collection (DGC), a set of putative full-length cDNAs that was produced by generating and analyzing over 250,000 expressed sequence tags (ESTs) derived from a variety of tissues and developmental stages. RESULTS: We have generated high-quality full-insert sequence for 8,921 clones in the DGC. We compared the sequence of these clones to the annotated Release 3 genomic sequence, and identified more than 5,300 cDNAs that contain a complete and accurate protein-coding sequence. This corresponds to at least one splice form for 40% of the predicted D. melanogaster genes. We also identified potential new cases of RNA editing. CONCLUSIONS: We show that comparison of cDNA sequences to a high-quality annotated genomic sequence is an effective approach to identifying and eliminating defective clones from a cDNA collection and ensure its utility for experimentation. Clones were eliminated either because they carry single nucleotide discrepancies, which most probably result from reverse transcriptase errors, or because they are truncated and contain only part of the protein-coding sequence.
PMID - 9889007
We assigned two human expressed sequence tags (ESTs), WI-15444 and SGC32067, homologous to mouse brain protein h5, to the critical region for Meckel syndrome (MKS) on 17q22-q23. For the sequence analyses in MKS patients, we isolated the corresponding human gene, PNUTL2, by analyzing an Image cDNA clone that contained these ESTs. Based on sequence homologies, the gene belongs to an expanding family of GTP-binding proteins, septins, that are involved in cytokinesis. In Northern analysis, PNUTL2 is ubiquitously expressed as a 1.7-kb transcript in adult and fetal tissues with particularly high expression in the heart, liver, and adrenal gland. Mutation analysis using sequencing of RT-PCR products and Northern blot analysis in MKS patients exclude PNUTL2 as the gene for MKS.
PMID - 8590280
In this series of projects regarding the accumulation of sequence information of unidentified human genes, we newly deduced the sequences of 40 full-length cDNA clones of human cell line KG-1, and predicted the coding sequences of the corresponding genes, named KIAA0121 to 0160. The results of a computer search of public databases indicated that the sequences of 13 genes were unrelated to any reported genes, while the remaining 27 genes carried sequences which showed some similarities to known genes. Obvious unique sequences noted were as follows. A stretch of triplet repeats was contained in each of three genes: These were GAG(Glu) in KIAA0122 and KIAA0147, and TCC(Ser) in KIAA0150. A stretch of 10 amino acid-residues was repeated 21 times in KIAA0139, and a homologous sequence of 76-78 nucleotides was found repeated 6 times in the untranslated region of KIAA0125. Northern hybridization analysis demonstrated that 13 genes were expressed in a cell- or tissue-specific manner. Although a vast number of expressed sequence tags (ESTs) have been registered for comprehensive analysis of cDNA clones, our sequence data indicated that their distribution is very unbalanced: e.g. while no EST hit 7 genes, 85 ESTs fell in a single gene.
PMID - 8791410
The septins are a novel family of proteins that were first recognized in yeast as proteins associated with the neck filaments. Recent work has shown that septins are also present in other fungi, insects, and vertebrates. Despite the apparent differences in modes of cytokinesis amongst species, septins appear to be essential for this process in both fungal and animal cells. The septins also appear to be involved in various other aspects of the organization of the cell surface.
PMID - 9203580
The mouse Nedd5 gene encodes a 41.5-kD GTPase similar to the Saccharomyces and Drosophila septins essential for cytokinesis. Nedd5 accumulates near the contractile ring from anaphase through telophase, and finally condenses into the midbody. Microinjection of anti-Nedd5 antibody interferes with cytokinesis, giving rise to binucleated cells. In interphase and postmitotic cells, Nedd5 localizes to fibrous or granular structures depending on the growth state of the cell. The Nedd5-containing fibers are disrupted by microinjection of GTPgammaS and by Nedd5 mutants lacking GTP-binding activity, implying that GTP hydrolysis is required for its assembly. The Nedd5-containing fibers also appear to physically contact actin bundles and focal adhesion complexes and are disrupted by cytochalasin D, C3 exoenzyme, and serum starvation, suggesting a functional interaction with the actin-based cytoskeletal systems in interphase cells.
PMID - 12537572
BACKGROUND: The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences. RESULTS: Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes. CONCLUSIONS: Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations.
PMID - 12477932
The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).
PMID - 9385360
A Drosophila-related expressed sequence tag (DRES) with sequence similarity to the peanut gene has previously been localized to human chromosome 22q11. We have isolated the cDNA corresponding to this DRES and show that it is a novel member of the family of septin genes, which encode proteins with GTPase activity thought to interact during cytokinesis. The predicted protein has P-loop nucleotide binding and GTPase motifs. The gene, which we call PNUTL1, maps to the region of 22q11.2 frequently deleted in DiGeorge and velo-cardio-facial syndromes and is particularly highly expressed in the brain. The mouse homologue, Pnutl1, maps to MMU16 adding to the growing number of genes from the DiGeorge syndrome region that map to this chromosome.
PMID - 8037772
We isolated a novel human cDNA, termed hCDC10, whose predicted product showed a high degree of homology to the CDC10 protein of Saccharomyces cerevisiae. This cDNA contained an open reading frame of 1254 nucleotides encoding 418 amino acids, which included a GTP-binding motif, GX4GKS--DX2G--KXD. The predicted peptide sequence also revealed partial amino-acid identity (40-50%) with Diff 6 in Drosophila and with H5 in mouse. Each of these sequence homologues, including Saccharomyces cerevisiae CDC10, contains the GTP-binding motif. Northern blot analyses indicated that the hCDC10 gene is expressed ubiquitously in normal tissues.
PMID - 11511094
Expression changes in subsets of genes occur in the course of altering cell fates, i.e., aging, cell death, and carcinogenesis. These changes simultaneously provide the good candidate as a biomarker for monitoring cancer. We have identified a novel human septin family gene, Bradeion, from adult brain cDNA library by a monoclonal antibody CE5. Northern blot and in situ hybridization analysis showed that Bradeion has two distinct transcripts, approximately 2.2 and 1.7 kb length (alpha and beta, respectively) mainly in brain and slightly in heart, and no expression in any fetal organs. Haplotype analysis placed the gene location at 17q23. The gene contains GTPase motifs highly conserved in the septin family genes that are essential for cytokinesis and cell separation. The transcript of beta form lacks a hydrophobic region, which suggests that this form arises from a single Bradeion gene through unique RNA splicing. Interestingly, this brain-specific Bradeion gene is also expressed in two human cancers, colorectal cancer and malignant melanoma. Ectopic expression of normal Bradeion alpha and beta transcripts were confirmed both in patients' tumor samples and in in vitro cultured human cancer cell lines. Thus the Bradeion provides valuable tools as a tumor-specific and selective marker.
PMID - 2174398
Lymphocyte adhesion to high endothelial venule cells in lymphoid organs of mice is mediated by several cell-surface glycoproteins, one of which, gp90MEL-14, is detected by the MEL-14 monoclonal antibody (mAb). The MEL-14 mAb was used to select two variants of the EL4 cell line, EL4MEL-14-hi and EL4-MEL-14-lo, that have disparate cell surface expression of this adhesion receptor. A cDNA library constructed from EL4MEL-14-hi mRNA was enriched for sequences present at higher levels in EL4MEL-14-hi cells than EL4MEL-14-lo cells. Quantitative analysis of candidate differential clones by RNA probe protection methods identified five clones whose steady-state mRNA levels were increased in the EL4MEL-14-hi cells. One of these clones, DIFF6, is derived from an RNA whose expression level is higher in several cell lines producing high amounts of MEL-14-reactive gp90, and absent or present at lower levels in several cell lines expressing low levels of this glycoprotein. However, DIFF6 does not encode gp90MEL-14. The nucleotide sequence of this clone predicts a relatively hydrophilic protein characteristic of a cytoplasmic or nuclear protein. Present experiments indicate that expression of gp90MEL-14, a cell-surface-adhesion receptor molecule, may be coregulated with additional cytoplasmic or nuclear factors.
PMID - 12466851
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
PMID - 9611266
Septins are a family of highly conserved filament-forming proteins that have been shown to mediate cytokinesis and cytoskeletal organization in fungi and Drosophila. The gene encoding the human septin family member HCDCREL-1 has been shown to be transcribed from a locus immediately adjacent to that of the platelet glycoprotein (GP) Ib b. The HCDCREL-1 gene possesses a non-consensus polyadenylation signal that apparently is not efficiently utilized, resulting in the expression of a readthrough transcript also containing the platelet GPIb beta coding region. As a first step in understanding the regulation and function of HCDCREL-1, we have analyzed the structure of this gene and characterized its expression in a variety of human cells. Our results indicate that the gene is expressed at high levels in platelets and neural tissue, and is transcriptionally complex.

STRP_STREQ.abs:
PMID - 8746458
The streptokinase gene of the Streptococcus pyogenes strain CIP 56.57 was cloned and sequenced. This sequence coding for a 441 amino acid protein is well conserved among streptococcus species: there are two very conserved domains separated by a more variable region.
PMID - 6760891
The complete amino acid sequence of streptokinase has been determined by automated Edman degradation of its cyanogen bromide and proteolytic fragments. The protein consists of 415 amino acid residues. Sequence microheterogeneity was found at two positions. The NH2-terminal 245 residues of streptokinase are homologous to the sequences of several serine proteases including bovine trypsin and Streptomyces griseus proteases A and B. The sequence alignment suggests that the active-site histidine-57 has changed to a glycine in streptokinase. The other active-site residues, aspartyl-102 and serine-195, are, however, present at the expected positions. Streptokinase also contains internal sequence homology between the NH2-terminal 173 residues and a COOH-terminal 162-residue region between residues 254 and 415. Moderate homology in predicted secondary structures also exists between these two regions. Although streptokinase is not a protease, these observations suggest that it has evolved from a serine protease by gene duplication and fusion. A COOH-terminal region of about 80 residues is apparently deleted from the second half of the duplicated structures. These observations further suggest that the three-dimensional structure of streptokinase likely contains two independently folded domains, each homologous to serine proteases.
PMID - 2989113
The entire nucleotide sequence of a cloned 2568-bp PstI fragment from the genome of Streptococcus equisimilis H46A encoding the streptokinase gene (skc) has been determined. The longest open reading frame comprises 1320 bp which code for streptokinase. The protein is synthesized with a 26-amino acid residue N-terminal extension having properties characteristic of a signal peptide. Comparison of the deduced amino acid sequence with the available amino acid sequence of a commercial streptokinase reveals minor primary structure differences. The nucleotide sequencing of skc does not support the hypothesis that the gene has evolved by duplication and fusion, as suggested by internal twofold amino acid homologies of its product. Furthermore, the skc gene sequence shows no extended regions homologous to the staphylokinase gene. Upstream from the skc gene, the putative skc promoter and the ribosome-binding site sequence have been identified; downstream from the coding region, inverted repeat sequences thought to function as transcription terminators have been detected.
PMID - 11296296
The 1,852,442-bp sequence of an M1 strain of Streptococcus pyogenes, a Gram-positive pathogen, has been determined and contains 1,752 predicted protein-encoding genes. Approximately one-third of these genes have no identifiable function, with the remainder falling into previously characterized categories of known microbial function. Consistent with the observation that S. pyogenes is responsible for a wider variety of human disease than any other bacterial species, more than 40 putative virulence-associated genes have been identified. Additional genes have been identified that encode proteins likely associated with microbial "molecular mimicry" of host characteristics and involved in rheumatic fever or acute glomerulonephritis. The complete or partial sequence of four different bacteriophage genomes is also present, with each containing genes for one or more previously undiscovered superantigen-like proteins. These prophage-associated genes encode at least six potential virulence factors, emphasizing the importance of bacteriophages in horizontal gene transfer and a possible mechanism for generating new strains with increased pathogenic potential.

TGR3_RAT.abs:
PMID - 1657407
The rat TGF-beta type III receptor cDNA has been cloned by overexpression in COS cells. The encoded receptor is an 853 amino acid protein with a large N-terminal extracellular domain containing at least one site for glycosaminoglycan addition, a single hydrophobic transmembrane domain, and a 41 amino acid cytoplasmic tail with no obvious signaling motif. Introduction of the cDNA into COS cells and L6 myoblasts induces expression of a heterogenously glycosylated 280-330 kd protein characteristic of the type III receptor that binds TGF-beta 1 specifically. In L6 myoblasts lacking the endogenous type III receptor, expression of the recombinant receptor leads to an increase in the amount of ligand bound and cross-linked to surface type II TGF-beta receptors. This indicates that the type III receptor may regulate the ligand-binding ability or surface expression of the type II receptor.
PMID - 7894484
Hereditary haemorrhagic telangiectasia (HHT) is an autosomal dominant disorder characterized by multisystemic vascular dysplasia and recurrent haemorrhage. Linkage for some families has been established to chromosome 9q33-q34. In the present study, endoglin, a transforming growth factor beta (TGF-beta) binding protein, was analysed as a candidate gene for the disorder based on chromosomal location, expression pattern and function. We have identified mutations in three affected individuals: a C to G substitution converting a tyrosine to a termination codon, a 39 base pair deletion and a 2 basepair deletion which creates a premature termination codon. We have identified endoglin as the HHT gene mapping to 9q3 and have established HHT as the first human disease defined by a mutation in a member of the TGF-beta receptor complex.
PMID - 8370410
Endoglin is an homodimeric membrane antigen with capacity to bind transforming growth factor-beta (TGF-beta) and whose expression is up-regulated on myeloid cells upon differentiation to macrophages. We have isolated full-length cDNA clones from a lambda gt 10 library, prepared from phorbol 12-myristate 13-acetate-differentiated HL60 cells by screening with an endoglin-specific cDNA probe from endothelial cells. Sequencing of the largest clone (3073 bp), revealed that the leader sequence contains 25 residues and that the 586 amino acids of the extracellular and transmembrane domains were identical to those described for endothelial endoglin. However, the cytoplasmic tail encoded by this cDNA clone contains only 14 amino acids as opposed to the 47 residues previously reported, suggesting the existence of two alternative endoglin variants. The expression of these isoforms was demonstrated by polymerase chain reaction analyses on endothelial cells, myelomonocytic cell lines HL-60 and U-937, and placenta. Independent cDNA constructs corresponding to both forms were transfected into mouse fibroblasts leading to the expression of two distinct endoglin molecules. Both forms were shown to bind TGF-beta 1 and, when overexpressed in transfected mouse fibroblasts, to form disulfide-linked homodimers, indicating that the cysteine residues present in the extracellular domain are responsible for the dimerization.
PMID - 8294451
Human endoglin is a dimeric protein that binds transforming growth factor-beta (TGF-beta). A porcine cDNA clone for endoglin was obtained from a porcine uterus cDNA library. The deduced sequence of the primary translated product of endoglin consists of 643 amino acids with a high sequence identity (96%) to human endoglin in the transmembrane and intracellular domains, but with a lower sequence similarity (66%) in the extracellular domain. In contrast to human endoglin, porcine endoglin has no Arg-Gly-Asp tripeptide in its sequence. Antibodies, raised against a peptide corresponding to the intracellular domain of porcine endoglin, immunoprecipitated an 84-kDa protein under reducing condition and a 130-kDa protein under nonreducing condition in porcine aortic endothelial cells. Porcine endoglin bound TGF-beta 1 and -beta 3 efficiently, but TGF-beta 2 less efficiently. Endoglin was found to be coimmunoprecipitated with TGF-beta receptors type I and/or II by the endoglin antibodies or by TGF-beta receptor II antibodies in the presence of ligand. Thus, endoglin and TGF-beta receptors I and/or II most likely formed a heteromeric receptor complex. Endoglin was phosphorylated on serine residue(s), which did not change after stimulation by TGF-beta 1. These results revealed that endoglin is a phosphorylated protein which forms a heteromeric complex with signaling receptors for TGF-beta.
PMID - 10545596
ENDOGLIN codes for a homodimeric membrane glycoprotein that interacts with receptors for members of the TGF-beta superfamily and is the gene mutated in the autosomal dominant vascular disorder hereditary hemorrhagic telangiectasia type 1 (HHT1). We recently demonstrated that functional endoglin was expressed at half levels on human umbilical vein endothelial cells (HUVECs) and peripheral blood activated monocytes from HHT1 patients. Two types of mutant protein were previously analyzed, the product of an exon 3 skip which was expressed as a transient intracellular species and prematurely truncated proteins that were undetectable in patient samples. Here we report the analysis of four proteins resulting from point mutations, with missense codons G52V and C53R in exon 2, W149C in exon 4 and L221P in exon 5. Metabolic labeling of activated monocytes from confirmed, clinically affected patients revealed reduced expression of fully processed normal endoglin in all cases. Pulse-chase analysis with HUVECs from a newborn with the C53R substitution indicated that mutant endoglin remained intracellular as a precursor form and did not impair processing of the normal protein. Biotinylation of cell surface proteins, metabolic labeling and pulse-chase analysis revealed that none of the engineered missense mutants was significantly expressed at the surface of COS-1 transfectants. Thus, these four HHT1 missense mutations lead to transient intracellular species which cannot interfere with normal endoglin function. These data suggest that haploinsufficiency, leading to reduced levels of one of the major surface glyco-proteins of vascular endothelium, is the predominant mechanism underlying the HHT1 phenotype.
PMID - 8125301
The rat monoclonal antibody, MJ7/18, which reacts selectively with the endothelium of blood vessels in mouse was used to screen a cDNA library derived from a transformed mouse brain endothelial cell line. The sequence of a cDNA encoding the cell surface MJ7/18 antigen revealed homology to human endoglin, a homodimeric transforming growth factor-beta (TGF-beta)-binding cell-surface glycoprotein expressed predominantly on vascular endothelial cells. Northern blot analysis shows a 3.4-kb single transcript of the mouse endoglin. The mouse endoglin is a type-I integral membrane protein of 653 amino acids (aa). The human and mouse sequences display 71% aa sequence identity with almost identical transmembrane and cytoplasmic domains. Like its human counterpart, mouse endoglin displays significant sequence homology to the type-III TGF-beta receptor in two extracellular domains, as well as striking similarity in the transmembrane and cytoplasmic regions. One of the extracellular regions of homology with TGF-beta receptor III represents a truncated version of a homology unit defining a novel gene family including uromodulin, the pancreatic granule protein gp2, and zona pellucida receptors for sperm. However, unlike its human counterpart, mouse endoglin does not contain an RGD tripeptide which has been suggested as a ligand of integrins.
PMID - 9245986
To identify mutations that cause hereditary hemorrhagic telangiectasia (HHT, or Rendu-Osler-Weber syndrome), clinical evaluations and genetic studies were performed on 32 families. Linkage studies in four of eight families indicated an endoglin (ENG) gene mutation. ENG sequences of affected members of the four linked families and probands from the 24 small families were screened for mutations, by Southern blot analyses and by cycle sequencing of PCR-amplified DNA. Seven novel mutations were identified in eight families. Two mutations (a termination codon in exon 4 and a large genomic deletion extending 3' of intron 8) did not produce a stable ENG transcript in lymphocytes. Five other mutations (two donor splice-site mutations and three deletions) produce altered mRNAs that are predicted to encode markedly truncated ENG proteins. Mutations in other families are predicted to lie in ENG-regulatory regions or in one of the additional genes that may cause HHT. These data suggest that the molecular mechanism by which ENG mutations cause HHT is haploinsufficiency. Furthermore, because the clinical manifestation of disease in these eight families was similar, we hypothesize that phenotypic variation of HHT is not related to a particular ENG mutation.
PMID - 10625079
Hereditary hemorrhagic telangiectasia (HHT) is a dominantly inherited vascular disorder that is heterogeneous in terms of age of onset and clinical manifestations. Endoglin is the gene mutated in HHT1, which is associated with a higher prevalence of pulmonary arteriovenous malformations than HHT2, where ALK-1 is the mutated gene. Endoglin is constitutively expressed on endothelial cells and inducible on peripheral blood activated monocytes so that protein levels can be measured by metabolic labeling and immunoprecipitation. We report the analysis of umbilical vein endothelial cells in 28 newborns from 24 families with a clinical diagnosis of HHT. Reduced levels of endoglin were observed in umbilical vein endothelial cells in 15/28 subjects and in activated monocytes of all clinically affected relatives tested, suggesting that these individuals had HHT1. No mutant protein was expressed at the cell surface in any of these cases, and a transient intracellular species was seen in samples of only two families, supporting a haploinsufficiency model. Quantitative multiplex PCR fragment analysis was established for the endoglin gene and revealed six mutations that were confirmed by automated DNA sequencing. An additional 10 mutations were identified in newborns by sequencing all exons. Of the 16 mutations, 10 were novel, three had been independently identified in related families, and three were previously known. Our data confirm that endoglin levels correlate with the presence or absence of mutation in HHT1 families, allowing the early identification of affected newborns that should be screened clinically to avoid serious complications of this disorder, such as cerebral arteriovenous malformations.
PMID - 1333192
Full-length cDNAs for the transforming growth factor-beta (TGF-beta) type III receptors were isolated from porcine uterus and human placenta cDNA libraries. The human TGF-beta type III receptor coding region encodes a protein of 849 amino acids with a single transmembrane domain and a short stretch of the intracellular domain. Potential glycosaminoglycan attachment sites were found in the extracellular domain. The overall amino acid sequence identities with those of the porcine and rat TGF-beta type III receptors were 83% and 81%, respectively. A high degree of sequence conservation was observed in the transmembrane and intracellular domains, which also have sequence similarity with human endoglin. In addition, two portions with 29 and 52 amino acids in the extracellular domain were found to be substantially similar with human endoglin.
PMID - 1657406
We describe the primary structure of rat betaglycan, a polymorphic membrane-anchored proteoglycan with high affinity for transforming growth factor-beta (TGF-beta). As deduced from its cDNA sequence, the 853 amino acid core protein of betaglycan has an extracellular domain with clustered sites for potential attachment of glycosaminoglycan chains. These chains are dispensable for TGF-beta binding to the core protein. The transmembrane region and the short cytoplasmic tail of betaglycan are very similar to these regions in human endoglin, an endothelial cell membrane glycoprotein involved in intercellular recognition. The ectodomain of betaglycan can be released as a soluble proteoglycan; a potential cleavage site near the transmembrane region is identical to the highly regulated cleavage site of the membrane-anchored transforming growth factor-alpha precursor. The unique features of betaglycan suggest important roles in cell interaction with TGF-beta.
PMID - 9157574
Hereditary hemorrhagic telangiectasia (HHT) is an autosomal dominant disorder characterized by multisystem vascular dysplasia and recurrent hemorrhage. Recent investigation has mapped one of the responsible genes for HHT to chromosome 9q33-q34; subsequently, nine different mutations have been identified in the endoglin gene, which encodes a transforming growth factor beta (TGF-beta) binding protein, in nine unrelated families with HHT. We examined the endoglin gene in a Japanese patient with HHT and her family members. Using PCR-SSCP analysis followed by sequencing, we identified a C to A missense mutation in exon 4 which changed an Ala160 codon(GCT) to an Asp160 codon (GAT). Since this mutation destroys one of three Fnu4H 1 sites in exon 4, the Fnu4H I digestion patterns of the PCR-amplified exon 4 fragments from each family member were analyzed. In affected members, the restriction patterns were all consistent with a phenotype of HHT. PCR-amplified exon 4 fragments from 150 normal individuals were also analyzed by allele-specific oligonucleotide hybridization analysis. As a result, the mutation was not found in any of them. We conclude that the C to A mutation in exon 4 of the endoglin gene in this proband is responsible for the occurrence of HHT in this family.
PMID - 8194490
Endoglin is an integral membrane glycoprotein predominantly expressed on human endothelial cells and recently shown to bind transforming growth factor-beta 1 (TGF beta 1) with high affinity. We now report the cloning and sequencing of a full-length murine endoglin complementary DNA of 2902 base pairs which hybridizes specifically with a single messenger RNA (mRNA) species. The polypeptide of 653 amino acids has an overall identity of 72% with human and porcine endoglin. The transmembrane and cytoplasmic domains of all three proteins differ by two to four amino acids and are 70% identical to the corresponding regions of the TGF beta binding protein, betaglycan. Relative levels of murine endoglin mRNA were estimated by polymerase chain reaction and found to be high in ovary and uterus, intermediate in heart and muscle, and low in placenta and spleen. In situ hybridization and immunofluorescence confirmed that murine endoglin, like its human counterpart, is present in blood vessels and capillaries in all tissues examined. In addition, the stromal cells in the connective tissue of intestine, stomach, heart, muscle, uterus, ovary, and testis were strongly and specifically reactive with complementary RNA probes and with a polyclonal antibody to endoglin; epithelial cell layers were distinctly unreactive. This distribution is similar to that of extracellular TGF beta 1, particularly in heart and uterus, and suggests that endoglin on stromal fibroblast-like cells might be regulating access of TGF beta 1 to the signaling receptor complex. NCTC-2071 fibroblasts in culture were shown to express high levels of endoglin mRNA by polymerase chain reaction. After chemical cross-linking with [125I]TGF beta 1 and immunoprecipitation with the polyclonal antihuman endoglin serum, a radiolabeled band of mol wt 180,000 corresponding to dimeric endoglin was observed under nonreducing conditions, whereas a single band of mol wt 90,000 was seen under reducing conditions. Thus murine fibroblast endoglin is capable of binding TGF beta 1. Future studies should establish the specialized role of endoglin in the TGF beta receptor complex of endothelial and stromal cells.
PMID - 1692830
Endoglin is a major glycoprotein of human vascular endothelium. As observed with monoclonal antibody 44G4, the distribution of endoglin is restricted to endothelial cells in all tissues except bone marrow. cDNA clones were isolated from an endothelial cell lambda gt11 cDNA library using a rabbit antibody prepared against endoglin purified from placenta. Eleven antibody-positive and cross-hybridizing clones were obtained; reactivity with endothelial cell 3.4-kilobase mRNA transcript was observed. The N-terminal sequence of placental endoglin was determined and found within the deduced protein sequence, thus confirming the identity of the cDNA and revealing a partial signal peptide. Endoglin is a type I integral membrane protein of Mr = 68,051 with an extracellular region of 561 amino acids, a hydrophobic transmembrane domain, and a 47-residue cytoplasmic tail. There are four potential N-linked glycosylation sites in the N-terminal domain and a probable O-glycan domain rich in Ser and Thr residues proximal to the membrane-spanning domain. Data base searches revealed that endoglin is a novel protein. The sequence contains an RGD tripeptide (374-376), the first identified on a surface protein of endothelium. The presence of RGD, a key recognition structure in cellular adhesion, suggests a critical role for endoglin in the binding of endothelial cells to integrins and/or other RGD receptors.
PMID - 9554745
Hereditary Hemorrhagic Telangiectasia (HHT) is an autosomal dominant disorder characterized by multisystemic vascular dysplasia and recurrent hemorrhage from the sites of vascular lesions. Two genes have been identified for HHT. Endoglin, a TGF-beta binding protein which maps to chromosome 9q3, is the gene for HHT1. The type and location of most of the previously described mutations in the endoglin (ENG) gene suggested a dominant-negative model of receptor-complex dysfunction for the molecular basis of this disorder. In this article we describe 11 novel ENG mutations in HHT kindreds, which include missense and splice-site mutations. Two identical missense mutations in unrelated families disrupt the start codon of the gene. In addition, some frameshift and nonsense mutations lead to very low or undetectable levels of transcript from the mutant allele. These combined data suggest that the nature of most ENG mutations is to create a null (nonfunctional) allele, and that there is no requirement for the synthesis of a truncated endoglin protein in the pathogenesis of HHT.
PMID - 7864874
The zona pellucida composed of three or four glycoproteins plays important roles in fertilization. Our previous study showed that porcine ZP1, one of the major glycoproteins of porcine zona pellucida, was divided into two components (porcine ZP4 and ZP2), and suggested it was a homologue of mouse ZP2. In this paper we report the cloning of a cDNA for porcine ZP1 and its genomic organization. The deduced amino acid sequence of porcine ZP1 shared a 54% and 63% identity with those of mouse and human ZP2, respectively. Genomic organization of porcine ZP1 was also similar to that of mouse ZP2. The transcript of porcine ZP1 gene was detected only in growing oocytes.
PMID - 7841460
Full length zona pellucida cDNAs from cat, dog and pig that are homologous to the ZP2/rc75 genes from mouse, human and rabbit, a full length zona pellucida cDNA from cat and a gene and full length cDNA from human that are homologous to the rc55/ZP3 alpha genes from rabbit and pig, and full length zona pellucida cDNAs from cat, cow, dog, pig and rabbit that are homologous to the ZP3 genes from mouse, hamster, human and marmoset have been cloned and characterized. The members of these gene families are herein referred to as ZPA, ZPB and ZPC genes to avoid the confusion that currently exists in the zona pellucida of nomenclature. This report is the first to describe the presence all three major zona pellucida genes within individual mammalian species. Within the ZPA, ZPB and ZPC gene families, the DNA and deduced amino acid sequences are highly homologous to each other, and are most homologous between members of the same order within the class mammalia. These results imply that all or most mammalian species express the ZPA, ZPB and ZPC proteins, which form the zona pellucida layer surrounding the oocyte.

VFUS_VACCC.abs:
PMID - 2795717
Comprehensive comparisons of genome organizations for poxviruses of different genera have not previously been reported. Here we have made such a comparison by cross-hybridizing genome fragments from capripoxvirus KS-1 and vaccinia virus WR (VV). This showed that a 100- to 115-kilobase (kb) centrally placed section is essentially colinear in organization in the two viruses and that a small region has translocated between the ends of one or other of the genomes during their divergence. No cross-hybridization could be detected between VV DNA and the respective left- and right-hand terminal 8 and 25 kb of capripoxvirus DNA or between capripoxvirus DNA and the respective left- and right-hand terminal 38 and 35 kb of VV DNA. By using the cross-hybridization data, a 4-kb fragment of KS-1 DNA was identified, which corresponds to the regions of the cowpox virus and VV genomes containing genes for the orthopoxvirus A-type inclusion body protein ("ATI"). The sequence of the KS-1 DNA fragment contains homologs of genes which are on either side of the orthopoxvirus ATI genes but contains no homolog of the ATI gene itself. Overall, these results show that the pattern of genomic conservation and variation between two poxvirus genera reflects the pattern within the orthopoxvirus genus but that, as observed previously, individual genes may not be present in genomic regions which are otherwise conserved in organization.
PMID - 2219722
The complete DNA sequence of the genome of vaccinia virus has been determined. The genome consisted of 191,636 bp with a base composition of 66.6% A + T. We have identified 198 "major" protein-coding regions and 65 overlapping "minor" regions, for a total of 263 potential genes. Genes encoded by the virus were located by examination of DNA sequence characteristics and compared with existing vaccinia virus mapping analyses, sequence data, and transcription data. These genes were found to be compactly organized along the genome with relatively few regions of noncoding sequences. Whereas several similarities to proteins of known function were discerned, the function of the majority of proteins encoded by these open reading frames is as yet undetermined.
PMID - 2033392
The nucleotide sequence of a region of DNA 30 kb from the right end of the orf virus genome has been determined. Examination of the sequences revealed an open reading frame encoding a 10K peptide with significant amino acid homology to the 14K 'fusion' protein reported in vaccinia virus. The orf virus sequence has a 31% identity with the vaccinia virus protein, but a higher level of homology of core predicted residues. The secondary structure of both proteins is also similar. The occurrence of the TAAAT sequence upstream from the initiation codon indicates that the sequence is likely to be transcribed late in infection.
PMID - 1856205
The gene rpo35, encoding a subunit of the vaccinia virus DNA-dependent RNA polymerase, was identified, and its RNA and protein products were characterized. An Mr 35,000 polypeptide, which bound antibody to the purified RNA polymerase, was synthesized in reticulocyte lysates programmed with viral mRNA that hybridized to a 2,300-base pair segment of the viral genome. Determination of the sequence of the DNA segment revealed four potential protein coding regions, none of which had evident similarity to any described RNA polymerase subunit of prokaryotes or eukaryotes. One open reading frame that could encode a 35,400-Da protein was identified as rpo35 on the basis of mRNA hybridization, cell-free translation, and immunoprecipitation. The identification was confirmed by sequencing tryptic peptides of the authentic Mr 35,000 RNA polymerase subunit. Antiserum to the purified recombinant protein, expressed in bacteria, reacted specifically with a Mr 35,000 polypeptide that was detected starting 2 h after virus infection and that co-sedimented with RNA polymerase purified from virions. RNA analyses indicated that the 5'-end of an early transcript started 25 nucleotides upstream of rpo35, which is consistent with the location of an early promoter consensus sequence.
PMID - 2822962
A library of rabbit poxvirus DNA fragments contained in the expression cloning vector lambda gt11 was screened with monoclonal antibodies that react specifically against a 14-kilodalton envelope protein of vaccinia virus and rabbit poxvirus. The 14-kilodalton protein appears to play an important role in virus penetration at the level of cell fusion; it also elicits neutralizing antibodies, and it forms covalently linked trimers on the surface of virions and in infected cells (Rodriguez et al., J. Virol. 56:482-488, 1985; Rodriguez et al., J. Virol. 61:395-404, 1987). Two recombinant bacteriophages expressing beta-galactosidase fusion proteins were isolated. Restriction enzyme analysis and hybridization studies mapped the 14-kilodalton encoding sequences in the middle of vaccinia virus HindIII A DNA fragment. Nucleotide sequence analysis revealed an open reading frame (ATG) preceded by a characteristic TAA sequence of late genes. The sequence spans 330 nucleotides and codes for a protein with a molecular weight of 12,500 and an isoelectric point of 6.3. There are two small hydrophobic regions, one at the C terminus (11 amino acids) and the other at the N terminus (5 amino acids). The protein contains two cysteines for oligomer formation and one glycosylation site. Inspection of the deduced amino acid sequence of the 14-kilodalton protein revealed consensus sites with the hemagglutinin precursor of influenza A virus and with adenylate kinase and cytochrome c of various species.
PMID - 2389560
The mechanism by which the large-size poxviruses enter animal cells is not known. In this investigation we show that acid pH treatment of wild-type vaccinia virus-infected cells triggers strong fusion of cells in culture, with an optimum at pH 4.8. We have identified the virus-induced fusion protein as a 14-kDa envelope protein, based on the ability of a 14-kDa specific monoclonal antibody (mAbC3) to block vaccinia virus-induced fusion-from-within and fusion-from-without. We provide genetic evidence for a role of the 14-kDa protein in cell fusion, since insertion of the 14-kDa encoding gene into the genome of nonfusogenic mutant viruses generates heterozygous viruses that now acquire acid pH-dependent fusion activity. DNA sequence analyses of the 14-kDa encoding gene of the mutant viruses, 65-16 and 101-14, reveal N-terminal deletions of 46 and 10 amino acids, respectively. These deletions remove a small hydrophobic region at the N-terminus of the 14-kDa protein and prevent fusion. Our findings demonstrate that vaccinia virus can induce strong fusion of cells in culture at acid pH implying some entry of the virus by endocytosis, that the 14-kDa virus envelope protein is the fusogenic protein, and that the N-terminal proximal region is involved in fusion.
PMID - 8384129
Analysis of variola virus nucleotide sequence revealed proteins belonging to several families which provide the virus with the possibility of overcoming the barriers of specific and non-specific host defence against viral infection. The complement-binding proteins, lymphokine-binding proteins, and serine protease inhibitors can be assigned to this type, as can the proteins providing the orthopoxviruses with resistance to interferon. The revealed differences between the genes (proteins) of variola and vaccinia viruses under study are discussed.

VGLG_IHNV.abs:
PMID - 2741347
A cDNA copy of the mRNA for the glycoprotein G of Chandipura virus, a rhabdovirus, has been cloned, sequenced, and expressed in mammalian cells. The deduced amino acid sequence of G shows that the encoded protein is a typical transmembrane glycoprotein of 524 amino acids containing a cleavable amino-terminal signal peptide, two potential N-linked glycosylation sites, a hydrophobic membrane anchor domain near the carboxy terminus, and a cytoplasmic domain at the carboxy terminus. Somewhat unusual is the appearance of two charged amino acid residues, aspartate and arginine, within the putative membrane anchor sequence. Expression of the G gene in COS cells resulted in production of a glycosylated protein of mol wt 71,000 which was recognized by anti-Chandipura antibodies. Like the viral G protein, the expressed G contained covalently linked palmitic acid. However, unlike its vesicular stomatitis virus (Indiana serotype) counterpart, the Chandipura G protein has no potential palmitate-accepting cysteine residue within its cytoplasmic domain. Thus, the covalent attachment of fatty acid to this molecule may occur at one or both of the cysteines within the membrane anchor domain. The G protein was intracellularly transported to the cell surface and could induce cell fusion at low pH, showing that the expressed G protein was biologically active.
PMID - 1413521
A 3789 nucleotide region of the bovine ephemeral fever virus (BEFV) genome, located 1.65 kb downstream of the N gene, has been cloned and sequenced. The region contains two long open reading frames (ORFs) which are bounded by putative consensus (AACAGG) and polyadenylation (CATG[A]7) sequences and are separated by an intergenic region of 53 nucleotides. Discrete mRNAs corresponding to each ORF have been identified. The first ORF encodes a polypeptide comprising 623 residues which was identified by peptide sequencing as the virion G protein. The deduced amino acid sequence of the G protein includes putative signal and transmembrane domains and five potential glycosylation sites. The second ORF encodes a polypeptide of 586 amino acids which also has characteristics of a rhabdovirus glycoprotein, including putative signal and transmembrane domains and eight potential glycosylation sites, and appears to correspond to a 90-kDa nonstructural glycoprotein (GNS) identified in BEFV-infected cells (Walker et al. [1991] J. Gen. Virol. 72, 67-74). A database search indicated that both the G and GNS proteins share significant amino acid sequence homology with other rhabdovirus G proteins and with each other. Highest homology scores for each protein were with sigma virus and vesicular stomatitis virus serotypes.
PMID - 3459163
Rabies cDNA clones, obtained by "walking along the genome" using two successive DNA primers, have allowed the sequence determination of the genes encoding the N, M1, M2, G, and the beginning of the L protein as well as the rabies intergenic regions. Start and stop transcription signals located at the border of each gene encoding a protein have been identified and are similar to the corresponding signals from vesicular stomatitis virus (VSV) and Sendai virus. Except for limited stretches of the nucleoprotein, there is no homology between corresponding structural proteins of these three viruses. Rabies intergenic regions are variable both in length and sequence. Evidence for the existence of a remnant protein gene in the 423 nucleotide long G-L intergenic region is presented. This finding is discussed in terms of the evolution of unsegmented negative-strand RNA viruses.
PMID - 6268840
The complete nucleotide sequences of the vesicular stomatitis virus mRNA's encoding the glycoprotein (G) and the matrix protein (M) have been determined from cDNA clones that contain the complete coding sequences from each mRNA. The G protein mRNA is 1,665 nucleotides long, excluding polyadenylic acid, and encodes a protein of 511 amino acids including a signal peptide of 16 amino acids. G protein contains two large hydrophobic domains, one in the signal peptide and the other in the transmembrane segment near the COOH terminus. Two sites of glycosylation are predicted at amino acid residues 178 and 335. The close correspondence of the positions of these sites with the reported timing of the addition of the two oligosaccharides during synthesis of G suggests that glycosylation occurs as soon as the appropriate asparagine residues traverse the membrane of the rough endoplasmic reticulum. The mRNA encoding the vesicular stomatitis virus M protein is 831 nucleotides long, excluding polyadenylic acid, and encodes a protein of 229 amino acids. The predicted M protein sequence does not contain any long hydrophobic or nonpolar domains that might promote membrane association. The protein is rich in basic amino acids and contains a highly basic amino terminal domain. Details of construction of the nearly full-length cDNA clones are presented.
PMID - 6298453
The nucleotide sequence of the mRNA encoding the glycoprotein from the New Jersey serotype of vesicular stomatitis virus (VSV) was determined from a cDNA clone containing the entire coding region. The sequence of 12 5'-terminal noncoding nucleotides present in the mRNA but not in the cDNA clone was determined from a primer extended to the 5' terminus of the mRNA. The mRNA is 1,573 nucleotides long (excluding polyadenylic acid) and encodes a protein of 517 amino acids. Only six nucleotides occur between the translation termination codon and the polyadenylic acid. Short homologies between the untranslated termini of this mRNA and the mRNAs of the Indiana serotype were found. The predicted protein sequence was compared with that of the glycoprotein of the Indiana serotype of VSV and with the glycoprotein of rabies virus, using a computer program which determines optimal alignment. An amino acid identity of 50.9% was found for the two VSV serotypes. Approximately 20% identity was found between the rabies virus and VSV New Jersey glycoproteins. The positions and sizes of the transmembrane domains, the signal sequences, and the glycosylation sites are identical in both VSV serotypes. Two of five serine residues which were possible esterification sites for palmitate in the glycoprotein from the Indiana serotype are changed to glycine residues in the glycoprotein from the New Jersey serotype. Because the glycoprotein of the New Jersey serotype does not contain esterified palmitate, we suggest that one or both of these residues are the probable esterification sites in the glycoprotein from the Indiana serotype.
PMID - 2139267
Complementary DNA spanning the entire genome of the attenuated rabies virus strain SAD B19 which is used for oral immunization of foxes in Europe was cloned and sequenced. The viral genome comprises 11,928 nucleotides and encodes the five viral proteins N, NS, M, G, and L. Deduced protein sequences are highly similar to those of the pathogenic PV strain, homologies ranging from 90.6% for the M to 98.6% for the L protein. The five cistrons are separated by intergenic regions of 2, 5, 5, and 24 nucleotides, respectively. The G transcription stop/polyadenylation consensus signal in SAD B19 is destroyed by a deletion of three A residues. The strong conservation of both noncoding and coding nucleotide sequences indicates a high selective pressure on the primary structure of rabies virus genomic RNA.
PMID - 3005478
BHK-21 cells readily produce tumours in athymic nude mice, but BHK-21 cells persistently infected with wild-type vesicular stomatitis virus (VSV) do not. However, rare persistently infected virus-shedding tumours (VSV-P tumour cells) were independently derived by in vivo selection on three different occasions. Cloned viruses isolated from each of these (VSV-P virus mutants) carried mutations determining the VSV-P phenotype because they all allowed growth of virus-shedding tumours in nude mice when they were used to persistently infect normal (unselected) BHK-21 cells. Treatment of nude mice with anti-asialo-GM1 allowed BHK cells persistently infected with wild-type VSV to form tumours, and BHK cells persistently infected with VSV-P were resistant to natural killer (NK) cells in vitro; this implicates NK cells in the in vivo rejection of persistently infected tumours and in the selection of the VSV-P variant. In this paper, we have sequenced the glycoprotein (G protein), matrix (M) and non-structural (NS) proteins of three independently derived VSV-P type mutants to find mutations associated with in vivo passage of persistently infected nude mouse tumours and with resistance to NK cells. We found extensive mutation in the G protein of VSV-P but relatively few mutations in the M and NS proteins. This suggests but does not prove a role for the G protein in NK cell killing of infected cells.
PMID - 1954257
A cDNA clone encoding for the glycoprotein of the viral haemorrhagic scepticaemia virus, a fish rhabdovirus, has been sequenced. The cDNA was 2035 bp long and contained two open reading frames (ORF). A 1523 bp ORF corresponded to the glycoprotein and was adjacent, on its 5' side, to an incomplete 372 bp ORF. Although the protein encoded by this ORF displayed no similarity with other rhabdovirus proteins, it was supposed that the cDNA had been reverse-transcribed from a readthrough mRNA encoding successively for the M2 and the G proteins.
PMID - 2822842
Sigma virus, the hereditary agent of a CO2-induced paralysis of Drosophila, is classified as a rhabdovirus on a molecular basis. We have purified its genome which after 32P-labelling was used as a probe to detect mRNAs in infected cells. A cDNA copy of the entire coding region of the glycoprotein mRNA was cloned. Nucleotide and deduced amino acid sequences were determined and compared to previously known sequences of other rhabdovirus glycoproteins to determine the relatedness of Sigma virus to other viruses of this group.
PMID - 2985803
DNA sequences were determined for three cDNA clones encoding vesicular stomatitis virus glycoproteins from the tsO45 mutant (which encodes a glycoprotein that exhibits temperature-sensitive cell-surface transport), the wild-type parent strain, and a spontaneous revertant of tsO45. The DNA sequence analysis showed that as many as three amino acid changes could be responsible for the transport defect. By recombining the cDNA clones in vitro and expressing the recombinants in COS cells, we were able to trace the critical lesion in tsO45 to a single substitution of a polar amino acid (serine) for a hydrophobic amino acid (phenylalanine) in a hydrophobic domain. We suggest that this nonconservative substitution may block protein transport by causing protein denaturation at the nonpermissive temperature. Comparison of the predicted glycoprotein sequences from two vesicular stomatitis virus strains suggests a possible basis for the differential carbohydrate requirement in transport of the two glycoproteins.
PMID - 3033264
The nucleotide sequence of the mRNA encoding the glycoprotein of infectious hematopoietic necrosis virus was determined from a cDNA clone containing the entire coding region. The G-protein cDNA is 1,609 nucleotides long (excluding the polyadenylic acid) and encodes a protein of 508 amino acids. The predicted amino acid sequence was compared with that of the glycoprotein of the Indiana and New Jersey serotypes of vesicular stomatitis virus and with the glycoprotein of rabies virus, using a computer program which determined optimal alignment. An amino acid identity of approximately 20% was found between infectious hematopoietic necrosis virus and the two vesicular stomatitis virus serotypes and between infectious hematopoietic necrosis virus and rabies virus. The positions and sizes of the signal sequence and transmembrane domain and the possible glycosylation sites were determined.
PMID - 2168974
We report the entire glycoprotein (G) gene nucleotide sequences of 26 vesicular stomatitis virus Indiana serotype (VSV IND) type 1 isolates from North and Central America. These sequences are also compared with partial G gene sequences of VSV IND type 2 (Cocal) and type 3 (Alagoas) viruses and the complete G gene sequences of the more distantly related VSV New Jersey (NJ) and Chandipura viruses. Phylogenetic analysis of the G gene sequences by maximum parsimony revealed four major lineages or subtypes within the classical VSV IND (type 1) viruses, each with a distinct geographic distribution. A high degree of VSV genetic diversity was found in Central America, with several virus subtypes of both VSV IND and NJ serotypes existing in this mainly enzootic disease region. Nineteen percent sequence variation but no deletions or insertions were evident within the 5' noncoding and the coding regions of the VSV IND type 1 G genes. In addition to numerous base substitutions, the 3' noncoding regions of these viruses also contained numerous base insertions and deletions. This resulted in striking variation in G gene sizes, with gene lengths ranging from 1,652 to 1,868 nucleotides. As the VSV IND type 1 subtypes have diverged from the common ancestor with the NJ subtypes, their G mRNAs have accumulated more 3' noncoding sequence inserts, ranging up to 303 nucleotides in length. These primarily consist of an imprecise reiteration of the sequence UUUUUAA, apparently generated by a unique polymerase stuttering error. Analysis of the deduced amino acid sequence differences among VSV IND type 1 viruses revealed numerous substitutions within defined antigenic epitopes, suggesting that immune selection may play a role in the evolution of these viruses.
PMID - 1736537
The sequence of the glycoprotein gene of a street rabies virus was determined directly using fragments of a rabid dog brain after PCR amplification. Compared with that of the prototype strain CVS, this sequence displayed 10% divergence in overall amino acid composition. However only 6% divergence was noted in the ectodomain suggesting that structural constraints are exerted on this portion of the glycoprotein. A human strain isolated on cell culture from the saliva of a patient with clinical rabies had only five amino acid differences with the canine isolate, an indication of their close relatedness. These differences could have originated during transmission from dog to dog, or from dog to man, or during isolation on cell culture; they are nonetheless indicative of a genetic evolution of street rabies virus. This evolution was further evidenced by the selection of cell-adapted variants which displayed new amino acid substitutions in the glycoprotein. One of them concerned antigenic site III where arginine at position 333 was replaced by glutamine. As expected this substitution conferred resistance to a site IIIa monoclonal antibody (MAb), but surprisingly did not abolish neurovirulence for adult mice. However, a decrease in the neurovirulence of the cell-adapted variant in the presence of a site IIIa specific MAb was noted, suggesting that neurovirulence was due to a subpopulation neutralizable by the MAb. Simultaneous presence of both the parental and variant sequences was indeed evidenced in the brain of a mouse inoculated with the cell-adapted variant; during multiplication in the mouse brain, the frequency of the parental sequence rose from less than 10% to nearly 50%, indicating the selective advantage conferred by arginine 333 in nervous tissue. Altogether these results were suggestive of an intrinsic heterogeneity of street rabies virus. This heterogeneity was further demonstrated by the sequencing of molecular clones of the glycoprotein gene, which revealed that only one-third of the viral genomes present in the brain of a rabid dog had the consensus sequence. Two-thirds of the clones analyzed displayed from one to three amino acid substitutions. Such heterogeneous populations have been referred to as quasispecies, a concept which implies heterogeneous populations kept together in a dynamic equilibrium. This equilibrium could be rapidly displaced, giving the virus the capacity to adapt easily to new environmental conditions.