1. Processing of an anglerfish somatostatin precursor to a hydroxylysine-containing somatostatin 28
J Spiess, B D Noe Proc Natl Acad Sci U S A. 1985 Jan;82(2):277-81. doi: 10.1073/pnas.82.2.277.
A novel 28-residue somatostatin (SS) has been isolated from anglerfish pancreatic islets and characterized by complete Edman degradation, peptide mapping, and amino acid analysis. The primary structure of this anglerfish SS-28 (aSS-28) containing hydroxylysine (Hyl) was established to be H-Ser-Val-Asp-Ser-Thr-Asn-Asn-Leu-Pro-Pro-Arg-Glu-Arg-Lys-Ala-Gly-Cys- Lys-Asn-Phe-Tyr-Trp-Hyl-Gly-Phe-Thr-Ser-Cys-OH. This sequence (with the exception of hydroxylysine-23, which is replaced by lysine) is identical to the sequence of the COOH-terminal 28 residues of prepro-SS II predicted on the basis of cDNA analysis [Hobart, P., Crawford, R., Shen, L., Pictet, R. & Rutter, W. J. (1980) Nature (London) 288, 137-141]. This is the first instance in which hydroxylysine (to date characteristically observed in collagen or collagen-like structures) has been found in a potential regulatory peptide. Chromatographic characterization of peptides, radiolabeled in islet culture, revealed that aSS-28 contained 10-12% of the radioactivity incorporated into the 8000- to 1000-dalton SS-like polypeptides, whereas 88-90% of this radioactivity was detected in anglerfish SS-14. It appears probable that aSS-28 represents the predominant primary cleavage product derived from prepro-SS II by cleavage at the COOH-terminal side of a single arginine. Based on knowledge of the collagen biosynthesis, it is speculated that hydroxylation may take place as an early post-translational event.
2. Statistical analysis of atomic contacts at RNA-protein interfaces
M Treger, E Westhof J Mol Recognit. 2001 Jul-Aug;14(4):199-214. doi: 10.1002/jmr.534.
Forty-five crystals of complexes between proteins and RNA molecules from the Protein Data Bank have been statistically surveyed for the number of contacts between RNA components (phosphate, ribose and the four bases) and amino acid side chains. Three groups of complexes were defined: the tRNA synthetases; the ribosomal complexes; and a third group containing a variety of complexes. The types of atomic contacts were a priori classified into ionic, neutral H-bond, C-H...O H-bond, or van der Waals interaction. All the contacts were organized into a relational database which allows for statistical analysis. The main conclusions are the following: (i) in all three groups of complexes, the most preferred amino acids (Arg, Asn, Ser, Lys) and the less preferred ones (Ala, Ile, Leu, Val) are the same; Trp and Cys are rarely observed (respectively 15 and 5 amino acids in the ensemble of interfaces); (ii) of the total number of amino acids located at the interfaces 22% are hydrophobic, 40% charged (positive 32%, negative 8%), 30% polar and 8% are Gly; (iii) in ribosomal complexes, phosphate is preferred over ribose, which is preferred over the bases, but there is no significant preference in the other two groups; (iv) there is no significant prevalence of a base type at protein-RNA interfaces, but specifically Arg and Lys display a preference for phosphate over ribose and bases; Pro and Asn prefer bases over ribose and phosphate; Met, Phe and Tyr prefer ribose over phosphate and bases. Further, Ile, Pro, Ser prefer A over the others; Leu prefers C; Asp and Gly prefer G; and Asn prefers U. Considering the contact types, the following conclusions could be drawn: (i) 23% of the contacts are via potential H-bonds (including CH...O H-bonds and ionic interactions), 72% belong to van der Waals interactions and 5% are considered as short contacts; (ii) of all potential H-bonds, 54% are standard, 33% are of the C-H...O type and 13% are ionic; (iii) the Watson-Crick sites of G, O6(G) and principally N2(G) and the hydroxyl group O2' is more often involved in H-bonds than expected; the protein main chain is involved in 32% and the side chains in 68% of the H-bonds; considering the neutral and ionic H-bonds, the following couples are more frequent than expected-base A-Ser, base G-Asp/Glu, base U-Asn. The RNA CH groups interact preferentially with oxygen atoms (62% on the main chain and 19% on the side chains); (iv) the bases are involved in 38% of all H-bonds and more than 26% of the H-bonds have the H donor group on the RNA; (v) the atom O2' is involved in 21% of all H-bonds, a number greater than expected; (vi) amino acids less frequently in direct contact with RNA components interact frequently via their main chain atoms through water molecules with RNA atoms; in contrast, those frequently observed in direct contact, except Ser, use instead their side chain atoms for water bridging interactions.
3. Amino acid sequence of an anti-tumor protein from Rana pipiens oocytes and early embryos. Homology to pancreatic ribonucleases
W Ardelt, S M Mikulski, K Shogen J Biol Chem. 1991 Jan 5;266(1):245-51.
Rana pipiens oocytes and early embryos contain large amounts of a basic protein with antiproliferative/cytotoxic activity against several tumor cell lines in vitro (Darzynkiewicz, Z., Carter, S. P., Mikulski, S. M., Ardelt, W., and Shogen, K. (1988) Cell Tissue Kinet. 21, 169-182; Mikulski, S.M., Viera, A., Ardelt, W., Menduke, H., and Shogen, K. (1990) Cell Tissue Kinet. 23, 237-246), as well as antitumor activity in vivo (Mikulski, S. M., Ardelt, W., Shogen, K., Bernstein, E. H., and Menduke, H. (1990) J. Natl. Cancer Inst. 82, 151-153). The protein, provisionally named P-30 Protein, was purified to homogeneity from early embryos and characterized. It is a single-chain protein consisting of 104 amino acid residues in the following sequence: less than Glu1-Asp-Trp-Leu-Thr-Phe-Gln-Lys-Lys-His-Ile-Thr-Asn-Thr- Arg15-Asp-Val-Asp-Cys-Asp-Ans-Ile-Met-Ser-Thr-Asn-Leu-Phe-His-C ys30-Lys-Asp-Lys - Asn-Thr-Phe-Ile-Tyr-Ser-Arg-Pro-Glu-Pro-Val-Lys45-Ala-Ile-Cys-Lys- Gly-Ile-Ile- Ala-Ser-Lys-Asn-Val-Leu-Thr-Thr60-Ser-Glu-Phe-Tyr-Leu-Ser-Asp -Cys-Asn-Val-Thr-Ser-Arg-Por-Cys75-Lys-Tyr-Lys-Leu-Lys-Lys-Ser-Thr -Asn-Lys-Phe- Cys-Val-Thr-Cys90-Glu-Asn-Gln-Ala-Pro-Val-His-Phe-Val-Gly-Val-Gly- Ser-Cys104-OH . Its molecular weight calculated from the sequence is 11,819. The sequence homology clearly indicates that the protein belongs to the superfamily of pancreatic ribonuclease. It is also demonstrated that it indeed exhibits a ribonucleolytic activity against highly polymerized RNA and that this activity seems to be essential for its antiproliferative/cytotoxic effects.