

We have previously described a system, PROTEAN-1, which is able to use multiple sources of uncertain data to produce estimates of structure. PROTEAN-1 is based on a constraint-satisfaction paradigm and is summarized below. PROTEAN is able to model the structure of a large ribonucleoprotein complex (the central domain of the 30S ribosomal subunit) from a combination of constraints derived from sequence analysis and experiments. The ease with which the data analysis protocols can be changed, and the consequent effects on the structural model can be gauged, makes PROTEAN a good program for exploring the structural implications of sparse and noisy data. The underlying constraint satisfaction engine can guarantee that the resulting structures satisfy all the constraints provided, and that they provide an upper bound on the set of structures that are compatible with the data.
The ribosome is the site of translation of messenger RNA into
protein. It is composed of two subunits. In prokaryotes, the large subunit
is called 50S and the small subunit is called 30S. The 30S subunit consists
of a single strand of RNA (the 16S rRNA, 1542 bases), and 21 proteins ranging
in molecular weight from 9 kD to 61 kD. The 30S subunit can exist independently
from the 50S subunit and be reconstituted from its components in vitro.
The 30S subunit is the site of translation initiation. An understanding
of its structure is useful for understanding the mechanisms of protein
translation, and perhaps for designing drugs to modify these mechanisms.
We have created a model of the 16S rRNA structure using hydroxyl
data and a computational method based on constraint satisfaction to define
the allowable conformational space for each secondary structural element
in the context of protein positions. A description of the work and coordinates
of the model are available here
.
We published a paper in J. Mol. Graphics describing the program,
proteand
which is designed to display various representations of structural uncertainty
for macromolecules, including overlapping stick drawings, ellipsoids of
uncertainty, and secondary structure accessible volumes. These representations
are closely related to our methodology for computing the structure of ribosomal
RNA. The program runs on Silicon Graphics (SGI) machines and sample input
files and binary executable code are available in ftp://ftp-smi.stanford.edu/pub/altman/tar.proteand.
The initial model of the central domain of the 16S ribosomal RNA
structure is contained as a sample input to the display program, proteanD.
With John
Gennari and Mark
Musen of the Knowledge Modeling Group at Stanford Section on Medical
Informatics, we have investigated the use of reusable constraint satisfaction
methods in the context of ribosome modeling. The propose-and-revise methodology
can be extended to the ribosome problem, although at considerable loss
of problem solving efficiency. See references below.