Biomolecular structure

Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a

hairpin loops

, bulges, and internal loops for nucleic acids. The terms primary, secondary, tertiary, and quaternary structure were introduced by Kaj Ulrik Linderstrøm-Lang in his 1951 Lane Medical Lectures at Stanford University.

Primary structure

The primary structure of a

nucleotides

.

The

3' end

. The nucleic acid sequence refers to the exact sequence of nucleotides that comprise the whole molecule. Often, the primary structure encodes sequence motifs that are of functional importance. Some examples of such motifs are: the C/D^[1] and H/ACA boxes^[2] of

Shine-Dalgarno sequence,^[3]

the Kozak consensus sequence^[4] and the RNA polymerase III terminator.^[5]

Secondary structure

The secondary structure of a protein is the pattern of hydrogen bonds in a biopolymer. These determine the general three-dimensional form of local segments of the biopolymers, but does not describe the global structure of specific atomic positions in three-dimensional space, which are considered to be tertiary structure. Secondary structure is formally defined by the hydrogen bonds of the biopolymer, as observed in an atomic-resolution structure. In proteins, the secondary structure is defined by patterns of hydrogen bonds between backbone amine and carboxyl groups (sidechain–mainchain and sidechain–sidechain hydrogen bonds are irrelevant), where the DSSP definition of a hydrogen bond is used.

The secondary structure of a nucleic acid is defined by the hydrogen bonding between the nitrogenous bases.

For proteins, however, the hydrogen bonding is correlated with other structural features, which has given rise to less formal definitions of secondary structure. For example, helices can adopt backbone dihedral angles in some regions of the Ramachandran plot; thus, a segment of residues with such dihedral angles is often called a helix, regardless of whether it has the correct hydrogen bonds. Many other less formal definitions have been proposed, often applying concepts from the differential geometry of curves, such as curvature and torsion. Structural biologists solving a new atomic-resolution structure will sometimes assign its secondary structure by eye and record their assignments in the corresponding Protein Data Bank (PDB) file.

The

stem loops. There are many secondary structure elements of functional importance to biological RNA. Famous examples include the Rho-independent terminator stem loops and the transfer RNA (tRNA) cloverleaf. There is a minor industry of researchers attempting to determine the secondary structure of RNA molecules. Approaches include both experimental and computational methods (see also the List of RNA structure prediction software

).

Tertiary structure

The

primary structure (its sequence of amino acids or nucleotides

).

Quaternary structure

Further information: Protein quaternary structure and Nucleic acid quaternary structure

The protein quaternary structure ^[a] refers to the number and arrangement of multiple protein molecules in a multi-subunit complex.
For nucleic acids, the term is less common, but can refer to the higher-level organization of DNA in chromatin,^[7] including its interactions with histones, or to the interactions between separate RNA units in the ribosome^[8]^[9] or spliceosome.

Structure determination

Further information: Protein structure and Nucleic acid structure determination

Structure probing is the process by which biochemical techniques are used to determine biomolecular structure.^[10] This analysis can be used to define the patterns that can be used to infer the molecular structure, experimental analysis of molecular structure and function, and further understanding on development of smaller molecules for further biological research.^[11] Structure probing analysis can be done through many different methods, which include chemical probing, hydroxyl radical probing, nucleotide analog interference mapping (NAIM), and in-line probing.^[10]
paracrystals with a significant degree of disorder (over 20%),^[17]^[18]
and the structure is not tractable using only the standard analysis.

In contrast, the standard analysis, involving only Fourier transforms of Bessel functions^[19] and DNA molecular models, is still routinely used to analyze A-DNA and Z-DNA X-ray diffraction patterns.^[20]

Structure prediction

Biomolecular structure prediction is the prediction of the three-dimensional structure of a protein from its amino acid sequence, or of a nucleic acid from its nucleobase (base) sequence. In other words, it is the prediction of secondary and tertiary structure from its primary structure. Structure prediction is the inverse of biomolecular design, as in rational design, protein design, nucleic acid design, and biomolecular engineering.

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Protein structure prediction is of high importance in medicine (for example, in drug design) and biotechnology (for example, in the design of novel enzymes). Every two years, the performance of current methods is assessed in the Critical Assessment of protein Structure Prediction (CASP) experiment.

There has also been a significant amount of

secondary structure or intra-molecular base-pairing interactions of the molecule. This is shown by the high conservation of base pairings

across diverse species.

Secondary structure of small nucleic acid molecules is determined largely by strong, local interactions such as

base stacking. Summing the free energy for such interactions, usually using a nearest-neighbor method, provides an approximation for the stability of given structure.^[21] The most straightforward way to find the lowest free energy structure would be to generate all possible structures and calculate the free energy for them, but the number of possible structures for a sequence increases exponentially with the length of the molecule.^[22] For longer molecules, the number of possible secondary structures is vast.^[21]

Sequence covariation methods rely on the existence of a data set composed of multiple homologous RNA sequences with related but dissimilar sequences. These methods analyze the covariation of individual base sites in evolution; maintenance at two widely separated sites of a pair of base-pairing nucleotides indicates the presence of a structurally required hydrogen bond between those positions. The general problem of pseudoknot prediction has been shown to be NP-complete.^[23]

Design

Biomolecular design can be considered the inverse of structure prediction. In structure prediction, the structure is determined from a known sequence, whereas, in protein or nucleic acid design, a sequence that will form a desired structure is generated.

Other biomolecules

Other biomolecules, such as polysaccharides, polyphenols and lipids, can also have higher-order structure of biological consequence.

Notes

distributive numbers, and follows binary and ternary; while quartary is derived from Latin ordinal numbers
, and follows secondary and tertiary. However, quaternary is standard in biology.

References

PMID 9649444
.

PMID 9106664
.

S2CID 4162567
.

PMID 3313277
.

S2CID 9982829
.

doi:10.1351/goldbook.T06282

S2CID 35930758
.

PMID 6206780
.

PMID 11296253
.

^
ISBN 978-90-901323-4-1
.

ISBN 978-0-87969-589-7
.

doi:10.1107/s0365110x53001939
.

S2CID 4268222
.

S2CID 4280080
.

PMID 7441761
.

S2CID 189888972
.

^ Hosemann R, Bagchi RN (1962). Direct analysis of diffraction by matter. Amsterdam/New York: North-Holland.

doi:10.1107/s0567739478001540
.

^ "Bessel functions and diffraction by helical structures". planetphysics.org.^{[permanent dead link]}

^ "X-Ray Diffraction Patterns of Double-Helical Deoxyribonucleic Acid (DNA) Crystals". planetphysics.org. Archived from the original on 24 July 2009.

^
PMID 16500677
.

S2CID 189885784
.

PMID 11108471
.

v
t
e
Biomolecular structure
Protein structure

Primary

Secondary

Tertiary

Quaternary

Determination

Prediction

Design

Thermodynamics

Nucleic acid structure

Primary

Secondary

Tertiary

Quaternary

Determination

Prediction

Design

Thermodynamics

See also

Protein

Protein domain

Protein engineering

Proteasome

Nucleic acid

DNA

RNA

Structural motif

Nucleic acid double helix

Retrieved from "https://en.wikipedia.org/w/index.php?title=Biomolecular_structure&oldid=1178888325"

[7] stributive numbers, and follows binary and ternary; while quartary is derived from Latin ordinal numbers
, and follows secondary and tertiary. However, quaternary is standard in biology.

[1] PMID 9649444
.

[2] PMID 9106664
.

[3] S2CID 4162567
.

[Kozak1987-4] PMID 3313277
.

[pmid6263489-5] S2CID 9982829
.

[6] doi:10.1351/goldbook.T06282

[8] S2CID 35930758
.

[9] PMID 6206780
.

[10] PMID 11296253
.

[Teunissen1979-11] 
ISBN 978-90-901323-4-1
.

[12] ISBN 978-0-87969-589-7
.

[13] :10.1107/s0365110x53001939
.

[NatFranGos-14] S2CID 4268222
.

[NatWilk-15] S2CID 4280080
.

[16] PMID 7441761
.

[17] S2CID 189888972
.

[18] Hosemann R, Bagchi RN (1962). Direct analysis of diffraction by matter. Amsterdam/New York: North-Holland.

[19] :10.1107/s0567739478001540
.

[20] "Bessel functions and diffraction by helical structures". planetphysics.org.^{[permanent dead link]}

[21] "X-Ray Diffraction Patterns of Double-Helical Deoxyribonucleic Acid (DNA) Crystals". planetphysics.org. Archived from the original on 24 July 2009.

[Mathews06-22] 
PMID 16500677
.

[Zuker84-23] S2CID 189885784
.

[Lyngso00-24] PMID 11108471
.

[1]

[2]

[3]

[4]

[5]

[a]

[7]

[8]

[9]

[10]

[11]

[17]

[18]

[19]

[20]

[21]

[22]

[23]