Nucleic acids are biopolymers, macromolecules, essential to all known forms of life. They are composed of nucleotides, which are the monomers made of three components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). If the sugar is ribose, the polymer is RNA; if the sugar is the ribose derivative deoxyribose, the polymer is DNA.
Nucleic acids are naturally occurring chemical compounds that serve as the primary information-carrying molecules in cells and make up the genetic material. Nucleic acids are found in abundance in all living things, where they create, encode, and then store information of every living cell of every life-form on Earth. In turn, they function to transmit and express that information inside and outside the cell nucleus to the interior operations of the cell and ultimately to the next generation of each living organism. The encoded information is contained and conveyed via the nucleic acid sequence, which provides the 'ladder-step' ordering of nucleotides within the molecules of RNA and DNA. They play an especially important role in directing protein synthesis.
Strings of nucleotides are bonded to form helical backbones—typically, one for RNA, two for DNA—and assembled into chains of base-pairs selected from the five primary, or canonical, nucleobases, which are: adenine, cytosine, guanine, thymine, and uracil. Thymine occurs only in DNA and uracil only in RNA. Using amino acids and the process known as protein synthesis, the specific sequencing in DNA of these nucleobase-pairs enables storing and transmitting coded instructions as genes. In RNA, base-pair sequencing provides for manufacturing new proteins that determine the frames and parts and most chemical processes of all life forms.
Nucleic acid was first discovered by Friedrich Miescher in 1869 at the University of Tübingen, Germany. He gave its first name as nuclein. In the early 1880s Albrecht Kossel further purified the substance and discovered its highly acidic properties. He later also identified the nucleobases. In 1889 Richard Altmann created the term nucleic acid – at that time DNA and RNA were not differentiated. In 1938 Astbury and Bell published the first X-ray diffraction pattern of DNA.
Experimental studies of nucleic acids constitute a major part of modern biological and medical research, and form a foundation for genome and forensic science, and the biotechnology and pharmaceutical industries.
Occurrence and nomenclature
The term nucleic acid is the overall name for DNA and RNA, members of a family of biopolymers, and is synonymous with polynucleotide. Nucleic acids were named for their initial discovery within the nucleus, and for the presence of phosphate groups (related to phosphoric acid). Although first discovered within the nucleus of eukaryotic cells, nucleic acids are now known to be found in all life forms including within bacteria, archaea, mitochondria, chloroplasts, and viruses (There is debate as to whether viruses are living or non-living). All living cells contain both DNA and RNA (except some cells such as mature red blood cells), while viruses contain either DNA or RNA, but usually not both. The basic component of biological nucleic acids is the nucleotide, each of which contains a pentose sugar (ribose or deoxyribose), a phosphate group, and a nucleobase. Nucleic acids are also generated within the laboratory, through the use of enzymes (DNA and RNA polymerases) and by solid-phase chemical synthesis. The chemical methods also enable the generation of altered nucleic acids that are not found in nature, for example peptide nucleic acids.
Molecular composition and size
Nucleic acids are generally very large molecules. Indeed, DNA molecules are probably the largest individual molecules known. Well-studied biological nucleic acid molecules range in size from 21 nucleotides (small interfering RNA) to large chromosomes (human chromosome 1 is a single molecule that contains 247 million base pairs).
In most cases, naturally occurring DNA molecules are double-stranded and RNA molecules are single-stranded. There are numerous exceptions, however—some viruses have genomes made of double-stranded RNA and other viruses have single-stranded DNA genomes, and, in some circumstances, nucleic acid structures with three or four strands can form.
Nucleic acids are linear polymers (chains) of nucleotides. Each nucleotide consists of three components: a purine or pyrimidine nucleobase (sometimes termed nitrogenous base or simply base), a pentose sugar, and a phosphate group which makes the molecule acidic. The substructure consisting of a nucleobase plus sugar is termed a nucleoside. Nucleic acid types differ in the structure of the sugar in their nucleotides–DNA contains 2'-deoxyribose while RNA contains ribose (where the only difference is the presence of a hydroxyl group). Also, the nucleobases found in the two nucleic acid types are different: adenine, cytosine, and guanine are found in both RNA and DNA, while thymine occurs in DNA and uracil occurs in RNA.
The sugars and phosphates in nucleic acids are connected to each other in an alternating chain (sugar-phosphate backbone) through phosphodiester linkages. In conventional nomenclature, the carbons to which the phosphate groups attach are the 3'-end and the 5'-end carbons of the sugar. This gives nucleic acids directionality, and the ends of nucleic acid molecules are referred to as 5'-end and 3'-end. The nucleobases are joined to the sugars via an N-glycosidic linkage involving a nucleobase ring nitrogen (N-1 for pyrimidines and N-9 for purines) and the 1' carbon of the pentose sugar ring.
Non-standard nucleosides are also found in both RNA and DNA and usually arise from modification of the standard nucleosides within the DNA molecule or the primary (initial) RNA transcript. Transfer RNA (tRNA) molecules contain a particularly large number of modified nucleosides.
Double-stranded nucleic acids are made up of complementary sequences, in which extensive Watson-Crick base pairing results in a highly repeated and quite uniform Nucleic acid double-helical three-dimensional structure. In contrast, single-stranded RNA and DNA molecules are not constrained to a regular double helix, and can adopt highly complex three-dimensional structures that are based on short stretches of intramolecular base-paired sequences including both Watson-Crick and noncanonical base pairs, and a wide range of complex tertiary interactions.
Nucleic acid molecules are usually unbranched and may occur as linear and circular molecules. For example, bacterial chromosomes, plasmids, mitochondrial DNA, and chloroplast DNA are usually circular double-stranded DNA molecules, while chromosomes of the eukaryotic nucleus are usually linear double-stranded DNA molecules. Most RNA molecules are linear, single-stranded molecules, but both circular and branched molecules can result from RNA splicing reactions. The total amount of pyrimidines in a double-stranded DNA molecule is equal to the total amount of purines. The diameter of the helix is about 20Å.
One DNA or RNA molecule differs from another primarily in the sequence of nucleotides. Nucleotide sequences are of great importance in biology since they carry the ultimate instructions that encode all biological molecules, molecular assemblies, subcellular and cellular structures, organs, and organisms, and directly enable cognition, memory, and behavior. Enormous efforts have gone into the development of experimental methods to determine the nucleotide sequence of biological DNA and RNA molecules, and today hundreds of millions of nucleotides are sequenced daily at genome centers and smaller laboratories worldwide. In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site.
Deoxyribonucleic acid (DNA) is a nucleic acid containing the genetic instructions used in the development and functioning of all known living organisms. The chemical DNA was first discovered in 1869,but its genetic inheritance was not demonstrated until 1943. The DNA segments carrying this genetic information are called genes. Likewise, other DNA sequences have structural purposes or are involved in regulating the use of this genetic information. Along with RNA and proteins, DNA is one of the three major macromolecules that are essential for all known forms of life. DNA consists of two long polymers of simple units called nucleotides, with backbones made of sugars and phosphate groups joined by ester bonds. These two strands run in opposite directions to each other and are, therefore, anti-parallel. Attached to each sugar is one of four types of molecules called nucleobases (informally, bases). It is the sequence of these four nucleobases along the backbone that encodes information. This information is read using the genetic code, which specifies the sequence of the amino acids within proteins. The code is read by copying stretches of DNA into the related nucleic acid RNA in a process called transcription. Within cells, DNA is organized into long structures called chromosomes. During cell division these chromosomes are duplicated in the process of DNA replication, providing each cell its own complete set of chromosomes. Eukaryotic organisms (animals, plants, fungi, and protists) store most of their DNA inside the cell nucleus and some of their DNA in organelles, such as mitochondria or chloroplasts. In contrast, prokaryotes (bacteria and archaea) store their DNA only in the cytoplasm. Within the chromosomes, chromatin proteins such as histones compact and organize DNA. These compact structures guide the interactions between DNA and other proteins, helping control which parts of the DNA are transcribed.
Ribonucleic acid (RNA) functions in converting genetic information from genes into the amino acid sequences of proteins. The three universal types of RNA include transfer RNA (tRNA), messenger RNA (mRNA), and ribosomal RNA (rRNA). Messenger RNA acts to carry genetic sequence information between DNA and ribosomes, directing protein synthesis and carries instructions from DNA in the nucleus to ribosome . Ribosomal RNA reads the DNA sequence, and catalyzes peptide bond formation. Transfer RNA serves as the carrier molecule for amino acids to be used in protein synthesis, and is responsible for decoding the mRNA. In addition, many other classes of RNA are now known.
Artificial nucleic acid
Artificial nucleic acid analogues have been designed and synthesized by chemists, and include peptide nucleic acid, morpholino- and locked nucleic acid, glycol nucleic acid, and threose nucleic acid. Each of these is distinguished from naturally occurring DNA or RNA by changes to the backbone of the molecules.
- Comparison of nucleic acid simulation software
- History of biochemistry
- History of molecular biology
- History of RNA biology – Aspect of history of a biological field of study
- Molecular biology – Branch of biology which studies biological activities at molecular level
- Nucleic acid methods
- Nucleic acid metabolism – Process
- Nucleic acid structure – Biomolecular structure of nucleic acids such as DNA and RNA
- Nucleic acid thermodynamics – Study of how temperature affects the nucleic acid structure
- Oligonucleotide synthesis – Chemical synthesis of relatively short fragments of nucleic acids with defined chemical structure
- Quantification of nucleic acids
- "Nucleic Acid". Genome.gov. Retrieved 1 January 2022.
- "What is DNA". What is DNA. Linda Clarks. Retrieved 6 August 2016.
- Bill Bryson, A Short History of Nearly Everything, Broadway Books, 2015.p. 500.
- Dahm R (January 2008). "Discovering DNA: Friedrich Miescher and the early years of nucleic acid research". Human Genetics. 122 (6): 565–81. doi:10.1007/s00439-007-0433-0. PMID 17901982. S2CID 915930.
- "BIOdotEDU". www.brooklyn.cuny.edu. Retrieved 1 January 2022.
- Cox M, Nelson D (2008). Principles of Biochemistry. Susan Winslow. p. 288. ISBN 9781464163074.
- "DNA Structure". What is DNA. Linda Clarks. Retrieved 6 August 2016.
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. (February 2001). "Initial sequencing and analysis of the human genome" (PDF). Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. PMID 11237011.
- Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, et al. (February 2001). "The sequence of the human genome". Science. 291 (5507): 1304–51. Bibcode:2001Sci...291.1304V. doi:10.1126/science.1058040. PMID 11181995.
- Budowle B, van Daal A (April 2009). "Extracting evidence from forensic DNA analyses: future molecular biology directions". BioTechniques. 46 (5): 339–40, 342–50. doi:10.2144/000113136. PMID 19480629.
- Elson D (1965). "Metabolism of Nucleic Acids (Macromolecular DNA and RNA)". Annual Review of Biochemistry. 34: 449–86. doi:10.1146/annurev.bi.34.070165.002313. PMID 14321176.
- Dahm R (January 2008). "Discovering DNA: Friedrich Miescher and the early years of nucleic acid research". Human Genetics. nih.gov. 122 (6): 565–81. doi:10.1007/s00439-007-0433-0. PMID 17901982. S2CID 915930.
- Brock TD, Madigan MT (2009). Brock biology of microorganisms. Pearson / Benjamin Cummings. ISBN 978-0-321-53615-0.
- Hardinger, Steven; University of California, Los Angeles (2011). "Knowing Nucleic Acids" (PDF). ucla.edu.
- Mullis, Kary B. The Polymerase Chain Reaction (Nobel Lecture). 1993. (retrieved December 1, 2010) http://nobelprize.org/nobel_prizes/chemistry/laureates/1993/mullis-lecture.html
- Verma S, Eckstein F (1998). "Modified oligonucleotides: synthesis and strategy for users". Annual Review of Biochemistry. 67: 99–134. doi:10.1146/annurev.biochem.67.1.99. PMID 9759484.
- Gregory SG, Barlow KF, McLay KE, Kaul R, Swarbreck D, Dunham A, et al. (May 2006). "The DNA sequence and biological annotation of human chromosome 1". Nature. 441 (7091): 315–21. Bibcode:2006Natur.441..315G. doi:10.1038/nature04727. PMID 16710414.
- Todorov TI, Morris MD (April 2002). National Institutes of Health. "Comparison of RNA, single-stranded DNA and double-stranded DNA behavior during capillary electrophoresis in semidilute polymer solutions". Electrophoresis. nih.gov. 23 (7–8): 1033–44. doi:10.1002/1522-2683(200204)23:7/8<1033::AID-ELPS1033>3.0.CO;2-7. PMID 11981850.
- Margaret Hunt; University of South Carolina (2010). "RN Virus Replication Strategies". sc.edu.
- McGlynn P, Lloyd RG (August 1999). "RecG helicase activity at three- and four-strand DNA structures". Nucleic Acids Research. 27 (15): 3049–56. doi:10.1093/nar/27.15.3049. PMC 148529. PMID 10454599.
- Stryer, Lubert; Berg, Jeremy Mark; Tymoczko, John L. (2007). Biochemistry. San Francisco: W.H. Freeman. ISBN 978-0-7167-6766-4.
- Rich A, RajBhandary UL (1976). "Transfer RNA: molecular structure, sequence, and properties". Annual Review of Biochemistry. 45: 805–60. doi:10.1146/annurev.bi.45.070176.004105. PMID 60910.
- Watson JD, Crick FH (April 1953). "Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid". Nature. 171 (4356): 737–8. Bibcode:1953Natur.171..737W. doi:10.1038/171737a0. PMID 13054692. S2CID 4253007.
- Ferré-D'Amaré AR, Doudna JA (1999). "RNA folds: insights from recent crystal structures". Annual Review of Biophysics and Biomolecular Structure. 28: 57–73. doi:10.1146/annurev.biophys.28.1.57. PMID 10410795.
- Alberts, Bruce (2008). Molecular biology of the cell. New York: Garland Science. ISBN 978-0-8153-4105-5.
- Gilbert, Walter G. 1980. DNA Sequencing and Gene Structure (Nobel Lecture) http://nobelprize.org/nobel_prizes/chemistry/laureates/1980/gilbert-lecture.html
- Sanger, Frederick. 1980. Determination of Nucleotide Sequences in DNA (Nobel Lecture) http://nobelprize.org/nobel_prizes/chemistry/laureates/1980/sanger-lecture.html
- NCBI Resource Coordinators (January 2014). "Database resources of the National Center for Biotechnology Information". Nucleic Acids Research. 42 (Database issue): D7-17. doi:10.1093/nar/gkt1146. PMC 3965057. PMID 24259429.
- Wolfram Saenger, Principles of Nucleic Acid Structure, 1984, Springer-Verlag New York Inc.
- Bruce Alberts, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Walter Molecular Biology of the Cell, 2007, ISBN 978-0-8153-4105-5. Fourth edition is available online through the NCBI Bookshelf: link
- Jeremy M Berg, John L Tymoczko, and Lubert Stryer, Biochemistry 5th edition, 2002, W H Freeman. Available online through the NCBI Bookshelf: link
- Astrid Sigel; Helmut Sigel; Roland K. O. Sigel, eds. (2012). Interplay between Metal Ions and Nucleic Acids. Metal Ions in Life Sciences. Vol. 10. Springer. doi:10.1007/978-94-007-2172-2. ISBN 978-94-007-2171-5. S2CID 92951134.