16S ribosomal RNA

16

Shine-Dalgarno sequence

and provides most of the SSU structure.
The genes coding for it are referred to as 16S rRNA genes and are used in reconstructing
bacterium.^[4]

Functions

Like the large (23S) ribosomal RNA, it has a structural role, acting as a scaffold defining the positions of the ribosomal proteins.

The
protein synthesis^[5]

Interacts with 23S, aiding in the binding of the two ribosomal subunits (
30S
)

Stabilizes correct codon-anticodon pairing in the A-site by forming a hydrogen bond between the N1 atom of adenine residues 1492 and 1493 and the 2′OH group of the mRNA backbone.

Structure

SSU Ribosomal RNA, bacteria and archaea. From Woese 1987.^[6]

Universal primers

The 16S rRNA gene is used for
annealing of "universal" primers.^[10] Mitochondrial and chloroplastic rRNA are also amplified.^[11]

The most common primer pair was devised by Weisburg et al. (1991)[7] and is currently referred to as 27F and 1492R; however, for some applications shorter amplicons may be necessary, for example for 454 sequencing with titanium chemistry the primer pair 27F-534R covering V1 to V3.^[12] Often 8F is used rather than 27F. The two primers are almost identical, but 27F has an M instead of a C. AGAGTTTGATCMTGGCTCAG compared with 8F.^[13]

Primer name Sequence (5′–3′) Ref.

8F AGA GTT TGA TCC TGG CTC AG ^[14]^[15]

27F AGA GTT TGA TCM TGG CTC AG ^[13]

336R ACT GCT GCS YCC CGT AGG AGT CT ^[16]

337F GAC TCC TAC GGG AGG CWG CAG ^[17]

518R GTA TTA CCG CGG CTG CTG G

533F GTG CCA GCM GCC GCG GTA A

785F GGA TTA GAT ACC CTG GTA

806R GGA CTA CVS GGG TAT CTA AT ^[18]^[19]

907R CCG TCA ATT CCT TTR AGT TT

928F TAA AAC TYA AAK GAA TTG ACG GG ^[16]

1100F YAA CGA GCG CAA CCC

1100R GGG TTG CGC TCG TTG

U1492R GGT TAC CTT GTT ACG ACT T ^[14]^[15]

1492R CGG TTA CCT TGT TAC GAC TT ^[20]

PCR and NGS applications

In addition to highly conserved primer binding sites, 16S rRNA gene sequences contain hypervariable regions that can provide species-specific signature sequences useful for identification of bacteria.^[21]^[22] As a result, 16S rRNA gene sequencing has become prevalent in medical microbiology as a rapid and cheap alternative to phenotypic methods of bacterial identification.^[23] Although it was originally used to identify bacteria, 16S sequencing was subsequently found to be capable of reclassifying bacteria into completely new species,^[24] or even genera.^[7]^[25] It has also been used to describe new species that have never been successfully cultured.^[26]^[27] With
gut flora.^[28]

Hypervariable regions

The bacterial 16S gene contains nine hypervariable regions (V1–V9), ranging from about 30 to 100
small ribosomal subunit.^[29] The degree of conservation varies widely between hypervariable regions, with more conserved regions correlating to higher-level taxonomy and less conserved regions to lower levels, such as genus and species.^[30] While the entire 16S sequence allows for comparison of all hypervariable regions, at approximately 1,500 base pairs long it can be prohibitively expensive for studies seeking to identify or characterize diverse bacterial communities.^[30] These studies commonly utilize the Illumina platform, which produces reads at rates 50-fold and 12,000-fold less expensive than 454 pyrosequencing and Sanger sequencing, respectively.^[31] While cheaper and allowing for deeper community coverage, Illumina sequencing only produces reads 75–250 base pairs long (up to 300 base pairs with Illumina MiSeq), and has no established protocol for reliably assembling the full gene in community samples.^[32] Full hypervariable regions can be assembled from a single Illumina run, however, making them ideal targets for the platform.^[32]

While 16S hypervariable regions can vary dramatically between bacteria, the 16S gene as a whole maintains greater length homogeneity than its eukaryotic counterpart (
CDC-watched pathogens tested, including anthrax.^[35]

While 16S hypervariable region analysis is a powerful tool for bacterial taxonomic studies, it struggles to differentiate between closely related species.[34] In the families Enterobacteriaceae, Clostridiaceae, and Peptostreptococcaceae, species can share up to 99% sequence similarity across the full 16S gene.^[36] As a result, the V4 sequences can differ by only a few nucleotides, leaving reference databases unable to reliably classify these bacteria at lower taxonomic levels.^[36] By limiting 16S analysis to select hypervariable regions, these studies can fail to observe differences in closely related taxa and group them into single taxonomic units, therefore underestimating the total diversity of the sample.^[34] Furthermore, bacterial genomes can house multiple 16S genes, with the V1, V2, and V6 regions containing the greatest intraspecies diversity.^[8] While not the most precise method of classifying bacterial species, analysis of the hypervariable regions remains one of the most useful tools available to bacterial community studies.^[36]

Promiscuity of 16S rRNA genes

Under the assumption that evolution is driven by
null mutant of E. coli as host, growth of the mutant strain was shown to be complemented by foreign 16S rRNA genes that were phylogenetically distinct from E. coli at the phylum level.^[37]^[38] Such functional compatibility was also seen in Thermus thermophilus.^[39] Furthermore, in T. thermophilus, both complete and partial gene transfer was observed. Partial transfer resulted in spontaneous generation of apparently random chimera between host and foreign bacterial genes. Thus, 16S rRNA genes may have evolved through multiple mechanisms, including vertical inheritance and horizontal gene transfer; the frequency of the latter may be much higher than previously thought.^[40]

16S ribosomal databases

The 16S rRNA gene is used as the standard for classification and identification of microbes, because it is present in most microbes and shows proper changes.[41] Type strains of 16S rRNA gene sequences for most bacteria and archaea are available on public databases, such as NCBI. However, the quality of the sequences found on these databases is often not validated. Therefore, secondary databases that collect only 16S rRNA sequences are widely used. The most frequently used databases are listed below:

MIMt

MIMt is a compact non-redundant 16S database for a rapid metagenomic samples identification. It is composed of 39.940 full 16S sequences belonging to 17,625 well classified bacteria and archaea species. All sequences were obtained from complete genomes deposited in NCBI and for each of the sequences full taxonomic hierarchy is provided. It contains no redundancy, so only one representative for each species was considered avoiding same sequences from differente strains, isolates or patovars resulting in a very fast tool for microorganisms identification, compatible with any classification software (QIIME, Mothur, DADA, etc).^[42]

EzBioCloud

EzBioCloud database, formerly known as EzTaxon, consists of a complete hierarchical taxonomic system containing 62,988 bacteria and archaea species/phylotypes which includes 15,290 valid published names as of September 2018. Based on the phylogenetic relationship such as maximum-likelihood and OrthoANI, all species/subspecies are represented by at least one 16S rRNA gene sequence. The EzBioCloud database is systematically curated and updated regularly which also includes novel candidate species. Moreover, the website provides bioinformatics tools such as ANI calculator, ContEst16S and 16S rRNA DB for QIIME and Mothur pipeline.^[43]^^

Ribosomal Database Project

The Ribosomal Database Project (RDP) is a curated database that offers ribosome data along with related programs and services. The offerings include phylogenetically ordered alignments of ribosomal RNA (rRNA) sequences, derived phylogenetic trees, rRNA secondary structure diagrams and various software packages for handling, analyzing and displaying alignments and trees. The data are available via ftp and electronic mail. Certain analytic services are also provided by the electronic mail server.^[44] Due to its large size the RDP database is often used as the basis for bioinformatic tool development and creating manually curated databases.^[45]

SILVA

LSU) ribosomal RNA (rRNA) sequences for all three domains of life as well as a suite of search, primer-design and alignment tools (Bacteria, Archaea and Eukarya).^[46]

GreenGenes

GreenGenes is a quality controlled, comprehensive 16S rRNA gene reference database and taxonomy based on a de novo phylogeny that provides standard operational taxonomic unit sets. Beware that it utilizes taxonomic terms proposed from phylogenetic methods applied years ago between 2012 and 2013. Since then, a variety of novel phylogenetic methods have been proposed for Archaea and Bacteria.[47]^[48]

References

S2CID 1024446
.

^
PMID 270744.

PMID 2112744
.

PMID 17071787
.

S2CID 22941368
.

PMID 2439888
.

^
PMID 1987160
.

^
PMID 14612235
.

PMID 28855596
.

PMID 26156036
.

PMID 33004967
.

^ "Human Microbiome Project DACC - Home". www.hmpdacc.org. Archived from the original on 2010-10-30.

^ ^a ^b "Primers, 16S ribosomal DNA - François Lutzoni's Lab". lutzonilab.net. Archived from the original on 2012-12-27.

^
PMID 1854644
.

^
ISBN 978-90-481-9038-6
.

^
PMID 8975607. Archived
(PDF) from the original on 2011-07-15.

PMID 34721319
.

S2CID 27232975
.

PMID 22267877
.

PMID 16751487
.

PMID 20923781
.

PMID 10383862
.

PMID 15489351
.

PMID 19395563
.

PMID 9542103
.

PMID 7520119
.

PMID 8899989
.

PMID 25226019
.

PMID 6462918
.

^
PMID 27000765
.

PMID 21460107
.

^
PMID 27688981
.

PMID 8811093
.

^
PMID 23460914
.

^
PMID 17391789
.

^
PMID 27148170
.

PMID 23112186
.

PMID 28855596
.

PMID 31375780
.

PMID 31375780
.

S2CID 21895693
.

^ "MIMt - (Mass Identification of Metagenomics tests)". mimt.bu.biopolis.pt. Retrieved 11 February 2024.

^ Yoon, S. H., Ha, S. M., Kwon, S., Lim, J., Kim, Y., Seo, H. and Chun, J. (2017). Introducing EzBioCloud: A taxonomically united database of 16S rRNA and whole genome assemblies. Int J Syst Evol Microbiol. 67:1613–1617

^ Larsen N, Olsen GJ, Maidak BL, McCaughey MJ, Overbeek R, Macke TJ, Marsh TL, Woese CR. (1993) The ribosomal database project. Nucleic Acids Res. Jul 1;21(13):3021-3.

PMID 26450747
.

^ Elmar Pruesse, Christian Quast, Katrin Knittel, Bernhard M. Fuchs, Wolfgang Ludwig, Jörg Peplies, Frank Oliver Glöckner (2007) Nucleic Acids Res. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. December; 35(21): 7188–7196.

PMID 16820507
.

PMID 22134646
.

External links

University of Washington Laboratory Medicine: Molecular Diagnosis | Bacterial Sequencing

MIMt 16S database

The Ribosomal Database Project Archived 2020-08-19 at the Wayback Machine

Ribosomes and Ribosomal RNA: (rRNA)

SILVA rRNA database

Greengenes: 16S rDNA data and tools

EzBioCloud

50S
):
30S
):
16S
50S
):
30S
):
16S
60S
):
40S
):
18S
Mitochondrial (55S)
Large (28S):
MT-RNR2, 16S
MT-tRNA^Val

Small (39S):
50S
):
30S
):
16S
Ribosomal proteins
(See article table)

Retrieved from "https://en.wikipedia.org/w/index.php?title=16S_ribosomal_RNA&oldid=1212133039"

[Schluenzen-1] S2CID 1024446
.

[woese1977-2] 
PMID 270744.

[Woese_1990-3] PMID 2112744
.

[pmid17071787-4] PMID 17071787
.

[5] S2CID 22941368
.

[6] PMID 2439888
.

[Weisburg-7] 
PMID 1987160
.

[pmid14612235-8] 
PMID 14612235
.

[9] PMID 28855596
.

[Jay-10] PMID 26156036
.

[11] PMID 33004967
.

[12] "Human Microbiome Project DACC - Home". www.hmpdacc.org. Archived from the original on 2010-10-30.

[:5-13] "Primers, 16S ribosomal DNA - François Lutzoni's Lab". lutzonilab.net. Archived from the original on 2012-12-27.

[Eden-14] 
PMID 1854644
.

[James_G-15] 
ISBN 978-90-481-9038-6
.

[Weirdner-16] 
PMID 8975607. Archived
(PDF) from the original on 2011-07-15.

[17] PMID 34721319
.

[18] S2CID 27232975
.

[19] PMID 22267877
.

[HC_Jiang-20] PMID 16751487
.

[21] PMID 20923781
.

[22] PMID 10383862
.

[23] PMID 15489351
.

[24] PMID 19395563
.

[25] PMID 9542103
.

[26] PMID 7520119
.

[27] PMID 8899989
.

[pmid25226019-28] PMID 25226019
.

[pmid6462918-29] PMID 6462918
.

[:0-30] 
PMID 27000765
.

[31] PMID 21460107
.

[:1-32] 
PMID 27688981
.

[pmid8811093-33] PMID 8811093
.

[:2-34] 
PMID 23460914
.

[:3-35] 
PMID 17391789
.

[:4-36] 
PMID 27148170
.

[37] PMID 23112186
.

[38] PMID 28855596
.

[39] PMID 31375780
.

[40] PMID 31375780
.

[pmid25118885-41] S2CID 21895693
.

[42] "MIMt - (Mass Identification of Metagenomics tests)". mimt.bu.biopolis.pt. Retrieved 11 February 2024.

[43] Yoon, S. H., Ha, S. M., Kwon, S., Lim, J., Kim, Y., Seo, H. and Chun, J. (2017). Introducing EzBioCloud: A taxonomically united database of 16S rRNA and whole genome assemblies. Int J Syst Evol Microbiol. 67:1613–1617

[44] Larsen N, Olsen GJ, Maidak BL, McCaughey MJ, Overbeek R, Macke TJ, Marsh TL, Woese CR. (1993) The ribosomal database project. Nucleic Acids Res. Jul 1;21(13):3021-3.

[45] PMID 26450747
.

[46] Elmar Pruesse, Christian Quast, Katrin Knittel, Bernhard M. Fuchs, Wolfgang Ludwig, Jörg Peplies, Frank Oliver Glöckner (2007) Nucleic Acids Res. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. December; 35(21): 7188–7196.

[47] PMID 16820507
.

[48] PMID 22134646
.

[1]

[4]

[5]

[6]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[7]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[35]

[36]

[34]

[8]

[37]

[38]

[39]

[40]

[42]

[43]

[44]

[45]

[46]

[48]