DNA binding site

DNA binding sites are a type of
DNA binding sites can be thus defined as short DNA sequences (typically 4 to 30 base pairs long, but up to 200 bp for recombination sites) that are specifically bound by one or more DNA-binding proteins or protein complexes. It has been reported that some binding sites have potential to undergo fast evolutionary change.[2]
Types of DNA binding sites
DNA binding sites can be categorized according to their biological function. Thus, we can distinguish between transcription factor-binding sites, restriction sites and recombination sites. Some authors have proposed that binding sites could also be classified according to their most convenient mode of representation.
History and main experimental techniques
The existence of something akin to DNA binding sites was suspected from the experiments on the biology of the
Databases
Due to the diverse nature of the experimental techniques used in determining binding sites and to the patchy coverage of most organisms and transcription factors, there is no central database (akin to
There are, however, several private and public databases devoted to compilation of experimentally reported, and sometimes computationally predicted, binding sites for different transcription factors in different organisms. Below is a non-exhaustive table of available databases:
Name | Organisms | Source | Access | URL |
---|---|---|---|---|
PlantRegMap | 165 plant species (e.g., Arabidopsis thaliana, Oryza sativa, Zea mays, etc.) | Expert curation and projection | Public | [1] |
JASPAR | Vertebrates, Plants, Fungi, Flies, and Worms | Expert curation with literature support | Public | [2] |
CIS-BP | All Eukaryotes | Experimentally derived motifs and predictions | Public | [3] |
CollecTF | Prokaryotes | Literature curation | Public | [4] |
RegPrecise | Prokaryotes | Expert curation | Public | [5] |
RegTransBase | Prokaryotes | Expert/literature curation | Public | [6] |
RegulonDB | Escherichia coli | Expert curation | Public | [7] Archived 2017-05-07 at the Wayback Machine |
PRODORIC | Prokaryotes | Expert curation | Public | [8] Archived 2007-05-16 at the Wayback Machine |
TRANSFAC | Mammals | Expert/literature curation | Public/Private | [9] Archived 2008-10-23 at the Wayback Machine |
TRED | Human, Mouse, Rat | Computer predictions, manual curation | Public | [10] |
DBSD | Drosophila species | Literature/Expert curation | Public | [11] |
HOCOMOCO | Human, Mouse | Literature/Expert curation | Public | [12],[13] |
MethMotif | Human, Mouse | Expert curation | Public | [14] Archived 2019-10-29 at the Wayback Machine |
Representation of DNA binding sites
A collection of DNA binding sites, typically referred to as a DNA binding motif, can be represented by a
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | |
A | 1 | 0 | 1 | 5 | 32 | 5 | 35 | 23 | 34 | 14 | 43 | 13 | 34 | 4 | 52 | 3 |
C | 50 | 1 | 0 | 1 | 5 | 6 | 0 | 4 | 4 | 13 | 3 | 8 | 17 | 51 | 2 | 0 |
G | 0 | 0 | 54 | 15 | 5 | 5 | 12 | 2 | 7 | 1 | 1 | 3 | 1 | 0 | 1 | 52 |
T | 5 | 55 | 1 | 35 | 14 | 40 | 9 | 27 | 11 | 28 | 9 | 32 | 4 | 1 | 1 | 1 |
Sum | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | 56 |
PSFM for the transcriptional repressor
Computational search and discovery of binding sites
In
More complex methods for binding site search and motif discovery rely on the base stacking and other interactions between DNA bases, but due to the small sample sizes typically available for binding sites in DNA, their efficiency is still not completely harnessed. An example of such tool is the ULPB[29]
See also
- DNA binding protein
- Binding site
- Transcriptional regulation
References
- PMID 15178741.
- S2CID 21535866.
- ^ PMID 10812473.
- PMID 9210460.
- PMID 10781547.
- ISBN 978-0-387-23919-4.
- PMID 14145311.
- S2CID 19804795.
- PMID 4587255.
- S2CID 4204720.
- PMID 1055366.
- PMID 17053094.
- S2CID 42489892.
- "A hot road to new drugs". Phys.org. February 24, 2010.
- PMID 20981028.
- PMID 15130839.
- PMID 11861919.
- PMID 3525846.
- PMID 19210776.
- PMID 7784221.
- PMID 2014171.
- PMID 18566768.
- S2CID 205157795.
- PMID 2919167.
- S2CID 3040614.
- PMID 15728117.
- PMID 20736340.
- PMID 18047721.
- PMID 16477324.
- PMID 20439311.
External links
- ENCODE threads Explorer Transcription factor motifs in Nature
- Manually Curated TF Binding Motifs for 157 plant species Archived 2016-10-19 at the Wayback Machine