Base pair

You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.

(Redirected from Kilobase)
Jump to: navigation, search

Editor-In-Chief: C. Michael Gibson, M.S., M.D. [1] Phone:617-525-6884

Please Take Over This Page and Apply to be Editor-In-Chief for this topic: There can be one or more than one Editor-In-Chief. You may also apply to be an Associate Editor-In-Chief of one of the subtopics below. Please mail us [2] to indicate your interest in serving either as an Editor-In-Chief of the entire topic or as an Associate Editor-In-Chief for a subtopic. Please be sure to attach your CV and or biographical sketch.

In molecular biology, two nucleotides on opposite complementary DNA or RNA strands that are connected via hydrogen bonds are called a base pair (often abbreviated bp). In the canonical Watson-Crick base pairing, adenine (A) forms a base pair with thymine (T), as does guanine (G) with cytosine (C) in DNA. In RNA, thymine is replaced by uracil (U). Non-Watson-Crick base pairing with alternate hydrogen bonding patterns also occur, especially in RNA; common such patterns are Hoogsteen base pairs. Pairing is also the mechanism by which codons on messenger RNA molecules are recognized by anticodons on transfer RNA during protein translation. Some DNA- or RNA-binding enzymes can recognize specific base pairing patterns that identify particular regulatory regions of genes.

The size of an individual gene or an organism's entire genome is often measured in base pairs because DNA is usually double-stranded. Hence, the number of total base pairs is equal to the number of nucleotides in one of the strands (with the exception of non-coding single-stranded regions of telomeres). The haploid human genome (23 chromosomes) is estimated to be about 3 billion base pairs long and to contain 20,000-25,000 distinct genes.[1]

Examples

The following DNA sequences illustrate pair double-stranded patterns. By convention, the top strand is written from the 5' end to the 3' end; thus the bottom strand is written 3' to 5'.

A base-paired DNA sequence:
ATCGAT
TAGCTA
The corresponding base-paired RNA sequence, in which uracil is substituted for thymine:
AUCGAU
UAGCUA

Length measurements

The following abbreviations are commonly used to describe the length of a DNA/RNA molecule:

  • bp = base pair(s)
  • kb (= kbp) = kilo base pairs = 1,000 bp
  • Mb = mega base pairs = 1,000,000 bp
  • Gb = giga base pairs = 1,000,000,000 bp

In case of single stranded DNA/RNA we talk about nucleotides, abbreviated nt (or knt, Mnt, Gnt), rather than base pairs, as they are not paired. For distinction between units of computer storage and bases kbp, Mbp, Gbp etc may be used for disambiguation.

The Centimorgan is also often used to imply distance along a chromosome, but the number of base-pairs it corresponds to varies widely. In the Human genome, it is about 1 million base pairs[1] [1].

Hydrogen bonding and stability

A GC base pair demonstrating three intermolecular hydrogen bonds
A GC base pair demonstrating three intermolecular hydrogen bonds
An AT base pair demonstrating two intermolecular hydrogen bonds
An AT base pair demonstrating two intermolecular hydrogen bonds

Hydrogen bonding is the chemical mechanism that underlies the base-pairing rules described above. Appropriate geometrical correspondence of hydrogen bond donors and acceptors allows only the "right" pairs to form stably. The GC base pair has three hydrogen bonds, whereas the AT base pair has only two; as a consequence, the GC pair is more stable.

The larger nucleic acids, adenine and guanine, are members of a class of doubly-ringed chemical structures called purines; the smaller nucleic acids, cytosine and thymine (and uracil), are members of a class of singly-ringed chemical structures called pyrimidines. Purines are only complementary with pyrimidines: pyrimidine-pyrimidine pairings are energetically unfavorable because the molecules are too far apart for hydrogen bonding to be established; purine-purine pairings are energetically unfavorable because the molecules are too close, leading to electrostatic repulsion. The only other possible pairings are GT and AC; these pairings are mismatches because the pattern of hydrogen donors and acceptors do not correspond. (It should be noted that the GU pairing, with two hydrogen bonds, does occur fairly often in RNA but rarely in DNA.)

Paired DNA and RNA molecules are comparatively stable at room temperature but the two nucleotide strands will separate above a melting point that is determined by the length of the molecules, the extent of mispairing (if any), and the GC content. Higher GC content results in higher melting temperatures; it is therefore unsurprising that the genomes of extremophile organisms such as Thermus thermophilus are particularly GC-rich. Conversely, regions of a genome that need to separate frequently - for example, the promoter regions for often-transcribed genes - are comparatively GC-poor (for example, see TATA box). GC content and melting temperature must also be taken into account when designing primers for PCR reactions.

Base stacking

Base stacking interactions between the pi orbitals of the bases' aromatic rings also contribute to stability, and again GC stacking interactions with adjacent bases tend to be more favorable. (Note, though, that a GC stacking interaction with the next base pair is geometrically different from a CG interaction.) Base stacking effects are especially important in the secondary structure of RNA; for example, RNA stem-loop structures are stabilized by base stacking in the loop region.

Further information: pi stacking

Base analogs and intercalators

Chemical analogs of nucleotides can take the place of proper nucleotides and establish non-canonical base-pairing, leading to errors (mostly point mutations) in DNA replication and DNA transcription. One common mutagenic base analog is 5-bromouracil, which resembles thymine but can base-pair to guanine in its enol form.

Other chemicals, known as DNA intercalators, fit into the gap between adjacent bases on a single strand and induce frameshift mutations by "masquerading" as a base, causing the DNA replication machinery to skip or insert additional nucleotides at the intercalated site. Most intercalators are large polyaromatic compounds and are known or suspected carcinogens. Examples include ethidium bromide and acridine.

See also

External links

  • DAN - webserver version of the EMBOSS tool for calculating melting temperatures

Cited references


General references

  • Watson JD, Baker TA, Bell SP, Gann A, Levine M, Losick R. (2004). Molecular Biology of the Gene. 5th ed. Pearson Benjamin Cummings: CSHL Press. See esp. ch. 6 and 9.


ca:Parell de bases de:Basenpaarfr:Paire de bases id:Pasangan basa it:Coppia di basi hu:Bázispár nl:Basepaar ja:塩基対simple:Base pair fi:Emäspari sv:Baspar vi:Nguyên tắc bổ sung uk:Пара основ


WikiDoc Help Menu

Quick Start..

Editing basics

Advanced editing

Communicating your edits

Help Videos You Can Watch


Acknowledgement and Attribution Regarding Sources of Content

Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .

Personal tools
In other languages