Dystrophin

You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.

Jump to: navigation, search


Dystrophin (muscular dystrophy, Duchenne and Becker types)
Image:PBB Protein DMD image.jpg
PDB rendering based on 1dxx.
Available structures:

The Protein Data Bank (PDB) is a repository for 3-D structural data of proteins and nucleic acids. These data, typically obtained by X-ray crystallography or NMR spectroscopy, are submitted by biologists and biochemists from around the world, are released into the public domain, and can be accessed for free.

Contents

History

Founded in 1971 by Drs. Edgar Meyer and Walter Hamilton Brookhaven National Laboratory, management of the Protein Data Bank was transferred in 1998 to members of the Research Collaboratory for Structural Bioinformatics (RCSB).

The Worldwide Protein Data Bank (wwPDB) consists of organizations that act as deposition, data processing and distribution centers for PDB data. The founding members are RCSB PDB (USA), MSD-EBI (Europe) and PDBj (Japan). The BMRB (USA) group joined the wwPDB in 2006. The mission of the wwPDB is to maintain a single Protein Data Bank Archive of macromolecular structural data that is freely and publicly available to the global community.

The PDB is a key resource in structural biology and is critical to more recent work in structural genomics.

Countless derived databases and projects have been developed to integrate and classify the PDB in terms of protein structure, protein function and protein evolution.

Growth

When the PDB was originally founded it contained just 7 protein structures. Since then it has undergone an approximate exponential growth in the number of structures, which does not show any sign of falling off.

The growth rate of the PDB has been the subject of fairly extensive analysis.

Contents

As of 26 September, 2006, the database contained 39,051 released atomic coordinate entries (or "structures"), 35,767 of that proteins, the rest being nucleic acids, nucleic acid-protein complexes, and a few other molecules. About 5,000 new structures are released each year. Data are stored in the mmCIF format specifically developed for the purpose.

Note that the database stores information about the exact location of all atoms in a large biomolecule (although, usually without the hydrogen atoms, as their positions are more of a statistical estimate); if one is only interested in sequence data, i.e. the list of amino acids making up a particular protein or the list of nucleotides making up a particular nucleic acid, the much larger databases from Swiss-Prot and the International Nucleotide Sequence Database Collaboration should be used.

Statistics

As of 11 September, 2007, the "PDB Holdings List" at RCSB reported the following statistics:

Proteins Nucleic Acids Protein/NA complexes Other Total
X-ray diffraction 36223 983 1684 24 38914
NMR 5665 781 134 7 6587
Electron microscopy 105 10 38 0 153
Other 80 4 4 2 90
Total 42073 1778 1860 33 45744

Note that theoretical models are no longer accepted in the PDB.

22461 structures in the PDB have a structure factor file. 3138 structures in the PDB have an NMR restraint file.

The current breakdown of holdings is updated weekly.

File format

Through the years the PDB file format has undergone many, many changes and revisions. Its original format was dictated by the width of computer punch cards.

This legacy format has caused many problems with the format, and consequently there are 'clean-up' projects;

The MMDB uses ASN.1 (and an XML conversion of this format). The wwPDB members RCSB PDB, MSD-EBI, and PDBj are working together to make the data uniform across the archive. Some believe this to be desirable; others argue that, without a universal repository of information (i.e., a common dictionary), it is not possible to draw comparisons.

Each structure published in PDB receives a four-character alphanumeric identifier, its PDB ID. This should not be used as an identifier for biomolecules, since often several structures for the same molecule (in different environments or conformations) are contained in PDB with different PDB IDs.

If a biologist submits structure data for a protein or nucleic acid, wwPDB staff reviews and annotates the entry. The data are then automatically checked for plausibility. The source code for this validation software has been released for free. The main data base accepts only experimentally derived structures, and not theoretically predicted ones (see protein structure prediction).

Various funding agencies and scientific journals now require scientists to submit their structure data to PDB.

Viewing the data

The structural data can be used to visualize the biomolecules with appropriate software, such as VMD, RasMol, PyMOL, Jmol, MDL Chime, QuteMol, web browser VRML plugin or any web-based software designed to visualize and analyse the protein structures such as STING. A recent desktop software addition is Sirius. The RCSB PDB website also contains resources for education, structural genomics, and related software.

References

Printed

  • H.M. Berman, K. Henrick, H. Nakamura (2003): Announcing the worldwide Protein Data Bank. Nature Structural Biology 10 (12), p. 980 PMID 14634627.
  • H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne: The Protein Data Bank. Nucleic Acids Research, 28 pp. 235-242 (2000). PMID 10592235
  • Bernstein FC, Koetzle TF, Williams GJ, Meyer Jr EF, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol 1977;112:535-542. PMID 875032.
  • E.F. Meyer “The First Years of the Protein Data Bank“, Protein Science 6:1591-1597 (1997)
  • Sussman, JL, Lin, D, Jiang, J, Manning, NO, Prilusky, J, Ritter, O & Abola, EE. Protein data bank (PDB): a database of 3D structural information of biological macromolecules. Acta Cryst 1998; D54:1078-1084. PMID 10089483.

Online

Other external links

Links to enzyme database data

  • [1] The best mapping is provided by Kim Henrick's group at EBI as part of the MSD SIFTS initiative.
  • [2] PDB provide a mapping on their beta site, but it is at the whole PDB level not chain level.
  • [3] Search at BRENDA enzyme database portal.
  • [4] PDBSProtEC:

Molecular graphic visualisation tools

The Protein Data Bank (PDB) is a repository for 3-D structural data of proteins and nucleic acids. These data, typically obtained by X-ray crystallography or NMR spectroscopy, are submitted by biologists and biochemists from around the world, are released into the public domain, and can be accessed for free.

History

Founded in 1971 by Drs. Edgar Meyer and Walter Hamilton Brookhaven National Laboratory, management of the Protein Data Bank was transferred in 1998 to members of the Research Collaboratory for Structural Bioinformatics (RCSB).

The Worldwide Protein Data Bank (wwPDB) consists of organizations that act as deposition, data processing and distribution centers for PDB data. The founding members are RCSB PDB (USA), MSD-EBI (Europe) and PDBj (Japan). The BMRB (USA) group joined the wwPDB in 2006. The mission of the wwPDB is to maintain a single Protein Data Bank Archive of macromolecular structural data that is freely and publicly available to the global community.

The PDB is a key resource in structural biology and is critical to more recent work in structural genomics.

Countless derived databases and projects have been developed to integrate and classify the PDB in terms of protein structure, protein function and protein evolution.

Growth

When the PDB was originally founded it contained just 7 protein structures. Since then it has undergone an approximate exponential growth in the number of structures, which does not show any sign of falling off.

The growth rate of the PDB has been the subject of fairly extensive analysis.

Contents

As of 26 September, 2006, the database contained 39,051 released atomic coordinate entries (or "structures"), 35,767 of that proteins, the rest being nucleic acids, nucleic acid-protein complexes, and a few other molecules. About 5,000 new structures are released each year. Data are stored in the mmCIF format specifically developed for the purpose.

Note that the database stores information about the exact location of all atoms in a large biomolecule (although, usually without the hydrogen atoms, as their positions are more of a statistical estimate); if one is only interested in sequence data, i.e. the list of amino acids making up a particular protein or the list of nucleotides making up a particular nucleic acid, the much larger databases from Swiss-Prot and the International Nucleotide Sequence Database Collaboration should be used.

Statistics

As of 11 September, 2007, the "PDB Holdings List" at RCSB reported the following statistics:

Proteins Nucleic Acids Protein/NA complexes Other Total
X-ray diffraction 36223 983 1684 24 38914
NMR 5665 781 134 7 6587
Electron microscopy 105 10 38 0 153
Other 80 4 4 2 90
Total 42073 1778 1860 33 45744

Note that theoretical models are no longer accepted in the PDB.

22461 structures in the PDB have a structure factor file. 3138 structures in the PDB have an NMR restraint file.

The current breakdown of holdings is updated weekly.

File format

Through the years the PDB file format has undergone many, many changes and revisions. Its original format was dictated by the width of computer punch cards.

This legacy format has caused many problems with the format, and consequently there are 'clean-up' projects;

The MMDB uses ASN.1 (and an XML conversion of this format). The wwPDB members RCSB PDB, MSD-EBI, and PDBj are working together to make the data uniform across the archive. Some believe this to be desirable; others argue that, without a universal repository of information (i.e., a common dictionary), it is not possible to draw comparisons.

Each structure published in PDB receives a four-character alphanumeric identifier, its PDB ID. This should not be used as an identifier for biomolecules, since often several structures for the same molecule (in different environments or conformations) are contained in PDB with different PDB IDs.

If a biologist submits structure data for a protein or nucleic acid, wwPDB staff reviews and annotates the entry. The data are then automatically checked for plausibility. The source code for this validation software has been released for free. The main data base accepts only experimentally derived structures, and not theoretically predicted ones (see protein structure prediction).

Various funding agencies and scientific journals now require scientists to submit their structure data to PDB.

Viewing the data

The structural data can be used to visualize the biomolecules with appropriate software, such as VMD, RasMol, PyMOL, Jmol, MDL Chime, QuteMol, web browser VRML plugin or any web-based software designed to visualize and analyse the protein structures such as STING. A recent desktop software addition is Sirius. The RCSB PDB website also contains resources for education, structural genomics, and related software.

References

Printed

  • H.M. Berman, K. Henrick, H. Nakamura (2003): Announcing the worldwide Protein Data Bank. Nature Structural Biology 10 (12), p. 980 PMID 14634627.
  • H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne: The Protein Data Bank. Nucleic Acids Research, 28 pp. 235-242 (2000). PMID 10592235
  • Bernstein FC, Koetzle TF, Williams GJ, Meyer Jr EF, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol 1977;112:535-542. PMID 875032.
  • E.F. Meyer “The First Years of the Protein Data Bank“, Protein Science 6:1591-1597 (1997)
  • Sussman, JL, Lin, D, Jiang, J, Manning, NO, Prilusky, J, Ritter, O & Abola, EE. Protein data bank (PDB): a database of 3D structural information of biological macromolecules. Acta Cryst 1998; D54:1078-1084. PMID 10089483.

Online

Other external links

Links to enzyme database data

  • [5] The best mapping is provided by Kim Henrick's group at EBI as part of the MSD SIFTS initiative.
  • [6] PDB provide a mapping on their beta site, but it is at the whole PDB level not chain level.
  • [7] Search at BRENDA enzyme database portal.
  • [8] PDBSProtEC:

Molecular graphic visualisation tools

The Protein Data Bank (PDB) is a repository for 3-D structural data of proteins and nucleic acids. These data, typically obtained by X-ray crystallography or NMR spectroscopy, are submitted by biologists and biochemists from around the world, are released into the public domain, and can be accessed for free.

History

Founded in 1971 by Drs. Edgar Meyer and Walter Hamilton Brookhaven National Laboratory, management of the Protein Data Bank was transferred in 1998 to members of the Research Collaboratory for Structural Bioinformatics (RCSB).

The Worldwide Protein Data Bank (wwPDB) consists of organizations that act as deposition, data processing and distribution centers for PDB data. The founding members are RCSB PDB (USA), MSD-EBI (Europe) and PDBj (Japan). The BMRB (USA) group joined the wwPDB in 2006. The mission of the wwPDB is to maintain a single Protein Data Bank Archive of macromolecular structural data that is freely and publicly available to the global community.

The PDB is a key resource in structural biology and is critical to more recent work in structural genomics.

Countless derived databases and projects have been developed to integrate and classify the PDB in terms of protein structure, protein function and protein evolution.

Growth

When the PDB was originally founded it contained just 7 protein structures. Since then it has undergone an approximate exponential growth in the number of structures, which does not show any sign of falling off.

The growth rate of the PDB has been the subject of fairly extensive analysis.

Contents

As of 26 September, 2006, the database contained 39,051 released atomic coordinate entries (or "structures"), 35,767 of that proteins, the rest being nucleic acids, nucleic acid-protein complexes, and a few other molecules. About 5,000 new structures are released each year. Data are stored in the mmCIF format specifically developed for the purpose.

Note that the database stores information about the exact location of all atoms in a large biomolecule (although, usually without the hydrogen atoms, as their positions are more of a statistical estimate); if one is only interested in sequence data, i.e. the list of amino acids making up a particular protein or the list of nucleotides making up a particular nucleic acid, the much larger databases from Swiss-Prot and the International Nucleotide Sequence Database Collaboration should be used.

Statistics

As of 11 September, 2007, the "PDB Holdings List" at RCSB reported the following statistics:

Proteins Nucleic Acids Protein/NA complexes Other Total
X-ray diffraction 36223 983 1684 24 38914
NMR 5665 781 134 7 6587
Electron microscopy 105 10 38 0 153
Other 80 4 4 2 90
Total 42073 1778 1860 33 45744

Note that theoretical models are no longer accepted in the PDB.

22461 structures in the PDB have a structure factor file. 3138 structures in the PDB have an NMR restraint file.

The current breakdown of holdings is updated weekly.

File format

Through the years the PDB file format has undergone many, many changes and revisions. Its original format was dictated by the width of computer punch cards.

This legacy format has caused many problems with the format, and consequently there are 'clean-up' projects;

The MMDB uses ASN.1 (and an XML conversion of this format). The wwPDB members RCSB PDB, MSD-EBI, and PDBj are working together to make the data uniform across the archive. Some believe this to be desirable; others argue that, without a universal repository of information (i.e., a common dictionary), it is not possible to draw comparisons.

Each structure published in PDB receives a four-character alphanumeric identifier, its PDB ID. This should not be used as an identifier for biomolecules, since often several structures for the same molecule (in different environments or conformations) are contained in PDB with different PDB IDs.

If a biologist submits structure data for a protein or nucleic acid, wwPDB staff reviews and annotates the entry. The data are then automatically checked for plausibility. The source code for this validation software has been released for free. The main data base accepts only experimentally derived structures, and not theoretically predicted ones (see protein structure prediction).

Various funding agencies and scientific journals now require scientists to submit their structure data to PDB.

Viewing the data

The structural data can be used to visualize the biomolecules with appropriate software, such as VMD, RasMol, PyMOL, Jmol, MDL Chime, QuteMol, web browser VRML plugin or any web-based software designed to visualize and analyse the protein structures such as STING. A recent desktop software addition is Sirius. The RCSB PDB website also contains resources for education, structural genomics, and related software.

References

Printed

  • H.M. Berman, K. Henrick, H. Nakamura (2003): Announcing the worldwide Protein Data Bank. Nature Structural Biology 10 (12), p. 980 PMID 14634627.
  • H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne: The Protein Data Bank. Nucleic Acids Research, 28 pp. 235-242 (2000). PMID 10592235
  • Bernstein FC, Koetzle TF, Williams GJ, Meyer Jr EF, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol 1977;112:535-542. PMID 875032.
  • E.F. Meyer “The First Years of the Protein Data Bank“, Protein Science 6:1591-1597 (1997)
  • Sussman, JL, Lin, D, Jiang, J, Manning, NO, Prilusky, J, Ritter, O & Abola, EE. Protein data bank (PDB): a database of 3D structural information of biological macromolecules. Acta Cryst 1998; D54:1078-1084. PMID 10089483.

Online

Other external links

Links to enzyme database data

  • [9] The best mapping is provided by Kim Henrick's group at EBI as part of the MSD SIFTS initiative.
  • [10] PDB provide a mapping on their beta site, but it is at the whole PDB level not chain level.
  • [11] Search at BRENDA enzyme database portal.
  • [12] PDBSProtEC:

Molecular graphic visualisation tools

Identifiers
Symbol(s) DMD; BMD; CMD3B; DXS142; DXS164; DXS206; DXS230; DXS239; DXS268; DXS269; DXS270; DXS272
External IDs OMIM: 300377 MGI94909 Homologene20856
RNA expression pattern

Image:PBB GE DMD 203881 s at tn.png

Image:PBB GE DMD 208086 s at tn.png

More reference expression data

Orthologs
Human Mouse
Entrez 1756 13405
Ensembl ENSG00000198947 ENSMUSG00000045103
Uniprot P11532 Q3TWL4
Refseq NM_000109 (mRNA)
NP_000100 (protein)
NM_007868 (mRNA)
NP_031894 (protein)
Location Chr X: 31.05 - 33.27 Mb Chr X: 79.39 - 81.45 Mb
Pubmed search [13] [14]

Dystrophin is a rod-shaped cytoplasmic protein, and a vital part of a protein complex that connects the cytoskeleton of a muscle fiber to the surrounding extracellular matrix through the cell membrane. This complex is variously known as the costamere or the dystrophin-associated protein complex. Many muscle proteins, such as α-dystrobrevin, syncoilin, synemin, sarcoglycan, dystroglycan, and sarcospan, colocalize with dystrophin at the costamere.

As of 2007 dystrophin has the longest gene known, at locus Xp21. The primary transcript measures 2.4 megabases (thus the gene comprises 0.008% of the human genome), and takes 16 hours to transcribe. The 79 exons[1] code for a protein of over 3500 amino acid residues.

Pathology

Its deficiency is one of the root causes of muscular dystrophy. It was first identified in 1987 by Louis M. Kunkel [1], after the 1986 discovery of the mutated gene that causes Duchenne muscular dystrophy (DMD) [1].

Normal tissue contains small amounts of dystrophin (about 0.002% of total muscle protein), but its absence leads to both DMD and fibrosis, a condition of muscle hardening. A different mutation of the same gene causes defective dystrophin, leading to Becker's muscular dystrophy (BMD).

References

Further reading

  • Roberts RG, Gardner RJ, Bobrow M (1994). "Searching for the 1 in 2,400,000: a review of dystrophin gene point mutations.". Hum. Mutat. 4 (1): 1-11. doi:10.1002/humu.1380040102. PMID 7951253.
  • Tinsley JM, Blake DJ, Zuellig RA, Davies KE (1994). "Increasing complexity of the dystrophin-associated protein complex.". Proc. Natl. Acad. Sci. U.S.A. 91 (18): 8307-13. PMID 8078878.
  • Blake DJ, Weir A, Newey SE, Davies KE (2002). "Function and genetics of dystrophin and dystrophin-related proteins in muscle.". Physiol. Rev. 82 (2): 291-329. doi:10.1152/physrev.00028.2001. PMID 11917091.
  • Röper K, Gregory SL, Brown NH (2003). "The 'spectraplakins': cytoskeletal giants with characteristics of both spectrin and plakin families.". J. Cell. Sci. 115 (Pt 22): 4215-25. PMID 12376554.
  • Muntoni F, Torelli S, Ferlini A (2003). "Dystrophin and mutations: one gene, several proteins, multiple phenotypes.". Lancet neurology 2 (12): 731-40. PMID 14636778.
  • Haenggi T, Fritschy JM (2006). "Role of dystrophin and utrophin for assembly and function of the dystrophin glycoprotein complex in non-muscle tissue.". Cell. Mol. Life Sci. 63 (14): 1614-31. doi:10.1007/s00018-005-5461-0. PMID 16710609.

External links

de:Dystrophin fr:Dystrophine it:Distrofina he:דיסטרופין nl:Dystrofinesv:Dystrofin

Personal tools
In other languages