Trendspotting in the Protein Data Bank
The Protein Data Bank (PDB) was established in 1971 as a repository for the three dimensional structures of biological macromolecules. Since then, more than 85,000 biological macromolecule structures have been determined and made available in the PDB archive. Through analysis of the corpus of data, it is possible to identify trends that can be used to inform us about the future of structural biology and to plan the best ways to improve the management of the ever-growing amount of PDB data. âº The number of entries in the PDB archive is predicted to increase 1.5-fold between now and the end of 2017. âº The structures deposited in the PDB are growing in complexity. âº Three-dimensional electron microscopy is emerging as an important method for determining structures of large macromolecular assemblies. âº Protein-nucleic acid complexes are responsible for strong growth in the number of nucleic acids in the PDB.