The position of this chapter at the beginning of this book reflects the belief that an understanding of biological databases is the first step toward being able to perform robust and accurate bioinformatic analyses. The reader is very strongly encouraged to take the time to understand the structure of the data found within these databases, as the basis for finding sequence data of interest and performing the more advanced analyses described in the chapters that follow.
Acknowledgments
The author thanks Rolf Apweiler for the use of material from the third edition of this book.
Internet Resources
Further Reading
1 Bairoch, A. (2000). Serendipity in bioinformatics: the tribulations of a Swiss bioinformatician through exciting times! Bioinformatics. 16: 48–64. A personal narrative conveying the early history of the development of sequence databases and related software tools, events that set the groundwork for the modern bioinformatics landscape.
2 Green, E.D., Rubin, E.M., and Olson, M.V. (2017). The future of DNA sequencing. Nature. 550: 179–181. An insightful perspective regarding the next several decades of the application of DNA sequencing methodologies in novel contexts and the implications of those applications to issues of data storage and data sharing.
3 Rigden, D.J. and Fernández, X.M. (2018). The 2018 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res. 46: D1–D7. The 25th overview of the annual database issue published by Nucleic Acids Research, capturing the wide variety of publicly available bioinformatic databases available to the community. This overview is updated yearly, and the individual papers describing these database resources are freely available through the Nucleic Acids Research web site.
References
1 Apweiler, R. (2001). Functional information in Swiss-Prot: the basis for large-scale characterization of protein sequences. Briefings Bioinf. 2: 9–18.
2 Bairoch, A. (2000). Serendipity in bioinformatics: the tribulations of a Swiss bioinformatician through exciting times! Bioinformatics. 16: 48–64.
3 Baxevanis, A.D. and Bateman, A. (2015). The importance of biological databases in biological discovery. Curr. Protoc. Bioinf. 50: 1.1.1–1.1.8.
4 Benson, D.A., Cavanaugh, M., Clark, K. et al. (2018). GenBank. Nucleic Acids Res. 46: D41–D47.
5 Cook, C.E., Bergman, M.T., Cochrane, G. et al. (2018). The European Bioinformatics Institute in 2017: data coordination and integration. Nucleic Acids Res. 46: D21–D29.
6 Dayhoff, M.O., Eck, R.V., Chang, M.A., and Sochard, M.R. (1965). Atlas of Protein Sequence and Structure. Silver Spring, MD: National Biomedical Research Foundation.
7 Gene