Happy Birthday, Open Data
The received wisdom is that open source begat open access, which begat open data, and in broad outline that's true enough. But in one respect it's quite wrong: the first, and arguably most important open data store was set up fully 25 years ago, and is still going from strength to strength:For a quarter century, GenBank has helped advance scientific discovery worldwide. The nucleic acid sequence database was established by the National Institutes of Health (NIH) in 1982. Since its creation, the GenBank database has grown at an exponential rate. Amazing as it may seem, in 1984, the entirety of GenBank’s data was published in a two volume hardcover book. Today, if the current contents of GenBank’s database were printed, it would fill more than 300 pickup trucks with paper.
Unveiled at the onset of the “Information Age”, GenBank has continued to evolve and incorporate technological innovations. The GenBank database has remained on the cutting edge of technology and illustrates the dynamic changes over the past 25 years in quantity and speed with which information is shared.
GenBank joined with sequence databases in Europe and Japan to form the International Nucleotide Sequence Database Collaboration. GenBank was one of the earliest bioinformatics community projects on the Internet promoting open access communications among bioscientists. In 1992, the GenBank project transitioned to the newly created National Center for Biotechnology Information (NCBI) within NIH where it resides today.