DNA for data storage

Researchers have successfully stored information in synthetic DNA and then sequenced the DNA to read the data. Nick Goldman and his colleagues from the European Bioinformatics Institute (EBI) encoded all of Shakespeare's sonnets, an audio clip of Martin Luther King's "I have a dream" speech, Watson and Crick's paper on DNA's structure, a photo of the EBI, and an explanation of their data conversion technique. Last year, Harvard molecular geneticist George Church encoded a book he had written in DNA, but EBI's breakthroughs are in the way the data is encoded and its error-correction. From the abstract of their scientific paper published at Nature:
We encoded computer files totalling 739 kilobytes of hard-disk storage and with an estimated Shannon information10 of 5.2 × 106 bits into a DNA code, synthesized this DNA, sequenced it and reconstructed the original files with 100% accuracy. Theoretical analysis indicates that our DNA-based storage scheme could be scaled far beyond current global information volumes and offers a realistic technology for large-scale, long-term and infrequently accessed digital archiving. In fact, current trends in technological advances are reducing DNA synthesis costs at a pace that should make our scheme cost-effective for sub-50-year archiving within a decade.
"Synthetic double-helix faithfully stores Shakespeare's sonnets" (Thanks, Mike Pescovitz!)

Fossils in storage: How do you sort through the backlog?

Yesterday, I posted about Pegomastax africanus, a parrot-like dinosaur whose fossil was discovered not in a remote waste in some far corner of the world, but in a rock that had sat in storage at Harvard University for 50 years.

In the post, I tried to explain why something like that could happen. The simple fact of the matter: A successful archaeological or paleontological dig will produce far more material than the original scientists have time (or money) to sort through, process, and examine. So lots of stuff ends up sitting in storage.

That led BoingBoing reader Matt Fedorko to some interesting speculation:


"...This seems like a perfect opportunity to exploit 3D scanning technology to put the shapes of fossils, at least, into some kind of digital storage area where other researchers could look at a dig's haul and start to work with them spatially, or beside any of the other data that is collected in the field or logged during the cataloging procedure."

Now, Charles Q. Choi, a journalist who wrote about the discovery of Pegomastax africanus, says that Matt's idea isn't all that far-fetched. In fact, scientists already do something like this with the fossils that do get closely examined.

Read the rest