Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Cloud Data Storage Databases Digital Microsoft Windows News Technology Hardware Science

Microsoft Buys Into DNA Data Storage (ieee.org) 81

the_newsbeagle writes: More than 2.5 exabytes of data is created every day, and some experts estimate that 90% of all data in the world today was created in the last two years. Clearly, storing all this data is becoming an issue. One idea is DNA data storage, in which digital files are converted into the genetic code of four nucleotides (As, Cs, Gs, and Ts). Microsoft just announced that it's testing out this idea, getting synthetic bio company Twist Bioscience to produce 10 million strands of DNA that encode some mystery file the company provided. Using DNA for long-term data storage is attractive because it's durable and efficient. For example, scientists can read the genome from a woolly mammoth hair dating from 20,000 years ago.
This discussion has been archived. No new comments can be posted.

Microsoft Buys Into DNA Data Storage

Comments Filter:
  • by Anonymous Coward

    Or an infectious virus?

    • Don't "run" it? If ribosomes can even read it and successfully create proteins from the code that is. I think you need specific tags to tell them where to start and stop anyhow, so if those aren't in the DNA nothing happens.
  • DNA isn't durable, it is duplicated on a massive scale. This why it is possible to read DNA from a mammoth hair - originally there were millions of copies in that hair, couple of these copies survived this long.
    • If you read the article, it appears they propose preserving the DNA strands artificially.

      The long-term stability of data encoded in DNA was reported in February 2015, in an article by researches from ETH Zurich. By adding redundancy via Reed–Solomon error correction coding and by encapsulating the DNA within silica glass spheres via Sol-gel chemistry, the researchers predict error-free information recovery after up to 1 million years at -18 C and 2000 years if stored at 10 C.

      Other than a having certain coolness factor in using nature's own data encoding scheme, it seems like it would make a lot more sense to etch data into crystals or glass using lasers, or other such solid state data storage that's currently being researched - essentially bypassing the "natural" encoding and jumping straight to their proposed long-term storage medium as the storage method itself. But what the hell do I kn

      • it seems like it would make a lot more sense to etch data into crystals or glass using lasers, or other such solid state data storage that's currently being researched

        The DNA can store millions of times more data per unit volume. Each nucleotide (2 bits) is 0.33 nm, and they can be packed in 3D structures. Laser etching on sapphire is dozens of nm wide, and is inherently 2D.

        • True, but on the other hand, I would think etching technology certainly has the potential of becoming more efficient than it is now, reducing that current advantage. I recall a story here a while back about some new "5-dimensional" etching techniques (three spacial dimensions plus two additional properties per point) that could show promise in the future regarding improved density:

          http://www.gizmag.com/superman... [gizmag.com]

      • I was just thinking of a whole new definition for BSOD.
    • We don't actually read the DNA like you think either from that mammoth hair. All we get are DNA snipets. We then use computers to look at all those and attempts to put them back into some kind of order that seems to make sense based on all the other DNA snipets we have seen before. In other words, it all works because we have this huge catalogue of DNA that we have looked at previously. That all falls apart once you have complete randomness. You would never be able to tell what piece comes next.
      • That all falls apart once you have complete randomness. You would never be able to tell what piece comes next.

        An obvious solution would be to use standard "start" and "stop" codons, and encode the track ID at the beginning of each DNA strand. So the data could be random, but the meta-data would not be random.

    • Just put the code on bit torrent, there will be millions of copies in less than a day!

  • How long (Score:5, Funny)

    by BarbaraHudson ( 3785311 ) <barbara.jane.hud ... minus physicist> on Wednesday April 27, 2016 @08:14PM (#52001779) Journal
    How long until they start checking people's DNA and say "we have data that looks like a section of your DNA. We have copyrighted it, and you can no longer reproduce it - not in offspring, and not in your own cells. You can either stop (by killing yourself) or take a monthly subscription to license it. Have a nice life or drop dead - your call."
    • More Important Worry (Score:4, Interesting)

      by Roger W Moore ( 538166 ) on Wednesday April 27, 2016 @09:18PM (#52002043) Journal
      Actually what I would be more worried about is how long will it be before someone's computer file turns out encode into a real virus and we have some new, nasty disease on our hands simply because some holiday photo produces the right DNA sequence for a new variant of Ebola.
      • Actually what I would be more worried about is how long will it be before someone's computer file turns out encode into a real virus and we have some new, nasty disease on our hands simply because some holiday photo produces the right DNA sequence for a new variant of Ebola.

        You should find something new to worry about. You could run every computer till the heat death of the universe, and it is unlikely any of them would just randomly produce a sequence for a viable pathogen.

    • by Afty0r ( 263037 )
      Yeah errrr.... never because that's not remotely realistic. Calm down.
    • Digital RNA Management: Only authorized copies permitted.

  • by Anonymous Coward

    DNA storage is easily damaged by viruses and bacteria. This won't work. Even the mammoth hair is contaminated by viruses and bacteria so the genome really isn't intact.

  • by Edis Krad ( 1003934 ) on Wednesday April 27, 2016 @08:24PM (#52001821)
    Symantec will go into the pharmaceutical business? ;)
  • by Doubting Sapien ( 2448658 ) on Wednesday April 27, 2016 @08:31PM (#52001861)
    Ancient mammoth DNA didn't persevere in casual ambient conditions. They were only able to retrieve genetic material because the animal's corpse had been preserved by permafrost. DNA storage of actual data would require cooling solutions an order of magnitude more intense than what is currently used to keep a data center running. Most people don't realize how much nucleic acid digesting enzymes are in our normal environment. A great deal of the sticky slimy residue generated copiously by our bodies are the chewed up DNA remnants of microbial organisms that our immune system keeps in check. This is to say nothing of the difficulty involved with reading/writing of said data. You DON'T want to go down that rabbit hole.
    • by Anonymous Coward

      This. If this is Twist's business model for increasing the market volume for DNA synthesis, they are in big trouble. They are already being sued by Agilent for the sketchy way in which the Twist was founded (by a few former Agilent employees). And getting from $0.10 to $0.02 / bp will be difficult while maintaining an OK profit margin, particularly when they will have competitors (Gen9, GenScript, GeneWiz, etc). Synthesized DNA has become a commodity. Selling a commodity is absolutely no fun.

    • by Zak3056 ( 69287 )

      So, the article claims 2000 year data life at 10C, and 1,000,000 year data life at -18C. That doesn't exactly sound like something that requires "cooling solutions and order of magnitude more intense that what is currently used to keep a data center running" especially since those temperatures can be localized to the storage device, rather than the general environment.

    • by chihowa ( 366380 )

      DNA isn't chewed up by enzymes that are commonly found in the environment. You're thinking of RNA. The information stored in DNA is mainly destroyed by UV or other ionizing radiation.

      DNA is extremely stable in the environment at room temperature. It's very common in labs to store DNA at room temperature dried onto filter paper. I have some plasmids on paper that I inherited from my advisor that she inherited from her advisor and they survived just fine. 20k years is a bit extreme, but even just storing your

  • by Anonymous Coward

    how will they get the files to have sex?

    • by MrKaos ( 858439 )

      how will they get the files to have sex?

      By

      fsck

      ing the filesystem. I'm not sure that fsck works on FAT filesystems though, you might have to force it.

    • Put one data sheet on top of another and play some Barry White music?

      Why are you looking at me like that? That's how we were taught in Health Class.

  • by NotQuiteReal ( 608241 ) on Wednesday April 27, 2016 @11:17PM (#52002585) Journal
    "90% of all data in the world today was created in the last two years"

    And most of it will not be readable 100 years from now, nor will it be missed.
  • Now we'll be able to eat the storage medium in order to survive the apocalypse, which wasn't possible with hard drives. Add to this a few corpsicles of frozen rich dead people and you are set for a while.
  • Imagine the latency.
  • One of my recent jobs has been considering the requirements of storing genetic sequences digitally.... I guess now we'll just put the tissue sample in a box.

  • When did DNA become durable? Thats news to me.

    Glass is durable.

    Rock is durable.

    DNA breaks down fairly quickly.

    It may be durable in comparison to your dirt cheap commodity hard drives ... but it also isn't dirt cheap or commodity.

  • Given it's Microsoft, all of our DNA source code will become, proprietary. Shortly after this there will be a GNU licensed version released that's several versions behind and less user friendly.

  • If this goes to production int he future, people will have to fend off their data from 2 types of viruses. Software and physical.

A committee takes root and grows, it flowers, wilts and dies, scattering the seed from which other committees will bloom. -- Parkinson

Working...