Bacteria Data Storage

|

The idea of storing data in bacteria had occurred about a decade. The consideration, even the simplest bacteria have long strands of DNA that can store data encryption.

In addition, naturally, the bacteria is much more resistant to damage than any electronic storage media. He was able to withstand the various kinds of disasters that can destroy the hard drive.

Natural reproduction of bacteria also can be used to create duplication of data and maintain the integrity of information stored. It also makes data retrieval process can be conducted more easily.

Guided by this thought, a group of researchers from The Chinese University of Hong Kong looking at how to save data into the DNA of bacteria. It was not difficult.

In bacteria, there are four bases of DNA that can be used to create DNA strands namely Adenine (A), Cytosine (C), Guanine (G), and thymine (T). That is, the storage will use the base four numeral system.

In his report, as quoted from the i09, December 9, 2010 researchers gave the example to change the word "iGEM" into the code that is ready to be stored in DNA.

They use an ASCII table to convert each letter into a numeric value for example i = 105, G = 71, and so on. This figure is then converted into base 4 numbering 105 to 1221 ie, 71 to 0113 and beyond.

Figures 4 base is then converted into the DNA system that uses the code A, T, C and G where A replaces the number 0, T in place of 1, C replaces item 2, and G substitute the number 3. So, the word iGEM stored in DNA as ATCTATTGATTTATGT.

After the raw data is ready, researchers say, some algorithms can be used to eliminate repetitive or redundant information. This is not just to save space, the number of repetitions in the DNA strands of DNA and potentially harmful biological bacteria. Means, the use of algorithms that would solve two problems at once.

The problem, DNA strands are not long enough to store complex information such as photos or books. The best solution is to break data into small pieces and spread it on a different cell.

To succeed, researchers create a system that allows the data fragments are identified and then organized into the correct sequence. For that, they make a three-part structure for the entire DNA of the header, message, and checksum.

The header is a series along 8 sections which are divided into four zones namely the information level, region, area, and districts that allows each part is returned into the proper sequence.

After the message that carries the actual data is delivered, the repetition of the header checksum provides a useful starting to control the mutation that might occur in bacteria in question.

Once the information is encrypted and placed in many different cells in the bacteria, how to retrieve the data owner data stored by the bacteria in question?

A Decrypter will take the DNA and run it on a technology called next-generation high-throughput sequencing, or NGS.

This type of sequencing can analyze and compare many copies of the same sequence and use the highest mode to find out where the correct database and data which has undergone a change. After that, the compression algorithm will be reversed to return the raw data into its original shape.

The final step was to reconstruct fragments of data in the correct sequence for the DNA sequence can be translated back into data that can be used.

Until this stage, data is stored and experienced encryption. People who want to read the data require a formula that knowing the correct sequence of headers and checksums. Without this formula, the data that he had not be used.

0 komentar:

Post a Comment

 

©2009 Technology | Template Blue by TNB