Abstract
Recent results of corpus-based linguistics demonstrate that context-appropriate sentences can be generated by a stochastic constraint satisfaction process. Exploiting the similarity of constraint satisfaction and DNA self-assembly, we explore a DNA assembly model of sentence generation. The words and phrases in a language corpus are encoded as DNA molecules to build a language model of the corpus. Given a seed word, the new sentences are constructed by a parallel DNA assembly process based on the probability distribution of the word and phrase molecules. Here, we present our DNA code word design and report on successful demonstration of their feasibility in wet DNA experiments of a small scale.
Original language | English |
---|---|
Pages (from-to) | 51-56 |
Number of pages | 6 |
Journal | BioSystems |
Volume | 106 |
Issue number | 1 |
DOIs | |
State | Published - Oct 2011 |
Bibliographical note
Funding Information:This work was supported in part by the Ministry of Education, Science, and Technology through NRF (KRF-2008-314-D00377, 2010-0017734, 0421-20110032, 2010K001137, 2010-0020821, 2011-0000331, 2011-0001643), the Ministry of Knowledge and Economy through KEIT (IITA-2009-A1100-0901-1639), the BK21-IT Program, and the Korea Student Aid Foundation (KOSAF) (No. S2-2009-000-01116-1).
Keywords
- Constraint satisfaction
- DNA language model
- DNA self-assembly
- Sentence generation