TY - GEN
T1 - An augmented block code model for protein translation using free energy based distance decoders
AU - Al Bataineh, Mohammad
AU - Huang, Lun
AU - Acedo, Alicia Fuente
AU - Atkin, Guillermo
AU - Menhart, Nick
PY - 2010
Y1 - 2010
N2 - Informational analysis of genetic sequences has revealed the existence of significant analogies between the genetic process and information processing systems used in the field of communications engineering. By analyzing key elements involved in the process of gene expression, we have developed several communications and coding theory based models for the process of translation [1-5]. A previous research investigated the use of coding theory based models that quantitatively describe the behavior of the ribosome during translation initiation in prokaryotic organisms [1]. In this paper we have investigated an augmented block code model with modified criteria and assumptions. We have also employed several minimum distance decoders to verify the proposed modified model based on the free energies involved in the binding between the ribosome and the mRNA sequence. The key biological elements considered in forming the investigated model are: the last 13 bases of the 3' end of the 16S rRNA molecule, the common features of bacterial ribosomal binding sites (such as the existence and location of the Shine-Dalgarno sequence), the energies involved in the rRNA-mRNA interaction, and RNA/DNA base-pairing principles. The model was tested on five different E. coli bacterial genomes. The obtained results prove the validity and significance of the model in clearly distinguishing four different test groups of gene predictions. Two of them are based on well known gene finder softwares (e.g. GeneMark [2] and Glimmer [3]).
AB - Informational analysis of genetic sequences has revealed the existence of significant analogies between the genetic process and information processing systems used in the field of communications engineering. By analyzing key elements involved in the process of gene expression, we have developed several communications and coding theory based models for the process of translation [1-5]. A previous research investigated the use of coding theory based models that quantitatively describe the behavior of the ribosome during translation initiation in prokaryotic organisms [1]. In this paper we have investigated an augmented block code model with modified criteria and assumptions. We have also employed several minimum distance decoders to verify the proposed modified model based on the free energies involved in the binding between the ribosome and the mRNA sequence. The key biological elements considered in forming the investigated model are: the last 13 bases of the 3' end of the 16S rRNA molecule, the common features of bacterial ribosomal binding sites (such as the existence and location of the Shine-Dalgarno sequence), the energies involved in the rRNA-mRNA interaction, and RNA/DNA base-pairing principles. The model was tested on five different E. coli bacterial genomes. The obtained results prove the validity and significance of the model in clearly distinguishing four different test groups of gene predictions. Two of them are based on well known gene finder softwares (e.g. GeneMark [2] and Glimmer [3]).
KW - Block codes
KW - Coding theory
KW - Communications
KW - Gene expression
KW - Translation initiation
UR - http://www.scopus.com/inward/record.url?scp=78649299583&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78649299583&partnerID=8YFLogxK
U2 - 10.1109/EIT.2010.5612184
DO - 10.1109/EIT.2010.5612184
M3 - Conference contribution
AN - SCOPUS:78649299583
SN - 9781424468751
T3 - 2010 IEEE International Conference on Electro/Information Technology, EIT2010
BT - 2010 IEEE International Conference on Electro/Information Technology, EIT2010
T2 - 2010 IEEE International Conference on Electro/Information Technology, EIT2010
Y2 - 20 May 2010 through 22 May 2010
ER -