Identification of Transcriptional promoter sequence based on statistical filter bank model

Lun Huang, Mohammad Al Bataineh, Alicia Fuente Acedo, G. E. Atkin, Xiangyu Deng, Wei Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper describes a new approach for locating transcription related signals, such as promoter sequence in nucleic acid sequences. Transcription Factor (TF) and corresponding polymerase binding to their DNA target site is a fundamental regulatory interaction. The most common model used to represent TF and polymerase binding specificities is a position weight matrix (PWM) [1], which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. In this paper, we present a statistical filter model based on Chi-Square (Χ2) distance [2], which is a statistical distance metric between the profiles of component vectors. It is a novel statistical method for modeling TF-DNA and polymerase-DNA interactions. Our approach also uses a generalized correlation algorithm to evaluate the combination coefficients for the filter bank. Simulation results show that the proposed approach identifies promoter sequences better than the PWM model method and Chi-Square (Χ2) distance model.

Original languageEnglish
Title of host publication2010 IEEE International Conference on Electro/Information Technology, EIT2010
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event2010 IEEE International Conference on Electro/Information Technology, EIT2010 - Normal, IL, United States
Duration: May 20 2010May 22 2010

Publication series

Name2010 IEEE International Conference on Electro/Information Technology, EIT2010

Conference

Conference2010 IEEE International Conference on Electro/Information Technology, EIT2010
Country/TerritoryUnited States
CityNormal, IL
Period5/20/105/22/10

Keywords

  • Chi-square distance
  • Transcription factor binding sites
  • Transcriptional promoter sequence

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems

Fingerprint

Dive into the research topics of 'Identification of Transcriptional promoter sequence based on statistical filter bank model'. Together they form a unique fingerprint.

Cite this