TY - GEN
T1 - Investigation of the effects of imputation methods for gene regulatory networks modelling using dynamic bayesian networks
AU - Lim, Sin Yi
AU - Mohamad, Mohd Saberi
AU - Chai, Lian En
AU - Deris, Safaai
AU - Chan, Weng Howe
AU - Omatu, Sigeru
AU - Corchado, Juan Manuel
AU - Sjaugi, Muhammad Farhan
AU - Zainuddin, Muhammad Mahfuz
AU - Rajamohan, Gopinathaan
AU - Ibrahim, Zuwairie
AU - Yusof, Zulkifli Md
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2016.
PY - 2016
Y1 - 2016
N2 - DNA microarray technology plays an important role in advancing the analysis of gene expression and gene functions. However, gene expression data often contain missing values, which cause problems as most of the analysis methods of gene expression data require a complete matrix. Several missing value imputation methods have been developed to overcome the problems. In this paper, effects of the missing value imputation methods in modeling of gene regulatory network are investigated. Three missing value imputation methods are used, which are k-Nearest Neighbor (kNN), Iterated Local Least Squares (ILLsimpute), and Fixed Rank Approximation Algorithm (FRAA). Dataset used in this paper is E. coli. The results suggest that the performance of each missing value imputation method is influenced by the percentage and distribution of the missing values in the dataset, which subsequently affect the modeling of gene regulatory network using Dynamic Bayesian network.
AB - DNA microarray technology plays an important role in advancing the analysis of gene expression and gene functions. However, gene expression data often contain missing values, which cause problems as most of the analysis methods of gene expression data require a complete matrix. Several missing value imputation methods have been developed to overcome the problems. In this paper, effects of the missing value imputation methods in modeling of gene regulatory network are investigated. Three missing value imputation methods are used, which are k-Nearest Neighbor (kNN), Iterated Local Least Squares (ILLsimpute), and Fixed Rank Approximation Algorithm (FRAA). Dataset used in this paper is E. coli. The results suggest that the performance of each missing value imputation method is influenced by the percentage and distribution of the missing values in the dataset, which subsequently affect the modeling of gene regulatory network using Dynamic Bayesian network.
KW - Artificial intelligence
KW - Bioinformatics
KW - Dynamic bayesian network
KW - Gene expression
KW - Gene expression data
KW - Gene regulatory network
KW - Imputation methods
KW - Missing values
UR - http://www.scopus.com/inward/record.url?scp=84975521332&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84975521332&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-40162-1_45
DO - 10.1007/978-3-319-40162-1_45
M3 - Conference contribution
AN - SCOPUS:84975521332
SN - 9783319401614
T3 - Advances in Intelligent Systems and Computing
SP - 413
EP - 421
BT - Distributed Computing and Artificial Intelligence, 13th International Conference
A2 - García-García, Julián A.
A2 - Semalat, Ali
A2 - Bocewicz, Grzegorz
A2 - Sitek, Paweł
A2 - Bajo, Javier
A2 - Omatu, Sigeru
A2 - Nielsen, Izabela
PB - Springer Verlag
T2 - 13th International Conference on Distributed Computing and Artificial Intelligence, DCAI 2016
Y2 - 1 June 2016 through 3 June 2016
ER -