TY - JOUR
T1 - Identification of informative genes and pathways using an improved penalized support vector machine with a weighting scheme
AU - Chan, Weng Howe
AU - Mohamad, Mohd Saberi
AU - Deris, Safaai
AU - Zaki, Nazar
AU - Kasim, Shahreen
AU - Omatu, Sigeru
AU - Corchado, Juan Manuel
AU - Al Ashwal, Hany
N1 - Publisher Copyright:
© 2016 Elsevier Ltd
PY - 2016/10/1
Y1 - 2016/10/1
N2 - Incorporation of pathway knowledge into microarray analysis has brought better biological interpretation of the analysis outcome. However, most pathway data are manually curated without specific biological context. Non-informative genes could be included when the pathway data is used for analysis of context specific data like cancer microarray data. Therefore, efficient identification of informative genes is inevitable. Embedded methods like penalized classifiers have been used for microarray analysis due to their embedded gene selection. This paper proposes an improved penalized support vector machine with absolute t-test weighting scheme to identify informative genes and pathways. Experiments are done on four microarray data sets. The results are compared with previous methods using 10-fold cross validation in terms of accuracy, sensitivity, specificity and F-score. Our method shows consistent improvement over the previous methods and biological validation has been done to elucidate the relation of the selected genes and pathway with the phenotype under study.
AB - Incorporation of pathway knowledge into microarray analysis has brought better biological interpretation of the analysis outcome. However, most pathway data are manually curated without specific biological context. Non-informative genes could be included when the pathway data is used for analysis of context specific data like cancer microarray data. Therefore, efficient identification of informative genes is inevitable. Embedded methods like penalized classifiers have been used for microarray analysis due to their embedded gene selection. This paper proposes an improved penalized support vector machine with absolute t-test weighting scheme to identify informative genes and pathways. Experiments are done on four microarray data sets. The results are compared with previous methods using 10-fold cross validation in terms of accuracy, sensitivity, specificity and F-score. Our method shows consistent improvement over the previous methods and biological validation has been done to elucidate the relation of the selected genes and pathway with the phenotype under study.
KW - Artificial intelligence
KW - Bioinformatics
KW - Informative genes
KW - Pathway-based microarray analysis
KW - Penalized support vector machine
KW - Penalty function
KW - Weighting scheme
UR - http://www.scopus.com/inward/record.url?scp=84981329854&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84981329854&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2016.08.004
DO - 10.1016/j.compbiomed.2016.08.004
M3 - Article
C2 - 27522238
AN - SCOPUS:84981329854
SN - 0010-4825
VL - 77
SP - 102
EP - 115
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
ER -