An effective pre-processing phase for gene expression classification

Choon Sen Seah, Shahreen Kasim, Mohd Farhan Md Fudzee, Mohd Saberi Mohamad, Rd Rohmat Saedudin, Rohayanti Hassan, Mohd Arfian Ismail, Rodziah Atan

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)


A raw dataset prepared by researchers comes with a lot of information. Whether the information is usefull or not, completely depends on the requirement and purposes. In machine learning, data pre-processing is the very initial stage. It is a must to make sure the dataset is totally suitable for the requirement. In significant directed random walk (sDRW), there are three steps in data pre-processing stage. First, we remove unwanted attributes, missing value and proper arrangement, followed by normalization of the expression value and lastly, filtering method is applied. The first two steps are completed by Bioconductor package while the last step is works in sDRW.

Original languageEnglish
Pages (from-to)1223-1227
Number of pages5
JournalIndonesian Journal of Electrical Engineering and Computer Science
Issue number3
Publication statusPublished - 2018
Externally publishedYes


  • Bioconductor
  • Data pre-processing
  • Gene expression dataset
  • Significant directed random walk

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems
  • Hardware and Architecture
  • Computer Networks and Communications
  • Control and Optimization
  • Electrical and Electronic Engineering


Dive into the research topics of 'An effective pre-processing phase for gene expression classification'. Together they form a unique fingerprint.

Cite this