A Survey of Machine Learning Approaches Applied to Gene Expression Analysis for Cancer Prediction

Mahmood Khalsan, Lee Machado, EMAN SALIH AL-SHAMERY, Suraj Ajit, Karen Anthony, Mu Mu, Michael Opoku Agyeman*

*Corresponding author for this work

Research output: Contribution to JournalArticlepeer-review


Machine learning approaches are powerful techniques commonly employed for developing cancer prediction models using associated gene expression and mutation data. Our survey provides a comprehensive review of recent cancer studies that have employed gene expression data from several cancer types (breast, lung, kidney, ovarian, liver, central nervous system and gallbladder) for survival prediction,tumor identification and stratification. We also provide an overview of biomarker studies that are associated with these cancer types. The survey captures multiple aspects of machine learning associated cancer studies,including cancer classification, cancer prediction, identification of biomarker genes, microarray, and RNA-Seq data.We discuss the technical issues with current cancer prediction models and the corresponding measurement tools for determining the activity levels of gene expression between cancerous tissues and noncancerous tissues. Additionally, we investigate how identifying putative biomarker gene expression patterns can aid in predicting future risk of cancer and inform the provision of personalized treatment.
Original languageEnglish
JournalIEEE Access
Early online date18 Mar 2022
Publication statusPublished - 18 Mar 2022


  • Biomarker
  • Cancer prediction
  • Deep learning
  • Feature Selection
  • Machine Learning
  • Microarray
  • RNA-Seq


Dive into the research topics of 'A Survey of Machine Learning Approaches Applied to Gene Expression Analysis for Cancer Prediction'. Together they form a unique fingerprint.

Cite this