Machine Learning Approaches for Cancer Detection

Full Text (PDF, 446KB), PP.45-55

Views: 0 Downloads: 0

Author(s)

Ayush Sharma 1 Sudhanshu Kulshrestha 1 Sibi B Daniel 1

1. Department of Computer Science, Jaypee Institute of Information Technology, Noida - 201301, Uttar Pradesh, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijem.2018.02.05

Received: 1 Nov. 2017 / Revised: 22 Dec. 2017 / Accepted: 8 Jan. 2018 / Published: 8 Mar. 2018

Index Terms

Support Vector Machine, Artificial Neural Network, Cancer, Accuracy, Machine Learning

Abstract

Accurate prediction of cancer can play a crucial role in its treatment. The procedure of cancer detection is incumbent upon the doctor, which at times can be subjected to human error and therefore leading to erroneous decisions. Using machine learning techniques for the same can prove to be beneficial. Many classification algorithms such as Support Vector Machines (SVM) and Artificial Neural Networks (ANN) are proven to produce good classification accuracies. The following study models data sets for breast, liver, ovarian and prostate cancer using the aforementioned algorithms and compares them. The study covers data from condition of organs, which is called standard data and from gene expression data as well. This research has shown that SVM classifier can obtain better performance for classification in comparison to the ANN classifier.

Cite This Paper

Ayush Sharma, Sudhanshu Kulshrestha, Sibi B Daniel,"Machine Learning Approaches for Cancer Detection", International Journal of Engineering and Manufacturing(IJEM), Vol.8, No.2, pp.45-55, 2018. DOI: 10.5815/ijem.2018.02.05

Reference

[1]National Cancer Institute (NCI), Cancer – A Comprehensive Guide http://cancer.gov/about-cancer, Accesed on March, 2016.

[2]Konstantina Kourou, Themis P. Exarchos, Michalis V. Karamouzis, Machine learning applications in cancer prognosis and prediction, accessed on August, 2016.Author(s) Profile

[3]Ahrim Youn, Richard Simon, Identifying cancer driver genes in tumour genome sequencing studies, Bioinformatics. 2011 Jan 15; 27(2): 175–181.

[4]UCI Machine Learning Repository of data sets for machine learning, http://archive.ics.uci.edu/ml/, Accessed at July, 2016.

[5]Data Catlog, Nation Cancer Institute R&D Resources, https://www.cancer.gov/research/resources/data-catalog, Accessed July 2016.

[6]Ismail Saritas, Prediction of Breast Cancer using Artificial Neural Networks, Springer Science+Business Media, LLC 2011, Published on 12 August, 2011.

[7]AS Ren J. 2012. ANN vs. SVM: Which One Performs Better in Classification of MCCs in Mammogram Imaging. Knowledge-Based Systems. 26: 144–153. 10 (1), January 2012.

[8]I.S. Subashini T. S., V. Ramalingam, and S. Palanivel. 2009. Breast Mass Classification Based on Cytological Patterns using RBFNN and SVM. Expert Systems with Applications. 36: 5284–5290.

[9]Ojha, Varun Kumar; Abraham, Ajith; Snášel, Václav (2017-04-01). "Metaheuristic design of feedforward neural networks: A review of two decades of research".

[10]Bottaci, Leonardo. "Artificial Neural Networks Applied to Outcome Prediction for Colorectal Cancer Patients in Separate Institutions".

[11]Cortes, C.; Vapnik, V. (1995). "Support-vector networks". Machine Learning. 20 (3): 273–297.

[12]O. L. Mangasarian and W. H. Wolberg: "Cancer diagnosis via linear programming", SIAM News, Volume 23, Number 5, September 1990, pp 1 & 18.

[13]William H. Wolberg and O.L. Mangasarian: "Multisurface method of pattern separation for medical diagnosis applied to breast cytology", Proceedings of the National Academy of Sciences, U.S.A., Volume 87, December 1990, pp 9193-9196

[14]A Pär Stattin, Sigrid Carlsson, Benny Holmström, Andrew Vickers, Jonas Hugosson, Hans Lilja, Håkan Jonsson, Pro Prostate Cancer Mortality in Areas With High and Low Prostate Cancer Incidence https://academic.oup.com/jnci/article/doi/10.1093/jnci/dju007/1745564/Prostate-Cancer-Mortality-in-Areas-With-High-and.

[15]Liu Y., and Y. F. Zheng. 2004. FS_SFS: A Novel Feature Selection Method for Support Vector Machines. IEEE International Conference on Acoustic, Speech, and Signal Processing. 5: 797–800.

[16]Keyvanfard F., M. A. Shoorehdeli, and M. Teshnehlab. 2011. Feature Selection and Classification of Breast Cancer on Dynamic Magnetic Resonance Imaging using ANN and SVM. American Journal of Biomedical Engineering. 1: 20–25.

[17]Subashini T. S., V. Ramalingam, and S. Palanivel. 2009. Breast Mass Classification Based on Cytological Patterns using RBFNN and SVM. Expert Systems with Applications. 36: 5284–5290.

[18]Ren J. 2012. ANN vs. SVM: Which One Performs Better in Classification of MCCs in Mammogram Imaging. Knowledge-Based Systems. 26: 144–153.

[19]Altman, N. S. (1992). "An introduction to kernel and nearest-neighbour nonparametric regression". The American Statistician. 46 (3): 175–185.