International Journal of Information Technology and Computer Science(IJITCS)

ISSN: 2074-9007 (Print), ISSN: 2074-9015 (Online)

Published By: MECS Press

IJITCS Vol.7, No.3, Feb. 2015

Feature Selection in Detection of Adverse Drug Reactions from the Health Improvement Network (THIN) Database

Full Text (PDF, 524KB), PP.68-85

Views:77   Downloads:4


Yihui Liu, Uwe Aickelin

Index Terms

Adverse Drug Reactions, Feature Matrix, Feature Selection, Atorvastatin, Alendronate, Metoclopramide


Adverse drug reaction (ADR) is widely concerned for public health issue. ADRs are one of most common causes to withdraw some drugs from market. Prescription event monitoring (PEM) is an important approach to detect the adverse drug reactions. The main problem to deal with this method is how to automatically extract the medical events or side effects from high-throughput medical events, which are collected from day to day clinical practice. In this study we propose a novel concept of feature matrix to detect the ADRs. Feature matrix, which is extracted from big medical data from The Health Improvement Network (THIN) database, is created to characterize the medical events for the patients who take drugs. Feature matrix builds the foundation for the irregular and big medical data. Then feature selection methods are performed on feature matrix to detect the significant features. Finally the ADRs can be located based on the significant features. The experiments are carried out on three drugs: Atorvastatin, Alendronate, and Metoclopramide. Major side effects for each drug are detected and better performance is achieved compared to other computerized methods. The detected ADRs are based on computerized methods, further investigation is needed.

Cite This Paper

Yihui Liu, Uwe Aickelin,"Feature Selection in Detection of Adverse Drug Reactions from the Health Improvement Network (THIN) Database", International Journal of Information Technology and Computer Science(IJITCS), vol.7, no.3, pp.68-85, 2015. DOI: 10.5815/ijitcs.2015.03.10


[1]G. Severino and M.D. Zompo, “Adverse drug reactions: role of pharmacogenomics,” Pharmacological Research, 49: 363-373, 2004.

[2]Y. Qian, X. Ye, W. Du, J. Ren, Y. Sun, H. Wang, B. Luo, Q. Gao, M. Wu, and J. He, “A computerized system for signal detection in spontaneous reporting system of Shanghai China,” Pharmacoepidemiology and Drug Safety, 18: 154-158, 2009.

[3]K.S. Park and O. Kwon, “The state of adverse event reporting and signal generation of dietary supplements in Korea,” Regulatory Toxicology and Pharmacology, 57: 74-77, 2010.

[4]A. Farcas, A. Sinpetrean, C. Mogosan, M. Palage, O. Vostinaru, M. Bojita, D. Dumitrascu, “Adverse drug reactions detected by stimulated spontaneous reporting in an internal medicine department in Romania,” European Journal of Internal Medicine, 21: 453-457, 2010.

[5]A. Szarfman , S.G. Machado and R.T. O'Neill, “Use of screening algorithms and computer systems to efficiently signal higher-than-expected combinations of drugs and events in the US FDA’s spontaneous reports database,” Drug Saf., 25: 381-392, 2002.

[6]R. Kasliwal, L.V. Wilton, V. Cornelius, B. Aurich-Barrera and S.A.W. Shakir, “Safety profile of Rosuvastatin-results of a prescription-event monitoring study of 11680 patients,” Drug Safety, 30: 157-170, 2007.

[7]R.D. Mann, K. Kubota, G. Pearce and L. Wilton, “Salmeterol: a study by prescription-event monitoring in a UK cohort of 15,407 patients,” J Clin Epidemio, 49: 247-250, 1996.

[8]N.S. Rawson, G.L. Pearce and W.H. Inman, “Prescription-event monitoring: methodology and recent progress,” Journal of Clinical Epidemiology, 43: 509-522, 1990.

[9]B. Aurich-Barrera, L. Wilton, D. Brown and S. Shakir, “Paediatric postmarketing pharmacovigilance using prescription-event monitoring: comparison of the adverse event profile of lamotrigine prescribed to children and adults in England,” Drug Saf., 33: 751-763, 2010.

[10]T.J. Hannan, “Detecting adverse drug reactions to improve patient outcomes,” International Journal of Medical Informatics, 55: 61-64, 1999.

[11]R.H. Meyboom, M. Lindquist, A.C. Egberts and I.R. Edwards, “Signal detection and follow-up in pharmacovigilance,” Drug Saf., 25: 459-465, 2002.

[12]A. Bate, M. Lindquist, I.R. Edwards, S. Olsson, R. Orre, A. Lansner and R.M. De Freitas, “A Bayesian neural network method for adverse drug reaction signal generation,” Eur J Clin Pharmacol, 54: 315-321, 1998.

[13]R. Orre, A. Lansner, A. Bate and M. Lindquist, “Bayesian neural networks with confidence estimations applied to data mining,” Computational Statistics & Data Analysis, 34: 473-493, 2000.

[14]M. Hauben and A. Bate, “Decision support methods for the detection of adverse events in post-marketing data,” Drug Discovery Today, 14: 343-357, 2009. 

[15]Y. Koh, C.W. Yap and S.C. Li, “A quantitative approach of using genetic algorithm in designing a probability scoring system of an adverse drug reaction assessment system,” International Journal of Medical Informatics, 77: 421-430, 2008.

[16]C. Henegar, C. Bousquet, A.L. Lillo-Le, P. Degoulet and M.C. Jaulent, “A knowledge based approach for automated signal generation in pharmacovigilance,” Stud Health Technol Inform, 107: 626-630, 2004

[17]C. Bousquet , C. Henegar , A.L. Louët , P. Degoulet and M.C. Jaulent, “Implementation of automated signal generation in pharmacovigilance using a knowledge-based approach,” International Journal of Medical Informatics, 74: 563-571, 2005.

[18]H. Jin, J. Chen, H. He, G.J. Williams, C. Kelman and C.M. O'Keefe, “Mining unexpected temporal associations: applications in detecting adverse drug reactions,” IEEE Trans Inf Technol Biomed, 12: 488-500, 2008.

[19]H. Jin, J. Chen, H. He, C. Kelman, D. McAullay and C.M. O'Keefe, “Signaling Potential Adverse Drug Reactions from Administrative Health Databases,” IEEE Trans. Knowl. Data Eng., 22: 839-853, 2010.

[20]G. Williams , D. Vickers , C. Rainsford , L. Gu , H. He , R. Baxter and S. Hawkins, Bias in the Queensland linked data set, CSIRO Math. Inf. Sci., 2002.

[21]J. Reps, J. Garibaldi, U. Aickelin, D. Soria, J. Gibson and R. Hubbard, “Comparison of algorithms that detect drug side effects using electronic healthcare databases,” Soft Computing, 2013

[22]Y. Liu, U. Aickelin, Jan. Feyereisl and L. Durrant, “Wavelet feature extraction and genetic algorithm for biomarker detection in colorectal cancer data,” Knowledge-Based Systems, 37: 502-514, 2013.

[23]H. Uguz, “A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm,” Knowledge-Based Systems, 24: 1024-1032, 2011.

[24]E. Kreyszig, Introductory Mathematical Statistics, John Wiley, New York, 1970.

[25]B. Rosner, Fundamentals of Biostatistics, Duxbury press, 2000.




[29]A.B. Fernández, R.H. Karas, A.A. Alsheikh-Ali and P.D. Thompson, “Statins and interstitial lung disease: a systematic review of the literature and of food and drug administration adverse event reports,” Chest, 134: 824-830, 2008.

[30]N. Sattar, D. Preiss, and H.M. Murray, et al., “Statins and risk of incident diabetes: a collaborative meta-analysis of randomized statins trials,” Lancet, 375: 735-742, 2010.

[31]D. Preiss, S.R.K. Seshasai, and P. Welsh, et al., “Risk of incident diabetes with intensive-dose compared with moderate-dose statin therapy,” JAMA, 305: 2556-2564, 2011.



[34]N. Miura, N. Mizuno, R. Aoyama, W. Kitagawa, H. Yamada, K. Nishikawa and H. Imai, “Massive proteinuria and acute renal failure after oral bisphosphonate (alendronate) administration in a patient with focal segmental glomerulosclerosis,” Clin Exp Nephrol, 13: 85-88, 2009.






[40]R. Parimal and R. Nallaswamy ,"Feature selection using a novel particle swarm optimization and It’s Variants," I.J. Information Technology and Computer Science, 5: 16-24, 2012

[41]S. Chittineni and R. Bhogapathi ,"Determining contribution of features in clustering multidimensional data using neural network," I.J. Information Technology and Computer Science, 10: 29-36, 2012.