Skip to main content
UOIT Home
Please use this identifier to cite or link to this item: http://hdl.handle.net/10155/318

Issue Date:  5
Title: Data mining occurrences of infectious diseases with SNOMED CT
Authors: Ciolko, Ewelina
Publisher : University of Ontario Institute of Technology
Degree : Master of Health Sciences (MHSc)
Department : Health Informatics
Supervisor : Lu, Fletcher
Keywords: SNOMED CT
Data mining
World Health Organization
Infectious diseases
Simple CART theory
Naive Bayes
Best Fit Trees
World health statistics
Abstract: Synonyms within SNOMED CT’s structure give meaning to the clinical terminology. The hypothesis in this thesis is that the number of synonyms of a disease within SNOMED CT can be used to predict the number of occurrences of an infectious disease reported on by the World Health Organization (WHO). Using simple Classification and Regression (CART), Bayes theory, and Best Fit trees, prediction algorithms are created based on the number of synonyms in infectious disease terms of SNOMED CT, the number of those diseases world-wide, the region of occurrence of the disease, and the year of occurrence of the disease. The results of experiments predict the number of occurrences of a disease correctly 67% of the time by using Simple Cart method; Bayes and Best Fit Trees each produce the correct number of occurrences 61% of the time.
Appears in Collections:Faculty of Health Sciences - Master Theses
Electronic Theses and Dissertations (Public)

Files in This Item:

File Description SizeFormat
Ciolk_Ewelina.pdf293.93 kBAdobe PDFView/Open

Items in e-scholar@UOIT are protected by copyright, with all rights reserved, unless otherwise indicated.