Repository logo
Article

Unsupervised labeling of data for supervised learning and its application to medical claims prediction

creativeworkseries.issn1508-2806
dc.contributor.authorNgufor, Che G.
dc.contributor.authorWojtusiak, Janusz
dc.date.available2017-09-19T07:27:53Z
dc.date.issued2013
dc.descriptionBibliogr. s. 213-214.
dc.description.abstractThe task identifying changes and irregularities in medical insurance claim payments is a difficult process of which the traditional practice involves querying historical claims databases and flagging potential claims as normal or abnormal. Because what is considered as normal payment is usually unknown and may change over time, abnormal payments often pass undetected, only to be discovered when the payment period has passed. This paper presents the problem of on-line unsupervised learning from data streams when the distribution that generates the data changes or drifts over time. Automated algorithms for detecting drifting concepts in a probability distribution of the data are presented. The idea behind the presented drift detection methods is to transform the distribution of the data within a sliding window into a more convenient distribution. Then, a test statistics p-value at a given significance level can be used to infer the drift rate, adjust the window size and decide on the status of the drift. The detected concepts drifts are used to label the data, for subsequent learning of classification models by a supervised learner. The algorithms were tested on several synthetic and real medical claims data sets.en
dc.description.placeOfPublicationKraków
dc.description.versionwersja wydawniczapl
dc.identifier.doihttps://doi.org/10.7494/csci.2013.14.2.191
dc.identifier.eissn2300-7036
dc.identifier.issn1508-2806
dc.identifier.nukatdd2013319089pl
dc.identifier.urihttps://repo.agh.edu.pl/handle/AGH/49106
dc.language.isoeng
dc.publisherWydawnictwa AGH
dc.relation.ispartofComputer Science
dc.rightsAttribution 4.0 International
dc.rights.accessotwarty dostęp
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/legalcode
dc.subjectunsupervised learningen
dc.subjectconcept driften
dc.subjectmedical claimsen
dc.titleUnsupervised labeling of data for supervised learning and its application to medical claims predictionen
dc.title.relatedComputer Science
dc.typeartykuł
dspace.entity.typePublication
publicationissue.issueNumberNo. 2
publicationissue.paginationpp. 191-214
publicationvolume.volumeNumberVol. 14
relation.isJournalIssueOfPublication6f6fcb81-b78b-4fb6-9d5c-8d45219aa90e
relation.isJournalIssueOfPublication.latestForDiscovery6f6fcb81-b78b-4fb6-9d5c-8d45219aa90e
relation.isJournalOfPublication020291ee-249b-4dcf-98a3-276a2f7981aa

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
csci.2013.14.2.191.pdf
Size:
271.15 KB
Format:
Adobe Portable Document Format