dc.contributor.author | Braithwaite, B | |
dc.contributor.author | Paananen, J | |
dc.contributor.author | Taipale, H | |
dc.contributor.author | Tanskanen, A | |
dc.contributor.author | Tiihonen, J | |
dc.contributor.author | Hartikainen, S | |
dc.contributor.author | Tolppanen, A-M | |
dc.date.accessioned | 2020-06-16T10:03:35Z | |
dc.date.available | 2020-06-16T10:03:35Z | |
dc.date.issued | 2020 | |
dc.identifier.uri | https://erepo.uef.fi/handle/123456789/8191 | |
dc.description.abstract | Objective
To study the feasibility of evaluating feature importance with Shapley Values and ensemble methods in the context of pharmacoepidemiology and medication safety.
Methods
We detected medications associated with Alzheimer's disease (AD) by examining the additive feature attribution with combined approach of Gradient Boosting and Shapley Values in the Medication use and Alzheimer's disease (MEDALZ) study, a nested case-control study of 70,719 verified AD cases in Finland. Our methodological approach is to do binary classification using Gradient boosting (an ensemble of weak classifiers) in a supervised learning manner. Then we apply Shapley Values (from cooperative game theory) to analyze how feature combinations affect the classification result. Medication use with a five to one year time-window before AD diagnosis was ascertained from Prescription register.
Results
Antipsychotics with low or medium dose, antidepressants with medium to high dose, and cardiovascular medications with medium to high dose were identified as the contributing features for separating cases with AD from controls. Medium to high amount of irregularity in the purchase pattern were an indicating feature for separating AD cases from controls. The similarity of medication purchases between AD cases and controls made the feature evaluation challenging.
Conclusions
The combined approach of Gradient Boosting and feature evaluation with Shapley Values identified features that were consistent with findings from previous hypothesis-driven studies. Additionally, the results from the additive feature attribution identified new candidates for future studies on AD risk factors. Our approach also shows promise for studies based on observational studies, where feature identification and interactions in populations are of interest; and the applicability of using Shapley Values for evaluating feature relevance in pattern recognition tasks. | |
dc.language.iso | englanti | |
dc.publisher | Elsevier BV | |
dc.relation.ispartofseries | International journal of medical informatics | |
dc.relation.uri | http://dx.doi.org/10.1016/j.ijmedinf.2020.104142 | |
dc.rights | CC BY-NC-ND 4.0 | |
dc.subject | pattern recognition | |
dc.subject | knowledge organization | |
dc.subject | feature attribution | |
dc.subject | cooperative game theory | |
dc.subject | Shapley value | |
dc.subject | Gradient boosting | |
dc.subject | Alzheimer's disease | |
dc.subject | nested case-control study | |
dc.subject | pharmacoepidemiology | |
dc.title | Detection of medications associated with Alzheimer's disease using ensemble methods and cooperative game theory | |
dc.description.version | published version | |
dc.contributor.department | School of Pharmacy, Activities | |
dc.contributor.department | School of Medicine / Biomedicine,School of Medicine / Clinical Medicine | |
uef.solecris.id | 71269915 | en |
dc.type.publication | Tieteelliset aikakauslehtiartikkelit | |
dc.relation.doi | 10.1016/j.ijmedinf.2020.104142 | |
dc.description.reviewstatus | peerReviewed | |
dc.publisher.country | Alankomaat | |
dc.relation.articlenumber | 104142 | |
dc.relation.issn | 1386-5056 | |
dc.relation.volume | 141 | |
dc.rights.accesslevel | openAccess | |
dc.type.okm | A1 | |
uef.solecris.openaccess | Hybridijulkaisukanavassa ilmestynyt avoin julkaisu | |
dc.rights.copyright | © 2020 The Authors | |
dc.type.displayType | article | en |
dc.type.displayType | artikkeli | fi |
dc.rights.url | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |