Development and Lessons Learned of Clinical Data Warehouse based on Common Data Model for Drug Surveillance

약물부작용 감시를 위한 공통데이터모델 기반 임상데이터웨어하우스 구축

  • Mi Jung Rho (College of Health Science, Dankook University)
  • 노미정 (단국대학교 공공.보건과학대학)
  • Received : 2023.01.25
  • Accepted : 2023.08.05
  • Published : 2023.09.30

Abstract

Purposes: It is very important to establish a clinical data warehouse based on a common data model to offset the different data characteristics of each medical institution and for drug surveillance. This study attempted to establish a clinical data warehouse for Dankook university hospital for drug surveillance, and to derive the main items necessary for development. Methodology/Approach: This study extracted the electronic medical record data of Dankook university hospital tracked for 9 years from 2013 (2013.01.01. to 2021.12.31) to build a clinical data warehouse. The extracted data was converted into the Observational Medical Outcomes Partnership Common Data Model (Version 5.4). Data term mapping was performed using the electronic medical record data of Dankook university hospital and the standard term mapping guide. To verify the clinical data warehouse, the use of angiotensin receptor blockers and the incidence of liver toxicity were analyzed, and the results were compared with the analysis of hospital raw data. Findings: This study used a total of 670,933 data from electronic medical records for the Dankook university clinical data warehouse. Excluding the number of overlapping cases among the total number of cases, the target data was mapped into standard terms. Diagnosis (100% of total cases), drug (92.1%), and measurement (94.5%) were standardized. For treatment and surgery, the insurance EDI (electronic data interchange) code was used as it is. Extraction, conversion and loading were completed. R language-based conversion and loading software for the process was developed, and clinical data warehouse construction was completed through data verification. Practical Implications: In this study, a clinical data warehouse for Dankook university hospitals based on a common data model supporting drug surveillance research was established and verified. The results of this study provide guidelines for institutions that want to build a clinical data warehouse in the future by deriving key points necessary for building a clinical data warehouse.

Keywords

Acknowledgement

본 연구는 2022년도 한국의약품안전관리원 용역비 지원에 의해 수행되었음

References

  1. The National Patient-Centered Clinical Research Network [cited 2023 July 31]. Available from https://pcornet.org/
  2. Fleurence R, Curtis L, Califf R, Platt R, Selby J, Brown J, Launching PCORnet, a national patient-centered clinical research network. J Am Med Inform Assoc 2014; 21(4) 578-582. https://doi.org/10.1136/amiajnl-2014-002747
  3. McTigue K, Wellman R, Nauman E, Anau J, Coley R, Odor A, Comparing the 5-year diabetes outcomes of sleeve gastrectomy and gastric bypass: the National Patient-Centered Clinical Research Network (PCORNet) Bariatric Study. JAMA surg 2020; 155(5): e200087-e200087. https://doi.org/10.1001/jamasurg.2020.0087
  4. The Biobanking and BioMolecular Resources Research Infrastructure [cited 2023 July 31]. Available from https://www.bbmri-eric.eu/
  5. Litton J. Launch of an infrastructure for health research: BBMRI-ERIC. Biopreserv Biobank 2018; 16(3) : 233-241. https://doi.org/10.1089/bio.2018.0027
  6. Vuorio E. Networking biobanks throughout Europe: the development of BBMRI-ERIC. Biobanking of Human Biospecimens. Springer, Cham, 2017. pp. 137-153.
  7. The European infrastructure for life science data [cited 2023 July 31]. Available from https://elixir-europe.org/
  8. Harrow J, Drysdale R, Smith A, Repo S, Lanfear J, Blomberg N. ELIXIR: providing a sustainable infrastructure for life science data at European Scale. Bioinformatics 2021; 37(16: 2506-2511. https://doi.org/10.1093/bioinformatics/btab481
  9. Crosswell LC, Thornton JM. ELIXIR: a distributed infrastructure for European biological data. Trends Biotechnol 2012; 30(5): 241-2. https://doi.org/10.1016/j.tibtech.2012.02.002
  10. Curtis LH, Weiner MG, Boudreau DM, CooperWO, Daniel GW, Nair VP. et al. Design considerations, architecture, and use of the MiniSentinel distributed data system. Pharmacoepidemiol Drug Saf 2012; 21: 23-31. https://doi.org/10.1002/pds.2336
  11. Sentinel Common Data Model, [cited 2023 January 22]. Available from https://www.sentinelinitiative.org/methods-data-tools/sentinel-common-data-model
  12. PCORnet Common Data Model, [cited 2023 January 22]. Available from https://pcornet.org/data/
  13. Prats-Uribe A, Sena AG, Lai LYH, Alghoul H, Alser O, Alshammari TM, et al. Use of repurposed and adjuvant drugs in hospital patients with covid-19: multinational network cohort study. Bmj 2021; 373.
  14. Reisinger SJ, Ryan PB, O'Hara DJ, Powell GE, Painter JL, Pattishall EN, et al. Development and evaluation of a common data model enabling active drug safety surveillance using disparate healthcare databases. J Am Med Inform Assoc 2010; 17(6): 652-662. https://doi.org/10.1136/jamia.2009.002477
  15. Drug Studies [cited 2023 January 22]. Available from https://www.sentinelinitiative.org/assess-ments/drugs
  16. Trifiro G, Coloma PM, Rijnbeek PR, Romio S, Mosseveld B, Weibel D, et al. Combining multiple healthcare databases for postmarketing drug and vaccine safety surveillance: why and how?. J Intern Med 2014; 275(6): 551-561. https://doi.org/10.1111/joim.12159
  17. Reisinger S, Powell G, Dreyfus B, Schneider G, Use of common data model to meaningfully compare treatment patterns for depression among disparate databases. Value Health 2013; 16(3): A24.
  18. PCORnet CDM 6.0 [cited 2023 January 22]. Available from https://pcornet.org/wp-content/uploads/2022/01/PCORnet-Common-Data-Model-v60-2020_10_221.pdf
  19. Juarez D, Schmidt EE, Stahl-Toyota S, uckert F, Lablans M, A generic method and implementation to evaluate and improve data quality in distributed research networks. Methods Inf Med 2019; 58(02/03): 086-093. https://doi.org/10.1055/s-0039-1693685
  20. Qualls LG, Phillips TA, Hammill BG, Topping J, Louzao DM, Brown JS, et al. Evaluating foundational data quality in the national patient-centered clinical research network (PCORnet®). Egems 2018;6(1).
  21. Marsolo KA, Brown JS, Hernandez AF, Hammill BG, Raman SR, Syat B, et al. Considerations for using distributed research networks to conduct aspects of randomized trials. Contemp. Clin. Trials Commun 2020; 17: 100515.
  22. Panozzo C. Distributed research networks and applications to pragmatic randomized trials. Pragmatic Randomized Clinical Trials. Academic Press, 2021. pp. 237-256.
  23. Oh S, Sung M, Rhee Y, Hong N, Park YR. Evaluation of the Privacy Risks of Personal Health Identifiers and Quasi-Identifiers in a Distributed Research Network: Development and Validation Study. JMIR Med Inform 2021;9(5): e24940.
  24. 조도연. HIRA ISSUE: 코로나 19의 세계적 유행에 대응하는 국제협력 연구플랫폼. 건강보험심사평가원 심사평가연구소: .14 2020.
  25. You S, Rho Y, Bikdeli B, Kim J, Siapos A, Weaver J, et al. Association of ticagrelor vs clopidogrel with net adverse clinical events in patients with acute coronary syndrome undergoing percutaneous coronary intervention. Jama 2020; 324(16): 1640-1650. https://doi.org/10.1001/jama.2020.16167
  26. Williams R, Markus A, Yang C, Salles T, Falconer T, Jonnagaddala J, et al. Seek COVER: Development and validation of a personalized risk calculator for COVID-19 outcomes in an international network. MedRxiv 2020: 05.
  27. Kim H, Yoon J, Kwak M, Cha J. Gastrointestinal and nongastrointestinal complications of esophagogastroduodenoscopy and colonoscopy in the real world: A nationwide standard cohort using the common data model database. Gut Liver 2021; 15(4): 569.
  28. Rho Y, Son Y, Lee Y, Kim J, Lee H, You S, et al. COVID-19 international collaborative research by the health insurance review and assessment service using its nationwide real-world data: database, outcomes, and implications. J Prev Med Public Health 2021; 54(1): 8.
  29. Son N, Kim B, Chung S, Han S. Korean Pharmacovigilance System Based on EHR-CDM. MedInfo 2019; 264:1592-3.
  30. Choi W, Yang YS, Chang DJ, Chung YW, Kim H, Ko SJ, et al. Association between the use of allopurinol and risk of increased thyroid-stimulating hormone level. Sci Rep 2021;11:1-10. https://doi.org/10.1038/s41598-020-79139-8
  31. OMOP CDM 5.4 [cited 2023 January 22]. Available from http://ohdsi.github.io/CommonDataModel/
  32. Nunez M. Clinical syndromes and consequences of antiretroviral-related hepatotoxicity. Hepatology 2010; 52(3): 1143-1155. https://doi.org/10.1002/hep.23716
  33. Yu Y, Zong N, Wen A, Liu S, Stone DJ, Knaack D, et al. Developing an ETL tool for converting the PCORnet CDM into the OMOP CDM to facilitate the COVID-19 data integration. J Biomed Inform 2022; 127: 104002.
  34. Park R. The Distributed Research Network, Observational Health Data Sciences and Informatics, and the South Korean Research Network. Korean J Med 2019; 94(4) : 309-314. https://doi.org/10.3904/kjm.2019.94.4.309
  35. Byeon SJ, Park HA, Kang DY, Choi IY, Choi JH, Surveillance of Adverse Drug Reactions to First-line Antituberculosis Drugs Using Common Data Model. Journal of Pharmacoepidemiology and Risk Management 2021; 13(2): 78-86. https://doi.org/10.56142/2021.13.2.78
  36. 하재호, 이은영, 이동윤, 조용혁, 이혜림, 박범희, 외. 우울증 환자에서 항우울제 단독요법과 항정신병제와 항우울제의 병용요법의 자살위험비교: 정신건강의학 공통 데이터 모델을 활용한 파일럿 연구. J Korean Neuropsychiatr Assoc 2020; 59(3) : 243-249. https://doi.org/10.4306/jknpa.2020.59.3.243
  37. Juarez D, Schmidt EE, Stahl-Toyota S, uckert F, Lablans M, A generic method and implementation to evaluate and improve data quality in distributed research networks. Methods Inf Med 2019; 58(02/03): 086-093. https://doi.org/10.1055/s-0039-1693685
  38. Kang B, Yoon J, Kim HY, Jo SJ, Lee Y, Kam HJ, Deep-learning-based automated terminology mapping in OMOP-CDM. J Am Med Inform Assoc 2021; 28(7): 1489-1496. https://doi.org/10.1093/jamia/ocab030
  39. Ahmadi N, Peng Y, Wolfien M, Zoch M, Sedlmayr M, OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review. Int J Mol Sci 2022; 23(19): 11834.
  40. 안윤애, 조한진. 공통데이터모델 기반의 임상의사결정 지원시스템에 관한 연구. 한국융합학회논문지 2019; 10(11): 117-124. https://doi.org/10.15207/JKCS.2019.10.11.117
  41. Rho MJ, Kim SR, Park SH, Jang KS, Park BJ, Hong JY, et al. Common data model for decision support system of adverse drug reaction to extract knowledge from multi-center database. Inf. Technol. Manag 2016; 17(1): 57-66. https://doi.org/10.1007/s10799-015-0240-6