• Title/Summary/Keyword: Medical Big Data

Search Result 418, Processing Time 0.026 seconds

Validation of Administrative Big Database for Colorectal Cancer Searched by International Classification of Disease 10th Codes in Korean: A Retrospective Big-cohort Study

  • Hwang, Young-Jae;Kim, Nayoung;Yun, Chang Yong;Yoon, Hyuk;Shin, Cheol Min;Park, Young Soo;Son, Il Tae;Oh, Heung-Kwon;Kim, Duck-Woo;Kang, Sung-Bum;Lee, Hye Seung;Park, Seon Mee;Lee, Dong Ho
    • Journal of Cancer Prevention
    • /
    • v.23 no.4
    • /
    • pp.183-190
    • /
    • 2018
  • Background: As the number of big-cohort studies increases, validation becomes increasingly more important. We aimed to validate administrative database categorized as colorectal cancer (CRC) by the International Classification of Disease (ICD) 10th code. Methods: Big-cohort was collected from Clinical Data Warehouse using ICD 10th codes from May 1, 2003 to November 30, 2016 at Seoul National University Bundang Hospital. The patients in the study group had been diagnosed with cancer and were recorded in the ICD 10th code of CRC by the National Health Insurance Service. Subjects with codes of inflammatory bowel disease or tuberculosis colitis were selected for the control group. For the accuracy of registered CRC codes (C18-21), the chart, imaging results, and pathologic findings were examined by two reviewers. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for CRC were calculated. Results: A total of 6,780 subjects with CRC and 1,899 control subjects were enrolled. Of these patients, 22 subjects did not have evidence of CRC by colonoscopy, computed tomography, magnetic resonance imaging, or positron emission tomography. The sensitivity and specificity of hospitalization data for identifying CRC were 100.00% and 98.86%, respectively. PPV and NPV were 99.68% and 100.00%, respectively. Conclusions: The big-cohort database using the ICD 10th code for CRC appears to be accurate.

A Study on the Application of Natural Language Processing in Health Care Big Data: Focusing on Word Embedding Methods (보건의료 빅데이터에서의 자연어처리기법 적용방안 연구: 단어임베딩 방법을 중심으로)

  • Kim, Hansang;Chung, Yeojin
    • Health Policy and Management
    • /
    • v.30 no.1
    • /
    • pp.15-25
    • /
    • 2020
  • While healthcare data sets include extensive information about patients, many researchers have limitations in analyzing them due to their intrinsic characteristics such as heterogeneity, longitudinal irregularity, and noise. In particular, since the majority of medical history information is recorded in text codes, the use of such information has been limited due to the high dimensionality of explanatory variables. To address this problem, recent studies applied word embedding techniques, originally developed for natural language processing, and derived positive results in terms of dimensional reduction and accuracy of the prediction model. This paper reviews the deep learning-based natural language processing techniques (word embedding) and summarizes research cases that have used those techniques in the health care field. Then we finally propose a research framework for applying deep learning-based natural language process in the analysis of domestic health insurance data.

Analysis of interest in non-face-to-face medical counseling of modern people in the medical industry (의료 산업에 있어 현대인의 비대면 의학 상담에 대한 관심도 분석 기법)

  • Kang, Yooseong;Park, Jong Hoon;Oh, Hayoung;Lee, Se Uk
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1571-1576
    • /
    • 2022
  • This study aims to analyze the interest of modern people in non-face-to-face medical counseling in the medical industrys. Big data was collected on two social platforms, 지식인, a platform that allows experts to receive medical counseling, and YouTube. In addition to the top five keywords of telephone counseling, "internal medicine", "general medicine", "department of neurology", "department of mental health", and "pediatrics", a data set was built from each platform with a total of eight search terms: "specialist", "medical counseling", and "health information". Afterwards, pre-processing processes such as morpheme classification, disease extraction, and normalization were performed based on the crawled data. Data was visualized with word clouds, broken line graphs, quarterly graphs, and bar graphs by disease frequency based on word frequency. An emotional classification model was constructed only for YouTube data, and the performance of GRU and BERT-based models was compared.

Design of Health Warning Model on the Basis of CRM by use of Health Big Data (의료 빅데이터를 활용한 CRM 기반 건강예보모형 설계)

  • Lee, Sangwon;Shin, Seong-Yoon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.8
    • /
    • pp.1460-1465
    • /
    • 2016
  • Lots of costs threaten the sustainability of the national health-guarantee system. Despite research by the national center for disease control and prevention on health care dynamics with its auditing systems, there are still restrictions of time limitation, sample limitation, and, target diseases limitation. Against this backdrop, using huge volume of total data, many technologies could be fully adopted to the preliminary forecasting and its target-disease expanding of health. With structured data from the national health insurance and unstructured data from the social network service, we attempted to design a model to predict disease. The model can enhance national health and maximize social benefit by providing a health warning service. Also, the model can reduce the advent increase of national health cost and predict timely disease occurrence based on Big Data analysis. We researched related medical prediction cases and performed an experiment with a pilot project so as to verify the proposed model.

IoT-based Digital Life Care Industry Trends

  • Kim, Young-Hak
    • International journal of advanced smart convergence
    • /
    • v.8 no.3
    • /
    • pp.87-94
    • /
    • 2019
  • IoT-based services are being released in accordance with the aging population and the demand for well-being pursuit needs. In addition to medical device companies, companies with ideas ranging from global ICT companies to startup companies are accelerating their market entry. The areas where these services are most commonly applied are health/medical, life/safety, city/energy, automotive and transportation. Furthermore, by expanding IoT technology convergence into the area of life care services, it contributes greatly to the development of service models in the public sector. It also provides an important opportunity for IoT-related companies to open up new markets. By addressing the problems of life care services that are still insufficient. We are providing opportunities to pursue the common interests of both users and workers and improve the quality of life. In order to establish IoT-based digital life care services, it is necessary to develop convergence technologies using cloud computing systems, big data analytics, medical information, and smart healthcare infrastructure.

A Study on Reliability Analysis According to the Number of Training Data and the Number of Training (훈련 데이터 개수와 훈련 횟수에 따른 과도학습과 신뢰도 분석에 대한 연구)

  • Kim, Sung Hyeock;Oh, Sang Jin;Yoon, Geun Young;Kim, Wan
    • Korean Journal of Artificial Intelligence
    • /
    • v.5 no.1
    • /
    • pp.29-37
    • /
    • 2017
  • The range of problems that can be handled by the activation of big data and the development of hardware has been rapidly expanded and machine learning such as deep learning has become a very versatile technology. In this paper, mnist data set is used as experimental data, and the Cross Entropy function is used as a loss model for evaluating the efficiency of machine learning, and the value of the loss function in the steepest descent method is We applied the Gradient Descent Optimize algorithm to minimize and updated weight and bias via backpropagation. In this way we analyze optimal reliability value corresponding to the number of exercises and optimal reliability value without overfitting. And comparing the overfitting time according to the number of data changes based on the number of training times, when the training frequency was 1110 times, we obtained the result of 92%, which is the optimal reliability value without overfitting.

Fatigue, Personality Traits, Learning Strategies, and Academic Achievement in Graduate-entry Medical Students (의학전문대학원 학생의 피로, 성격특성, 학습전략과 학업성취도의 관계)

  • Hwang, In Cheol;Park, Kwi Hwa;Yim, Jun;Kim, Jin Joo;Ko, Kwang Pil;Bae, Seung Min;Kyung, Sun Young
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.4
    • /
    • pp.231-240
    • /
    • 2016
  • The purpose of this study is to investigate the relationship among fatigue, personality, learning strategies, and academic achievement of medical students. 146 students from year 1 to year 4 at one medical school participated in this study. Students completed the fatigue, Big Five personality traits(Neuroticism, Extraversion, Openness, Agreeableness, Conscientiousness), learning, strategies. The academic achievement of students measured by GPA. The data were analyzed by t-test and stepwise multiple regression. The student's fatigue differed by grade, and the students of low grade had higher scores than high grade. But personality traits and learning strategies were not significantly different by grade. The factors that affect on academic achievement differ by grade. In low grade, neuroticism, extraversion, and rehearsal affected students' academic achievement. In high grade, conscientiousness and extraversion had an effect on the academic achievement of students. These results could guide the design of medical education improvement, and be useful in developing a supporting program for medical students.

Wellness Prediction in Diabetes Mellitus Risks Via Machine Learning Classifiers

  • Saravanakumar M, Venkatesh;Sabibullah, M.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.4
    • /
    • pp.203-208
    • /
    • 2022
  • The occurrence of Type 2 Diabetes Mellitus (T2DM) is hoarding globally. All kinds of Diabetes Mellitus is controlled to disrupt over 415 million grownups worldwide. It was the seventh prime cause of demise widespread with a measured 1.6 million deaths right prompted by diabetes during 2016. Over 90% of diabetes cases are T2DM, with the utmost persons having at smallest one other chronic condition in UK. In valuation of contemporary applications of Big Data (BD) to Diabetes Medicare by sighted its upcoming abilities, it is compulsory to transmit out a bottomless revision over foremost theoretical literatures. The long-term growth in medicine and, in explicit, in the field of "Diabetology", is powerfully encroached to a sequence of differences and inventions. The medical and healthcare data from varied bases like analysis and treatment tactics which assistances healthcare workers to guess the actual perceptions about the development of Diabetes Medicare measures accessible by them. Apache Spark extracts "Resilient Distributed Dataset (RDD)", a vital data structure distributed finished a cluster on machines. Machine Learning (ML) deals a note-worthy method for building elegant and automatic algorithms. ML library involving of communal ML algorithms like Support Vector Classification and Random Forest are investigated in this projected work by using Jupiter Notebook - Python code, where significant quantity of result (Accuracy) is carried out by the models.

A Study on the Status of Medical Equipment and Radiological Technologists using Big Data for Health Care: Based on Data for 2020-2021 (보건의료 빅데이터를 활용한 의료장비 및 방사선사 인력 현황 연구 : 2020-2021년 자료를 기준으로)

  • Jang, Hyon-Chol
    • Journal of the Korean Society of Radiology
    • /
    • v.15 no.5
    • /
    • pp.667-673
    • /
    • 2021
  • As we enter the era of the 4th industrial revolution, it is judged that the scope of work of radiologists will be further expanded according to the innovation and advancement of radiation medical technology development. In this study, the current status of medical equipment and radiology technicians was identified, and basic data were provided for the plan for nurturing talents in the field of radiation medical technology in the era of the 4th industrial revolution, as well as career and employment counseling. Data from the second quarter of 2020 and the second quarter of 2021 were analyzed using health and medical big data. As a result of comparing the status of medical equipment by type in 2021 compared to 2020, C-Arm X-ray examination equipment increased by 5.83% to 6,638 units, followed by MRI examination equipment 1,811 units 5.29%, and angiography equipment 725 units 5.22% , general X-ray examination equipment 21,557 units increased 3.99%, CT examination equipment 2,136 units 3.03%, and breast examination equipment 3,425 units increased 3.00%. As a result of a comparison of the total number of radiologists in 2021 compared to 2020, the number was 29,038, an increase of 2.73%. As a result of comparing the status of radiographers by region, the increase was highest in the Gyeonggi region with 5.96%, followed by the Gangwon region with a 5.66% increase and the Chungnam region with a 3.81% increase. In a situation where the number of medical equipment and radiologist manpower is increasing, universities are developing specialized knowledge and practical competency through subject development related to the understanding and utilization of customized artificial intelligence and big data that can be applied in the medical radiation technology field in the era of the 4th industrial revolution. It is necessary to nurture qualified radiographers, and at the level of the association, it is thought that active policies are needed to create new jobs and improve employment.

Epidemiology of trigeminal neuralgia: an electronic population health data study in Korea

  • Lee, Cheol-Hyeong;Jang, Ho-Yeon;Won, Hyung-Sun;Kim, Ja-Sook;Kim, Yeon-Dong
    • The Korean Journal of Pain
    • /
    • v.34 no.3
    • /
    • pp.332-338
    • /
    • 2021
  • Background: Trigeminal neuralgia (TN) is one of the most painful disorder in the orofacial region, and many patients have suffered from this disease. For the effective management of TN, fundamental epidemiologic data related to the target population group are essential. Thus, this study was performed to clarify the epidemiological characteristics of TN in the Korean population. This is the first national study to investigate the prevalence of TN in Korean patients. Methods: From 2014 to 2018, population-based medical data for 51,276,314 subscribers to the National Health Insurance Service of Korea were used for this study. Results: The incidence of TN was 100.21 per 100,000 person-years in the year of 2018 in Korea, and the male to female ratio was 1:2.14. The age group of 51-59 years had the highest prevalence of TN. Constant increases in medical cost, regional imbalance, and differences in prescription patterns by the medical specialties were showed in the management of TN. Conclusions: The results in this study will not only help to study the characteristics of TN, but also serve as an important basis for the effective management of TN in Korea.