• Title/Summary/Keyword: disease prediction

Search Result 552, Processing Time 0.025 seconds

Prediction of Quantitative Traits Using Common Genetic Variants: Application to Body Mass Index

  • Bae, Sunghwan;Choi, Sungkyoung;Kim, Sung Min;Park, Taesung
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.149-159
    • /
    • 2016
  • With the success of the genome-wide association studies (GWASs), many candidate loci for complex human diseases have been reported in the GWAS catalog. Recently, many disease prediction models based on penalized regression or statistical learning methods were proposed using candidate causal variants from significant single-nucleotide polymorphisms of GWASs. However, there have been only a few systematic studies comparing existing methods. In this study, we first constructed risk prediction models, such as stepwise linear regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN), using a GWAS chip and GWAS catalog. We then compared the prediction accuracy by calculating the mean square error (MSE) value on data from the Korea Association Resource (KARE) with body mass index. Our results show that SLR provides a smaller MSE value than the other methods, while the numbers of selected variables in each model were similar.

Prediction of Intravenous Immunoglobulin Nonresponse Kawasaki Disease in Korea (한국인에서 면역글로불린-저항성 가와사키병 환자의 예측)

  • Choi, Myung Hyun;Park, Chung Soo;Kim, Dong Soo;Kim, Ki Hwan
    • Pediatric Infection and Vaccine
    • /
    • v.21 no.1
    • /
    • pp.29-36
    • /
    • 2014
  • Purpose: The objective of this study was to find the predictors and generate a prediction scoring model of nonresponse to intravenous immunoglobulin in patients with Kawasaki disease. Methods: We examined 573 children diagnosed with KD at the Severance Children's Hospital between January 2009 and december 2012. We retrospectively reviewed their medical records. These patients were divided into 2 groups; the experimental group (N=433) and the validation group (N=140). Each group were divided into 2 groups the intravenous immunoglobulin nonresponders and the responders. Multivariate logistic regression analysis identified predictive factors of intravenous immunoglobulin nonresponders which make predictive scoring model. We practice internal validation and external validation. Results: Multivariate logistic regression analysis identified male, cervical lymphadenopathy, changes of the extremities, platelet, total bilirubin, alkaline phophatase, lactate dehydrogenase, C-reactive protein as significant predictors for nonresponse to intravenous immunoglobulin. We generated prediction score assigning 1 point for (1) male, (2) cervical lymphadenopathy, (3) changes of the extremities, (4) platelet (${\leq}368,000/mm^3$), (5) total bilirubin (${\geq}0.4mg/dL$), (6) alkaline phophatase (${\geq}227IU/L$), (7) lactate dehydrogenase (${\geq}268IU/L$), (8) C-reactive protein (>77.1 mg/dL). Using a cut-off point of 4 and more with this prediction score, we could identify the intravenous immunoglobulin nonresponder group. Sensitivity and specificity were 52.5% and 82.4% in experimental group and 37.8% and 81.8% in validation group, respectively. Conclusion: Our predictive scoring models had high specificity and low sensitivity in Korean patients. Therefore it is useful in predicting nonresponse to intravenous immunoglobulin with Kawasaki disease.

Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks

  • Kim, Joo-Chang;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.4
    • /
    • pp.2060-2077
    • /
    • 2019
  • Recently, mobile healthcare services have attracted significant attention because of the emerging development and supply of diverse wearable devices. Smartwatches and health bands are the most common type of mobile-based wearable devices and their market size is increasing considerably. However, simple value comparisons based on accumulated data have revealed certain problems, such as the standardized nature of health management and the lack of personalized health management service models. The convergence of information technology (IT) and biotechnology (BT) has shifted the medical paradigm from continuous health management and disease prevention to the development of a system that can be used to provide ground-based medical services regardless of the user's location. Moreover, the IT-BT convergence has necessitated the development of lifestyle improvement models and services that utilize big data analysis and machine learning to provide mobile healthcare-based personal health management and disease prevention information. Users' health data, which are specific as they change over time, are collected by different means according to the users' lifestyle and surrounding circumstances. In this paper, we propose a prediction model of user physical activity that uses data characteristics-based long short-term memory (DC-LSTM) recurrent neural networks (RNNs). To provide personalized services, the characteristics and surrounding circumstances of data collectable from mobile host devices were considered in the selection of variables for the model. The data characteristics considered were ease of collection, which represents whether or not variables are collectable, and frequency of occurrence, which represents whether or not changes made to input values constitute significant variables in terms of activity. The variables selected for providing personalized services were activity, weather, temperature, mean daily temperature, humidity, UV, fine dust, asthma and lung disease probability index, skin disease probability index, cadence, travel distance, mean heart rate, and sleep hours. The selected variables were classified according to the data characteristics. To predict activity, an LSTM RNN was built that uses the classified variables as input data and learns the dynamic characteristics of time series data. LSTM RNNs resolve the vanishing gradient problem that occurs in existing RNNs. They are classified into three different types according to data characteristics and constructed through connections among the LSTMs. The constructed neural network learns training data and predicts user activity. To evaluate the proposed model, the root mean square error (RMSE) was used in the performance evaluation of the user physical activity prediction method for which an autoregressive integrated moving average (ARIMA) model, a convolutional neural network (CNN), and an RNN were used. The results show that the proposed DC-LSTM RNN method yields an excellent mean RMSE value of 0.616. The proposed method is used for predicting significant activity considering the surrounding circumstances and user status utilizing the existing standardized activity prediction services. It can also be used to predict user physical activity and provide personalized healthcare based on the data collectable from mobile host devices.

Cat Behavior Pattern Analysis and Disease Prediction System of Home CCTV Images using AI (AI를 이용한 홈CCTV 영상의 반려묘 행동 패턴 분석 및 질병 예측 시스템 연구)

  • Han, Su-yeon;Park, Dea-woo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.165-167
    • /
    • 2022
  • The proportion of cat cats among companion animals has been increasing at an average annual rate of 25.4% since 2012. Cats have strong wildness compared to dogs, so they have a characteristic of hiding diseases well. Therefore, when the guardian finds out that the cat has a disease, the disease may have already worsened. Symptoms such as anorexia (eating avoidance), vomiting, diarrhea, polydipsia, and polyuria in cats are some of the symptoms that appear in cat diseases such as diabetes, hyperthyroidism, renal failure, and panleukopenia. It will be of great help in treating the cat's disease if the owner can recognize the cat's polydipsia (drinking a lot of water), polyuria (a large amount of urine), and frequent urination (urinating frequently) more quickly. In this paper, 1) Efficient version of DeepLabCut for posture prediction running on an artificial intelligence server, 2) yolov4 for object detection, and 3) LSTM are used for behavior prediction. Using artificial intelligence technology, it predicts the cat's next, polyuria and frequency of urination through the analysis of the cat's behavior pattern from the home CCTV video and the weight sensor of the water bowl. And, through analysis of cat behavior patterns, we propose an application that reports disease prediction and abnormal behavior to the guardian and delivers it to the guardian's mobile and the main server system.

  • PDF

Analysis of COVID-19 Context-awareness based on Clustering Algorithm (클러스터링 알고리즘기반의 COVID-19 상황인식 분석)

  • Lee, Kangwhan
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.5
    • /
    • pp.755-762
    • /
    • 2022
  • This paper propose a clustered algorithm that possible more efficient COVID-19 disease learning prediction within clustering using context-aware attribute information. In typically, clustering of COVID-19 diseases provides to classify interrelationships within disease cluster information in the clustering process. The clustering data will be as a degrade factor if new or newly processing information during treated as contaminated factors in comparative interrelationships information. In this paper, we have shown the solving the problems and developed a clustering algorithm that can extracting disease correlation information in using K-means algorithm. According to their attributes from disease clusters using accumulated information and interrelationships clustering, the proposed algorithm analyzes the disease correlation clustering possible and centering points. The proposed algorithm showed improved adaptability to prediction accuracy of the classification management system in terms of learning as a group of multiple disease attribute information of COVID-19 through the applied simulation results.

Influenza prediction models by using meteorological and social media informations (기상 및 소셜미디어 정보를 활용한 인플루엔자 예측모형)

  • Hwang, Eun-Ji;Na, Jong-Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.5
    • /
    • pp.1087-1095
    • /
    • 2015
  • Influenza, commonly known as "the flu", is an infectious disease caused by the influenza virus. We consider, in this paper, regression models as a prediction model of influenza disease. While most of previous researches use mainly the meteorological variables as a predictive variables, we consider social media information in the models. As a result, we found that the contributions of two-type of informations are comparable. We used the medical treatment data of influenza provided by Natioal Health Insurance Survice (NHIS) and the meteorological data provided by Korea Meteorological Administration (KMA). We collect social media information (twitter buzz amount) from Twitter. Time series model is also considered for comparison.

Development of a Diabetic Foot Ulceration Prediction Model and Nomogram (당뇨병성 발궤양 발생 위험 예측모형과 노모그램 개발)

  • Lee, Eun Joo;Jeong, Ihn Sook;Woo, Seung Hun;Jung, Hyuk Jae;Han, Eun Jin;Kang, Chang Wan;Hyun, Sookyung
    • Journal of Korean Academy of Nursing
    • /
    • v.51 no.3
    • /
    • pp.280-293
    • /
    • 2021
  • Purpose: This study aimed to identify the risk factors for diabetic foot ulceration (DFU) to develop and evaluate the performance of a DFU prediction model and nomogram among people with diabetes mellitus (DM). Methods: This unmatched case-control study was conducted with 379 adult patients (118 patients with DM and 261 controls) from four general hospitals in South Korea. Data were collected through a structured questionnaire, foot examination, and review of patients' electronic health records. Multiple logistic regression analysis was performed to build the DFU prediction model and nomogram. Further, their performance was analyzed using the Lemeshow-Hosmer test, concordance statistic (C-statistic), and sensitivity/specificity analyses in training and test samples. Results: The prediction model was based on risk factors including previous foot ulcer or amputation, peripheral vascular disease, peripheral neuropathy, current smoking, and chronic kidney disease. The calibration of the DFU nomogram was appropriate (χ2 = 5.85, p = .321). The C-statistic of the DFU nomogram was .95 (95% confidence interval .93~.97) for both the training and test samples. For clinical usefulness, the sensitivity and specificity obtained were 88.5% and 85.7%, respectively at 110 points in the training sample. The performance of the nomogram was better in male patients or those having DM for more than 10 years. Conclusion: The nomogram of the DFU prediction model shows good performance, and is thereby recommended for monitoring the risk of DFU and preventing the occurrence of DFU in people with DM.

Diamox-enhanced Brain SPECT in Cerebrovascular Diseases (뇌혈관질환에서 다이아목스부하 뇌 단일광자방출 전산화단층촬영)

  • Choi, Yun-Young
    • Nuclear Medicine and Molecular Imaging
    • /
    • v.41 no.2
    • /
    • pp.85-90
    • /
    • 2007
  • Acute event in cerebrovascular disease is the second most common cause of death in Korea following cancer, and it can also cause serious neurologic deficits. Understanding of perfusion status is important for clinical applications in management of patients with cerebrovascular diseases, and then the attacks of ischemic neurologic symptoms and the risk of acute events can be reduced. Therefore, the normal vascular anatomy of brain, various clinical applications of acetazolamide-enhanced brain perfusion SPECT, including meaning and role of assessment of vascular reserve in carotid stenosis before procedure, in pediatric Moyamoya disease before and after operation, in prediction of development of hyperperfusion syndrome before procedure, and in prediction of vasospasm and of prognosis in subarachnoid hemorrahge were reviewed in this paper.

Review of Biological Network Data and Its Applications

  • Yu, Donghyeon;Kim, MinSoo;Xiao, Guanghua;Hwang, Tae Hyun
    • Genomics & Informatics
    • /
    • v.11 no.4
    • /
    • pp.200-210
    • /
    • 2013
  • Studying biological networks, such as protein-protein interactions, is key to understanding complex biological activities. Various types of large-scale biological datasets have been collected and analyzed with high-throughput technologies, including DNA microarray, next-generation sequencing, and the two-hybrid screening system, for this purpose. In this review, we focus on network-based approaches that help in understanding biological systems and identifying biological functions. Accordingly, this paper covers two major topics in network biology: reconstruction of gene regulatory networks and network-based applications, including protein function prediction, disease gene prioritization, and network-based genome-wide association study.

Animal Infectious Diseases Prevention through Big Data and Deep Learning (빅데이터와 딥러닝을 활용한 동물 감염병 확산 차단)

  • Kim, Sung Hyun;Choi, Joon Ki;Kim, Jae Seok;Jang, Ah Reum;Lee, Jae Ho;Cha, Kyung Jin;Lee, Sang Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.137-154
    • /
    • 2018
  • Animal infectious diseases, such as avian influenza and foot and mouth disease, occur almost every year and cause huge economic and social damage to the country. In order to prevent this, the anti-quarantine authorities have tried various human and material endeavors, but the infectious diseases have continued to occur. Avian influenza is known to be developed in 1878 and it rose as a national issue due to its high lethality. Food and mouth disease is considered as most critical animal infectious disease internationally. In a nation where this disease has not been spread, food and mouth disease is recognized as economic disease or political disease because it restricts international trade by making it complex to import processed and non-processed live stock, and also quarantine is costly. In a society where whole nation is connected by zone of life, there is no way to prevent the spread of infectious disease fully. Hence, there is a need to be aware of occurrence of the disease and to take action before it is distributed. Epidemiological investigation on definite diagnosis target is implemented and measures are taken to prevent the spread of disease according to the investigation results, simultaneously with the confirmation of both human infectious disease and animal infectious disease. The foundation of epidemiological investigation is figuring out to where one has been, and whom he or she has met. In a data perspective, this can be defined as an action taken to predict the cause of disease outbreak, outbreak location, and future infection, by collecting and analyzing geographic data and relation data. Recently, an attempt has been made to develop a prediction model of infectious disease by using Big Data and deep learning technology, but there is no active research on model building studies and case reports. KT and the Ministry of Science and ICT have been carrying out big data projects since 2014 as part of national R &D projects to analyze and predict the route of livestock related vehicles. To prevent animal infectious diseases, the researchers first developed a prediction model based on a regression analysis using vehicle movement data. After that, more accurate prediction model was constructed using machine learning algorithms such as Logistic Regression, Lasso, Support Vector Machine and Random Forest. In particular, the prediction model for 2017 added the risk of diffusion to the facilities, and the performance of the model was improved by considering the hyper-parameters of the modeling in various ways. Confusion Matrix and ROC Curve show that the model constructed in 2017 is superior to the machine learning model. The difference between the2016 model and the 2017 model is that visiting information on facilities such as feed factory and slaughter house, and information on bird livestock, which was limited to chicken and duck but now expanded to goose and quail, has been used for analysis in the later model. In addition, an explanation of the results was added to help the authorities in making decisions and to establish a basis for persuading stakeholders in 2017. This study reports an animal infectious disease prevention system which is constructed on the basis of hazardous vehicle movement, farm and environment Big Data. The significance of this study is that it describes the evolution process of the prediction model using Big Data which is used in the field and the model is expected to be more complete if the form of viruses is put into consideration. This will contribute to data utilization and analysis model development in related field. In addition, we expect that the system constructed in this study will provide more preventive and effective prevention.