• Title/Summary/Keyword: Statistical classification

Search Result 1,419, Processing Time 0.028 seconds

Projection of Cancer Incidence and Mortality From 2020 to 2035 in the Korean Population Aged 20 Years and Older

  • Youjin, Hong;Sangjun, Lee;Sungji, Moon;Soseul, Sung;Woojin, Lim;Kyungsik, Kim;Seokyung, An;Jeoungbin, Choi;Kwang-Pil, Ko;Inah, Kim;Jung Eun, Lee;Sue K., Park
    • Journal of Preventive Medicine and Public Health
    • /
    • v.55 no.6
    • /
    • pp.529-538
    • /
    • 2022
  • Objectives: This study aimed to identify the current patterns of cancer incidence and estimate the projected cancer incidence and mortality between 2020 and 2035 in Korea. Methods: Data on cancer incidence cases were extracted from the Korean Statistical Information Service from 2000 to 2017, and data on cancer-related deaths were extracted from the National Cancer Center from 2000 to 2018. Cancer cases and deaths were classified according to the International Classification of Diseases, 10th edition. For the current patterns of cancer incidence, age-standardized incidence rates (ASIRs) and age-standardized mortality rates were investigated using the 2000 mid-year estimated population aged over 20 years and older. A joinpoint regression model was used to determine the 2020 to 2035 trends in cancer. Results: Overall, cancer cases were predicted to increase from 265 299 in 2020 to 474 085 in 2035 (growth rate: 1.8%). The greatest increase in the ASIR was projected for prostate cancer among male (7.84 vs. 189.53 per 100 000 people) and breast cancer among female (34.17 vs. 238.45 per 100 000 people) from 2000 to 2035. Overall cancer deaths were projected to increase from 81 717 in 2020 to 95 845 in 2035 (average annual growth rate: 1.2%). Although most cancer mortality rates were projected to decrease, those of breast, pancreatic, and ovarian cancer among female were projected to increase until 2035. Conclusions: These up-to-date projections of cancer incidence and mortality in the Korean population may be a significant resource for implementing cancer-related regulations or developing cancer treatments.

Wafer bin map failure pattern recognition using hierarchical clustering (계층적 군집분석을 이용한 반도체 웨이퍼의 불량 및 불량 패턴 탐지)

  • Jeong, Joowon;Jung, Yoonsuh
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.3
    • /
    • pp.407-419
    • /
    • 2022
  • The semiconductor fabrication process is complex and time-consuming. There are sometimes errors in the process, which results in defective die on the wafer bin map (WBM). We can detect the faulty WBM by finding some patterns caused by dies. When one manually seeks the failure on WBM, it takes a long time due to the enormous number of WBMs. We suggest a two-step approach to discover the probable pattern on the WBMs in this paper. The first step is to separate the normal WBMs from the defective WBMs. We adapt a hierarchical clustering for de-noising, which nicely performs this work by wisely tuning the number of minimum points and the cutting height. Once declared as a faulty WBM, then it moves to the next step. In the second step, we classify the patterns among the defective WBMs. For this purpose, we extract features from the WBM. Then machine learning algorithm classifies the pattern. We use a real WBM data set (WM-811K) released by Taiwan semiconductor manufacturing company.

Spatial modeling of mortality from acute lower respiratory infections in children under 5 years of age in 2000-2017: a global study

  • Almasi, Ali;Reshadat, Sohyla;Zangeneh, Alireza;Khezeli, Mehdi;Teimouri, Raziyeh;Naderi, Samira Rahimi;Saeidi, Shahram
    • Clinical and Experimental Pediatrics
    • /
    • v.64 no.12
    • /
    • pp.632-641
    • /
    • 2021
  • Background: Over the past few decades, various goals have been defined to reduce the mortality of children caused by acute lower respiratory infections (ALRIs) worldwide. However, few spatial studies to date have reported on ALRI deaths. Purpose: We aimed to assess the spatial modeling of mortality from ALRI in children under 5 years of age during 2000-2017 using a global data. Methods: The data on the mortality of children under 5 years old caused by ALRI were initially obtained from the official website of the World Health Organization. The income status of their home countries was also gathered from the Country Income Groups (World Bank Classification) website and divided into 5 categories. After that, in the ArcGIS 10.6 environment, a database was created and the statistical tests and related maps were extracted. The Global Moran's I statistic, Getis-Ord Gi statistic, and geographically weighted regression were used for the analyses. In this study, higher z scores indicated the hot spots, while lower z scores indicated the cold spots. Results: In 2000-2017, child mortality showed a downward trend from 17.6 per 100,000 children to 8.1 and had a clustered pattern. Hot spots were concentrated in Asia in 2000 but shifted toward African countries by 2017. A cold spot that formed in Europe in 2007 showed an ascending trend by 2017. Based on the results of geographically weighted regression test, the regions identified as the hot spots of mortality from ALRI in children under 5 years old were among the middle-income countries (R2=0.01, adjusted R2=8.77). Conclusion: While the total number of child deaths in 2000-2017 has decreased, the number of hot spots has increased among countries. This study also concluded that, during the study period, Central and Western Africa countries became the main new hot spots of deaths from ALRI.

BEEF MEAT TRACEABILITY. CAN NIRS COULD HELP\ulcorner

  • Cozzolino, D.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1246-1246
    • /
    • 2001
  • The quality of meat is highly variable in many properties. This variability originates from both animal production and meat processing. At the pre-slaughter stage, animal factors such as breed, sex, age contribute to this variability. Environmental factors include feeding, rearing, transport and conditions just before slaughter (Hildrum et al., 1995). Meat can be presented in a variety of forms, each offering different opportunities for adulteration and contamination. This has imposed great pressure on the food manufacturing industry to guarantee the safety of meat. Tissue and muscle speciation of flesh foods, as well as speciation of animal derived by-products fed to all classes of domestic animals, are now perhaps the most important uncertainty which the food industry must resolve to allay consumer concern. Recently, there is a demand for rapid and low cost methods of direct quality measurements in both food and food ingredients (including high performance liquid chromatography (HPLC), thin layer chromatography (TLC), enzymatic and inmunological tests (e.g. ELISA test) and physical tests) to establish their authenticity and hence guarantee the quality of products manufactured for consumers (Holland et al., 1998). The use of Near Infrared Reflectance Spectroscopy (NIRS) for the rapid, precise and non-destructive analysis of a wide range of organic materials has been comprehensively documented (Osborne et at., 1993). Most of the established methods have involved the development of NIRS calibrations for the quantitative prediction of composition in meat (Ben-Gera and Norris, 1968; Lanza, 1983; Clark and Short, 1994). This was a rational strategy to pursue during the initial stages of its application, given the type of equipment available, the state of development of the emerging discipline of chemometrics and the overwhelming commercial interest in solving such problems (Downey, 1994). One of the advantages of NIRS technology is not only to assess chemical structures through the analysis of the molecular bonds in the near infrared spectrum, but also to build an optical model characteristic of the sample which behaves like the “finger print” of the sample. This opens the possibility of using spectra to determine complex attributes of organic structures, which are related to molecular chromophores, organoleptic scores and sensory characteristics (Hildrum et al., 1994, 1995; Park et al., 1998). In addition, the application of statistical packages like principal component or discriminant analysis provides the possibility to understand the optical properties of the sample and make a classification without the chemical information. The objectives of this present work were: (1) to examine two methods of sample presentation to the instrument (intact and minced) and (2) to explore the use of principal component analysis (PCA) and Soft Independent Modelling of class Analogy (SIMCA) to classify muscles by quality attributes. Seventy-eight (n: 78) beef muscles (m. longissimus dorsi) from Hereford breed of cattle were used. The samples were scanned in a NIRS monochromator instrument (NIR Systems 6500, Silver Spring, MD, USA) in reflectance mode (log 1/R). Both intact and minced presentation to the instrument were explored. Qualitative analysis of optical information through PCA and SIMCA analysis showed differences in muscles resulting from two different feeding systems.

  • PDF

Study on Countermeasures Against Increasing New Drugs (신종 마약류 증가에 따른 대응방안)

  • Jaehun Shin
    • Journal of the Society of Disaster Information
    • /
    • v.19 no.2
    • /
    • pp.270-279
    • /
    • 2023
  • Purpose: The purpose of this study is to examine the new drugs that recently have shown rapid increase and provide solutions to eradicate them. Method: This study used the relevant preceding studies, statistics, and overseas materials to identify the new drug problems and suggest solutions. Result: Compared to the past, the numbers of criminals detected for the administration, distribution, and production of drugs are rapidly increasing. According to the statistical data on drugs in 2021, the number of drug-related cases decreased compared to the previous year. However, there are concerns because the amount of detected drugs increased more than three times, and the age group of drugrelated criminals are getting younger. Such results are largely affected by the spread of new drugs. In particular, it is deemed to be affected by the spread of new drugs, such as fentanyl, yaba, khat, kratom, etc., as well as the new psychoactive drugs and hemp-related materials. Conclusion: In response to spread of new drugs, this study suggests simplifying the temporary classification of drugs, enforcing control of foreign drug users, strengthening the cooperation with relevant institutions, such as Korea Customs Service and the Ministry of Food and Drug Safety, and intensifying the punishment on the drug users in order to strengthen the countermeasure against the new drugs.

Comparison of Machine Learning Models to Predict the Occurrence of Ground Subsidence According to the Characteristics of Sewer (하수관로 특성에 따른 지반함몰 발생 예측을 위한 기계학습 모델 비교)

  • Lee, Sungyeol;Kim, Jinyoung;Kang, Jaemo;Baek, Wonjin
    • Journal of the Korean GEO-environmental Society
    • /
    • v.23 no.4
    • /
    • pp.5-10
    • /
    • 2022
  • Recently, ground subsidence has been continuously occurring in downtown areas, threatening the safety of citizens. Various underground facilities such as water and sewage pipelines and communication pipelines are buried under the road. It is reported that the cause of ground subsidence is the deterioration of various facilities and the reckless development of the underground. In particular, it is known that the biggest cause of ground subsidence is the aging of sewage pipelines. As an existing study related to this, several representative factors of sewage pipelines were selected and a study to predict the risk of ground subsidence through statistical analysis has been conducted. In this study, a data SET was constructed using the characteristics of OO city's sewage pipe characteristics and ground subsidence data, The data set constructed from the characteristics of the sewage pipe of OO city and the location of the ground subsidence was used. The goal of this study was to present a classification model for the occurrence of ground subsidence according to the characteristics of sewage pipes through machine learning. In addition, the importance of each sewage pipe characteristic affecting the ground subsidence was calculated.

A Study on Efficient AI Model Drift Detection Methods for MLOps (MLOps를 위한 효율적인 AI 모델 드리프트 탐지방안 연구)

  • Ye-eun Lee;Tae-jin Lee
    • Journal of Internet Computing and Services
    • /
    • v.24 no.5
    • /
    • pp.17-27
    • /
    • 2023
  • Today, as AI (Artificial Intelligence) technology develops and its practicality increases, it is widely used in various application fields in real life. At this time, the AI model is basically learned based on various statistical properties of the learning data and then distributed to the system, but unexpected changes in the data in a rapidly changing data situation cause a decrease in the model's performance. In particular, as it becomes important to find drift signals of deployed models in order to respond to new and unknown attacks that are constantly created in the security field, the need for lifecycle management of the entire model is gradually emerging. In general, it can be detected through performance changes in the model's accuracy and error rate (loss), but there are limitations in the usage environment in that an actual label for the model prediction result is required, and the detection of the point where the actual drift occurs is uncertain. there is. This is because the model's error rate is greatly influenced by various external environmental factors, model selection and parameter settings, and new input data, so it is necessary to precisely determine when actual drift in the data occurs based only on the corresponding value. There are limits to this. Therefore, this paper proposes a method to detect when actual drift occurs through an Anomaly analysis technique based on XAI (eXplainable Artificial Intelligence). As a result of testing a classification model that detects DGA (Domain Generation Algorithm), anomaly scores were extracted through the SHAP(Shapley Additive exPlanations) Value of the data after distribution, and as a result, it was confirmed that efficient drift point detection was possible.

[Retracted]Analysis of Slope Safety by Tension Wire Data ([논문철회]지표변위계를 활용한 비탈면 안정성 예측)

  • Lee, Seokyoung;Jang, Seoyong;Kim, Taesoo;Han, Heuisoo
    • Journal of the Korean GEO-environmental Society
    • /
    • v.16 no.4
    • /
    • pp.5-12
    • /
    • 2015
  • Civil engineers have taken the numerous slope monitoring data for an engineering project subjected to hazard potential of slide. However, the topics on how to deal with and draw out proper information from the data related to the slope behavior have not been widely discussed. Recently, several researchers had installed the real-time monitoring system to cope with slope failure; however they are mainly focused on the hardware system installation. Therefore, this study tries to show how the measured data could be grouped and connected each other. The basic idea of analyzing method studied in this paper came from the clustering, which is the part of data mining analysis. Therefore, at the base of classification of time series data, the authors suggest three mathematical data analyzing methods; Average Index of different displacement ($AD_{i,j}$), Difference of average relative displacement ($\overline{RD}_{i,j}$) and Coordinate system of average and relative displacement ($\overline{RD}$, AD). These analyzing methods are based on the statistical method and failure mechanism of slope. Therefore they showed clustering relationships of the similar parts of the slope which makes the same sliding mechanism.

A Review of Korean Medicine Treatment for Managing the Thoracolumbar Compression Fractures: A Retrospective Observational Study (흉요추 압박골절 치료에 대한 한의복합치료 고찰: 후향적 관찰 연구)

  • Min-Jin Cho;Jiyun Lee;Myeong-Jong Lee;Hojun Kim;Kyungsun Han
    • Journal of Korean Medicine Rehabilitation
    • /
    • v.33 no.4
    • /
    • pp.109-124
    • /
    • 2023
  • Objectives This study aims to find out effect of Korean medicine treatment on managing thoracolumbar compression fractures through retrospective observational study. Methods Among hospitalized patients at the Department of Korean Medicine Rehabilitation from January 1st, 2018 to February 28th, 2023, a total of 24 inpatients who were diagnosed with thoracolumbar compression fractures and received Korean medicine treatment were included in this study. Numeric rating scale (NRS) was used for pain assessment and clinical variables such as sex, symptoms, age, thoracolumbar injury classification and severity (TLICS) scores were collected. For subgroup analysis to analyze factors affecting treatment response, we divided patients into responders and non-responders according to NRS change. For statistical analysis, we compared before/after hospitalization and analyzed distinct features between two groups. Results Most of the patients were in their 70s and 83.33% were female. Average hospitalization period was 24.54±11.91 days. All patients had back pain as their chief complaint and only 2 patients received surgeries. In TLICS, only 1 patient got score 6, which represented surgery indication. After Korean medicine treatment, NRS of almost every patient got lower significantly at the time of discharge (3.02±1.93) than admission (5.52±1.95). Comparing two groups, responders had lower NRS at the time of discharge and TLICS score of them were lower than non-responders. Conclusions Our results show that Korean medicine treatment for thoracolumbar compression fractures was effective in reducing pain. There were distinct clinical features such as age, past history, surgeries between those with significant improvement in pain scores and those who did not.

Natural variation of functional components between Korean maize types (국내 옥수수 품종에 따른 기능성 성분의 자연 변이 분석)

  • Jung-Won Jung;Myeong-Ji Kim;Imran Muhammad;Eun-Ha Kim;Soo-Yun Park;Tae-Young Oh;Young-Sam Go;Moon-Jong Kim;Sang-Gu Lee;Seonwoo Oh;Hyoun-Min Park
    • Journal of Applied Biological Chemistry
    • /
    • v.66
    • /
    • pp.484-491
    • /
    • 2023
  • Maize is one of the major crops consumed in worldwide, which nutrients accounts for a large amount of starch, but also functional components, and phenolic acid is known to have a high content. Maize is divided into waxy maize, sweet maize, and normal maize with its shape and use, therefore there is also a difference in nutritional composition. This study was conducted to analyze the content of functional components according to the type of maize and to produce natural variation data in consideration of environmental factors. 3 shapes of maize (waxy maize, sweet maize, and normal maize) samples cultivated in 3 regions (Suwon, Daegu, and Hongcheon) were analyzed using HPLC and GC-TOF-MS. Comparing with type through ANOVA, multivariate statistical analysis, Pearson correlation analysis, 28 components, including carotenoids and tocopherols, showed significant differences among a total of 32 components (p <0.05), 15 of them showed very significant differences (p <0.001). When comparing with regions, 15 components showed significant differences and only vanillate, syringate, C23-ol of them showed most significant differences (p <0.001). As a result of principal component analysis, cluster classification was distinguished by shape than by region, with α-carotene, cholesterol for waxy maize, vanillate and stigmasterol for sweet maize, lutein and β-carotene for normal maize had a great effect on cluster formation. It suggests that the content of functional components is more affected by genetic factors than environmental factors.