• Title/Summary/Keyword: random data analysis

Search Result 1,741, Processing Time 0.04 seconds

Development and Verification of and Single Nucleotide Polymorphism Markers toDetermine Country of Origin of Korean and Chinese Scapharca subcrenata (한국산과 중국산 새꼬막(Scapharca subcrenata)의 원산지 판별을 위한 SNP 마커의 개발 및 검증)

  • Seong Seok Choi;Seung Hyun Yoo;Yong Bae Seo;Jong Oh Kim;Ik Jung Kwon;So Hee Bae;Gun Do Kim
    • Journal of Life Science
    • /
    • v.33 no.12
    • /
    • pp.1025-1035
    • /
    • 2023
  • In this study, we analyzed SNPs that appear between Korean and Chinese Scapharca subcrenata using the nucleotide sequence data of S. subcrenata analyzed by genotyping by sequencing (GBS). To distinguish the country of origin for S. subcrenata in Korean and Chinese, we developed a primer set as single nucleotide polymorphism (SNP) markers for quantitative real-time PCR (qPCR) analysis and validated by sequencing SNPs. A total of 180 samples of S. subcrenata were analyzed by genotyping by sequencing, and 15 candidate SNPs were selected. SNP marker selection for country of origin were identified through real-time qPCR. Insertion 1 and SNP 21 markers showed the most distinct separation between the sequence types as well as the country of origin through qPCR, with the observed amplification patterns matching the expected outcomes.. Additionally, in a blind test conducted by mixing samples of S. subcrenata at random, Insertion 1 showed 74% accuracy, 52% sensitivity, and 96% specificity, and SNP 21 showed 86% accuracy, 79% sensitivity, and 93% specificity. Therefore, the two SNP markers developed are expected to be useful in verifying the authenticity of the country of origin of S. subcrenata when used independently or in combination.

Trends of Dental Caries Prevalence in Children Under 14-Year-Old Using a Health Insurance Database (건강보험 데이터를 이용한 14세 이하 소아청소년의 치아 우식 유병률 경향성)

  • Seongeun Mo;Jaegon Kim;Daewoo Lee;Yeonmi Yang
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.50 no.4
    • /
    • pp.409-420
    • /
    • 2023
  • The purpose of this study is to analyze trends in the prevalence of dental caries and demand for dental caries treatment among children under 14 years old using Health Insurance Review and Assessment data. The analysis was conducted using treatment records from a random sample of approximately 1 million pediatric patients from a population that included all children and adolescents for each year from 2011 to 2020. In this study, the number of children diagnosed with K02 dental caries and the number of children receiving dental caries treatment across all ages have increased. However, the number of children aged 10 to 14 who received pulp treatment or extraction has decreased. In the National Survey of Children's Oral Health, the decay-missing-filled teeth index for 5- and 12-year-olds has stagnated or increased slightly, but the percentage of the population with active dental caries has decreased. Accessibility and local environments for dental caries treatment have generally improved compared to the past, but preventive dental care has stagnated over the past decade. Therefore, it is necessary to evaluate the effectiveness of oral health programs implemented in Korea to promote and prevent dental caries among children.

Retrieval of Hourly Aerosol Optical Depth Using Top-of-Atmosphere Reflectance from GOCI-II and Machine Learning over South Korea (GOCI-II 대기상한 반사도와 기계학습을 이용한 남한 지역 시간별 에어로졸 광학 두께 산출)

  • Seyoung Yang;Hyunyoung Choi;Jungho Im
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_3
    • /
    • pp.933-948
    • /
    • 2023
  • Atmospheric aerosols not only have adverse effects on human health but also exert direct and indirect impacts on the climate system. Consequently, it is imperative to comprehend the characteristics and spatiotemporal distribution of aerosols. Numerous research endeavors have been undertaken to monitor aerosols, predominantly through the retrieval of aerosol optical depth (AOD) via satellite-based observations. Nonetheless, this approach primarily relies on a look-up table-based inversion algorithm, characterized by computationally intensive operations and associated uncertainties. In this study, a novel high-resolution AOD direct retrieval algorithm, leveraging machine learning, was developed using top-of-atmosphere reflectance data derived from the Geostationary Ocean Color Imager-II (GOCI-II), in conjunction with their differences from the past 30-day minimum reflectance, and meteorological variables from numerical models. The Light Gradient Boosting Machine (LGBM) technique was harnessed, and the resultant estimates underwent rigorous validation encompassing random, temporal, and spatial N-fold cross-validation (CV) using ground-based observation data from Aerosol Robotic Network (AERONET) AOD. The three CV results consistently demonstrated robust performance, yielding R2=0.70-0.80, RMSE=0.08-0.09, and within the expected error (EE) of 75.2-85.1%. The Shapley Additive exPlanations(SHAP) analysis confirmed the substantial influence of reflectance-related variables on AOD estimation. A comprehensive examination of the spatiotemporal distribution of AOD in Seoul and Ulsan revealed that the developed LGBM model yielded results that are in close concordance with AERONET AOD over time, thereby confirming its suitability for AOD retrieval at high spatiotemporal resolution (i.e., hourly, 250 m). Furthermore, upon comparing data coverage, it was ascertained that the LGBM model enhanced data retrieval frequency by approximately 8.8% in comparison to the GOCI-II L2 AOD products, ameliorating issues associated with excessive masking over very illuminated surfaces that are often encountered in physics-based AOD retrieval processes.

Studies on Genetic Diversity and Phylogenetic Relationships of Korean Native Chicken using the Microsatellite Marker (Microsatellite Marker를 활용한 한국 토종닭 품종의 유전적 다양성 및 유연관계 분석)

  • Seo, Joo Hee;Oh, Jea-Don;Lee, Jun-Heon;Seo, Dongwon;Kong, Hong Sik
    • Korean Journal of Poultry Science
    • /
    • v.42 no.1
    • /
    • pp.15-26
    • /
    • 2015
  • In this study, genotyping was executed by using 27 microsatellite markers for genetic diversity of 469 Korean Native Chickens [20 population, each population is 24 samples but Hanhyup A line is 13 samples). in total 469 samples were collected from National Institute of Animal Science (Korean Native Chicken (NR, NY, NG, NL and NW), Ogye (NO), Leghorn F,K (NF and NK), Black and Brown cormish (NH and NS), Rhode Island Red C, D (NC and ND), Total is 12 populations] and Hanhyup [H line (HH), F line (HF), G line (HG), V line (HV), S line (HS), W line (HW), Y line (HY), A line (HA), total is 8 populations]. [The allele number were observed 5 (ADL0268) to 20 (MCW0127) each markers. Observed heterozygostiy ($H_{obs}$), expected heterozygosity ($H_{exp}$), polymorphism Information Content (PIC) were observed 0.359 to 0.677, 0.668 to 0.881 and 0.646 to 0.869, respectively. Using these markers, the calculated the heterozygote deficit within chicken line ($F_{is}$) value each population from mean 0.117. Phylogenetic tree showing the genetic relationship among 20 population using standard genetic distance calculated from 27 microsatellite markers. genetic distances revealed the closest (0.175) between NC and ND. on the other hand, Farthest genetic distances (0.710) revealed between NF and HV. STRUCTURE analysis and Principal Components Analysis (PCA) showed that results of similar phylogenetic tree. The expected probability of identity values on random individuals (Total population and only Hanhyup line) was estimated at $8.80{\times}10^{-83}$ and $3.87{\times}10^{-117}$, respectively. In conclusion, This study shows the useful data that be utilized as a basic data of Korean Native Chicken breeding and development for commercial chicken industry to meet the consumer's demand.

The Effect of Objective and Subjective Social Isolation and Interpersonal Conflict Type on the Probability of Cognitive Impairment by Age Group in Old Age (노년기 연령집단별 객관적·주관적 사회적 고립과 대인관계갈등 유형이 인지기능에 미치는 영향)

  • Lee, Sang Chul
    • 한국노년학
    • /
    • v.38 no.4
    • /
    • pp.811-835
    • /
    • 2018
  • Social relations and cognitive function in old age are closely related to each other, and social relation is classified into structural characteristics and qualitative characteristics reflecting cognitive and emotional evaluation. The concept of social isolation is the focus of attention in relation to the social relations of old age. Social isolation has a multidimensional theoretical structure that is divided into objective dimension such as social network, type of furniture, social participation, and subjective dimension such as lack of perceived social support and loneliness. There is also a close relationship between cognitive function and interpersonal conflict in old age. In this study, we examined the effect of subjective social isolation, which shows the structural characteristics of social relations, and subjective social isolation and interpersonal conflict on the dementia occurrence by age group in the elderly. The data were analyzed by applying a random effect panel logit model using 1,740 panel data from the first year to the third year of KSHAP. The results of the analysis are summarized as follows. First, the cognitive impairment increased sharply with age. Objective and subjective social isolation were both U-shaped distribution with an inflection point of 80 years old. Second, the main effect on the probability of cognitive impairment was statistically significant with objective and subjective social isolation, but the type of interpersonal conflict did not appear to be significant. Third, the results of two-way interaction effect analysis on the probability of cognitive impairment are as follows. The relationship between subjective social isolation and the probability of occurrence of cognitive impairment was significantly different according to the level of conflict with spouse. In addition, the higher the subjective social isolation, the higher the probability of cognitive impairment in the elderly(over 85) than in the young-old(65~74). In addition, as the level of conflict with spouses increases, the probability of cognitive impairment of the oldest-old(aged 85 or older) is drastically lower than that of the young-old(aged 65~74). Based on the results of this study, policy and practical implications for reducing the cognitive impairment of the elderly age group were suggested, and limitations of the study and suggestions for future research were discussed.

Label Embedding for Improving Classification Accuracy UsingAutoEncoderwithSkip-Connections (다중 레이블 분류의 정확도 향상을 위한 스킵 연결 오토인코더 기반 레이블 임베딩 방법론)

  • Kim, Museong;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.175-197
    • /
    • 2021
  • Recently, with the development of deep learning technology, research on unstructured data analysis is being actively conducted, and it is showing remarkable results in various fields such as classification, summary, and generation. Among various text analysis fields, text classification is the most widely used technology in academia and industry. Text classification includes binary class classification with one label among two classes, multi-class classification with one label among several classes, and multi-label classification with multiple labels among several classes. In particular, multi-label classification requires a different training method from binary class classification and multi-class classification because of the characteristic of having multiple labels. In addition, since the number of labels to be predicted increases as the number of labels and classes increases, there is a limitation in that performance improvement is difficult due to an increase in prediction difficulty. To overcome these limitations, (i) compressing the initially given high-dimensional label space into a low-dimensional latent label space, (ii) after performing training to predict the compressed label, (iii) restoring the predicted label to the high-dimensional original label space, research on label embedding is being actively conducted. Typical label embedding techniques include Principal Label Space Transformation (PLST), Multi-Label Classification via Boolean Matrix Decomposition (MLC-BMaD), and Bayesian Multi-Label Compressed Sensing (BML-CS). However, since these techniques consider only the linear relationship between labels or compress the labels by random transformation, it is difficult to understand the non-linear relationship between labels, so there is a limitation in that it is not possible to create a latent label space sufficiently containing the information of the original label. Recently, there have been increasing attempts to improve performance by applying deep learning technology to label embedding. Label embedding using an autoencoder, a deep learning model that is effective for data compression and restoration, is representative. However, the traditional autoencoder-based label embedding has a limitation in that a large amount of information loss occurs when compressing a high-dimensional label space having a myriad of classes into a low-dimensional latent label space. This can be found in the gradient loss problem that occurs in the backpropagation process of learning. To solve this problem, skip connection was devised, and by adding the input of the layer to the output to prevent gradient loss during backpropagation, efficient learning is possible even when the layer is deep. Skip connection is mainly used for image feature extraction in convolutional neural networks, but studies using skip connection in autoencoder or label embedding process are still lacking. Therefore, in this study, we propose an autoencoder-based label embedding methodology in which skip connections are added to each of the encoder and decoder to form a low-dimensional latent label space that reflects the information of the high-dimensional label space well. In addition, the proposed methodology was applied to actual paper keywords to derive the high-dimensional keyword label space and the low-dimensional latent label space. Using this, we conducted an experiment to predict the compressed keyword vector existing in the latent label space from the paper abstract and to evaluate the multi-label classification by restoring the predicted keyword vector back to the original label space. As a result, the accuracy, precision, recall, and F1 score used as performance indicators showed far superior performance in multi-label classification based on the proposed methodology compared to traditional multi-label classification methods. This can be seen that the low-dimensional latent label space derived through the proposed methodology well reflected the information of the high-dimensional label space, which ultimately led to the improvement of the performance of the multi-label classification itself. In addition, the utility of the proposed methodology was identified by comparing the performance of the proposed methodology according to the domain characteristics and the number of dimensions of the latent label space.

A Study on Clinical Variables Contributing to Differentiation of Delirium and Non-Delirium Patients in the ICU (중환자실 섬망 환자와 비섬망 환자 구분에 기여하는 임상 지표에 관한 연구)

  • Ko, Chanyoung;Kim, Jae-Jin;Cho, Dongrae;Oh, Jooyoung;Park, Jin Young
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.27 no.2
    • /
    • pp.101-110
    • /
    • 2019
  • Objectives : It is not clear which clinical variables are most closely associated with delirium in the Intensive Care Unit (ICU). By comparing clinical data of ICU delirium and non-delirium patients, we sought to identify variables that most effectively differentiate delirium from non-delirium. Methods : Medical records of 6,386 ICU patients were reviewed. Random Subset Feature Selection and Principal Component Analysis were utilized to select a set of clinical variables with the highest discriminatory capacity. Statistical analyses were employed to determine the separation capacity of two models-one using just the selected few clinical variables and the other using all clinical variables associated with delirium. Results : There was a significant difference between delirium and non-delirium individuals across 32 clinical variables. Richmond Agitation Sedation Scale (RASS), urinary catheterization, vascular catheterization, Hamilton Anxiety Rating Scale (HAM-A), Blood urea nitrogen, and Acute Physiology and Chronic Health Examination II most effectively differentiated delirium from non-delirium. Multivariable logistic regression analysis showed that, with the exception of vascular catheterization, these clinical variables were independent risk factors associated with delirium. Separation capacity of the logistic regression model using just 6 clinical variables was measured with Receiver Operating Characteristic curve, with Area Under the Curve (AUC) of 0.818. Same analyses were performed using all 32 clinical variables;the AUC was 0.881, denoting a very high separation capacity. Conclusions : The six aforementioned variables most effectively separate delirium from non-delirium. This highlights the importance of close monitoring of patients who received invasive medical procedures and were rated with very low RASS and HAM-A scores.

The Knowledge and Attitude on Breast Feeding of Female University Students (모유수유에 대한 여대생의 지식 및 태도)

  • Kim, Sung-Hee;Choi, Euy-Soon
    • Women's Health Nursing
    • /
    • v.7 no.1
    • /
    • pp.93-106
    • /
    • 2001
  • The purpose of this study is to provide the basic data in order to develop of some educational programs for increasing breast feeding by studying the female university student's knowledge and attitude on breast feeding, who will become a mother in future. The respondents of this research were selected at random for 462 female students at the university in Seoul and Kyongki area, and it was the period collected the data from Oct 28, 2000 to Nov 8, 2000. The method of study distributed the measuring tools of knowledge with 33 items and the tools of measurement with 20 items on the attitude of breast feeding to the respondents directly, and collected them. The data were analyzed to use SPSS program. Unpaired t-test, ANOVA, Pearson correlation coefficient and Multiple regression analysis were used for the calculation of difference between groups and the results were as follows ; 1. The breast feeding was 50.6% in the period of lactation for the respondents and the nuclear families were 81.7% in family constituent unit. In the future the wisher of breast feeding was 91.5%, the medical personnel was a major informer who enjoyed their best confidence, Besides the respond-ents responded that the proper period for education of the breast feeding was in a high school. 2. The level of Knowledge on breast feeding. The respondents's knowledge on breast feeding was average $16.40{\pm}4.59$ points on the basis of 33 points and On the merits and demerits ratio of breast feeding has shown highest but there was low in the field of such a concrete and practical plan as the estimate of breast feeding and the method and mindfulness for breast feeding. The higher grader, the college of the natural science showed significantly the higher points in the knowledge degree by respondents's characters and in such cases the persons of breast feeding or the informed of breast feeding by a medical personnel or the women of strong will for breast feeding action in the future. 3. The Attitude on breast feeding. There was relatively shown a positive attitude of the total average $60.50{\pm}7.59$ points and the average evaluation $3.04{\pm}.36$ points in the attitude on breast feeding. The attitude by each factors has the highest points in the practical action aspect but the lowest in the emotional aspect. The attitude on breast feeding by respondents's characters significantly showed a positive attitude in such cases the persons of breast feeding or the informed of breast feeding or the women of strong will for breast feeding action in the future. 4. Relation to knowledge and attitude on breast feeding. There was shown a correlation of definition in the relation to knowledge and attitude on breast feeding, 5.Factors which have an effect on knowledge and attitude on breast feeding. The factors which have an effect on knowledge of breast feeding were attitudes on breast feeding, graders, the college of natural science and the informed of breast feeding. Also the factors which have an effect on attitude on breast feeding were on will and knowledge on breast feeding, a large family, the informed of breast feeding. In conclusion, it will have to enforce a systematic education on the method of a practical breast feeding enlarged by a medical personnel and professional early enough as the information provision on breast feeding enables one to increase knowledge and attitude on it, besides it has relations with their practical will.

  • PDF

Service Quality, Customer Satisfaction and Customer Loyalty of Mobile Communication Industry in China (중국이동통신산업중적복무질량(中国移动通信产业中的服务质量), 고객만의도화고객충성도(顾客满意度和顾客忠诚度))

  • Zhang, Ruijin;Li, Xiangyang;Zhang, Yunchang
    • Journal of Global Scholars of Marketing Science
    • /
    • v.20 no.3
    • /
    • pp.269-277
    • /
    • 2010
  • Previous studies have shown that the most important factor affecting customer loyalty in the service industry is service quality. However, on the subject of whether service quality has a direct or indirect effect on customer loyalty, scholars' views apparently vary. Some studies suggest that service quality has a direct and fundamental influence on customer loyalty (Bai and Liu, 2002). However, others have shown that service quality not only directly affects customer loyalty, it also has an indirect impact on customer loyalty by influencing customer satisfaction and perceived value (Cronin, Brady, and Hult, 2000). Currently, there are few domestic articles that specifically address the relationship between service quality and customer loyalty in the mobile communication industry. Moreover, research has studied customer loyalty as a whole variable, rather than breaking it down further into multiple dimensions. Based on this analysis, this paper summarizes previous study results, establishes an effect mechanism model among service quality, customer satisfaction, and customer loyalty in the mobile communication industry, and presents a statistical test on model assumptions by using customer investigation data from Heilongjiang Mobile Company. It provides theoretical guidance for mobile service management based on the discussion of the hypothesis test results. For data collection, the sample comprised mobile users in Harbin city, and the survey was taken by random sampling. Out of a total of 300 questionnaires, 276 (92.9%) were recovered. After excluding invalid questionnaires, 249 remained, for an effective rate of 82.6 percent for the study. Cronbach's ${\alpha}$ coefficient was adapted to assess the scale reliability, and validity testing was conducted on the questionnaire from three aspects: content validity, construct validity. and convergent validity. The study tested for goodness of fit mainly from the absolute and relative fit indexes. From the hypothesis testing results, overall, four assumptions have not been supported. The ultimate affective relationship of service quality, customer satisfaction, and customer loyalty is demonstrated in Figure 2. On the whole, the service quality of the communication industry not only has a direct positive significant effect on customer loyalty, it also has an indirect positive significant effect on customer loyalty through service quality; the affective mechanism and extent of customer loyalty are different, and are influenced by each dimension of service quality. This study used the questionnaires of existing literature from home and abroad and tested them in empirical research, with all questions adapted to seven-point Likert scales. With the SERVQUAL scale of Parasuraman, Zeithaml, and Berry (1988), or PZB, as a reference point, service quality was divided into five dimensions-tangibility, reliability, responsiveness, assurance, and empathy-and the questions were simplified down to nineteen. The measurement of customer satisfaction was based mainly on Fornell (1992) and Wang and Han (2003), ending up with four questions. Based on the study’s three indicators of price tolerance, first choice, and complaint reaction were used to measure attitudinal loyalty, while repurchase intention, recommendation, and reputation measured behavioral loyalty. The collection and collation of literature data produced a model of the relationship among service quality, customer satisfaction, and customer loyalty in mobile communications, and China Mobile in the city of Harbin in Heilongjiang province was used for conducting an empirical test of the model and obtaining some useful conclusions. First, service quality in mobile communication is formed by the five factors mentioned earlier: tangibility, reliability, responsiveness, assurance, and empathy. On the basis of PZB SERVQUAL, the study designed a measurement scale of service quality for the mobile communications industry, and obtained these five factors through exploratory factor analysis. The factors fit basically with the five elements, indicating the concept of five elements of service quality for the mobile communications industry. Second, service quality in mobile communications has both direct and indirect positive effects on attitudinal loyalty, with the indirect effect being produced through the intermediary variable, customer satisfaction. There are also both direct and indirect positive effects on behavioral loyalty, with the indirect effect produced through two intermediary variables: customer satisfaction and attitudinal loyalty. This shows that better service quality and higher customer satisfaction will activate the attitudinal to service providers more active and show loyalty to service providers much easier. In addition, the effect mechanism of all dimensions of service quality on all dimensions of customer loyalty is different. Third, customer satisfaction plays a significant intermediary role among service quality and attitudinal and behavioral loyalty, indicating that improving service quality can boost customer satisfaction and make it easier for satisfied customers to become loyal customers. Moreover, attitudinal loyalty plays a significant intermediary role between service quality and behavioral loyalty, indicating that only attitudinally and behaviorally loyal customers are truly loyal customers. The research conclusions have some indications for Chinese telecom operators and others to upgrade their service quality. Two limitations to the study are also mentioned. First, all data were collected in the Heilongjiang area, so there might be a common method bias that skews the results. Second, the discussion addresses the relationship between service quality and customer loyalty, setting customer satisfaction as mediator, but does not consider other factors, like customer value and consumer features, This research will be continued in the future.

Real Option Analysis to Value Government Risk Share Liability in BTO-a Projects (손익공유형 민간투자사업의 투자위험분담 가치 산정)

  • KU, Sukmo;LEE, Sunghoon;LEE, Seungjae
    • Journal of Korean Society of Transportation
    • /
    • v.35 no.4
    • /
    • pp.360-373
    • /
    • 2017
  • The BTO-a projects is the types, which has a demand risk among the type of PPP projects in Korea. When demand risk is realized, private investor encounters financial difficulties due to lower revenue than its expectation and the government may also have a problem in stable infrastructure operation. In this regards, the government has applied various risk sharing policies in response to demand risk. However, the amount of government's risk sharing is the government's contingent liabilities as a result of demand uncertainty, and it fails to be quantified by the conventional NPV method of expressing in the text of the concession agreement. The purpose of this study is to estimate the value of investment risk sharing by the government considering the demand risk in the profit sharing system (BTO-a) introduced in 2015 as one of the demand risk sharing policy. The investment risk sharing will take the form of options in finance. Private investors have the right to claim subsidies from the government when their revenue declines, while the government has the obligation to pay subsidies under certain conditions. In this study, we have established a methodology for estimating the value of investment risk sharing by using the Black - Scholes option pricing model and examined the appropriateness of the results through case studies. As a result of the analysis, the value of investment risk sharing is estimated to be 12 billion won, which is about 4% of the investment cost of the private investment. In other words, it can be seen that the government will invest 12 billion won in financial support by sharing the investment risk. The option value when assuming the traffic volume risk as a random variable from the case studies is derived as an average of 12.2 billion won and a standard deviation of 3.67 billion won. As a result of the cumulative distribution, the option value of the 90% probability interval will be determined within the range of 6.9 to 18.8 billion won. The method proposed in this study is expected to help government and private investors understand the better risk analysis and economic value of better for investment risk sharing under the uncertainty of future demand.