• Title/Summary/Keyword: latent class model

Search Result 69, Processing Time 0.042 seconds

Feature selection for text data via topic modeling (토픽 모형을 이용한 텍스트 데이터의 단어 선택)

  • Woosol, Jang;Ye Eun, Kim;Won, Son
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.6
    • /
    • pp.739-754
    • /
    • 2022
  • Usually, text data consists of many variables, and some of them are closely correlated. Such multi-collinearity often results in inefficient or inaccurate statistical analysis. For supervised learning, one can select features by examining the relationship between target variables and explanatory variables. On the other hand, for unsupervised learning, since target variables are absent, one cannot use such a feature selection procedure as in supervised learning. In this study, we propose a word selection procedure that employs topic models to find latent topics. We substitute topics for the target variables and select terms which show high relevance for each topic. Applying the procedure to real data, we found that the proposed word selection procedure can give clear topic interpretation by removing high-frequency words prevalent in various topics. In addition, we observed that, by applying the selected variables to the classifiers such as naïve Bayes classifiers and support vector machines, the proposed feature selection procedure gives results comparable to those obtained by using class label information.

Market Segmentation to Identify Forest Recreation Welfare Consumers (산림휴양복지 수요자에 대한 시장 세분화 연구)

  • Seung Yeon Byun;Seong Yoon Heo;Ja-choon Koo
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.2
    • /
    • pp.248-257
    • /
    • 2023
  • Because of various societal changes, such as the recent improvement in income levels and extension of the flexible work system, the demand for forest recreation activities and their use patterns are undergoing a change. Accordingly, it is necessary to identify the characteristics of each type through the segmentation of the overall forest recreation and welfare markets and to plan differentiated policies for each market type. This study classifies the forest recreation and welfare activities according to four types of users (i.e., passive usage type, ordinary type, active lover type, and indifferent type) using the Latent Class Analysis and examines their demographic and socioeconomic characteristics to explain the differences between the groups. Three policy implications were derived from the results obtained: 1) the group experiencing forest recreation welfare is subdivided; 2) the socioeconomic characteristics that distinguish the groups undertaking forest recreation activities were identified; and 3) the policy targets and characteristics that can increase the experience of forest recreation welfare were identified. This study is insightful as it suggests differentiated policies for each group and proposes policy measures to move to the desirable group.

A Study on Political Attitude Estimation of Korean OSN Users (온라인 소셜네트워크를 통한 한국인의 정치성향 예측 기법의 연구)

  • Wijaya, Muhammad Eka;Ahn, Heejune
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.21 no.4
    • /
    • pp.1-11
    • /
    • 2016
  • Recently numerous studies are conducted to estimate the human personality from the online social activities. This paper develops a comprehensive model for political attitude estimation leveraging the Facebook Like information of the users. We designed a Facebook Crawler that efficiently collects data overcoming the difficulties in crawling Ajax enabled Facebook pages. We show that the category level selection can reduce the data analysis complexity utilizing the sparsity of the huge like-attitude matrix. In the Korean Facebook users' context, only 28 criteria (3% of the total) can estimate the political polarity of the user with high accuracy (AUC of 0.82).

A Comparison Study on Satisfied Customer Reclassification Methods for Customer Satisfaction Management (고객만족경영을 위한 만족고객 재분류 방법의 비교 연구)

  • Song, Ki-Jeong;Seo, Kwang-Kyu
    • Journal of Digital Convergence
    • /
    • v.11 no.1
    • /
    • pp.139-144
    • /
    • 2013
  • This paper is an exploratory study to improve customer satisfaction survey for resolving practical problems. It is natural phenomenon that, as the level of customer satisfaction index increases, the ratio of satisfied customers increases too. However, the effectiveness of practical application of customer satisfaction survey for improvement of customer satisfaction decreases due to its structural limitation on its data analysis system. In order to cope with these problems, we compares the three satisfied customer reclassification methods such as attribute complex scores, satisfaction/dissatisfaction dimension and latent class analysis models. The case study results show that satisfied customer reclassification methods have merits and demerits and are expected to play the role as the groundwork for the revitalization of customer satisfaction survey as well as improving customer satisfaction management.

An Exploratory Study on International Undergraduate Students' Satisfaction with Life of Studying Abroad -Focusing on Multidimensional Approach- (외국인 학부 유학생의 유학생활만족에 관한 탐색적 연구 -다차원적 접근을 중심으로-)

  • Hwang, Dongjin
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.6
    • /
    • pp.415-424
    • /
    • 2021
  • The life of studying abroad includes not only school life, but also various areas such as economy, social relationship, and culture, so the level of satisfaction in each area could be differently shown in each individual. Based on this critical mind, this study aims to analyze the satisfaction with life of studying abroad in the multidimensional perspective. To analyze this, a latent class analysis was applied to identify subgroups, and a multinomial logistic regression model was applied to verify factors influencing group classification. The results of the analysis could be summarized into two. First, there were sub-groups showing different satisfaction with life of studying abroad. The sub-groups showed different levels of satisfaction in five areas such as housing, economy, social relationship, study, and culture, which were not discerned in single dimension. Second, the classification of group was complexly influenced by academic factor, psychological/emotional factor, and environmental factor. Especially, the predictive factor had different influences on each sub-factor. Based on such results of this study, this study aims to seek for the practical and policy-level suggestions for improving foreign students' satisfaction with life of studying abroad.

Predictors of Latent Class of Longitudinal Medical Expenses of Older People and the Effects on Subjective Health (노인 의료비 변화궤적의 잠재계층 유형: 예측요인과 주관적 건강에 대한 영향)

  • Song, Si Young;Jun, Hey Jung;Choi, Bo Mi
    • 한국노년학
    • /
    • v.39 no.3
    • /
    • pp.467-484
    • /
    • 2019
  • The purpose of this study is to explore latent classes of longitudinal medical expenses of older people and to analyze its predictors and its effects on subjective health. Among participants of the Korean Health Panel, the sample of this study includes 1,119 people who is 65-year-old or older and reported their medical expenses for nine consecutive years. The analyses were conducted in three steps. First, Growth Mixture Model (GMM) was applied to find distinct subgroups showing similar patterns in medical expenses. The results showed four groups which were classified as high medical expenditure maintenance group, medical expenditure increase group, low medical expenditure maintenance group, and medical expenditure reduction group. Second, the multinominal logistic regression found that the presence of spouse, economic participation, the number of chronic diseases, and the type of health insurance were significant predictors of latent classes in medical expenses. In particular, the greater the number of chronic diseases, the higher the likelihood of belonging to the high medical expenditure maintenance group. In addition, medical benefit recipients are more likely to belong to the low medical cost maintenance and medical cost reduction groups. Third, multiple regression analysis revealed that the older people in the groups with low or reducing expenses reported better subjective health than people with higher expenses. This study has its meanings in exploring the heterogeneity in longitudinal medical expenses among older people and its predictors and its associations with health outcome. The results of this research provide background information in establishing public health policy for older people.

Effects of Consistency Criterion for Scoring on the Reliability and the Validity of Polygraph Test for Crime Suspects (범죄 용의자의 거짓말탐지검사의 신뢰도와 타당도에 대한 일관성 채점기준의 효과)

  • Han, Yu-Hwa;Jeong, Je-Young;Park, Kwang-Bai
    • Science of Emotion and Sensibility
    • /
    • v.12 no.4
    • /
    • pp.557-564
    • /
    • 2009
  • For scoring polygraph charts, the Prosecutors' Office of the Republic of Korea uses a consistency criterion in which an elevated signal on one physiological channel is scored as a deceptive response only if the signal is also elevated on other channels. In the current study, the effects of this scoring criterion on reliability and accuracy (validity) of polygraph scores were assessed. Polygraph tests on 26 suspects were evaluated twice by the same examiners. The examiners used the consistency criterion in the first evaluation. In the second evaluation, the examiners were prevented from using the criterion; the signals from each physiological channel were separated and randomly arranged before they were rescored by the same examiner. Reliability was assessed by the variation among the scores for each suspect. Accuracy was assessed by establishing a standard, based on a Latent Class Analysis model, using the results of polygraph tests on each of 182 additional suspects. Reliability and accuracy were both improved by the use of the consistency criterion which therefore was recommended.

  • PDF

A Study of the Relation of Stress to Oral Parafunctional Habits of Male High School Students (일부 지역 남자 고등학생들의 스트레스와 구강악습관과의 관련성 연구)

  • Jung, Yu Yeon;Hong, Jin Tae
    • Journal of dental hygiene science
    • /
    • v.13 no.4
    • /
    • pp.471-479
    • /
    • 2013
  • This study is trying to grasp the stress of the male high school students and the correlation between the stress according to the academic and economic level and oral parafunctional habits, emphasizing the need for the education of oral parafunctional habits, providing the basic data in order to accomplish correctly until the oral health of the oral maxillofacial region. From May 2013 till July 2013, a self administered survey was conducted by the selected by convenience sampling from subjects of 1, 2 grade of two high school located in Chungnam, Korea. The study results were as follow: 1) Among five areas of stress, the stress of school life was the highest as 2.11 points and the stress of home problem was the lowest as 1.51 points; 2) the stress by class showed that grade 2 was higher than grade 1 in all areas. The stress of the school life (2.21) (p<0.01), interpersonal relationship (p<0.01), and own problem (p<0.05) showed the significant difference; 3) The significance analysis results between the five areas of stress according to the stress of latent variable and the oral parafunctional habits all showed the significant difference (p<0.001). The correlation between the stress and the oral parafunctional habits showed a weak negative correlation as -0.30, and the stress of the school life, own problem, environment problem, and interpersonal relationship showed very strong correlations more than 0.7; 4) Fit measures test result of stress, academic level, and family economic level model all showed more than 0.9 in good of fit index, adjusted goodness of fit index, normed fit index and root mean square residual and root mean square error of approximation values is all estimated less than 0.1, so it showed good model. From this study, it can be concluded that there is the correlation between stress and oral parafunctional habits.

A longitudinal analysis of high school students' dropping out: Focusing on the change pattern of dropout, changes in school violence and school counseling. (전국 고등학교 학생의 학업중단에 대한 종단적 분석 -학업중단 변화양상에 따른 유형탐색, 학교폭력 및 학교상담의 변화추이를 중심으로-)

  • Kwon, Jae-Ki;Na, Woo-Yeol
    • Journal of the Korean Society of Child Welfare
    • /
    • no.59
    • /
    • pp.209-234
    • /
    • 2017
  • This study viewed schools as a cause of students dropping out and posited that dropping out of high school would vary depending on the characteristics and influencing factors of the school from which students were dropping out. Therefore, focusing on schools, we longitudinally investigated the change patterns of school dropout across high schools in the country, and the types of changes in dropping out of high school. In addition, we predicted the general characteristics of schools according to the type of school students were dropping out from, looked at the changes in the major factors (i.e., school violence and school counseling) affecting school dropout, and reviewed schools' long-term efforts and outcomes in relation to school dropout. For this purpose, KERIS EDSS's "Secondary School Information Disclosure Data" were used. The final model included data collected five years20122016) from high schools across the country. The results were as follows. First, in order to examine the longitudinal change patterns of dropping out of high schools, a latent growth models analysis was conducted, and it revealed that, as time passed, the dropout rate decreased. Second, growth mixture modeling was used to explore types according to the change patterns of the school students were dropping out from. The results showed three types: the "remaining in school" type, the "gradually decreasing school dropout" type, and the "increasing school dropping out". Third, the multinomial logistic regression was conducted to predict the general characteristics of schools by type. The results showed that public schools, vocational schools, and schools with a large number of students who have below the basic levels in Korean, English and mathematics were more likely to belong to the "increasing school dropout" type. Further, the larger the total number of students, the higher the probability of belonging to the "remaining in school" type or the "gradually decreasing school dropout" type. Lastly, growth mixture modeling was used to analyze the trend of school violence and school counseling according to the three types. The focus was on the "gradually decreasing school dropout" type. In the case of the "gradually decreasing school dropout" type, it was found that as time passed, the number of school violence cases and the number of offenders gradually decreased. In addition, in terms of change in school counseling the results revealed that the number of placement of professional counselors in schools increased every year and peer counseling was continuously promoted, which may account for the "gradually decreasing school dropout" type.