• Title/Summary/Keyword: latent variable score

Search Result 10, Processing Time 0.023 seconds

A Stagewise Approach to Structural Equation Modeling (구조식 모형에 대한 단계적 접근)

  • Lee, Bora;Park, Changsoon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.1
    • /
    • pp.61-74
    • /
    • 2015
  • Structural equation modeling (SEM) is a widely used in social sciences such as education, business administration, and psychology. In SEM, the latent variable score is the estimate of the latent variable which cannot be observed directly. This study uses stagewise structural equation modeling(stagewise SEM; SSEM) by partitioning the whole model into several stages. The traditional estimation method minimizes the discrepancy function using the variance-covariance of all observed variables. This method can lead to inappropriate situations where exogenous latent variables may be affected by endogenous latent variables. The SSEM approach can avoid such situations and reduce the complexity of the whole SEM in estimating parameters.

Estimating Average Causal Effect in Latent Class Analysis (잠재범주분석을 이용한 원인적 영향력 추론에 관한 연구)

  • Park, Gayoung;Chung, Hwan
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.7
    • /
    • pp.1077-1095
    • /
    • 2014
  • Unlike randomized trial, statistical strategies for inferring the unbiased causal relationship are required in the observational studies. Recently, new methods for the causal inference in the observational studies have been proposed such as the matching with the propensity score or the inverse probability treatment weighting. They have focused on how to control the confounders and how to evaluate the effect of the treatment on the result variable. However, these conventional methods are valid only when the treatment variable is categorical and both of the treatment and the result variables are directly observable. Research on the causal inference can be challenging in part because it may not be possible to directly observe the treatment and/or the result variable. To address this difficulty, we propose a method for estimating the average causal effect when both of the treatment and the result variables are latent. The latent class analysis has been applied to calculate the propensity score for the latent treatment variable in order to estimate the causal effect on the latent result variable. In this work, we investigate the causal effect of adolescents delinquency on their substance use using data from the 'National Longitudinal Study of Adolescent Health'.

Latent causal inference using the propensity score from latent class regression model (잠재범주회귀모형의 성향점수를 이용한 잠재변수의 원인적 영향력 추론 연구)

  • Lee, Misol;Chung, Hwan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.5
    • /
    • pp.615-632
    • /
    • 2017
  • Unlike randomized trial, statistical strategies for inferring the unbiased causal relationship are required in the observational studies. The matching with the propensity score is one of the most popular methods to control the confounders in order to evaluate the effect of the treatment on the outcome variable. Recently, new methods for the causal inference in latent class analysis (LCA) have been proposed to estimate the average causal effect (ACE) of the treatment on the latent discrete variable. They have focused on the application study for the real dataset to estimate the ACE in LCA. In practice, however, the true values of the ACE are not known, and it is difficult to evaluate the performance of the estimated the ACE. In this study, we propose a method to generate a synthetic data using the propensity score in the framework of LCA, where treatment and outcome variables are latent. We then propose a new method for estimating the ACE in LCA and evaluate its performance via simulation studies. Furthermore we present an empirical analysis based on data form the 'National Longitudinal Study of Adolescents Health,' where puberty as a latent treatment and substance use as a latent outcome variable.

Discovery of Association Rules Using Latent Variables

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.10a
    • /
    • pp.177-188
    • /
    • 2005
  • Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary threshold measures in association rule; support and confidence and lift. In the case of appling real world to association rules, we have some difficulties in data interpretation because we obtain many rules. In this paper, we develop the model of association rules using latent variables for environmental survey data.

  • PDF

Discovery of Association Rules Using Latent Variables

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.149-160
    • /
    • 2006
  • Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary threshold measures in association rule; support and confidence and lift. In the case of appling real world to association rules, we have some difficulties in data interpretation because we obtain many rules. In this paper, we develop the model of association rules using latent variables for environmental survey data.

  • PDF

Investigating Students' Profiles of Mathematical Modeling: A Latent Profile Analysis in PISA 2012

  • SeoJin Jeong;Jihyun Hwang;Jeong Su Ahn
    • Research in Mathematical Education
    • /
    • v.26 no.3
    • /
    • pp.235-252
    • /
    • 2023
  • We investigated the classification of learner groups for students' mathematical modeling competency and analyzed the characteristics in each profile group for each country and variable using PISA 2012 data from six countries. With a perspective on measuring sub-competency, we applied the latent profile analysis method to student achievement for mathematical modeling variables - Formulate, Employ, Interpret. The findings showed the presence of 4-6 profile groups, with the variables exhibiting high and low achievement within each profile group varying by country, and a hierarchical structure was observed in the profile group distribution in all countries, interestingly, the Formulate variable showed the largest difference between high-achieving and low-achieving profile groups. These results have significant implications. Comparison by country, variable, and profile group can provide valuable insights into understanding the various characteristics of students' mathematical modeling competency. The Formulate variable could serve as the most suitable predictor of a student's profile group and the score range of other variables. We suggest further studies to gain more detailed insights into mathematical modeling competency with different cultural contexts.

An Exploratory Study on the Introduction of Loyalty to Segmentation of Theme Park Users (주제공원 이용자의 시장세분화를 위한 충성도의 사용가능성 검토)

    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.26 no.1
    • /
    • pp.1-11
    • /
    • 1998
  • The purpose of this paper is twofold : to identify loyalty applicable to segmentation of theme park users and to find characteristics of the segments. Thetheme park was regarded as a product and Lotte World was regarded as a brand. One hundred thirty five college students were selected by nonprobability sampling for two waves thirty of data collection. Both behavioral and attitudinal dimension of loyalty were measured in the first wave by the proportion of visit of the Lotte World to 3 major theme parks for one year, including the Lotte World, and by calculating the mean score of selected 7 attitudinal items, respectively. After 14 weeks, the same respondents were asked the number of actual visits of the Lotte World. Medians of two dimensions and cluster analyses were utilized to classify the respondents into 4 categories : high, spurious, latent, and low loyalty. Then ANOVA and $$\chi$^2$ test of independence were conducted to find the difference in intention to visit the Lotte World and actual visitation of it among groups. Only intention was significantly different by the group and the mean score of intention was highest in the high loyalty group. Although no statistical difference was found in actual visitation among groups, the theory of planned behavior provided a theoretical support to conclude that the loyalty is a useful variable for segmentation of theme park users because intention is an antecedent variable to the behavior. Discriminant analyses showed that characteristics of each loyalty group can be differentiated by motivations and constraints. When median was a group classification criterion, 73.2 percent of high loyalty group was correctly classified. A few comments were suggested on data collection, and inclusion of new discriminant variables was discussed for the future research.

  • PDF

Structural Equation Model Analysis of Communication Ability by Havruta Teaching-Learning Method

  • Jae-Nam Kim;Seong-Eun Chu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.10
    • /
    • pp.197-205
    • /
    • 2023
  • This study is to apply the Havruta teaching-learning method to college students' major classes and analyze the relationship between the effectiveness evaluation of communication skills and sub-factors using a structural equation model. As a result of the study, the communication ability score was different before and after Havruta teaching-learning, and it was found that after Havruta teaching-learning was higher than before Havruta teaching-learning. The path effect was found to be significant in all of the total, direct, and indirect effects among latent variables, except for the relationship between interpretation ability, role-playing ability, and goal-setting ability in the direct effect. In this study, it was found that the Havruta teaching-learning method not only improves creativity and thinking ability, but also improves self-directed learning ability. In addition, it was reconfirmed that it is a teaching-learning method that can develop social skills and communication skills as well as problem-solving skills while experiencing opinions different from one's own. As a result, research on a thorough student-centered teaching-learning method suitable for the Homo Machina era must be continued and its application in the educational field must be implemented.

Anomaly Detection for User Action with Generative Adversarial Networks (적대적 생성 모델을 활용한 사용자 행위 이상 탐지 방법)

  • Choi, Nam woong;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.43-62
    • /
    • 2019
  • At one time, the anomaly detection sector dominated the method of determining whether there was an abnormality based on the statistics derived from specific data. This methodology was possible because the dimension of the data was simple in the past, so the classical statistical method could work effectively. However, as the characteristics of data have changed complexly in the era of big data, it has become more difficult to accurately analyze and predict the data that occurs throughout the industry in the conventional way. Therefore, SVM and Decision Tree based supervised learning algorithms were used. However, there is peculiarity that supervised learning based model can only accurately predict the test data, when the number of classes is equal to the number of normal classes and most of the data generated in the industry has unbalanced data class. Therefore, the predicted results are not always valid when supervised learning model is applied. In order to overcome these drawbacks, many studies now use the unsupervised learning-based model that is not influenced by class distribution, such as autoencoder or generative adversarial networks. In this paper, we propose a method to detect anomalies using generative adversarial networks. AnoGAN, introduced in the study of Thomas et al (2017), is a classification model that performs abnormal detection of medical images. It was composed of a Convolution Neural Net and was used in the field of detection. On the other hand, sequencing data abnormality detection using generative adversarial network is a lack of research papers compared to image data. Of course, in Li et al (2018), a study by Li et al (LSTM), a type of recurrent neural network, has proposed a model to classify the abnormities of numerical sequence data, but it has not been used for categorical sequence data, as well as feature matching method applied by salans et al.(2016). So it suggests that there are a number of studies to be tried on in the ideal classification of sequence data through a generative adversarial Network. In order to learn the sequence data, the structure of the generative adversarial networks is composed of LSTM, and the 2 stacked-LSTM of the generator is composed of 32-dim hidden unit layers and 64-dim hidden unit layers. The LSTM of the discriminator consists of 64-dim hidden unit layer were used. In the process of deriving abnormal scores from existing paper of Anomaly Detection for Sequence data, entropy values of probability of actual data are used in the process of deriving abnormal scores. but in this paper, as mentioned earlier, abnormal scores have been derived by using feature matching techniques. In addition, the process of optimizing latent variables was designed with LSTM to improve model performance. The modified form of generative adversarial model was more accurate in all experiments than the autoencoder in terms of precision and was approximately 7% higher in accuracy. In terms of Robustness, Generative adversarial networks also performed better than autoencoder. Because generative adversarial networks can learn data distribution from real categorical sequence data, Unaffected by a single normal data. But autoencoder is not. Result of Robustness test showed that he accuracy of the autocoder was 92%, the accuracy of the hostile neural network was 96%, and in terms of sensitivity, the autocoder was 40% and the hostile neural network was 51%. In this paper, experiments have also been conducted to show how much performance changes due to differences in the optimization structure of potential variables. As a result, the level of 1% was improved in terms of sensitivity. These results suggest that it presented a new perspective on optimizing latent variable that were relatively insignificant.

Methodology for Classifying Hierarchical Data Using Autoencoder-based Deeply Supervised Network (오토인코더 기반 심층 지도 네트워크를 활용한 계층형 데이터 분류 방법론)

  • Kim, Younha;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.3
    • /
    • pp.185-207
    • /
    • 2022
  • Recently, with the development of deep learning technology, researches to apply a deep learning algorithm to analyze unstructured data such as text and images are being actively conducted. Text classification has been studied for a long time in academia and industry, and various attempts are being performed to utilize data characteristics to improve classification performance. In particular, a hierarchical relationship of labels has been utilized for hierarchical classification. However, the top-down approach mainly used for hierarchical classification has a limitation that misclassification at a higher level blocks the opportunity for correct classification at a lower level. Therefore, in this study, we propose a methodology for classifying hierarchical data using the autoencoder-based deeply supervised network that high-level classification does not block the low-level classification while considering the hierarchical relationship of labels. The proposed methodology adds a main classifier that predicts a low-level label to the autoencoder's latent variable and an auxiliary classifier that predicts a high-level label to the hidden layer of the autoencoder. As a result of experiments on 22,512 academic papers to evaluate the performance of the proposed methodology, it was confirmed that the proposed model showed superior classification accuracy and F1-score compared to the traditional supervised autoencoder and DNN model.