• Title/Summary/Keyword: 부분최소제곱모형

Search Result 22, Processing Time 0.02 seconds

Variable Selection in PLS Regression with Penalty Function (벌점함수를 이용한 부분최소제곱 회귀모형에서의 변수선택)

  • Park, Chong-Sun;Moon, Guy-Jong
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.4
    • /
    • pp.633-642
    • /
    • 2008
  • Variable selection algorithm for partial least square regression using penalty function is proposed. We use the fact that usual partial least square regression problem can be expressed as a maximization problem with appropriate constraints and we will add penalty function to this maximization problem. Then simulated annealing algorithm can be used in searching for optimal solutions of above maximization problem with penalty functions added. The HARD penalty function would be suggested as the best in several aspects. Illustrations with real and simulated examples are provided.

Utilization of R Program for the Partial Least Square Model: Comparison of SmartPLS and R (부분최소제곱모형을 위한 R 프로그램의 활용: SmartPLS와 R의 비교)

  • Kim, Yong-Tae;Lee, Sang-Jun
    • Journal of Digital Convergence
    • /
    • v.13 no.12
    • /
    • pp.117-124
    • /
    • 2015
  • As the acceptance of statistical analysis has been increased because of Big Data, the needs for an advanced second generation of statistical analysis method like Structural Equation Model are also increasing. This study suggests how R-Program, as open software, can be utilized when Partial Least Square Model, one of the SEMs, is applied to statistical analysis. R is a free software as a part of GNU projects as well as a powerful and useful tool for statistical analysis including Big Data. The study utilized R and SmartPLS, a representative statistical package of PLS-SEM, and analyzed internal consistency reliability, convergent validity, and discriminant validity of the measurement model. The study also analyzed path coefficients and moderator effects of the structural model and compared the results, respectively. The results indicated that R showed the same results with SmartPLS on the measurement model and the structural model. Therefore, the study confirmed that R could be a powerful tool that is alternative to a commercial statistical package in the future.

A Development of Statistical Model for Pavement Response Model (도로포장 반응모형에 대한 통계모형 개발)

  • Lee, Moon Sup;Park, Hee Mun;Kim, Boo Il;Heo, Tae-Young
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.17 no.5
    • /
    • pp.89-96
    • /
    • 2012
  • The Falling Weight Deflectormeter has been widely used in evaluating the structural adequacy of pavement structures. The deflections measured from the FWD are capable of estimating the stiffness of pavement layers and measuring the pavement responses in the pavement structure. The objective of paper is to develop the pavement response model using a partial least square regression technique based on the FWD deflection data. The partial least square regression method enables to solve the multicollinearity problem occurred in multiple regression model. It is also found that the pavement response model can be developed using the raw data when a partial least square regression was used.

Analysis of internet addiction in Korean adolescents using sparse partial least-squares regression (희소 부분 최소 제곱법을 이용한 우리나라 청소년 인터넷 중독 자료 분석)

  • Han, Jeongseop;Park, Soobin;Lee, onghwan
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.2
    • /
    • pp.253-263
    • /
    • 2018
  • Internet addiction in adolescents is an important social issue. In this study, sparse partial least-squares regression (SPLS) was applied to internet addiction data in Korean adolescent samples. The internet addiction score and various clinical and psychopathological features were collected and analyzed from self-reported questionnaires. We considered three PLS methods and compared the performance in terms of prediction and sparsity. We found that the SPLS method with the hierarchical likelihood penalty was the best; in addition, two aggression features, AQ and BSAS, are important to discriminate and explain latent features of the SPLS model.

Type I projection sum of squares by weighted least squares (가중최소제곱법에 의한 제1종 사영제곱합)

  • Choi, Jaesung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.423-429
    • /
    • 2014
  • This paper discusses a method for getting Type I sums of squares by projections under a two-way fixed-effects model when variances of errors are not equal. The method of weighted least squares is used to estimate the parameters of the assumed model. The model is fitted to the data in a sequential manner by using the model comparison technique. The vector space generated by the model matrix can be composed of orthogonal vector subspaces spanned by submatrices consisting of column vectors related to the parameters. It is discussed how to get the Type I sums of squares by using the projections into the orthogonal vector subspaces.

Study on analysis with partial least square path modeling using multiple factor analysis (다중요인분석을 이용한 부분 최소제곱 경로 모형에 대한 고찰)

  • Park, Ri-Ra;Lee, Eun-Kyung
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.3
    • /
    • pp.315-328
    • /
    • 2018
  • In this paper, we examine the methodology to predict consumer preferences using several groups of attributes of products and application to real data. In the food industry, studies are in progress to investigate the relationship between product attributes and consumer preferences; consequently, various methodologies are proposed. Among these methodologies, we consider multiple factor analysis (MFA). The result of the MFA enable the division of consumers into four clusters with similar liking and the defining of preference characteristics for each cluster. Also, using the results of multiple factor analysis, we find the partial least squares path model to predict consumer preferences through the characteristics of the product and the characteristics evaluated by consumers. We can understand the relationship between the cluster of consumers and the preferred/undesirable characteristics of products through the partial least squares path model applied to two clusters with different liking. When multiple factor analysis is used in the partial least squares path model, it is possible to investigate relationships between products and consumers by analyzing product characteristics and consumer preferences simultaneously. The results can be applied to product developments and sales which makes this methodology important and useful.

Performance Comparison of Data Mining Approaches for Prediction Models of Near Infrared Spectroscopy Data (근적외선 분광 데이터 예측 모형을 위한 데이터 마이닝 기법의 성능비교)

  • Baek, Seung Hyun
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.4
    • /
    • pp.311-315
    • /
    • 2013
  • 본 논문에서는 주성분 회귀법과 부분최소자승 회귀법을 비교하여 보여준다. 이 비교의 목적은 선형형태를 보유한 근적외선 분광 데이터의 분석에 사용할 수 있는 적합한 예측 방법을 찾기 위해서이다. 두 가지 데이터 마이닝 방법론인 주성분 회귀법과 부분최소자승 회귀법이 비교되어 질 것이다. 본 논문에서는 부분최소자승 회귀법은 주성분 회귀법과 비교했을 때 약간 나은 예측능력을 가진 결과를 보여준다. 주성분 회귀법에서 50개의 주성분이 모델을 생성하기 위해서 사용지만 부분최소자승 회귀법에서는 12개의 잠재요소가 사용되었다. 평균제곱오차가 예측능력을 측정하는 도구로 사용되었다. 본 논문의 근적외선 분광데이터 분석에 따르면 부분최소자승회귀법이 선형경향을 가진 데이터의 예측에 가장 적합한 모델로 판명되었다.

AI Technology Analysis using Partial Least Square Regression

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.3
    • /
    • pp.109-115
    • /
    • 2020
  • In this paper, we propose an artificial intelligence(AI) technology analysis using partial least square(PLS) regression model. AI technology is now affecting most areas of our society. So, it is necessary to understand this technology. To analyze the AI technology, we collect the patent documents related to AI from the patent databases in the world. We extract AI technology keywords from the patent documents by text mining techniques. In addition, we analyze the AI keyword data by PLS regression model. This regression model is based on the technique of partial least squares used in the advanced analyses such as bioinformatics, social science, and engineering. To show the performance of our proposed method, we make experiments using AI patent documents, and we illustrate how our research can be applied to real problems. This paper is applicable not only to AI technology but also to other technological fields. This also contributes to understanding other various technologies by PLS regression analysis.

Minimum Bias Design for Polynomial Regression (다항회귀모형에 대한 최소편의 실험계획)

  • Jang, Dae-Heung;Kim, Youngil
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1227-1234
    • /
    • 2015
  • Traditional criteria for optimum experimental designs depend on the specifications of the model; however, there will be a dilemma when we do not have perfect knowledge about the model. Box and Draper (1959) suggested one direction to minimize bias that may occur in this situation. We will demonstrate some examples with exact solutions that provide a no-bias design for polynomial regression. The most interesting finding is that a design that requires less bias should allocate design points away from the border of the design space.

Type I Analysis by Projections (사영에 의한 제1종 분석)

  • Choi, Jae-Sung
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.373-381
    • /
    • 2011
  • This paper discusses how to get the sums of squares due to treatment factors when Type I Analysis is used by projections for the analysis of data under the assumption of a two-way ANOVA model. The suggested method does not need to calculate the residual sums of squares for the calculation of sums of squares. There-fore, the calculation is easier and faster than classical ANOVA methods. It also discusses how eigenvectors and eigenvalues of the projection matrices can be used to get the calculation of sums of squares. An example is given to illustrate the calculation procedure by projections for unbalanced data.