• Title/Summary/Keyword: Multivariate algorithm

Search Result 186, Processing Time 0.025 seconds

Biplot method algorithm and application in tire engineering (Biplot 이론과 타이어 제조공학에의 응용)

  • 조완현
    • The Korean Journal of Applied Statistics
    • /
    • v.9 no.2
    • /
    • pp.55-72
    • /
    • 1996
  • It is essential in modern industry that quality and procuctivity are improved continuously. To accomplish this purpose, quality control must be maintained in all parts of a company. Recently, some tire manufacture companies are beginning to show interest in quality control. They have tried to achive some results through the statistical analysis for the experimental data which has accumulated up to now and then they strive to determine the structural relationship between the design factors in tire construction and tire performance characteristics. The measurement data obtained from the construction engineering is given in multivariate form owing to the various properties found in tire design components as wll as in performance. Also it may be existed the relationship among the multimple response variables. Thus we proposes the use of the biplot graphical display as an analytic tool of data matrices with complex respects. The proposed biplots are also availalbe to understand both the underlying structure of the data and the roles played by the different components. In particular, we consider the matter of how best to use the biplots in the maltivariate analysis of variance and multiple response data. Finally we apply this method to analyze the actual data.

  • PDF

A Procedure for Indentifying Outliers in Multivariate Data (다변량 자료에서 다수 이상치 인식의 절차)

  • Yum, Joon-Keun;Park, Jong-Goo;Kim, Jong-Woo
    • Journal of Korean Society for Quality Management
    • /
    • v.23 no.4
    • /
    • pp.28-41
    • /
    • 1995
  • We consider the problem of identifying multiple outliers in linear model. The available regression diagnostic methods often do not succeed in detecting multiple outliers because of the masking and swamping effect. Recently, among the various robust estimator of reducing the effect of outliers, LMS(Least Meadian Square) estimator has been to be a suitable method proposed to expose outliers and leverage points. However, as you know it, the data analysis method with LMS estimator is to be taken the median of the squared residuals in the sample which is extracted the sample space. Then this model causes the trouble, for the number of the chosen sample is nCp, i.e. as the size of sample space n is increasing, the number is increasing fastly. And the covariance matrix may be the singular matrix, so that matrix is approching collinearity. Thus we propose a procedure ELMS for the resampling in LMS method and study the size of the effective elementary set in this algorithm.

  • PDF

Removing Non-informative Features by Robust Feature Wrapping Method for Microarray Gene Expression Data (유전자 알고리즘과 Feature Wrapping을 통한 마이크로어레이 데이타 중복 특징 소거법)

  • Lee, Jae-Sung;Kim, Dae-Won
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.8
    • /
    • pp.463-478
    • /
    • 2008
  • Due to the high dimensional problem, typically machine learning algorithms have relied on feature selection techniques in order to perform effective classification in microarray gene expression datasets. However, the large number of features compared to the number of samples makes the task of feature selection computationally inprohibitive and prone to errors. One of traditional feature selection approach was feature filtering; measuring one gene per one step. Then feature filtering was an univariate approach that cannot validate multivariate correlations. In this paper, we proposed a function for measuring both class separability and correlations. With this approach, we solved the problem related to feature filtering approach.

An integrated approach for structural health monitoring using an in-house built fiber optic system and non-parametric data analysis

  • Malekzadeh, Masoud;Gul, Mustafa;Kwon, Il-Bum;Catbas, Necati
    • Smart Structures and Systems
    • /
    • v.14 no.5
    • /
    • pp.917-942
    • /
    • 2014
  • Multivariate statistics based damage detection algorithms employed in conjunction with novel sensing technologies are attracting more attention for long term Structural Health Monitoring of civil infrastructure. In this study, two practical data driven methods are investigated utilizing strain data captured from a 4-span bridge model by Fiber Bragg Grating (FBG) sensors as part of a bridge health monitoring study. The most common and critical bridge damage scenarios were simulated on the representative bridge model equipped with FBG sensors. A high speed FBG interrogator system is developed by the authors to collect the strain responses under moving vehicle loads using FBG sensors. Two data driven methods, Moving Principal Component Analysis (MPCA) and Moving Cross Correlation Analysis (MCCA), are coded and implemented to handle and process the large amount of data. The efficiency of the SHM system with FBG sensors, MPCA and MCCA methods for detecting and localizing damage is explored with several experiments. Based on the findings presented in this paper, the MPCA and MCCA coupled with FBG sensors can be deemed to deliver promising results to detect both local and global damage implemented on the bridge structure.

Source Characterization of Suspended Particulate Matter in Taegu Area, Using Principal Component Analysis Coupled with Multiple Regression (주성분/중회귀분석을 이용한 대구지역 대기중 부유분진의 발생원별 특성평가)

  • 백성옥;황승만
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.8 no.3
    • /
    • pp.179-190
    • /
    • 1992
  • This study was carried out to characterize sources of atmospheric total suspended particulates (TSP) in urban and sub--urban areas of metropolitan taegu. The sources were tentatively identified by a multivariate technique, i.e. principal component analysis (PCA), and the source contributions to the atmospheric concentrations of TSP were further estimated by stepwise multiple regression analysis. A total of 5 sources was identified in the urban area of Taegu (soil dust resuspension, fuel combustion, secondary aerosol, traffic related aerosol, and refuge burning), while 4 sources were found to be significant in the sub--urban area as following: fuel combustion/secondary aerosol, soil dust resuspension, traffic related aerosol, and wood/agricultural burning. The largest contributor to the atmospheric TSP appeared to be the soil dust resuspension in both areas. The source apportionment of the extractable organic matter (EOM) was also carried out for the Taegu data. The EOM was determined with respect to the solvent polarity, i.e. cyclohexane (non-polar), dichloromethane (semi--polar), and acetone (polar). In addition, the source profiles for the TSP in Taegu area were estimated using a PCA-based algorithm, and the validity was evaluated tentatively by comparing the data in the literature.

  • PDF

On Statistical Estimation of Multivariate (Vector-valued) Process Capability Indices with Bootstraps)

  • Cho, Joong-Jae;Park, Byoung-Sun;Lim, Soo-Duck
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.3
    • /
    • pp.697-709
    • /
    • 2001
  • In this paper we study two vector-valued process capability indices $C_{p}$=($C_{px}$, $C_{py}$ ) and C/aub pm/=( $C_{pmx}$, $C_{pmy}$) considering process capability indices $C_{p}$ and $C_{pm}$ . First, two asymptotic distributions of plug-in estimators $C_{p}$=($C_{px}$, $C_{py}$ ) and $C_{pm}$ =) $C_{pmx}$, $C_{pmy}$) are derived.. With the asymptotic distributions, we propose asymptotic confidence regions for our indices. Next, obtaining the asymptotic distributions of two bootstrap estimators $C_{p}$=($C_{px}$, $C_{py}$ )and $C_{pm}$ =( $C_{pmx}$, $C_{pmy}$) with our bootstrap algorithm, we will provide the consistency of our bootstrap for statistical inference. Also, with the consistency of our bootstrap, we propose bootstrap asymptotic confidence regions for our indices. (no abstract, see full-text)see full-text)e full-text)

  • PDF

Soft computing-based slope stability assessment: A comparative study

  • Kaveh, A.;Hamze-Ziabari, S.M.;Bakhshpoori, T.
    • Geomechanics and Engineering
    • /
    • v.14 no.3
    • /
    • pp.257-269
    • /
    • 2018
  • Analysis of slope stability failures, as one of the complex natural hazards, is one of the important research issues in the field of civil engineering. Present paper adopts and investigates four soft computing-based techniques for this problem: Patient Rule-Induction Method (PRIM), M5' algorithm, Group Method of data Handling (GMDH) and Multivariate Adaptive Regression Splines (MARS). A comprehensive database consisting of 168 case histories is used to calibrate and test the developed models. Six predictive variables including slope height, slope angle, bulk density, cohesion, angle of internal friction, and pore water pressure ratio were considered to generate new models. The results of test studies are used for feasibility, effectiveness and practicality comparison of techniques with each other, and with the other available well-known methods in the literature. Results show that all methods not only are feasible but also result in better performance than previously developed soft computing based predictive models and tools. It is shown that M5' and PRIM algorithms are the most effective and practical prediction models.

Imputation of Multiple Missing Values by Normal Mixture Model under Markov Random Field: Application to Imputation of Pixel Values of Color Image (마코프 랜덤 필드 하에서 정규혼합모형에 의한 다중 결측값 대체기법: 색조영상 결측 화소값 대체에 응용)

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.6
    • /
    • pp.925-936
    • /
    • 2009
  • There very many approaches to impute missing values in the iid. case. However, it is hardly found the imputation techniques in the Markov random field(MRF) case. In this paper, we show that the imputation under MRF is just to impute by fitting the normal mixture model(NMM) under several practical assumptions. Our multivariate normal mixture model based approaches under MRF is applied to impute the missing pixel values of 3-variate (R, G, B) color image, providing a technique to smooth the imputed values.

A Study on Statistical Modeling of Spatial Land-use Change Prediction (토지이용 공간변화 예측의 통계학적 모형에 관한 연구)

  • 김의홍
    • Spatial Information Research
    • /
    • v.5 no.2
    • /
    • pp.177-183
    • /
    • 1997
  • S1he concept of a class in the land-use classification system can be equally applied to a class in the land-use-change classification. The maximum likelihood method using linear discriminant function and Markov transition matrix method were integrated to a synthetic modeling effort in order to project spatial allocation of land-use-change and quantitative assignment of that prediction as a whole. The algorithm of both the multivariate discriminant function and the Markov chain matrix were discussed and the test of synthetic model on the study area was resulted in the projection of '90 year as well as '95 year land -use classification. The accuracy and the issue of modeling improvement were discussed eventually.

  • PDF

Multivariate Analysis of EEG Signal using Intervention Models (개입모형을 이용한 EEG 신호의 다변량 분석에 관한 연구)

  • Im, Seong-Sik;Kim, Jin-Ho;Kim, Chi-Yong;Hwang, Min-Cheol
    • Journal of the Ergonomics Society of Korea
    • /
    • v.18 no.1
    • /
    • pp.13-24
    • /
    • 1999
  • The objective of the study is to discriminate EEG(electroencephalogram) due to emotional changes. Emotion was evoked by the series of auditory stimuli which were selected from the natural sounds in the sound effect collection of compact disc. Seventeen university students participated and experienced positive or negative emotions by six auditory stimuli with intermission between stimuli. Temporal EEG ($T_3$, $T_4$, $T_5$, and $T_6$) was recorded at the same time and a subjective test was performed on the eleven point scales after the experiment. The maximum and minimum scores of the EEG among six stimuli EEG were analyzed for discrimination of emotion. The EEG signals were transformed into feature objects based on scalar intervention model coefficients. Auditory stimulus was considered as intervention variable. They were classified by Discriminant Analysis for each channel. The features showed results with the best classification accuracy of 91.2 % in $T_4$ for auditory stimuli. This study could be extended to establish an algorithm which quantifies and classifies emotions evoked by auditory stimulus using time-series models.

  • PDF