• Title/Summary/Keyword: Multivariate Techniques

Search Result 216, Processing Time 0.03 seconds

Study on Rainfall Regional Frequency Analysis (강우 지역빈도해석의 적용성 연구)

  • Shin Hong Joon;Nam Woo Sung;Heo Jun Haeng;Kim Kyung Duk
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2005.05b
    • /
    • pp.593-598
    • /
    • 2005
  • At-site analysis is not appropriate if the record length is shorter than target return period T. If the record length is longer than 27 years, then at-site analysis may be sufficient(Institute of Hydrology, 1999). However, in such a case, regional frequency analysis is recommended for purpose of comparison. Record lengths of annual maximum rainfall data in Korea are usually shorter than 50 years. It is therefore essential to apply regional frequency analysis for estimating rainfall quantiles of more than 100 years return period. In this research, regional rainfall frequency analysis is performed for hourly rainfall data of South Korea. Homogeneous regions are idntified by clusgter analysis which is a standard method of statistical multivariate analysis for dividing a data set into groups. An appropriate distribution is chosen by goodness-of-fit test. GLO is found to be an appropriate distribution as a result of goodness-of-fit measure (Hosking & Wallis, 1997). Simulation experiments are performed to check the performance of frequency analysis techniques. The effects of discordant sites on quantiles are considered.

  • PDF

NEW DEVELOPED PORTABLE NEAR INFRARED (NIR) SYSTEM USING MICROSPECTROMETER

  • Woo, Young-Ah;Ha, Tae-Kyu;Kim, Jae-Min;Kim, Hyo-Jin
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1123-1123
    • /
    • 2001
  • In recent years, a miniature spectrometer has been extensively developed due to the marriage of fiber optics and semiconductor detector array. This type of miniature spectrometer has advantages of low price and robustness due to the capability of mass production and no moving parts are required such as lenses, mirrors and scanning monochromator. These systems are ideal for use in teaching labs, process monitoring and field analyses. A portable near infrared (NIR) system has been developed for qualitative and quantitative analysis. This system includes a tungsten halogen lamp for light source, a fiber optics connected a light source, and a sample module to the microspectrometer, The size of spectrometer can be as small as 2.5 cm x 1.5 cm x 0.1 cm. Wavelength ranges can be chosen as 360-800 nm, 800-1100 nm and 1100-1900 nm depending on the type of detector. The software consists of various tools for multivariate analysis and pattern recognition techniques. To evaluate the system, long and short-term stability, wavelength accuracy, and stray light have been investigated and compared with conventional scanning type NIR spectrometer. This developed system can be sufficiently used for quantitative and qualitative analysis for various samples such as agricultural product, herbal medicine, food, petroleum, and pharmaceuticals, etc.

  • PDF

Removing Non-informative Features by Robust Feature Wrapping Method for Microarray Gene Expression Data (유전자 알고리즘과 Feature Wrapping을 통한 마이크로어레이 데이타 중복 특징 소거법)

  • Lee, Jae-Sung;Kim, Dae-Won
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.8
    • /
    • pp.463-478
    • /
    • 2008
  • Due to the high dimensional problem, typically machine learning algorithms have relied on feature selection techniques in order to perform effective classification in microarray gene expression datasets. However, the large number of features compared to the number of samples makes the task of feature selection computationally inprohibitive and prone to errors. One of traditional feature selection approach was feature filtering; measuring one gene per one step. Then feature filtering was an univariate approach that cannot validate multivariate correlations. In this paper, we proposed a function for measuring both class separability and correlations. With this approach, we solved the problem related to feature filtering approach.

Prediction of High Level Ozone Concentration in Seoul by Using Multivariate Statistical Analyses (다변량 통계분석을 이용한 서울시 고농도 오존의 예측에 관한 연구)

  • 허정숙;김동술
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.9 no.3
    • /
    • pp.207-215
    • /
    • 1993
  • In order to statistically predict $O_3$ levels in Seoul, the study used the TMS (telemeted air monitoring system) data from the Department of Environment, which have monitored at 20 sites in 1989 and 1990. Each data in each site was characterized by 6 major criteria pollutants ($SO_2, TSP, CO, NO_2, THC, and O_3$) and 2 meteorological parameters, such as wind speed and wind direction. To select proper variables and to determine each pollutant's behavior, univariate statistical analyses were extensively studied in the beginning, and then various applied statistical techniques like cluster analysis, regression analysis, and expert system have been intensively examined. For the initial study of high level $O_3$ prediction, the raw data set in each site was separated into 2 group based on 60 ppb $O_3$ level. A hierarchical cluster analysis was applied to classify the group based on 60 ppb $O_3$ into small calsses. Each class in each site has its own pattern. Next, multiple regression for each class was repeatedly applied to determine an $O_3$ prediction submodel and to determine outliers in each class based on a certain level of standardized redisual. Thus, a prediction submodel for each homogeneous class could be obtained. The study was extended to model $O_3$ prediction for both on-time basis and 1-hr after basis. Finally, an expect system was used to build a unified classification rule based on examples of the homogenous classes for all of sites. Thus, a concept of high level $O_3$ prediction model was developed for one of $O_3$ alert systems.

  • PDF

Factors Affecting the Intention to Use Digital Banking in Vietnam

  • NGUYEN, Oanh Thi
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.3
    • /
    • pp.303-310
    • /
    • 2020
  • The study aims to evaluate the factors affecting the intention to use digital banking in Vietnam. Multivariate data analysis techniques (Cronbach's Alpha test, Confirmatory Factor Analysis, Structure equation model) are used for the survey data collected from 201 customers who have access to digital banking. The analysis results show that: (1) attitude towards the service, perceived usefulness has a positive impacts on the intention to use; (2) convenience does not affect the intention to use digital banking services; (3) perceived usefulness factor has a positive effect on the attitude towards the service; (4) The perceived risk has a negative impact on attitude towards the service; (5) trust has no effect on the attitude towards the service; (6) ease of use has a positive impact on perceived usefulness; (7) trust has a positive effect on perceived risk. From the results of this study, perceived usefulness has a positive effect on attitude and intention to use the service. Therefore, it is necessary to enhance the sense of the usefulness of customers through media advertising and consulting so that customers fully understand the benefits brought about by using digital banking services. Perceived risk has a negative impact on attitude towards the service.

Milling tool wear forecast based on the partial least-squares regression analysis

  • Xu, Chuangwen;Chen, Hualing
    • Structural Engineering and Mechanics
    • /
    • v.31 no.1
    • /
    • pp.57-74
    • /
    • 2009
  • Power signals resulting from spindle and feed motor, present a rich content of physical information, the appropriate analysis of which can lead to the clear identification of the nature of the tool wear. The partial least-squares regression (PLSR) method has been established as the tool wear analysis method for this purpose. Firstly, the results of the application of widely used techniques are given and their limitations of prior methods are delineated. Secondly, the application of PLSR is proposed. The singular value theory is used to noise reduction. According to grey relational degree analysis, sample variable is filtered as part sample variable and all sample variables as independent variables for modelling, and the tool wear is taken as dependent variable, thus PLSR model is built up through adapting to several experimental data of tool wear in different milling process. Finally, the prediction value of tool wear is compare with actual value, in order to test whether the model of the tool wear can adopt to new measuring data on the independent variable. In the new different cutting process, milling tool wear was predicted by the methods of PLSR and MLR (Multivariate Linear Regression) as well as BPNN (BP Neural Network) at the same time. Experimental results show that the methods can meet the needs of the engineering and PLSR is more suitable for monitoring tool wear.

The Evaluation of Long-Term Generation Portfolio Considering Uncertainty (불확실성을 고려한 장기 전원 포트폴리오의 평가)

  • Chung, Jae-Woo;Min, Dai-Ki
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.37 no.3
    • /
    • pp.135-150
    • /
    • 2012
  • This paper presents a portfolio model for a long-term power generation mix problem. The proposed portfolio model evaluates generation mix by considering the tradeoffs between the expected cost for power generation and its variability. Unlike conventional portfolio models measuring variance, we introduce Conditional Value-at-Risk (CVaR) in designing the variability with aims to considering events that are enormously expensive but are rare such as nuclear power plant accidents. Further, we consider uncertainties associated with future electricity demand, fuel prices and their correlations, and capital costs for power plant investments. To obtain an objective generation by each energy source, we employ the sample average approximation method that approximates the stochastic objective function by taking the average of large sample values so that provides asymptotic convergence of optimal solutions. In addition, the method includes Monte Carlo simulation techniques in generating random samples from multivariate distributions. Applications of the proposed model and method are demonstrated through a case study of an electricity industry with nuclear, coal, oil (OCGT), and LNG (CCGT) in South Korea.

Assessment of Educational Conditions for 28 National Universities in South Korea

  • Jeong, Dong-Bin
    • Asian Journal of Business Environment
    • /
    • v.7 no.1
    • /
    • pp.25-29
    • /
    • 2017
  • Purpose - In this paper, we categorize and segment the 28 national universities in South Korea and measure the degree of dissimilarity (or similarity) between pairs of ones by using dissimilarity distance matrix and cluster analysis, respectively, based on the seven quantitative evaluation of educational conditions (percentage of small-scale courses, percentage of lecture by the faculty, collection of books per student, material purchase per student, percentage of building capacity, percentage of real estate capacity and rate of accommodation) in 2015. In addition, multidimensional scaling (MDS) techniques can obtain visual representation for exploring patterns of proximities among 28 national universities based on seven attributes of educational conditions. Research design, data, and methodology - This work is carried out by the 2015 Announcement of University Information, which is provided by Ministry of Education in South Korea and utilized by multivariate analyses with CLUSTER, PROXIMITIES and ALSCAL modules in IBM SPSS 23.0. Results - We make certain that 28 national universities can be categorized into five clusters which have similar traits by applying two-stage cluster analysis. MDS is utilized to perform positioning of grouped places of cluster and 28 national universities joining every cluster. Conclusions - Both types and traits of each national university can be relatively assessed and practically utilized for each university competitiveness based on underlying results.

Imputation of Multiple Missing Values by Normal Mixture Model under Markov Random Field: Application to Imputation of Pixel Values of Color Image (마코프 랜덤 필드 하에서 정규혼합모형에 의한 다중 결측값 대체기법: 색조영상 결측 화소값 대체에 응용)

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.6
    • /
    • pp.925-936
    • /
    • 2009
  • There very many approaches to impute missing values in the iid. case. However, it is hardly found the imputation techniques in the Markov random field(MRF) case. In this paper, we show that the imputation under MRF is just to impute by fitting the normal mixture model(NMM) under several practical assumptions. Our multivariate normal mixture model based approaches under MRF is applied to impute the missing pixel values of 3-variate (R, G, B) color image, providing a technique to smooth the imputed values.

Application of Metabolomics to Quality Control of Natural Product Derived Medicines

  • Lee, Kyung-Min;Jeon, Jun-Yeong;Lee, Byeong-Ju;Lee, Hwanhui;Choi, Hyung-Kyoon
    • Biomolecules & Therapeutics
    • /
    • v.25 no.6
    • /
    • pp.559-568
    • /
    • 2017
  • Metabolomics has been used as a powerful tool for the analysis and quality assessment of the natural product (NP)-derived medicines. It is increasingly being used in the quality control and standardization of NP-derived medicines because they are composed of hundreds of natural compounds. The most common techniques that are used in metabolomics consist of NMR, GC-MS, and LC-MS in combination with multivariate statistical analyses including principal components analysis (PCA) and partial least squares-discriminant analysis (PLS-DA). Currently, the quality control of the NP-derived medicines is usually conducted using HPLC and is specified by one or two indicators. To create a superior quality control framework and avoid adulterated drugs, it is necessary to be able to determine and establish standards based on multiple ingredients using metabolic profiling and fingerprinting. Therefore, the application of various analytical tools in the quality control of NP-derived medicines forms the major part of this review. $Veregen^{(R)}$ (Medigene AG, Planegg/Martinsried, Germany), which is the first botanical prescription drug approved by US Food and Drug Administration, is reviewed as an example that will hopefully provide future directions and perspectives on metabolomics technologies available for the quality control of NP-derived medicines.