• 제목/요약/키워드: Multivariate Tools

검색결과 48건 처리시간 0.022초

Applications of NMR spectroscopy based metabolomics: a review

  • Yoon, Dahye;Lee, Minji;Kim, Siwon;Kim, Suhkmann
    • 한국자기공명학회논문지
    • /
    • 제17권1호
    • /
    • pp.1-10
    • /
    • 2013
  • Metabolomics is the study which detects the changes of metabolites level. Metabolomics is a terminal view of the biological system. The end products of the metabolism, metabolites, reflect the responses to external environment. Therefore metabolomics gives the additional information about understanding the metabolic pathways. These metabolites can be used as biomarkers that indicate the disease or external stresses such as exposure to toxicant. Many kinds of biological samples are used in metabolomics, for example, cell, tissue, and bio fluids. NMR spectroscopy is one of the tools of metabolomics. NMR data are analyzed by multivariate statistical analysis and target profiling technique. Recently, NMR-based metabolomics is a growing field in various studies such as disease diagnosis, forensic science, and toxicity assessment.

High-resolution 1H NMR Spectroscopy of Green and Black Teas

  • Jeong, Ji-Ho;Jang, Hyun-Jun;Kim, Yongae
    • 대한화학회지
    • /
    • 제63권2호
    • /
    • pp.78-84
    • /
    • 2019
  • High-resolution $^1H$ NMR spectroscopic technique has been widely used as one of the most powerful analytical tools in food chemistry as well as to define molecular structure. The $^1H$ NMR spectra-based metabolomics has focused on classification and chemometric analysis of complex mixtures. The principal component analysis (PCA), an unsupervised clustering method and used to reduce the dimensionality of multivariate data, facilitates direct peak quantitation and pattern recognition. Using a combination of these techniques, the various green teas and black teas brewed were investigated via metabolite profiling. These teas were characterized based on the leaf size and country of cultivation, respectively.

Fused inverse regression with multi-dimensional responses

  • Cho, Youyoung;Han, Hyoseon;Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제28권3호
    • /
    • pp.267-279
    • /
    • 2021
  • A regression with multi-dimensional responses is quite common nowadays in the so-called big data era. In such regression, to relieve the curse of dimension due to high-dimension of responses, the dimension reduction of predictors is essential in analysis. Sufficient dimension reduction provides effective tools for the reduction, but there are few sufficient dimension reduction methodologies for multivariate regression. To fill this gap, we newly propose two fused slice-based inverse regression methods. The proposed approaches are robust to the numbers of clusters or slices and improve the estimation results over existing methods by fusing many kernel matrices. Numerical studies are presented and are compared with existing methods. Real data analysis confirms practical usefulness of the proposed methods.

로지스틱 회귀분석을 통한 청년 우울감의 다변량 분석 및 영향 요인 연구 (Multivariate Analysis and Determinants of Youth Depression through Logistic Regression)

  • Seong Eum LEE
    • Journal of Korea Artificial Intelligence Association
    • /
    • 제1권2호
    • /
    • pp.7-13
    • /
    • 2023
  • In this paper, Depression is a mental disorder characterized by a lack of enthusiasm and feelings of sadness, which significantly impairs daily functioning. In 2018, there was an increase in book sales in the essay genre, particularly the popularity of "healing essays." This trend is seen as challenging the negative image and prejudices associated with depression. In 2021, a significant rise in the proportion of 20-year-old patients with depression is attributed to factors like job-related stress, interpersonal issues, and financial burdens. Additionally, there is a strong correlation between depression and suicidal thoughts, particularly among individuals who have experienced feelings of depression. Despite the increasing prevalence of depression among young adults, research in this area is lacking. To address this gap, statistical tools such as logistic regression and chi-squared tests are employed. The analysis reveals various independent variables associated with feelings of depression, shedding light on the relationships between these factors.

Principles of Multivariate Data Visualization

  • Huh, Moon Yul;Cha, Woon Ock
    • Communications for Statistical Applications and Methods
    • /
    • 제11권3호
    • /
    • pp.465-474
    • /
    • 2004
  • Data visualization is the automation process and the discovery process to data sets in an effort to discover underlying information from the data. It provides rich visual depictions of the data. It has distinct advantages over traditional data analysis techniques such as exploring the structure of large scale data set both in the sense of number of observations and the number of variables by allowing great interaction with the data and end-user. We discuss the principles of data visualization and evaluate the characteristics of various tools of visualization according to these principles.

다변량통계기법을 이용한 지하저장시설 주변의 지하수질 변동에 관한 연구 (Use of Multivariate Statistical Approaches for Decoding Chemical Evolution of Groundwater near Underground Storage Caverns)

  • 이정훈
    • 한국지구과학회지
    • /
    • 제35권4호
    • /
    • pp.225-236
    • /
    • 2014
  • 다변량통계기법은 수리지구화학 자료의 분석 및 해석에 많이 이용되어 왔다. 본 연구에서 대응분석과 주성분분석을 동시에 사용하여 인위적인 활동에 의한 지하수의 특징을 살펴보았다. 본 연구의 목적은 NETPATH 프로그램 속의 WATEQ4F를 이용하여 지하수 화학성분의 분화를 계산하고 이를 다변량통계기법을 이용하여 지구화학적인 정보를 추출하는 것이다. 연구지역은 한반도의 남동쪽에 위치한 울산의 LPG 저장시설이다. 본 연구지역에서는 다른 저장시설에서 관찰되는 초염기성의 조성을 가지는 지하수가 관찰되었다. 이러한 인위적인 영향에 의한 높은 pH를 가지는 지하수로 인해 Al의 분화특징과 탄산염의 침전을 유발할 수 있다. 본 연구에서는 연구지역에 지하수에 영향을 주는 두 인위적인 요소(세정작용와 시멘트영향)에 의해서 수리지구화학적인 특징과 상이 어떻게 변하는 가에 초점을 두었다. 이전 연구결과와 두 통계분석을 통해 제시된 결과를 비교하여 지구화학적인 정보를 이용한 주성분분석과 대응분석인 수리지구화학 연구에서 기초연구로 활용될 수 있음을 알 수 있다.

Estimation of Brain Connectivity during Motor Imagery Tasks using Noise-Assisted Multivariate Empirical Mode Decomposition

  • Lee, Ki-Baek;Kim, Ko Keun;Song, Jaeseung;Ryu, Jiwoo;Kim, Youngjoo;Park, Cheolsoo
    • Journal of Electrical Engineering and Technology
    • /
    • 제11권6호
    • /
    • pp.1812-1824
    • /
    • 2016
  • The neural dynamics underlying the causal network during motor planning or imagery in the human brain are not well understood. The lack of signal processing tools suitable for the analysis of nonlinear and nonstationary electroencephalographic (EEG) hinders such analyses. In this study, noise-assisted multivariate empirical mode decomposition (NA-MEMD) is used to estimate the causal inference in the frequency domain, i.e., partial directed coherence (PDC). Natural and intrinsic oscillations corresponding to the motor imagery tasks can be extracted due to the data-driven approach of NA-MEMD, which does not employ predefined basis functions. Simulations based on synthetic data with a time delay between two signals demonstrated that NA-MEMD was the optimal method for estimating the delay between two signals. Furthermore, classification analysis of the motor imagery responses of 29 subjects revealed that NA-MEMD is a prerequisite process for estimating the causal network across multichannel EEG data during mental tasks.

Value at Risk of portfolios using copulas

  • Byun, Kiwoong;Song, Seongjoo
    • Communications for Statistical Applications and Methods
    • /
    • 제28권1호
    • /
    • pp.59-79
    • /
    • 2021
  • Value at Risk (VaR) is one of the most common risk management tools in finance. Since a portfolio of several assets, rather than one asset portfolio, is advantageous in the risk diversification for investment, VaR for a portfolio of two or more assets is often used. In such cases, multivariate distributions of asset returns are considered to calculate VaR of the corresponding portfolio. Copulas are one way of generating a multivariate distribution by identifying the dependence structure of asset returns while allowing many different marginal distributions. However, they are used mainly for bivariate distributions and are not widely used in modeling joint distributions for many variables in finance. In this study, we would like to examine the performance of various copulas for high dimensional data and several different dependence structures. This paper compares copulas such as elliptical, vine, and hierarchical copulas in computing the VaR of portfolios to find appropriate copula functions in various dependence structures among asset return distributions. In the simulation studies under various dependence structures and real data analysis, the hierarchical Clayton copula shows the best performance in the VaR calculation using four assets. For marginal distributions of single asset returns, normal inverse Gaussian distribution was used to model asset return distributions, which are generally high-peaked and heavy-tailed.

Non-Destructive Sorting Techniques for Viable Pepper (Capsicum annuum L.) Seeds Using Fourier Transform Near-Infrared and Raman Spectroscopy

  • Seo, Young-Wook;Ahn, Chi Kook;Lee, Hoonsoo;Park, Eunsoo;Mo, Changyeun;Cho, Byoung-Kwan
    • Journal of Biosystems Engineering
    • /
    • 제41권1호
    • /
    • pp.51-59
    • /
    • 2016
  • Purpose: This study examined the performance of two spectroscopy methods and multivariate classification methods to discriminate viable pepper seeds from their non-viable counterparts. Methods: A classification model for viable seeds was developed using partial least square discrimination analysis (PLS-DA) with Fourier transform near-infrared (FT-NIR) and Raman spectroscopic data in the range of $9080-4150cm^{-1}$ (1400-2400 nm) and $1800-970cm^{-1}$, respectively. The datasets were divided into 70% to calibration and 30% to validation. To reduce noise from the spectra and compare the classification results, preprocessing methods, such as mean, maximum, and range normalization, multivariate scattering correction, standard normal variate, and $1^{st}$ and $2^{nd}$ derivatives with the Savitzky-Golay algorithm were used. Results: The classification accuracies for calibration using FT-NIR and Raman spectroscopy were both 99% with first derivative, whereas the validation accuracies were 90.5% with both multivariate scattering correction and standard normal variate, and 96.4% with the raw data (non-preprocessed data). Conclusions: These results indicate that FT-NIR and Raman spectroscopy are valuable tools for a feasible classification and evaluation of viable pepper seeds by providing useful information based on PLS-DA and the threshold value.

건설투자(建設投資)의 단기예측모형(短期豫測模型) 비교(比較) (Short-term Construction Investment Forecasting Model in Korea)

  • 김관영;이창수
    • KDI Journal of Economic Policy
    • /
    • 제14권1호
    • /
    • pp.121-145
    • /
    • 1992
  • 본고(本稿)에서는 현재의 경제상황을 잘 반영하는 건설투자활동(建設投資活動)의 단기예측모형(短期豫測模型)을 정립하고자 먼저 관련 시계열자료의 안정성(安定性) 여부(與否)와 순환성(循環性), 계절성(季節性)의 특성을 살펴본 후 여러 단기모형의 예측력(豫測力), 정합성(整合性), 설명력(說明力)을 비교 검토했다. 단위근(單位根) 검정(檢定)과 자기상관계수(自己相關係數) 스펙트랄 밀도함수 분석의 결과, 건설관련 시계열자료들이 대체로 단위근(單位根)을 갖지 않음으로써 안정적이고 주기적인 순환변동을 하고 있으며, 시차변수의 설명력이 높은 특성을 나타내었다. 또한 건설투자자료의 특성이 선행지표(先行指標)인 건축허가연면적(建築許可延面積) 및 건설수주액(建設受注額)과 아주 유사하여 건설투자 단기예측에 있어서 두 지표 사이의 시차관계(時差關係) 파악이 중요함을 알 수 있었다. 제(第)III장(章)에서는 단변량(單變量) 시계열모형(時系列模型)으로 ARIMA모형(模型)과 승법선형추세예측모형(乘法線型趨勢豫測模型)을, 다변량(多變量) 시계열모형(時系列模型)으로는 첫째, 선행지표(先行指標)를 이용한 1차자기회귀모형(次自己回歸模型), VAR모형(模型), 둘째 GNP자료를 이용한 거시경제모형의 단순한 축약형모형(縮約型模型)과 VAR모형(模型)을 제시하고 이들을 비교 평가하였다. 이에 따르면 단변량 시계열모형보다는 다변량 시계열모형이 시간이 경과할수록 예측오차(豫測誤差)가 커지지 않는다는 점에서 우수한 것으로 나타났으며, 다변량모형 중에서도 벡터자기회귀모형이 여타 모형보다 절대예측오차평균(絶對豫測誤差平均), 평균자승근(平均自乘根) 퍼센트 오차(誤差), 결정계수(決定係數) 등 모든 면에서 우수한 것으로 평가되었다. 이는 최근 건설투자가 추세에서 벗어난 급증세를 지속하고 있음을 고려할 때 타당한 결론이라 생각된다.

  • PDF