• 제목/요약/키워드: Vector data model

검색결과 1,180건 처리시간 0.029초

L1-norm regularization을 통한 SGMM의 state vector 적응 (L1-norm Regularization for State Vector Adaptation of Subspace Gaussian Mixture Model)

  • 구자현;김영관;김회린
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.131-138
    • /
    • 2015
  • In this paper, we propose L1-norm regularization for state vector adaptation of subspace Gaussian mixture model (SGMM). When you design a speaker adaptation system with GMM-HMM acoustic model, MAP is the most typical technique to be considered. However, in MAP adaptation procedure, large number of parameters should be updated simultaneously. We can adopt sparse adaptation such as L1-norm regularization or sparse MAP to cope with that, but the performance of sparse adaptation is not good as MAP adaptation. However, SGMM does not suffer a lot from sparse adaptation as GMM-HMM because each Gaussian mean vector in SGMM is defined as a weighted sum of basis vectors, which is much robust to the fluctuation of parameters. Since there are only a few adaptation techniques appropriate for SGMM, our proposed method could be powerful especially when the number of adaptation data is limited. Experimental results show that error reduction rate of the proposed method is better than the result of MAP adaptation of SGMM, even with small adaptation data.

Cointegration Analysis with Mixed-Frequency Data of Quarterly GDP and Monthly Coincident Indicators

  • Seong, Byeongchan
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.925-932
    • /
    • 2012
  • The article introduces a method to estimate a cointegrated vector autoregressive model, using mixed-frequency data, in terms of a state-space representation of the vector error correction(VECM) of the model. The method directly estimates the parameters of the model, in a state-space form of its VECM representation, using the available data in its mixed-frequency form. Then it allows one to compute in-sample smoothed estimates and out-of-sample forecasts at their high-frequency intervals using the estimated model. The method is applied to a mixed-frequency data set that consists of the quarterly real gross domestic product and three monthly coincident indicators. The result shows that the method produces accurate smoothed and forecasted estimates in comparison to a method based on single-frequency data.

Estimating global solar radiation using wavelet and data driven techniques

  • Kim, Sungwon;Seo, Youngmin
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2015년도 학술발표회
    • /
    • pp.475-478
    • /
    • 2015
  • The objective of this study is to apply a hybrid model for estimating solar radiation and investigate their accuracy. A hybrid model is wavelet-based support vector machines (WSVMs). Wavelet decomposition is employed to decompose the solar radiation time series into approximation and detail components. These decomposed time series are then used as inputs of support vector machines (SVMs) modules in the WSVMs model. Results obtained indicate that WSVMs can successfully be used for the estimation of daily global solar radiation at Champaign and Springfield stations in Illinois.

  • PDF

Chatbot Design Method Using Hybrid Word Vector Expression Model Based on Real Telemarketing Data

  • Zhang, Jie;Zhang, Jianing;Ma, Shuhao;Yang, Jie;Gui, Guan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권4호
    • /
    • pp.1400-1418
    • /
    • 2020
  • In the development of commercial promotion, chatbot is known as one of significant skill by application of natural language processing (NLP). Conventional design methods are using bag-of-words model (BOW) alone based on Google database and other online corpus. For one thing, in the bag-of-words model, the vectors are Irrelevant to one another. Even though this method is friendly to discrete features, it is not conducive to the machine to understand continuous statements due to the loss of the connection between words in the encoded word vector. For other thing, existing methods are used to test in state-of-the-art online corpus but it is hard to apply in real applications such as telemarketing data. In this paper, we propose an improved chatbot design way using hybrid bag-of-words model and skip-gram model based on the real telemarketing data. Specifically, we first collect the real data in the telemarketing field and perform data cleaning and data classification on the constructed corpus. Second, the word representation is adopted hybrid bag-of-words model and skip-gram model. The skip-gram model maps synonyms in the vicinity of vector space. The correlation between words is expressed, so the amount of information contained in the word vector is increased, making up for the shortcomings caused by using bag-of-words model alone. Third, we use the term frequency-inverse document frequency (TF-IDF) weighting method to improve the weight of key words, then output the final word expression. At last, the answer is produced using hybrid retrieval model and generate model. The retrieval model can accurately answer questions in the field. The generate model can supplement the question of answering the open domain, in which the answer to the final reply is completed by long-short term memory (LSTM) training and prediction. Experimental results show which the hybrid word vector expression model can improve the accuracy of the response and the whole system can communicate with humans.

벡터오차수정모형을 이용한 유럽 탄소배출권가격 분석 (The analysis of EU carbon trading and energy prices using vector error correction model)

  • 부기덕;정기호
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권3호
    • /
    • pp.401-412
    • /
    • 2011
  • 본 연구는 벡터오차수정모형을 이용하여 유럽 탄소배출권 현물가격의 일간 시계열자료를 분석한다. 내생변수로는 탄소배출권가격 이외에 오일가격, 천연가스가격, 전력가격, 석탄가격 등 모두 5개 변수를 고려하며, 분석기간은 유럽 배출권가격의 왜곡이 발생한 제1단계 기간 (2005~2007년)을 피해 제2단계 기간 (2008년 4월 21일~2010년 3월 31일)을 대상으로 하였다. 시계열변수의 안정성 및 공적분 검정 결과, 모든 변수들이 단위근을 갖으며 또한 공적분 벡터가 존재하는 것으로 나타나서 분석모형으로서 벡터자기회귀모형 대신에 벡터오차수정모형을 채택하였다. 분석결과, (1) 오일, 천연가스, 전력 등의 가격이 배출권가격에 대해 원인으로 작용하는 그랜저인과관계가 존재하였다. (2) 충격 반응분석에서 배출권가격은 오일가격의 외생적 충격에 대해 가장 크게 반응하였고, 석탄가격의 충격에 대해서는 초기 상승 후 하락, 전력가격과 천연가스가격의 충격에 대해서는 초기 상승 후 음 (-)으로 감소하는 반응을 보였다. (3) 예측오차 분산분해 분석에서 배출권가격에 대해 가장 큰 영향을 주는 요인은 초기 (3기)에는 오일가격>석탄가격>천연가스가격>전력가격의 순이었으나 이후 (20기)에는 전력가격>오일가격>석탄가격>천연가스가격의 순으로 나타났다.

최소제곱 서포트벡터기계를 이용한 시장점유율 자료 분석 (Analysis of market share attraction data using LS-SVM)

  • 박혜정
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권5호
    • /
    • pp.879-886
    • /
    • 2009
  • 본 논문에서는 시장점유율을 추정할 때 최소제곱 서포트벡터기계를 적용하여 보통최소제곱과 최소제곱 서포트벡터기계의 성능을 비교하고자 한다. 최소제곱 서포트벡터기계는 커널 함수를 사용함으로 고차원의 특징 공간에서 선형회귀로 재구성함으로 비선형 회귀문제까지도 해결할 수 있는 장점을 가지고 있다. 그래서 본 논문에서는 비모수 기법인 최소제곱 서포트벡터기계를 이용하여 시장점유율 모형을 추정하고자 한다. 최소제곱 서포트벡터기계를 기반으로 한 모형 추정은 시장점유율 유인모형을 해결하기 위한 좋은 대안이 된다. 최소제곱 서포트벡터기계의 성능을 평가하기 위해 비교 실험에서는 한국 자동차 시장에서 차량 판매량을 이용하여 브랜드별 시장점유율 모형을 추정하였다.

  • PDF

비선형 평균 일반화 이분산 자기회귀모형의 추정 (Estimation of nonlinear GARCH-M model)

  • 심주용;이장택
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권5호
    • /
    • pp.831-839
    • /
    • 2010
  • 최소제곱 서포트벡터기계는 비선형회귀분석과 분류에 널리 쓰이는 커널기법이다. 본 논문에서는 금융시계열자료의 평균 및 변동성을 추정하기 위하여 평균의 추정 방법으로는 가중최소제곱 서포트벡터기계, 변동성의 추정 방법으로는 최소제곱 서포트벡터기계를 사용하는 비선형 평균 일반화 이분산 자기회귀모형을 제안한다. 제안된 모형은 선형 일반화 이분산 자기회귀모형 및 선형 평균 일반화 이분산 자기회귀모형보다 더 나은 추정 능력을 가진다는 것을 실제자료의 추정을 통하여 보였다.

Estimating Hydrodynamic Coefficients of Real Ships Using AIS Data and Support Vector Regression

  • Hoang Thien Vu;Jongyeol Park;Hyeon Kyu Yoon
    • 한국해양공학회지
    • /
    • 제37권5호
    • /
    • pp.198-204
    • /
    • 2023
  • In response to the complexity and time demands of conventional methods for estimating the hydrodynamic coefficients, this study aims to revolutionize ship maneuvering analysis by utilizing automatic identification system (AIS) data and the Support Vector Regression (SVR) algorithm. The AIS data were collected and processed to remove outliers and impute missing values. The rate of turn (ROT), speed over ground (SOG), course over ground (COG) and heading (HDG) in AIS data were used to calculate the rudder angle and ship velocity components, which were then used as training data for a regression model. The accuracy and efficiency of the algorithm were validated by comparing SVR-based estimated hydrodynamic coefficients and the original hydrodynamic coefficients of the Mariner class vessel. The validated SVR algorithm was then applied to estimate the hydrodynamic coefficients for real ships using AIS data. The turning circle test wassimulated from calculated hydrodynamic coefficients and compared with the AIS data. The research results demonstrate the effectiveness of the SVR model in accurately estimating the hydrodynamic coefficients from the AIS data. In conclusion, this study proposes the viability of employing SVR model and AIS data for accurately estimating the hydrodynamic coefficients. It offers a practical approach to ship maneuvering prediction and control in the maritime industry.

Weighted Support Vector Machines for Heteroscedastic Regression

  • Park, Hye-Jung;Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권2호
    • /
    • pp.467-474
    • /
    • 2006
  • In this paper we present a weighted support vector machine(SVM) and a weighted least squares support vector machine(LS-SVM) for the prediction in the heteroscedastic regression model. By adding weights to standard SVM and LS-SVM the better fitting ability can be achieved when errors are heteroscedastic. In the numerical studies, we illustrate the prediction performance of the proposed procedure by comparing with the procedure which combines standard SVM and LS-SVM and wild bootstrap for the prediction.

  • PDF

Support Vector Machine을 이용한 지능형 신용평가시스템 개발 (Development of Intelligent Credit Rating System using Support Vector Machines)

  • 김경재
    • 한국정보통신학회논문지
    • /
    • 제9권7호
    • /
    • pp.1569-1574
    • /
    • 2005
  • In this paper, I propose an intelligent credit rating system using a bankruptcy prediction model based on support vector machines (SVMs). SVMs are promising methods because they use a risk function consisting of the empirical error and a regularized term which is derived from the structural risk minimization principle. This study examines the feasibility of applying SVM in Predicting corporate bankruptcies by comparing it with other data mining techniques. In addition. this study presents architecture and prototype of intelligeht credit rating systems based on SVM models.