Search | Korea Science

Singing Voice Synthesis Using HMM Based TTS and MusicXML (HMM 기반 TTS와 MusicXML을 이용한 노래음 합성)

Khan, Najeeb Ullah;Lee, Jung-Chul
- Journal of the Korea Society of Computer and Information
- /
- v.20 no.5
- /
- pp.53-63
- /
- 2015
Singing voice synthesis is the generation of a song using a computer given its lyrics and musical notes. Hidden Markov models (HMM) have been proved to be the models of choice for text to speech synthesis. HMMs have also been used for singing voice synthesis research, however, a huge database is needed for the training of HMMs for singing voice synthesis. And commercially available singing voice synthesis systems which use the piano roll music notation, needs to adopt the easy to read standard music notation which make it suitable for singing learning applications. To overcome this problem, we use a speech database for training context dependent HMMs, to be used for singing voice synthesis. Pitch and duration control methods have been devised to modify the parameters of the HMMs trained on speech, to be used as the synthesis units for the singing voice. This work describes a singing voice synthesis system which uses a MusicXML based music score editor as the front-end interface for entry of the notes and lyrics to be synthesized and a hidden Markov model based text to speech synthesis system as the back-end synthesizer. A perceptual test shows the feasibility of our proposed system.
https://doi.org/10.9708/jksci.2015.20.5.053 인용 PDF KSCI

Effects of Financial College Tuition Support by Korean Parents using a Hierarchical Bayes Model (계층적 베이즈 모형을 이용한 대학등록금에 대한 부모님의 경제적 지원 영향 분석)

Oh, Man-Suk;Oh, Hyun Sook;Oh, Min Jung
- The Korean Journal of Applied Statistics
- /
- v.26 no.2
- /
- pp.267-280
- /
- 2013
College tuition is a significant economic, social, and political issue in Korea. We conduct a Bayesian analysis of a hierarchical model to address the factors related to college tuition based on a survey data collected by Statistics Korea. A binary response variable is selected depending on if more than 70% of tuition costs are supported by parents, and a hierarchical Probit model is constructed with areas as groups. A set of explanatory variables is selected from a factor analysis of available variables in the survey. A Markov chain Monte Carlo algorithm is used to estimate parameters. From the analysis results, income and stress are significantly related to college tuition support from parents. Parents with high income tend to support children's college tuition and students with parents' financial support tend to be mentally less stressed; subsequently, this shows that the economic status of parents significantly affects the mental health of college students. Gender, a healthy life style, and college satisfaction are not significant factors. Comparing areas in terms of the degrees of correlation between stress/income and tuition support from parents, students in Kangwon-do are the most mentally stressed when parents' support is limited; in addition, the positive correlation between parents support and income is stronger in big cities compared to provincial areas.
https://doi.org/10.5351/KJAS.2013.26.2.267 인용 PDF KSCI

Survival Analysis for White Non-Hispanic Female Breast Cancer Patients

Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Stewart, Tiffanie Shauna-Jeanne;Bhatt, Chintan
- Asian Pacific Journal of Cancer Prevention
- /
- v.15 no.9
- /
- pp.4049-4054
- /
- 2014
Background: Race and ethnicity are significant factors in predicting survival time of breast cancer patients. In this study, we applied advanced statistical methods to predict the survival of White non-Hispanic female breast cancer patients, who were diagnosed between the years 1973 and 2009 in the United States (U.S.). Materials and Methods: Demographic data from the Surveillance Epidemiology and End Results (SEER) database were used for the purpose of this study. Nine states were randomly selected from 12 U.S. cancer registries. A stratified random sampling method was used to select 2,000 female breast cancer patients from these nine states. We compared four types of advanced statistical probability models to identify the best-fit model for the White non-Hispanic female breast cancer survival data. Three model building criterion were used to measure and compare goodness of fit of the models. These include Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC). In addition, we used a novel Bayesian method and the Markov Chain Monte Carlo technique to determine the posterior density function of the parameters. After evaluating the model parameters, we selected the model having the lowest DIC value. Using this Bayesian method, we derived the predictive survival density for future survival time and its related inferences. Results: The analytical sample of White non-Hispanic women included 2,000 breast cancer cases from the SEER database (1973-2009). The majority of cases were married (55.2%), the mean age of diagnosis was 63.61 years (SD = 14.24) and the mean survival time was 84 months (SD = 35.01). After comparing the four statistical models, results suggested that the exponentiated Weibull model (DIC= 19818.220) was a better fit for White non-Hispanic females' breast cancer survival data. This model predicted the survival times (in months) for White non-Hispanic women after implementation of precise estimates of the model parameters. Conclusions: By using modern model building criteria, we determined that the data best fit the exponentiated Weibull model. We incorporated precise estimates of the parameter into the predictive model and evaluated the survival inference for the White non-Hispanic female population. This method of analysis will assist researchers in making scientific and clinical conclusions when assessing survival time of breast cancer patients.
https://doi.org/10.7314/APJCP.2014.15.9.4049 인용 PDF KSCI

Non-Simultaneous Sampling Deactivation during the Parameter Approximation of a Topic Model

Jeong, Young-Seob;Jin, Sou-Young;Choi, Ho-Jin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.7 no.1
- /
- pp.81-98
- /
- 2013
Since Probabilistic Latent Semantic Analysis (PLSA) and Latent Dirichlet Allocation (LDA) were introduced, many revised or extended topic models have appeared. Due to the intractable likelihood of these models, training any topic model requires to use some approximation algorithm such as variational approximation, Laplace approximation, or Markov chain Monte Carlo (MCMC). Although these approximation algorithms perform well, training a topic model is still computationally expensive given the large amount of data it requires. In this paper, we propose a new method, called non-simultaneous sampling deactivation, for efficient approximation of parameters in a topic model. While each random variable is normally sampled or obtained by a single predefined burn-in period in the traditional approximation algorithms, our new method is based on the observation that the random variable nodes in one topic model have all different periods of convergence. During the iterative approximation process, the proposed method allows each random variable node to be terminated or deactivated when it is converged. Therefore, compared to the traditional approximation ways in which usually every node is deactivated concurrently, the proposed method achieves the inference efficiency in terms of time and memory. We do not propose a new approximation algorithm, but a new process applicable to the existing approximation algorithms. Through experiments, we show the time and memory efficiency of the method, and discuss about the tradeoff between the efficiency of the approximation process and the parameter consistency.
https://doi.org/10.3837/tiis.2013.01.006 인용 PDF KSCI

Statistical Characteristics and Stochastic Modeling of Water Quality Data at the Influent of Daejeon Wastewater Treatment Plant (대전시 공공하수처리시설 유입수 수질자료의 통계적 특성 및 추계학적 모의)

Pak, Gijung;Jung, Minjae;Lee, Hansaem;Kim, Deokwoo;Yoon, Jaeyong;Paik, Kyungrock
- Journal of Korean Society on Water Environment
- /
- v.28 no.1
- /
- pp.38-49
- /
- 2012
In this study, we analyze statistical characteristics of influent water quality in Daejeon waste water treatment plant and apply a stochastic model for data generation. In the analysis, the influent water quality data from year 2003 to 2008, except for year 2006, are used. Among water quality variables, we find strong correlations between BOD and T-N; T-N and T-P; BOD and T-P; $COD_{Mn}$ and T-P; and BOD and $COD_{Mn}$. We also find that different water quality variables follow different theoretical probability distribution functions, which also depends on whether the seasonal cycle is removed. Finally, we generate the influent water quality data using the multi-season 1st Markov model (Thomas-Fiering model). With model parameters calibrated for the period 2003~2005, the generated data for 2007~2008 are well compared with observed data showing good agreement in general. BOD and T-N are underestimated by the stochastic model. This is mainly due to the statistical difference in observed data itself between two periods of 2003~2005 and 2007~2008. Therefore, we expect the stochastic model can be applied with more confidence in the case that the data follows stationary pattern.
KSCI

A Bayesian zero-inflated Poisson regression model with random effects with application to smoking behavior (랜덤효과를 포함한 영과잉 포아송 회귀모형에 대한 베이지안 추론: 흡연 자료에의 적용)

Kim, Yeon Kyoung;Hwang, Beom Seuk
- The Korean Journal of Applied Statistics
- /
- v.31 no.2
- /
- pp.287-301
- /
- 2018
It is common to encounter count data with excess zeros in various research fields such as the social sciences, natural sciences, medical science or engineering. Such count data have been explained mainly by zero-inflated Poisson model and extended models. Zero-inflated count data are also often correlated or clustered, in which random effects should be taken into account in the model. Frequentist approaches have been commonly used to fit such data. However, a Bayesian approach has advantages of prior information, avoidance of asymptotic approximations and practical estimation of the functions of parameters. We consider a Bayesian zero-inflated Poisson regression model with random effects for correlated zero-inflated count data. We conducted simulation studies to check the performance of the proposed model. We also applied the proposed model to smoking behavior data from the Regional Health Survey (2015) of the Korea Centers for disease control and prevention.
https://doi.org/10.5351/KJAS.2018.31.2.287 인용 PDF KSCI

An Application of Dirichlet Mixture Model for Failure Time Density Estimation to Components of Naval Combat System (디리슈레 혼합모형을 이용한 함정 전투체계 부품의 고장시간 분포 추정)

Lee, Jinwhan;Kim, Jung Hun;Jung, BongJoo;Kim, Kyeongtaek
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.42 no.4
- /
- pp.194-202
- /
- 2019
Reliability analysis of the components frequently starts with the data that manufacturer provides. If enough failure data are collected from the field operations, the reliability should be recomputed and updated on the basis of the field failure data. However, when the failure time record for a component contains only a few observations, all statistical methodologies are limited. In this case, where the failure records for multiple number of identical components are available, a valid alternative is combining all the data from each component into one data set with enough sample size and utilizing the useful information in the censored data. The ROK Navy has been operating multiple Patrol Killer Guided missiles (PKGs) for several years. The Korea Multi-Function Control Console (KMFCC) is one of key components in PKG combat system. The maintenance record for the KMFCC contains less than ten failure observations and a censored datum. This paper proposes a Bayesian approach with a Dirichlet mixture model to estimate failure time density for KMFCC. Trends test for each component record indicated that null hypothesis, that failure occurrence is renewal process, is not rejected. Since the KMFCCs have been functioning under different operating environment, the failure time distribution may be a composition of a number of unknown distributions, i.e. a mixture distribution, rather than a single distribution. The Dirichlet mixture model was coded as probabilistic programming in Python using PyMC3. Then Markov Chain Monte Carlo (MCMC) sampling technique employed in PyMC3 probabilistically estimated the parameters' posterior distribution through the Dirichlet mixture model. The simulation results revealed that the mixture models provide superior fits to the combined data set over single models.
https://doi.org/10.11627/jkise.2019.42.4.194 인용 PDF KSCI

Development of dam inflow simulation technique coupled with rainfall simulation and rainfall-runoff model (강우모의기법과 강우-유출 모형을 연계한 댐 유입량 자료 생성기법 개발)

Kim, Tae-Jeong;So, Byung-Jin;Ryou, Min-Suk;Kwon, Hyun-Han
- Journal of Korea Water Resources Association
- /
- v.49 no.4
- /
- pp.315-325
- /
- 2016
Generally, a natural river discharge is highly regulated by the hydraulic structures, and the regulated flow is substantially different from natural inflow characteristics for the use of water resources planning. The natural inflow data are necessarily required for hydrologic analysis and water resources planning. This study aimed to develop an integrated model for more reliable simulation of daily dam inflow. First, a piecewise Kernel-Pareto distribution was used for rainfall simulation model, which can more effectively reproduce the low order moments (e.g. mean and median) as well as the extremes. Second, a Bayesian Markov Chain Monte Carlo scheme was applied for the SAC-SMA rainfall-runoff model that is able to quantitatively assess uncertainties associated with model parameters. It was confirmed that the proposed modeling scheme is capable of reproducing the underlying statistical properties of discharge, and can be further used to provide a set of plausible scenarios for water budget analysis in water resources planning.
https://doi.org/10.3741/JKWRA.2016.49.4.315 인용 PDF KSCI

Performance Improvement in Speech Recognition by Weighting HMM Likelihood (은닉 마코프 모델 확률 보정을 이용한 음성 인식 성능 향상)

권태희;고한석
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.2
- /
- pp.145-152
- /
- 2003
In this paper, assuming that the score of speech utterance is the product of HMM log likelihood and HMM weight, we propose a new method that HMM weights are adapted iteratively like the general MCE training. The proposed method adjusts HMM weights for better performance using delta coefficient defined in terms of misclassification measure. Therefore, the parameter estimation and the Viterbi algorithms of conventional 1:.um can be easily applied to the proposed model by constraining the sum of HMM weights to the number of HMMs in an HMM set. Comparing with the general segmental MCE training approach, computing time decreases by reducing the number of parameters to estimate and avoiding gradient calculation through the optimal state sequence. To evaluate the performance of HMM-based speech recognizer by weighting HMM likelihood, we perform Korean isolated digit recognition experiments. The experimental results show better performance than the MCE algorithm with state weighting.
PDF KSCI

ImprovementofMLLRAlgorithmforRapidSpeakerAdaptationandReductionofComputation (빠른 화자 적응과 연산량 감소를 위한 MLLR알고리즘 개선)

Kim, Ji-Un;Chung, Jae-Ho
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.1C
- /
- pp.65-71
- /
- 2004
We improved the MLLR speaker adaptation algorithm with reduction of the order of HMM parameters using PCA(Principle Component Analysis) or ICA(Independent Component Analysis). To find a smaller set of variables with less redundancy, we adapt PCA(principal component analysis) and ICA(independent component analysis) that would give as good a representation as possible, minimize the correlations between data elements, and remove the axis with less covariance or higher-order statistical independencies. Ordinary MLLR algorithm needs more than 30 seconds adaptation data to represent higher word recognition rate of SD(Speaker Dependent) models than of SI(Speaker Independent) models, whereas proposed algorithm needs just more than 10 seconds adaptation data. 10 components for ICA and PCA represent similar performance with 36 components for ordinary MLLR framework. So, compared with ordinary MLLR algorithm, the amount of total computation requested in speaker adaptation is reduced by about 1/167 in proposed MLLR algorithm.
PDF KSCI

Search Result 343, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)