• 제목/요약/키워드: Korean speech

검색결과 5,307건 처리시간 0.028초

언어 및 인지 과제 동시수행이 발화속도에 미치는 영향 (Effects of Concurrent Linguistic or Cognitive Tasks on Speech Rate)

  • 한지연;김효정;김문정
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.102-105
    • /
    • 2007
  • This study was designed to examination effects of concurrent linguistic or cognitive tasks on speech rate. Eight normal speakers were repeated sentences either with or without simultaneous a linguistic task and a cognitive task. Linguistic task was conducted by generating verbs from nouns and cognitive task meaned performing mental arithmetic. Speech rate was measured from acoustic data. One-way ANOVA conducted to know speech rate difference among 3 different type of tasks. The results showed there was no significant difference between sentence repeat and linguistic tasks. But There was significant difference findings: sentence repeat and linguistic task, linguistic and cognitive task.

  • PDF

아동기 말실행증 아동의 조음교대운동 특성 (Alternating Motion Rate Characteristics in Children with Childhood Apraxia of Speech)

  • 박준범;하승희
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.33-40
    • /
    • 2014
  • The purpose of the study was to examine alternating motion rate and its variability in children with childhood apraxia of speech (CAS) compared to typically developing children. Six children with CAS aged 9-12 years old and 10 children who were age-matched participated in the study. This study measured tokens per second and variabilities of the rates during the production of /$p^*$ a/, /$t^*$ a/, and /$k^*$ a/. For variability measures of the rates, each participant was asked to repeat speech tasks three times and the average value of the rates and its standard deviation were obtained. The results revealed that the CAS group showed slower rate only at /$k^*$ a/ than the control group. The CAS group exhibited greater variability of AMR at all the tasks than the control group. The results suggested that variability of AMR might be a more distinctive speech feature to children with CAS than the rate of the speech task.

목소리 특성과 음성 특징 파라미터의 상관관계와 SVM을 이용한 특성 분류 모델링 (Correlation analysis of voice characteristics and speech feature parameters, and classification modeling using SVM algorithm)

  • 박태성;권철홍
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.91-97
    • /
    • 2017
  • This study categorizes several voice characteristics by subjective listening assessment, and investigates correlation between voice characteristics and speech feature parameters. A model was developed to classify voice characteristics into the defined categories using SVM algorithm. To do this, we extracted various speech feature parameters from speech database for men in their 20s, and derived statistically significant parameters correlated with voice characteristics through ANOVA analysis. Then, these derived parameters were applied to the proposed SVM model. The experimental results showed that it is possible to obtain some speech feature parameters significantly correlated with the voice characteristics, and that the proposed model achieves the classification accuracies of 88.5% on average.

MDVP와 Praat, Dr. Speech간의 음향학적 측정치에 관한 상관연구 (A Correlation Study among Acoustic Parameters of MDVP, Praat, and Dr. Speech)

  • 유재연;정옥란;장태엽;고도흥
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.29-36
    • /
    • 2003
  • The purposes of this study was to conduct a correlational analysis among $F_^{0}$, Jitter, Shimmer, and NHR (HNR), and NNE estimated by three speech analysis softwares, MDVP, Praat and Dr. Speech. Thirty females and 15 males with normal voice participated in the study. We used Sound Forge 6.0 to record their voice. MDVP, Praat and Dr. Speech were used to measure the acoustic parameters. The Pearson correlation coefficient was determined through a statistical analysis. The results came out as follows: Firstly, there was a strong correlation between $F_^{0}$ and Shimmer of both instruments. However, there was no correlation between Jitter of both instruments. Secondly, Shimmer showed a stronger correlation with HNR, NHR, and NNE than Jitter. Therefore, Shimmer was considered to be more useful and sensitive parameter to identify dysphonic voice compared to jitter.

  • PDF

성직자 음성의 음향학적인 비교 연구 (A Comparative Study on the Voices of Clergymen: Ministers vs. Priests)

  • 이은선;박상희;조성미;정옥란;석동일
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.79-86
    • /
    • 2003
  • This study compared the voices of ministers and priests. There. has been a common notion that ministers is more passionate than priests in delivering their speech. Therefore, it can be assumed that ministers abuses or misuses his/her voice compared to priests. This study attempted acoustic analysis of the voices of 6 ministers and .5 priests before and after their speech. We measured F0, jitter, shimmer, NNE and HNR using Dr. Speech (Version 4.0, Tiger DRS). A t-test was performed to determine any objective differences of their voices. The results showed that there were no significant differences in the voices of ministers and priests before and after their speech. However, there seemed to be an interesting reversed tendency between ministers and priests, although it did not reach a statistical significance. That is, P0 tended to increase after the speech in ministers, whereas it tended to decrease in priests. In addition, HNR tended to decrease after the speech in priests, while it tended to increase in ministers.

  • PDF

직접데이터 기반의 모델적응 방식을 이용한 잡음음성인식에 관한 연구 (A Study on the Noisy Speech Recognition Based on the Data-Driven Model Parameter Compensation)

  • 정용주
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.247-257
    • /
    • 2004
  • There has been many research efforts to overcome the problems of speech recognition in the noisy conditions. Among them, the model-based compensation methods such as the parallel model combination (PMC) and vector Taylor series (VTS) have been found to perform efficiently compared with the previous speech enhancement methods or the feature-based approaches. In this paper, a data-driven model compensation approach that adapts the HMM(hidden Markv model) parameters for the noisy speech recognition is proposed. Instead of assuming some statistical approximations as in the conventional model-based methods such as the PMC, the statistics necessary for the HMM parameter adaptation is directly estimated by using the Baum-Welch algorithm. The proposed method has shown improved results compared with the PMC for the noisy speech recognition.

  • PDF

잡음 환경에서의 음성 감정 인식을 위한 특징 벡터 처리 (Feature Vector Processing for Speech Emotion Recognition in Noisy Environments)

  • 박정식;오영환
    • 말소리와 음성과학
    • /
    • 제2권1호
    • /
    • pp.77-85
    • /
    • 2010
  • This paper proposes an efficient feature vector processing technique to guard the Speech Emotion Recognition (SER) system against a variety of noises. In the proposed approach, emotional feature vectors are extracted from speech processed by comb filtering. Then, these extracts are used in a robust model construction based on feature vector classification. We modify conventional comb filtering by using speech presence probability to minimize drawbacks due to incorrect pitch estimation under background noise conditions. The modified comb filtering can correctly enhance the harmonics, which is an important factor used in SER. Feature vector classification technique categorizes feature vectors into either discriminative vectors or non-discriminative vectors based on a log-likelihood criterion. This method can successfully select the discriminative vectors while preserving correct emotional characteristics. Thus, robust emotion models can be constructed by only using such discriminative vectors. On SER experiment using an emotional speech corpus contaminated by various noises, our approach exhibited superior performance to the baseline system.

  • PDF

Spline 코드북 기반의 spectral folding을 이용한 대역폭 확장 방법 (Bandwidth Expansion Method Using Spline Codebook Based Spectral Folding)

  • 박지훈;한승호;양희식;정상배;한민수
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • /
    • pp.131-134
    • /
    • 2006
  • Quality of narrowband speech $(0{\sim}4kHz)$ can be enhanced by the bandwidth expansion technique, by which the high- band components are estimated. This paper proposes the bandwidth expansion method using the spline codebook based spectral folding. For the performance evaluation, the PESQ(Perceptual Evaluation of Speech Quality) scores are measured as the objective measurement In addition, the MOS (Mean Opinion Score) and the preference tests are performed as the subjective measurement. The results show our proposed method outperforms the existing spline based one.

  • PDF

웨이브렛 변환을 이용한 음성신호의 성문폐쇄시점 검출 (Detection of Glottal Closure Instant for Voiced Speech Using Wavelet Transform)

  • 배건성
    • 음성과학
    • /
    • 제7권3호
    • /
    • pp.153-165
    • /
    • 2000
  • During the phonation of voiced sounds, instants exist where the glottis is opened or closed, due to the periodic vibration of the vocal cord. When closed, this is called the glottal closure instant(GCI) or epoch.. The correct detection of the GCI is one of the important problems in speech processing for pitch detection, pitch synchronous analysis, and so on. Recently, it has been shown that the local maxima points of the wavelet transformed speech signal correspond to the GCIs of speech signal. In this paper, we investigate the accuracy of Gels estimated from this wavelet transformed speech signal. For this purpose we compare them with the negative peak points of the differentiated EGG signal that represents the actual GCIs of speech signal.

  • PDF

A Review of Timing Factors in Speech

  • Yun, Il-Sung
    • 음성과학
    • /
    • 제7권3호
    • /
    • pp.87-98
    • /
    • 2000
  • Timing in speech is determined by many factors. In this paper, we introduce and discuss some factors that have generally been regarded as important in speech timing. They include stress, syllable structure, consonant insertion or deletion, tempo, lengthening at clause, phrase and word boundaries, preconsonantal vowel shortening, and compensation between segments or within phonological units (e.g., word, foot), compression due to the increase of syllables in word or foot level, etc. and each of them may playa crucial role in the structuring of speech timing in a language. But some of these timing factors must interact with each other rather than be independent and the effects of each factor on speech timing will vary from language to language. On the other hand, there could well be many other factors unknown so far. Finding out and investigating new timing factors and reinterpreting the already-known timing factors should enhance our understanding of timing structures in a given language or languages.

  • PDF