• Title/Summary/Keyword: 피치편차

Search Result 14, Processing Time 0.022 seconds

Recognizing Five Emotional States Using Speech Signals (음성 신호를 이용한 화자의 5가지 감성 인식)

  • Kang Bong-Seok;Han Chul-Hee;Woo Kyoung-Ho;Yang Tae-Young;Lee Chungyong;Youn Dae-Hee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.101-104
    • /
    • 1999
  • 본 논문에서는 음성 신호를 이용해서 화자의 감정을 인식하기 위해 3가지 시스템을 구축하고 이들의 성능을 비교해 보았다. 인식 대상으로 하는 감정은 기쁨, 슬픔, 화남, 두려움, 지루함, 평상시의 감정이고, 각 감정에 대한 감정 음성 데이터베이스를 직접 구축하였다. 피치와 에너지 정보를 감성 인식의 특징으로 이용하였고, 인식 알고리듬은 MLB(Maximum-Likelihood Bayes)분류기, NN(Nearest Neighbor)분류기 및 HMM(Hidden Markov Model)분류기를 이용하였다. 이 중 MLB 분류기와 NN 분류기에서는 특징벡터로 피치와 에너지의 평균과 표준편차, 최대값 등 통계적인 정보를 이용하였고, TMM 분류기에서는 각 프레임에서의 델타 피치와 델타델타 피치, 델타 에너지와 델타델타 에너지 등 시간적 정보를 이용하였다. 실험은 화자종속, 문장독립형 방식으로 하였고, 인식 실험 결과는 MLB를 이용해서 $68.9\%, NN을 이용해서 $66.7\%를 얻었고, HMM 분류기를 이용해서 $89.30\%를 얻었다.

  • PDF

Interaction of native language interference and universal language interference on L2 intonation acquisition: Focusing on the pitch range variation (L2 억양에서 나타나는 모국어 간섭과 언어 보편적 간섭현상의 상호작용: 피치대역을 중심으로)

  • Yune, Youngsook
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.35-46
    • /
    • 2021
  • In this study, we examined the interactive aspects between pitch reduction phenomena considered a universal language phenomenon and native language interference in the production of L2 intonation performed by Chinese learners of Korean. To investigate their interaction, we conducted an acoustic analysis using acoustic measures such as pitch span, pitch level, pitch dynamic quotient, skewness, and kurtosis. In addition, the correlation between text comprehension and pitch was examined. The analyzed material consisted of four Korean discourses containing five and seven sentences of varying difficulty. Seven Korean native speakers and thirty Chinese learners who differed in their Korean proficiency participated in the production test. The results, for differences by language, showed that Chinese had a more expanded pitch span, and a higher pitch level than Korean. The analysis between groups showed that at the beginner and intermediate levels, pitch reduction was prominent, i.e., their Korean was characterized by a compressed pitch span, low pitch level, and less sentence internal pitch variation. Contrariwise, the pitch use of advanced speakers was most similar to Korean native speakers. There was no significant correlation between text difficulty and pitch use. Through this study, we observed that pitch reduction was more pronounced than native language interference in the phonetic layer.

Tutorial on the Coordinate Transforms in Applied Geophysics (물리탐사에 유용한 좌표계 회전 정리)

  • Song, Yoonho
    • Geophysics and Geophysical Exploration
    • /
    • v.23 no.2
    • /
    • pp.89-96
    • /
    • 2020
  • This tutorial summarizes the coordinate transforms for formulating geophysical problems. To ensure mathematical consistency, this discussion begins with the right-hand rule. Further, the concepts of active and passive transforms are introduced. By extending these concepts, the coordinate transform and its inverse between two coordinates are related to the matrix transpose. The yaw-pitch-roll rotation and the azimuth-deviation-tool face rotation transforms are described as the most frequently used schemes, and the relation between the Rodrigues' rotation formula and these two transforms are mathematically explained. The "Gimbal Lock" problem inherent in yaw-pitch-roll rotation is schematically presented and mathematically derived. As a useful tool overcome this problem, the principle and usage of the quaternion is also described.

Perceptive evaluation of Korean native speakers on the polysemic sentence final ending produced by Chinese Korean learners (KFL중국인학습자들의 한국어 동형다의 종결어미 발화문에 대한 원어민화자의 지각 평가 양상)

  • Yune, Youngsook
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.27-36
    • /
    • 2020
  • The aim of this study is to investigate the perceptive aspects of the polysemic sentence final ending "-(eu)lgeol" produced by Chinese Korean learners. "-(Eu)lgeol" has two different meanings, that is, a guess and a regret, and these different meanings are expressed by the different prosodic features of the last syllable of "-(eu)lgeol". To examine how Korean native speakers perceive "-(eu)lgeol" sentences produced by Chinese Korean learners and the most saliant prosodic variable for the semantic discrimination of "-(eu)lgeol" at the perceptive level, we performed a perceptual experiment. The analysed material constituted four Korean sentences containing "-(eu)lgeol" in which two sentences expressed guesses and the other two expressed regret. Twenty-five Korean native speakers participated in the perceptual experiment. Participants were asked to mark whether "-(eu)lgeol" sentences they listened to were (1) definitely regrets, (2) probably regrets, (3) ambiguous, (4) probably guesses, or (5) definitely guesses based on the prosodic features of the last syllable of "-(eu)lgeol". The analysed prosodic variables were sentence boundary tones, slopes of boundary tones, pitch difference between sentence-final and penultimate syllables, and pitch levels of boundary tones. The results show that all the analysed prosodic variables are significantly correlated with the semantic discrimination of "-(eu)lgeol" and among these prosodic variables, the most salient role in the semantic discrimination of "-(eu)lgeol" is pitch difference between sentence-final syllable and penultimate syllable.

Comparison of feature parameters for emotion recognition using speech signal (음성 신호를 사용한 감정인식의 특징 파라메터 비교)

  • 김원구
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.5
    • /
    • pp.371-377
    • /
    • 2003
  • In this paper, comparison of feature parameters for emotion recognition using speech signal is studied. For this purpose, a corpus of emotional speech data recorded and classified according to the emotion using the subjective evaluation were used to make statical feature vectors such as average, standard deviation and maximum value of pitch and energy and phonetic feature such as MFCC parameters. In order to evaluate the performance of feature parameters speaker and context independent emotion recognition system was constructed to make experiment. In the experiments, pitch, energy parameters and their derivatives were used as a prosodic information and MFCC parameters and its derivative were used as phonetic information. Experimental results using vector quantization based emotion recognition system showed that recognition system using MFCC parameter and its derivative showed better performance than that using the pitch and energy parameters.

Study on Temperature Control and Optimal Design for Continuous Sterilizer (연속 살균기의 온도제어 및 최적설계에 관한 연구)

  • Park, Cheol Jae
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.39 no.8
    • /
    • pp.813-821
    • /
    • 2015
  • In this paper, we analyzed the problems of a batch-type sterilizer and design a continuous sterilizer to control the temperature deviation. The temperature deviation is analyzed with respect to design parameters such as a nozzle diameter, hole diameter, and nozzle length. The significant temperature parameters are optimized using the response surface methodology. An experimental apparatus is developed using the optimized design parameters. Using a field test, we show that the target temperature is obtained in about 7.3 minutes and the temperature deviation is improved about $0.84^{\circ}C$. The optimized parameters from the test are equal to the analytical parameters.

Attitude Estimation of Unmanned Vehicles Using Unscented Kalman Filter (무향 칼만 필터를 이용한 무인 운송체의 자세 추정)

  • Song, Gyeong-Sub;Ko, Nak-Yong;Choi, Hyun-Seung
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.14 no.1
    • /
    • pp.265-274
    • /
    • 2019
  • The paper describes an application of unscented Kalman filter(UKF) for attitude estimation of an unmanned vehicle(UV), which is equipped with a low-cost attitude heading reference system (AHRS). The roll, pitch and yaw required at the correction stage of the UKF are calculated from the measurements of acceleration and geomagnetic field. The roll and pitch are attributed to the measurement of acceleration, while yaw is calculated from the geomagnetic field measurement. Since the measurement of geomagnetic field is vulnerable to distortion by hard-iron and soft-iron effects, the calculated yaw has more uncertainty than the calculated roll and pitch. To reduce the uncertainty of geomagnetic field measurement, the proposed method estimates bias in the geomagnetic field measurement and compensates for the bias for more accurate calculation of yaw. The proposed method is verified through navigation experiments of a UV in a test pool. The results show that the proposed method yields more accurate attitude estimation; thus, it results more accurate location estimation.

Paper Title : Speech Parameter Estimation and Enhancement Using the EM Algorithm (EM 알고리즘을 이용한 음성 파라미터 추정 및 향상)

  • Lee, Ki-Yong;Kang, Young-Tae;Lee, Byung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.2E
    • /
    • pp.68-75
    • /
    • 1994
  • In many applications of signal processing, we have to deal with densities which are highly non-Gaussian or which may have Gaussian shape in the middle but have potent deviations in the tails. To fight against these deviations, we consider a finite mixture distribution for the speech excitation. We utilize the EM algorithm for the estimation of speech parameters and their enhancement. Robust Kalman filtering is used in the enhancement process, and a detection/estimation technique is used for parameter estimation. Experimental results show that the proposed algorithm performs better in adverse SNR input conditions.

  • PDF

Study on Performance Prediction of Industrial Axial Flow Fan with Adjustable Pitch Blades (산업용 조정 피치형 축류송풍기의 성능예측에 관한 연구)

  • Koo, Jae-In;Kim, Chang-Soo;Chung, Jin-Teak;Kim, Kwang-Ho
    • 유체기계공업학회:학술대회논문집
    • /
    • 2001.11a
    • /
    • pp.30-34
    • /
    • 2001
  • In the present study, we studied the method of predicting the on-design and on-design point performance of axial flow fan with adjustable pitch blades. With the change of stagger angle of axial flow fan with adjustable pitch blade, flow rate and pressure can be changed. Because of this merit adjustable pitch fans are used in many industrial facility. When changing stagger angle or estimating the performance at a wide range of off-design condition, incidence angle changes greatly as the flow rate changes. Therefore, the deviation angle at the blade exit is estimated by the correlation considering the effects of blade design, incidence angle variation. In the loss model, we used known pressure loss model for blade boundary layer and wake, secondary flow, endwall boundary layer and tip leakage flow. The results of modified deviation angle model and experiment were compared for the usefulness of the modified model.

  • PDF

An Active Region Detection Method for The Speech Playback-speed Control (음성재생 속도 제어를 위한 활성화 영역 검출방법)

  • Yoo, Deok-Hyeon;Kim, Dong-Hyeok;Jeon, Joon-Hyeon
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.3
    • /
    • pp.98-105
    • /
    • 2012
  • This paper describes a new method for a speech playback speed control with high quality. The proposed method provides an adaptive threshold filtering solution for detecting active regions of a speech signal that are followed by playback speed. For a given playback speed, threshold value is adaptively determined with the statistics(:mean and standard deviation) of each frame in speech, and is used to select only active blocks within the current frame. To minimize quality degradation(i.e., pitch degradation) caused due to high-speed playback, the threshold filtering priorly eliminates relatively low-activity blocks including voice and unvoice. Simulation results show that the proposed scheme provides a playback speed control solution with higher quality than SOLA(Synchonized OverLap Add) method using the pitch extraction of speech.