• 제목/요약/키워드: pitch slope

검색결과 55건 처리시간 0.019초

자유 발화 자료에서 나타나는 한국어 억양 곡선의 기울기 특성에 대한 연구 (A Study of Intonation Curve Slopes in Korean Spontaneous Speech)

  • 오재혁
    • 말소리와 음성과학
    • /
    • 제6권1호
    • /
    • pp.21-30
    • /
    • 2014
  • This study aims to discuss pitch slope on Korean intonation curve in spontaneous speech data. For this study, 656 utterances were taken in the spoken corpus and used 'close-copy stylization'. And then the physical feature of pitch movements was extracted for the study. The pitch slope was calculated on the basis of time and pitch range in each utterance. As a result, the average and distribution of pitch slope is similar between men and women in the range of the pitch movement except for essential differences. The slope of pitch movement confirms that there are no differences between men and women. Pitch slope on a scale of -10 to 10 is 90% of the entire pitch slope; pitch slope that moves by time scale without curve is 33.1%; pitch slope that moves half of the pitch bandwidth during the average time for pitch movement is 23.4%; pitch slope that moves 100% of pitch bandwidth during a half of the average time for pitch movement is 10.4%. Those results imply the possibility of standardization methods of Korean intonation by pitch slope.

억양의 근접복사 유형화를 이용한 감정음성의 음향분석 (An acoustical analysis of emotional speech using close-copy stylization of intonation curve)

  • 이서배
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.131-138
    • /
    • 2014
  • A close-copy stylization of intonation curve was used for an acoustical analysis of emotional speech. For the analysis, 408 utterances of five emotions (happiness, anger, fear, neutral and sadness) were processed to extract acoustical feature values. The results show that certain pitch point features (pitch point movement time and pitch point distance within a sentence) and sentence level features (pitch range of a final pitch point, pitch range of a sentence and pitch slope of a sentence) are affected by emotions. Pitch point movement time, pitch point distance within a sentence and pitch slope of a sentence show no significant difference between male and female participants. The emotions with high arousal (happiness and anger) are consistently distinguished from the emotion with low arousal (sadness) in terms of these acoustical features. Emotions with higher arousal show steeper pitch slope of a sentence. They have steeper pitch slope at the end of a sentence. They also show wider pitch range of a sentence. The acoustical analysis in this study implies the possibility that the measurement of these acoustical features can be used to cluster and identify emotions of speech.

영어 동시발화의 자동 억양궤적 추출을 통한 음향 분석 (An acoustical analysis of synchronous English speech using automatic intonation contour extraction)

  • 이서배
    • 말소리와 음성과학
    • /
    • 제7권1호
    • /
    • pp.97-105
    • /
    • 2015
  • This research mainly focuses on intonational characteristics of synchronous English speech. Intonation contours were extracted from 1,848 utterances produced in two different speaking modes (solo vs. synchronous) by 28 (12 women and 16 men) native speakers of English. Synchronous speech is found to be slower than solo speech. Women are found to speak slower than men. The effect size of speech rate caused by different speaking modes is greater than gender differences. However, there is no interaction between the two factors (speaking modes vs. gender differences) in terms of speech rate. Analysis of pitch point features has it that synchronous speech has smaller Pt (pitch point movement time), Pr (pitch point pitch range), Ps (pitch point slope) and Pd (pitch point distance) than solo speech. There is no interaction between the two factors (speaking modes vs. gender differences) in terms of pitch point features. Analysis of sentence level features reveals that synchronous speech has smaller Sr (sentence level pitch range), Ss (sentence slope), MaxNr (normalized maximum pitch) and MinNr (normalized minimum pitch) but greater Min (minimum pitch) and Sd (sentence duration) than solo speech. It is also shown that the higher the Mid (median pitch), the MaxNr and the MinNr in solo speaking mode, the more they are reduced in synchronous speaking mode. Max, Min and Mid show greater speaker discriminability than other features.

영어의 억양 유형화를 이용한 발화 속도와 남녀 화자에 따른 음향 분석 (An acoustical analysis of speech of different speaking rates and genders using intonation curve stylization of English)

  • 이서배
    • 말소리와 음성과학
    • /
    • 제6권4호
    • /
    • pp.79-90
    • /
    • 2014
  • An intonation curve stylization was used for an acoustical analysis of English speech. For the analysis, acoustical feature values were extracted from 1,848 utterances produced with normal and fast speech rate by 28 (12 women and 16 men) native speakers of English. Men are found to speak faster than women at normal speech rate but no difference is found between genders at fast speech rate. Analysis of pitch point features has it that fast speech has greater Pt (pitch point movement time), Pr (pitch point pitch range), and Pd (pitch point distance) but smaller Ps (pitch point slope) than normal speech. Men show greater Pt, Pr, and Pd than women. Analysis of sentence level features reveals that fast speech has smaller Sr (sentence level pitch range), Sd (sentence duration), and Max (maximum pitch) but greater Ss (sentence slope) than normal speech. Women show greater Sr, Ss, Sp (pitch difference between the first pitch point and the last), Sd, MaxNr (normalized Max), and MinNr (normalized Min) than men. As speech rate increases, women speak with greater Ss and Sr than men.

The Comparison of Pitch Production Between Children with Cochlear Implants and Normal Hearing Children

  • Yoo, Hyun-Soo;Ko, Do-Heung
    • 음성과학
    • /
    • 제15권1호
    • /
    • pp.87-98
    • /
    • 2008
  • This study compares the pitch production of children using cochlear implants (CI) with that of children with normal hearing. Twenty subjects from six to eight years old participated in the study. Three kinds of sentences were read and analyzed using Visi-Pitch $\blacktriangleright$(KAY Elemetrics, Model 3300). There were no considerable differences between the two groups regarding pitch, mean fundamental frequency (F0) and pitch range. In the cases of the slope value of F0 and duration, however, there were significant differences. Thus, it is concluded that duration and pitch control can be crucial factors in determining the intonation treatment of the children with cochlear implants.

  • PDF

F0 변화율로 본 한국어 억양 패턴의 음향 특성 (Korean Intonation Patterns from the Viewpoint of F0 Percentage Change)

  • 이지연;이호영
    • 말소리와 음성과학
    • /
    • 제5권1호
    • /
    • pp.123-130
    • /
    • 2013
  • Previous researches on Korean intonation have been mainly focused on $F_0$ target frequencies, $F_0$ slope, and the duration of intonation patterns. This study investigated Korean intonation patterns, both boundary and phrasal tones, in relation to the $F_0$ percentage change between pitch targets. We measured the percentage change between the pitch targets of both boundary and phrasal tones. Additionally, the $F_0$ change between the preceding pitch target and the first pitch target of the boundary tone and the $F_0$ targets of the sequence of two LH phrasal tones ('LH + LH') were also measured. Two phrasal tones, LHLH and HLH, were compared with 'LH + LH' and the 'HLH' in the LHLH pattern respectively. We found that the percentage change between pitch targets in the phrasal tone is fixed to some extent. This helped explain why the slope of the phrasal tone is closely related to the number of syllables and the duration of the phrasal tone as discussed in previous studies. Since we analyzed the intonation patterns with the utterances from a large speech corpus, the results of this paper are expected to be used in building a larger annotated corpus of Korean.

타워크레인의 기울어짐 측정 시스템 개발 (The Development of the Slope Monitoring System(SMS) of the Tower Crane)

  • 신운철;홍용수
    • 한국안전학회지
    • /
    • 제25권6호
    • /
    • pp.60-64
    • /
    • 2010
  • The purpose of this study if to prevent dangerous accident of the overthrow of the tower crane in summer's hurricane. We develop the SMS in order to give automatic alarm system to operator within the dangerous range and to give a information of the exactly slope in the real time. The slope value of the tower crane is compose of direction, pitch by the front and rear, roll by the right and left and synthesis by the its pitch and roll. Especially, the synthesis eliminate the effect of the wall tie or wire bracing. So, this value should correctly indicate the actual slope. In this study, more applying field test should be applied with the SMS. In the future, a more measurement device can be applied to, and be able to feed more alarm criteria for the review of the risk in the field.

신경망을 이용한 고립단어에서의 피치변화곡선 발생기에 관한 연구 (A Study on the Pitch Contour Generator with Neural Network in the Isolated Words)

  • 임운천;곽진구;장석왕
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 2월 학술대회지
    • /
    • pp.137-155
    • /
    • 1996
  • The purpose of this paper is to generate a pitch contour which is affected by tile phonetic environment and the number of syllables in each Korean isolated word using a neural network. To do this, we analyzed a set of 513 Korean isolated words, consisting of 1-4 syllables and extracted the pitch contour and the duration of each phoneme in all the words. The total number of phonemes we analyzed is about 3800. After that we approximated the pitch contour with a 1st order polynominal by a regression analysis. We could get the slope, the initial pitch and the duration of each phoneme. We used these 3 parameters as the target pattern of the neural network and let the neural network learn the rule of the variation of the pitch and duration, which was affected by the phonetic environment of each phoneme. We used 7 consecutive phoneme strings as an input pattern for a neural network to make the network learn the effect of phonetic environment around the center phoneme. In the learning phase, we used 3545 items(463 words) as target patterns which contained the phonetic environment of front and rear 3 phonemes and the neural network showed the correctness rate of 98.43%, 98.59%, 97.7% in the estimation of the duration, the slope, the initial pitch. In the recall phase, we tested the performance of tile neural network with 251 items(50 words) which weren't need as learning data and we could get the good correctness rate of 97.34%, 95.45%, 96.3% in the generation of the duration, the slope, and the initial pitch of each phoneme.

  • PDF

학령전기아동 관련 성인의 운율 특성 (The Prosodic Characteristics of Pre-school Age Children-Related Adults)

  • 김지원;성철재
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.23-32
    • /
    • 2014
  • This study presents the prosodic characteristics of 'Motherese' and 'Teacherese (child care teacher and kindergarten teacher)'. 21 mothers and 24 teachers spoke to children in the child care center or kindergarten. Children are in their 4;00-6;11. Speech and articulation rate, number of accentual phrases (APs), number of intonational phrases (IPs), pitch-related factors (f0, pitch range, f0 standard deviation), and intonation slope (mean Absolute, f0, q-tone slope) were measured. 2 groups spoke 2 sentential types (interrogative_ alternative question, declarative_ coordinated sentence) in 2 situations (one accompanied with the children, the other done without children, but pretending as if they were in front of the children). The results indicate that teachers show more noticeable prosodic characteristics than mothers do.

GPS/IMU/OBD 융합기반 ACF/IMMKF를 이용한 차량 Pitch 추정 알고리즘 (Vehicular Pitch Estimation Algorithm with ACF/IMMKF Based on GPS/IMU/OBD Data Fusion)

  • 김주원;이명수;이상선
    • 한국통신학회논문지
    • /
    • 제40권9호
    • /
    • pp.1837-1845
    • /
    • 2015
  • 도심지환경에서 정확한 차량 위치를 추정하기 위해서는 종방향 속도가 필요하다. 이러한 종방향 속도는 노면경사, 즉 차량의 피치각(Pitch) 산출을 통해서 가능하다. 하지만 단일 센서와 알고리즘을 이용한 피치각 추정에는 정확한 값을 기대할 수 없다. 본 논문에서는 정확한 피치각 추정을 위해 AKF(Adaptive Kalman Filter)와 CF(Complementary Filter)로 구성된 ACF(Adaptive Complementary Filter)를 이용하여 IMU(Inertial Measurement Unit)의 프로세스 노이즈와 측정에러를 주행환경에 맞게 조절하고, 이에 GPS(Global Positioning System)와 OBD(Onboard Equipment) 데이터를 융합한다. 그리고 노면 경사 모델에 따른 필터에 시스템 모델 최적화를 위해 IMMKF(Interactive Multiple Model Kalman Filter)를 사용하여 주행환경에 적합한 최종 피치각을 추정한다.