• Title/Summary/Keyword: pitch slope

Search Result 55, Processing Time 0.03 seconds

A Study of Intonation Curve Slopes in Korean Spontaneous Speech (자유 발화 자료에서 나타나는 한국어 억양 곡선의 기울기 특성에 대한 연구)

  • Oh, Jeahyuk
    • Phonetics and Speech Sciences
    • /
    • v.6 no.1
    • /
    • pp.21-30
    • /
    • 2014
  • This study aims to discuss pitch slope on Korean intonation curve in spontaneous speech data. For this study, 656 utterances were taken in the spoken corpus and used 'close-copy stylization'. And then the physical feature of pitch movements was extracted for the study. The pitch slope was calculated on the basis of time and pitch range in each utterance. As a result, the average and distribution of pitch slope is similar between men and women in the range of the pitch movement except for essential differences. The slope of pitch movement confirms that there are no differences between men and women. Pitch slope on a scale of -10 to 10 is 90% of the entire pitch slope; pitch slope that moves by time scale without curve is 33.1%; pitch slope that moves half of the pitch bandwidth during the average time for pitch movement is 23.4%; pitch slope that moves 100% of pitch bandwidth during a half of the average time for pitch movement is 10.4%. Those results imply the possibility of standardization methods of Korean intonation by pitch slope.

An acoustical analysis of emotional speech using close-copy stylization of intonation curve (억양의 근접복사 유형화를 이용한 감정음성의 음향분석)

  • Yi, So Pae
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.131-138
    • /
    • 2014
  • A close-copy stylization of intonation curve was used for an acoustical analysis of emotional speech. For the analysis, 408 utterances of five emotions (happiness, anger, fear, neutral and sadness) were processed to extract acoustical feature values. The results show that certain pitch point features (pitch point movement time and pitch point distance within a sentence) and sentence level features (pitch range of a final pitch point, pitch range of a sentence and pitch slope of a sentence) are affected by emotions. Pitch point movement time, pitch point distance within a sentence and pitch slope of a sentence show no significant difference between male and female participants. The emotions with high arousal (happiness and anger) are consistently distinguished from the emotion with low arousal (sadness) in terms of these acoustical features. Emotions with higher arousal show steeper pitch slope of a sentence. They have steeper pitch slope at the end of a sentence. They also show wider pitch range of a sentence. The acoustical analysis in this study implies the possibility that the measurement of these acoustical features can be used to cluster and identify emotions of speech.

An acoustical analysis of synchronous English speech using automatic intonation contour extraction (영어 동시발화의 자동 억양궤적 추출을 통한 음향 분석)

  • Yi, So Pae
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.97-105
    • /
    • 2015
  • This research mainly focuses on intonational characteristics of synchronous English speech. Intonation contours were extracted from 1,848 utterances produced in two different speaking modes (solo vs. synchronous) by 28 (12 women and 16 men) native speakers of English. Synchronous speech is found to be slower than solo speech. Women are found to speak slower than men. The effect size of speech rate caused by different speaking modes is greater than gender differences. However, there is no interaction between the two factors (speaking modes vs. gender differences) in terms of speech rate. Analysis of pitch point features has it that synchronous speech has smaller Pt (pitch point movement time), Pr (pitch point pitch range), Ps (pitch point slope) and Pd (pitch point distance) than solo speech. There is no interaction between the two factors (speaking modes vs. gender differences) in terms of pitch point features. Analysis of sentence level features reveals that synchronous speech has smaller Sr (sentence level pitch range), Ss (sentence slope), MaxNr (normalized maximum pitch) and MinNr (normalized minimum pitch) but greater Min (minimum pitch) and Sd (sentence duration) than solo speech. It is also shown that the higher the Mid (median pitch), the MaxNr and the MinNr in solo speaking mode, the more they are reduced in synchronous speaking mode. Max, Min and Mid show greater speaker discriminability than other features.

An acoustical analysis of speech of different speaking rates and genders using intonation curve stylization of English (영어의 억양 유형화를 이용한 발화 속도와 남녀 화자에 따른 음향 분석)

  • Yi, So Pae
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.79-90
    • /
    • 2014
  • An intonation curve stylization was used for an acoustical analysis of English speech. For the analysis, acoustical feature values were extracted from 1,848 utterances produced with normal and fast speech rate by 28 (12 women and 16 men) native speakers of English. Men are found to speak faster than women at normal speech rate but no difference is found between genders at fast speech rate. Analysis of pitch point features has it that fast speech has greater Pt (pitch point movement time), Pr (pitch point pitch range), and Pd (pitch point distance) but smaller Ps (pitch point slope) than normal speech. Men show greater Pt, Pr, and Pd than women. Analysis of sentence level features reveals that fast speech has smaller Sr (sentence level pitch range), Sd (sentence duration), and Max (maximum pitch) but greater Ss (sentence slope) than normal speech. Women show greater Sr, Ss, Sp (pitch difference between the first pitch point and the last), Sd, MaxNr (normalized Max), and MinNr (normalized Min) than men. As speech rate increases, women speak with greater Ss and Sr than men.

The Comparison of Pitch Production Between Children with Cochlear Implants and Normal Hearing Children

  • Yoo, Hyun-Soo;Ko, Do-Heung
    • Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.87-98
    • /
    • 2008
  • This study compares the pitch production of children using cochlear implants (CI) with that of children with normal hearing. Twenty subjects from six to eight years old participated in the study. Three kinds of sentences were read and analyzed using Visi-Pitch $\blacktriangleright$(KAY Elemetrics, Model 3300). There were no considerable differences between the two groups regarding pitch, mean fundamental frequency (F0) and pitch range. In the cases of the slope value of F0 and duration, however, there were significant differences. Thus, it is concluded that duration and pitch control can be crucial factors in determining the intonation treatment of the children with cochlear implants.

  • PDF

Korean Intonation Patterns from the Viewpoint of F0 Percentage Change (F0 변화율로 본 한국어 억양 패턴의 음향 특성)

  • Lee, Ji Yeon;Lee, Ho-Young
    • Phonetics and Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.123-130
    • /
    • 2013
  • Previous researches on Korean intonation have been mainly focused on $F_0$ target frequencies, $F_0$ slope, and the duration of intonation patterns. This study investigated Korean intonation patterns, both boundary and phrasal tones, in relation to the $F_0$ percentage change between pitch targets. We measured the percentage change between the pitch targets of both boundary and phrasal tones. Additionally, the $F_0$ change between the preceding pitch target and the first pitch target of the boundary tone and the $F_0$ targets of the sequence of two LH phrasal tones ('LH + LH') were also measured. Two phrasal tones, LHLH and HLH, were compared with 'LH + LH' and the 'HLH' in the LHLH pattern respectively. We found that the percentage change between pitch targets in the phrasal tone is fixed to some extent. This helped explain why the slope of the phrasal tone is closely related to the number of syllables and the duration of the phrasal tone as discussed in previous studies. Since we analyzed the intonation patterns with the utterances from a large speech corpus, the results of this paper are expected to be used in building a larger annotated corpus of Korean.

The Development of the Slope Monitoring System(SMS) of the Tower Crane (타워크레인의 기울어짐 측정 시스템 개발)

  • Shin, Woon-Chul;Hong, Yong-Soo
    • Journal of the Korean Society of Safety
    • /
    • v.25 no.6
    • /
    • pp.60-64
    • /
    • 2010
  • The purpose of this study if to prevent dangerous accident of the overthrow of the tower crane in summer's hurricane. We develop the SMS in order to give automatic alarm system to operator within the dangerous range and to give a information of the exactly slope in the real time. The slope value of the tower crane is compose of direction, pitch by the front and rear, roll by the right and left and synthesis by the its pitch and roll. Especially, the synthesis eliminate the effect of the wall tie or wire bracing. So, this value should correctly indicate the actual slope. In this study, more applying field test should be applied with the SMS. In the future, a more measurement device can be applied to, and be able to feed more alarm criteria for the review of the risk in the field.

A Study on the Pitch Contour Generator with Neural Network in the Isolated Words (신경망을 이용한 고립단어에서의 피치변화곡선 발생기에 관한 연구)

  • Lim Unchun;Kwak Jingu;Chang Sokwang
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.137-155
    • /
    • 1996
  • The purpose of this paper is to generate a pitch contour which is affected by tile phonetic environment and the number of syllables in each Korean isolated word using a neural network. To do this, we analyzed a set of 513 Korean isolated words, consisting of 1-4 syllables and extracted the pitch contour and the duration of each phoneme in all the words. The total number of phonemes we analyzed is about 3800. After that we approximated the pitch contour with a 1st order polynominal by a regression analysis. We could get the slope, the initial pitch and the duration of each phoneme. We used these 3 parameters as the target pattern of the neural network and let the neural network learn the rule of the variation of the pitch and duration, which was affected by the phonetic environment of each phoneme. We used 7 consecutive phoneme strings as an input pattern for a neural network to make the network learn the effect of phonetic environment around the center phoneme. In the learning phase, we used 3545 items(463 words) as target patterns which contained the phonetic environment of front and rear 3 phonemes and the neural network showed the correctness rate of 98.43%, 98.59%, 97.7% in the estimation of the duration, the slope, the initial pitch. In the recall phase, we tested the performance of tile neural network with 251 items(50 words) which weren't need as learning data and we could get the good correctness rate of 97.34%, 95.45%, 96.3% in the generation of the duration, the slope, and the initial pitch of each phoneme.

  • PDF

The Prosodic Characteristics of Pre-school Age Children-Related Adults (학령전기아동 관련 성인의 운율 특성)

  • Kim, Jiwon;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.23-32
    • /
    • 2014
  • This study presents the prosodic characteristics of 'Motherese' and 'Teacherese (child care teacher and kindergarten teacher)'. 21 mothers and 24 teachers spoke to children in the child care center or kindergarten. Children are in their 4;00-6;11. Speech and articulation rate, number of accentual phrases (APs), number of intonational phrases (IPs), pitch-related factors (f0, pitch range, f0 standard deviation), and intonation slope (mean Absolute, f0, q-tone slope) were measured. 2 groups spoke 2 sentential types (interrogative_ alternative question, declarative_ coordinated sentence) in 2 situations (one accompanied with the children, the other done without children, but pretending as if they were in front of the children). The results indicate that teachers show more noticeable prosodic characteristics than mothers do.

Vehicular Pitch Estimation Algorithm with ACF/IMMKF Based on GPS/IMU/OBD Data Fusion (GPS/IMU/OBD 융합기반 ACF/IMMKF를 이용한 차량 Pitch 추정 알고리즘)

  • Kim, Ju-won;Lee, Myung-su;Lee, Sang-sun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.9
    • /
    • pp.1837-1845
    • /
    • 2015
  • The longitudinal velocity is necessary for accurate vehicular positioning in urban environment. The pitch angle, which is a road slope, should be calculated to acquire the longitudinal velocity. However, it is impossible to consider very accurate pitch, when using a sensor and an algorithm. That's why process noise and positioning stimation error of IMU should be adjusted to the driving environment and fuse GPS, OBD data with ACF which consist of AKF, CF in this paper. Then, final pitch angle which is appropriate for driving environment is estimated by IMMKF in order to optimize the system model according to road slope models.