• 제목/요약/키워드: Speech Register

검색결과 22건 처리시간 0.02초

언어 사용력(Speech Register)원리를 활용한 유아의 교육용 로봇 인식 (Applying the Speech Register Principle to young children`s Perception of the Intelligent Service Robot)

  • 현은자;이하원;연혜민
    • 한국콘텐츠학회논문지
    • /
    • 제12권10호
    • /
    • pp.532-540
    • /
    • 2012
  • 본 연구의 목적은 유아교육기관에서 로봇을 경험한 만 5세 유아들의 로봇 인식을 조사하는 것이다. 연구의 이론적 배경은 언어 사용력(Speech Register) 이론이었으며 로봇, 친구, 인형을 대상으로 말투(Speech Tone) 비교를 하였다. 2-3년간 로봇을 경험한 3군데 유치원의 만 5세 유아 50명을 대상으로 인간에게 어울리는 자연스러운 말투와 단조로운 말투로 그림책을 읽어준 후 어떤 말투가 사람에게 어울리는 말투인지 유아들에게 질문하여 확인하였다. 그리고 인간에게 어울리는 말투가 로봇, 친구, 인형 중 어떤 대상에게 적합한지 선택하게 하였고 그 이유를 분석하였다. 연구결과, 사람에게 어울리는 말투는 86%의 유아가 인형보다 로봇에게, 74% 유아는 로봇보다 친구에게 그리고 68%유아는 인형보다 친구에게 어울린다고 인식하였다. 즉, 유아들은 로봇을 인공물보다는 인간에게 가까운 존재로서 중간자적 혼성물(hybrid beings)로 인식하고 있었다. 그리고 인식 기제는 인간 고유성(human uniqueness)인 인지적 특성이 반영된 결과였다. 그러므로 생물과 무생물의 이분법적 존재론적 인식 분류는 혼성물을 포함한 분류방식으로 대치할 필요성이 있음을 제안한다.

음성장애 성인 여성과 정상음성 성인 여성 간 진성구와 가성구의 음성범위프로파일 비교 (Comparison of voice range profiles of modal and falsetto register in dysphonic and non-dysphonic adult women)

  • 김재옥;이승진
    • 말소리와 음성과학
    • /
    • 제14권4호
    • /
    • pp.67-75
    • /
    • 2022
  • 본 연구는 53명의 음성장애 성인 여성과 53명의 정상음성 성인 여성의 진성구와 가성구 음역대 및 진성구-가성구 성구전환이 발생하는 구간을 비교하였다. 최고기본주파수(F0MAX), 최고음성강도(IMAX), 기본주파수범위(F0RANGE) 및 음성강도범위(IRANGE) 모두 음성장애군이 정상음성군에 비해 유의하게 낮았다. 또한 두 집단 모두 F0MAX와 F0RANGE가 진성구에 비해 가성구가 유의하게 높았다. IMAX와 IRANGE는 정상음성군에서 가성구가 진성구보다 유의하게 높았으나 음성장애군은 진성구와 가성구 간에 차이가 없었다. F0MIN과 IMIN은 두 집단 간에 통계적으로 유의한 차이는 없었다. 진성구-가성구 성구전환이 발생하는 지점의 주파수는 음성장애군은 378.86Hz(F4#), 정상음성군은 557.79Hz(C5#)로 음성장애군이 유의하게 낮았다. 본 연구를 통해 음성장애가 있는 성인 여성의 진성구와 가성구 음역대 모두 정상음성 성인 여성에 비해 감소하여 성대가 고주파수에서 진동하기 어렵다는 것을 알 수 있다. 본 연구결과는 성인 여성 음성장애 화자의 진성구와 가성구의 VRP를 함께 살펴봄으로써 음성장애가 미치는 음향학적 특성을 파악하기 위한 기초자료로 사용될 수 있을 것이다.

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers - Possibility of a Second Singer's Formant-

  • Jin, Sung-Min;Kwon, Young-Kyung;Song, Yun-Kyung
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.99-109
    • /
    • 2003
  • The purpose of this study was to analyze and compare head register with chest register of singers acoustically. Fifteen healthy tenor major students were participated. Fifteen healthy untrained adults were chosen as the control group for this study. Long term average (LTA) power spectrum using the Fast Fourier transform (FFT) algorithm and Linear predictive coding (LPC) filter response were made with /a/ sustained in both head (G4, 392 Hz) and chest registers (C3, 131 Hz). Statistical analysis was performed using the Mann-Whitney test. In the LTA power spectrum, head register of singers increased in the level of energy gain within the frequency of 2.2-3.4 kHz (p<0.01), and 7.5-8.4 kHz (p<0.01, p<0.05). Chest register of singers increased in the frequency of 2.2-3.1 kHz (p<0.01), 7.8-8.4 kHz (p<0.05) and around 9.6 kHz (p<0.01). The LTA power spectrum revealed a peak of acoustic energy around 2,500 Hz, known as the singer's formant and another peak of acoustic energy around 8,000 Hz in the singer's voice.

  • PDF

CELP 보코더의 성능 개선 및 실시간 구현 (Performance improvement and Realtime implementation in CELP Coder)

  • 정창경
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.199-204
    • /
    • 1994
  • In this paper, we researched abut CELP speech coding algorithm using efficlent pseudo-stochastic block codes, adaptive-codebook and improved fixed-gain codebook. The pseudo-stochastic block codes refer to stochastically populated block codes in which the adjacent codewords in an innovation codebook are non-independent. The adaptive-codebook was made with previous prediction speech data by storage-shift register. This CELP coding algorithm enables the coding of toll quality speech at bit rates from 4.8kbits/s to 9.6 kbits/s. This algorithm was realized TMS320C30 microprocessor in realtime.

  • PDF

Energy-Efficient Approximate Speech Signal Processing for Wearable Devices

  • Park, Taejoon;Shin, Kyoosik;Kim, Nam Sung
    • ETRI Journal
    • /
    • 제39권2호
    • /
    • pp.145-150
    • /
    • 2017
  • As wearable devices are powered by batteries, they need to consume as little energy as possible. To address this challenge, in this article, we propose a synergistic technique for energy-efficient approximate speech signal processing (ASSP) for wearable devices. More specifically, to enable the efficient trade-off between energy consumption and sound quality, we synergistically integrate an approximate multiplier and a successive approximate register analog-to-digital converter using our enhanced conversion algorithm. The proposed ASSP technique provides ~40% lower energy consumption with ~5% higher sound quality than a traditional one that optimizes only the bit width of SSP.

장모음 인식장치 설계 제작 (Design and Manufacture of a Device for the Recognition of Long Vowels)

  • 구용회
    • 전자공학회논문지T
    • /
    • 제35T권3호
    • /
    • pp.9-14
    • /
    • 1998
  • 장모음 음성인식을 전자회로로 수행하였다. 레벨 압축은 음성 파형을 직렬 펄스로 변화시킬 수 있었다 이 펄스들로 모음을 구별하는 정보가 된다. 펄스의 샘풀링은 한 단위로 모음의 피치 직렬신호를 얻어지는 레지스터 에 의해서 이루어진다. 샘풀링 펄스에 의한 시간제어 펄스는 음성파형의 첩두치 펄스에 의해 발진되어 진다. 이 레지스터에 있는 병렬 데이터는 만약 OO이면 OO이다는 규칙으로 이루처지는 의지결정 회로의 뜻에 따라 음성 심볼이 인식되어진다.

  • PDF

Countertenor 1인의 Modal Register와 Falsetto Register에서의 공기역학적 변화 및 전기성문파형의 변화 연구 (Analysis of Phonatory Aerodynamic & Electroglottography of a Countertenor)

  • 남도현;최성희;최재남;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제17권1호
    • /
    • pp.43-48
    • /
    • 2006
  • Background and Objectives: Countertenors who can produce higher vocal pitch like female classical singer's voice and use both modal and falsetto register. This study was conducted to study phonatory characteristics between modal and falsetto register of the countertenor. Materials and Methods: A male countertenor who had 8 years of experience was examined using a videostroboscopy and his voice was analyzed using aerodynamic measures; fundamental frequency(F0), Mean air flow rate(MFR), intensity(SLP), subglottal air pressure(Psub) with phonatory function analyzer(Nagashima) and acoustic measures; jitter, shimmer, HNR, closed quotient(CQ) using a Electro-glottography(EGG) of Lx. Speech Studio(Laryngoscope, Ltd, UK) and voice range profile of CSL(Kay elemetrics). Results: In the stroboscopy finding, the longitudinal length of vocal folds was increased at the falsetto register and the upper margin of vocal folds vibrated with incomplete closure of true vocal folds. In aerodynamic analysis, intensity was same at the modal and falsetto register. However, MFR, Psub, MPT were higher at the falsetto register. In the electroglottographic analysis, closed quotient(CQ) at the modal register was high and also much higher at the high-pitch falsetto than at the loud falsetto. In the VRP, intensity was similar though F0 was different between modal and falsetto register. Conclusion: It implied that countertenor could produce powerful voice quality by increasing of respiratory pressure and respiratory volume though glottal closure was incomplete. In addition, no change of EGG waveform, similar voice range with alto was observed.

  • PDF

남성성악가의 Vocal Register Transition(Passaggio)시 공기역학적 변화와 EGG의 변화 연구 (Analysis of Phonatory Aerodynamic & E.G.G. during Passaggio of the Trained Male Singers)

  • 남도현;최성희;최재남;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제15권1호
    • /
    • pp.21-26
    • /
    • 2004
  • Vocal Register Transition(Passaggio) is one of the most important vocal technique for classically trined male singers(tenor). Passaggio is that it bridges the chest register to head register without a noticeable voice break. Vocalist gest the feeling that voice is not locked a particular register. The purpose of this study was to clarify the difference between easy($B_3$) tone and non passaggio(F#_4$) & passaggio(F#_4$). We selected 6 trained singers(tenor), who had more than 12.6 years of experience and were well trained in passaggio technique. Simulataneous measurement was performed frequency(F0), mean flow rate(MFR), intensity(I), and subglottal pressure(Psub) using a phonatory function analyzer(Nagashima) and Closed Quotient(CQ), Jitter, Shimmer, NHR a Electro-glottography(EGG) of Lx. Speech Studio(Laryngogrph Lt, London, UK) and vocal efficiency was calculated by Carroll's method. For the tenor, target tone/a/was measured in three conditions : 1) easy phonation : $B_3$, 2) high tone without passaggio : F#_4$, 3) high tone with passaggio : F#_4$). The results revealed that F0 of the target tones between non-passaggio group and passaggio group were not significantly different though higher is F0, higher is subglottal pressure. And also CQ, MFR, Psub were increased in passagio than nonpssagio but these values were not statistically different. This study concluded that passaggio is the vocal technique to make the same quality of tone between chest register and head register in tenor.

  • PDF

Dual MAC를 이용한 음성 부호화기용 DSP Core 설계에 관한 연구 (Design of a dedicated DSP core for speech coder using dual MACs)

  • 박주현
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1995년도 제12회 음성통신 및 신호처리 워크샵 논문집 (SCAS 12권 1호)
    • /
    • pp.137-140
    • /
    • 1995
  • In the paper, CDMA's vocoder algorithm, QCELP, was analyzed. And, 16-bit programmable DSP core for QCELP was designed. When it is used two MACs in DSP, we can implement low-power DSP and estimate decrease of parameter computation speed. Also, we implemented in FIFO memory using register file to increase the access time of the data. This DSP was designed using logic synthesis tool, COMPASS, by top-down design methodology. Therefore, it is possible to cope with rapid change at mobile communication market.

  • PDF

성악다들의 목소리에 대한 Long Term Average Spectrum 분석 -$2^{nd}$ Singer's Formant의 존재 가능성에 대하여- (Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers : Extended Study)

  • 반재호;권영경;진성민
    • 대한후두음성언어의학회지
    • /
    • 제15권1호
    • /
    • pp.31-36
    • /
    • 2004
  • Background and Objectives : It has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. In previous study, authors showed that in trained tenors, besides the conventional singer's formant in the region of ,5500Hz, another energy peak was observed in the region of 8,000Hz. This peak was interpreted as the second resonance of the epilarynx tube. Singers in other voice categories who produce vocal ring are assumed to have the same peak, but no measurements have as yet been made. Materials and Methods : Fifteen tenors, fourteen baritones, seven sopranos and five mezzo sopranos attending the music college, department of vocal music who could reliably produce the head and chest registers were chosen for this study. Each subject was asked to produce an/ah/sound for at least three seconds for the head register sound(tenors ; G4, barions ; E4 sopranos ; F5 and mezzosopranos ; C5) and for the chest register sound (tenors ; C3, baritones ; D3, sopranos ; D4 and Mezzosoprano ; A3). The sound data was analyzed using the Fast Fourier Transform (FFT)-based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab (CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social sciences(SPSS). Results : For head register sounds, a significant increase was seen in the 2,200-3,400Hz region(p<0.05) and the Similar to the head register sounds, there was a significant increase in energy in the four trained singer group compared with the untrained group in the 2,200-3,100Hz region(p<0.05), the 7,800-8,400Hz region(p<0.05) for the chest register sounds. Conclusions : When good vocal production was made for the head and chest registers, an energy peak was observed near 2,500Hz, a frequency already known as the "singer's formant', in all subjects in the study group. Another region of increased energy was observed around 8,000Hz that had not been noticed previously. The authors believe this region to be the second singer's formant.

  • PDF