• Title/Summary/Keyword: Pause

Search Result 228, Processing Time 0.025 seconds

The Modeling of Pause Duration For Text-To-Speech Synthesis System (TTS 시스템을 위한 휴지기간 모델링)

  • Chung Jihye;Lee Yanhee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.83-86
    • /
    • 2000
  • 본 논문에서는 비정형 단위를 사용한 음성 합성 시스템의 합성음에 대한 자연성을 향상시키기 위한 휴지 구간 추출 및 휴지 지속시간 예측 모델을 제안한다. 제안된 휴지 지속시간 예측 모델은 트리 기반 모델링 기법 중 하나인 CART (Classification And Regression Trees)방법을 이용하였다. 이를 위해 남성 단일 화자가 발성한 6,220개의 어절경계 포함하는 총 400문장의 문 음성 데이터베이스를 구축하였고, 이 데이터베이스로부터 V-fold Cross-Validation 방법에 의해 최적의 트리를 결정하였다. 이 모델을 평가한 결과, 휴지 구간 추출 정확율은 $81\%$로 휴지 구간 존재 추출 정확율은 $83\%, 휴지 구간 비존재 추출 정확율은 $80\%이었고, 실 휴지지속시간과 예측 휴지지속시간과의 다중상관 계수는 0.84로, 오차 범위 20ms 이내에서 의 정 확율은 $88\%$ 이었다. 또한, 휴지지속시간을 예측하여 적용한 합성음을 청취 실험한 결과 자연 음성과 대체적으로 유사하게 나타났다.

  • PDF

Prosodic Break Index Estimation using LDA and Tri-tone Model (LDA와 tri-tone 모델을 이용한 운율경계강도 예측)

  • 강평수;엄기완;김진영
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.7
    • /
    • pp.17-22
    • /
    • 1999
  • In this paper we propose a new mixed method of LDA and tri-tone model to predict Korean prosodic break indices(PBI) for a given utterance. PBI can be used as an important cue of syntactic discontinuity in continuous speech recognition(CSR). The model consists of three steps. At the first step, PBI was predicted with the information of syllable and pause duration through the linear discriminant analysis (LDA) method. At the second step, syllable tone information was used to estimate PBI. In this step we used vector quantization (VQ) for coding the syllable tones and PBI is estimated by tri-tone model. In the last step, two PBI predictors were integrated by a weight factor. The proposed method was tested on 200 literal style spoken sentences. The experimental results showed 72% accuracy.

  • PDF

Perspective Coherence in Simultaneous Interpreting - with Reference to German-Korean Interpreting - (동시통역과 시각적 응집성 - 독한 통역을 중심으로 -)

  • Ahn In-Kyoung
    • Koreanishche Zeitschrift fur Deutsche Sprachwissenschaft
    • /
    • v.9
    • /
    • pp.169-193
    • /
    • 2004
  • In simultaneous interpreting, if the syntactic structure of the source language and the target language are very different, interpreters have to wait before being able to reformulate the source text segments into a meaningful utterance in target language. It is inevitable to adapt the target language structure to that of the source language so as not to unduly increase the memory load and to minimize the pause. While such adaptation enables simultaneous interpretating, it results in damaging the perspective coherence of the text. Discovering when such perspective coherence is impaired, and how the problem can be relieved, will enable interpreters to enhance their performance. This paper analyses the reasons for perspective coherence damage by looking at some examples of German-Korean simultaneous interpreting.

  • PDF

The implementation of home-server for intelligent Personal Video Recorder (지능형 PVR의 원격제어를 위한 홈 서버 구현)

  • Son, Kang-Sun;Oh, Young-Ho;Kim, Dae-Jin
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.414-416
    • /
    • 2004
  • The intelligent PVR(Personal Video Recorder) is an enhanced PVR that provides viewers with some advanced features as well as pause, instant replay, search and skip forward found in conventional PVRs. By embedding a home server into a PVR, it is possible for an intelligent PVR to provide a powerful web-based management user interface constructed using HTML, graphics and other features common to web-browsers. When applied to other embedded systems, web technologies offer graphical user interfaces which are user-friendly, inexpensive, cross-platform and network-ready. It is the purpose of this paper to introduce implementation of intelligent PVR which is control by internet. We present the architecture of an home- server with a simple but powerful web-based network interconnection.

  • PDF

Reverse Trick Mode Algorithms of MPEG Bit Stream (MPEG Bit Stream에서의 Reverse Trick Mode 알고리즘)

  • 신성욱;이동호
    • Proceedings of the IEEK Conference
    • /
    • 2001.09a
    • /
    • pp.689-692
    • /
    • 2001
  • 대용량 hard disk를 내장하고 있는 DTV용 PVR(Personal video recorder)에서는 단순히 수신되는 방송 stream을 녹화하고 재생하는 기능 뿐만 아니라 다른 여러 가지의 부가적인 기능들을 지원하는 것이 필요하다. 그 중의 하나가 기존의 아날로그 VCR의 사용자들에게 친숙한 fast-forward play, reverse play, pause 등과 같은 trick mode play기능을 지원하는 것이다. 그러나 MPEG video는 화면간의 상관관계를 이용하여 압축하는 방식을 채택하고 있으므로 재생하고자 하는 frame이 intra frame이 아닌 한 독립적으로 재생할 수가 없어서 trick mode play 기능을 구현하기가 용이하지 않다. 특히 reverse trick mode의 경우에는 original stream에서의 마지막 frame이 먼저 display되어야 하는데 이를 위해 하나의 GOP가 모두 decoding되어야 하므로 더욱 그 구현이 어렵다. 본 논문에서는 reverse trick mode를 구현하기 위한 여러 알고리즘을 소개하고 이에 대한 system 복잡도, 메모리 사용량 성능 등을 분석하고자 한다.

  • PDF

Prevention of Diapause in Bivoltine Eggs of the Silkworm, Bombyx mori, L., through a Cross with the Race KS-10 as Male Parent

  • Mundkur, Rajendra;Murthy, Mallesha;Mahadevappa;Raghuraman, R.;Bongale, U.D.
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • v.9 no.1
    • /
    • pp.107-109
    • /
    • 2004
  • The present investigation reports a phenomenon hitherto unknown in tropical sericulture, wherein dia-pause nature of bivoltine eggs is overcome through a cross with a non-diapausing race of silkworm, Bombyx mori, L. Eggs of bivoltine silkworm Bombyx mori, L. generally do not hatch under tropical conditions. To prevent diapause, they are subjected to acid treatment or low temperature hibernation scheduled. A race developed at KSSRDI is found to prevent the diapause nature of bivoltine eggs when crossed as male parent, without any acid treatment or hibernation schedule. This phenomenon is reported for the first time, being unique, opens up interesting area of research in silkworm genetics of commercial implications in the industry.

The proficiency-based and integrated teaching of High School English reading and listening based on sense group and utterance restructuring (의미군과 발화의 재구조에 의한 고등학교 영어 읽기와 듣기의 수준별 통합 지도)

  • Lee, Sun-Beom
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.245-249
    • /
    • 2004
  • The aim of this paper is to show the possibilities of the proficiency -based and integrated teaching of High School English reading and listening based on sense group and utterance restructuring. The proficiency -based and integrated listening and reading activities in stages are as follows. Step1, students fill in the blanks with strong or weak sounding words according to their abilities. Step2, speak along (track) based on restructuring and post-lexical phenomena while listening to the sentence. Step3, read and understand directly the passage, which have been marked the differentiated places where a native speaker of English would beat all likely to pause. Students need to listen to spoken English, so they recognize words in written and spoken form. They must be familiar with suprasegmental features, stress and rhythm, and post-lexical phenomena during reading activities.

  • PDF

Instantaneous torque control of induction motor with line-to-line voltage modulation on the variable load. (가변 부하시 선간전압 변조방식을 이용한 유도전동기의 순시 Torque 제어)

  • No, J.H.;Woo, J.I.;Lee, H.W.
    • Proceedings of the KIEE Conference
    • /
    • 1991.07a
    • /
    • pp.645-648
    • /
    • 1991
  • In conventional sinusoidal wave PWM control, torque oscillation is a problem on account of harmonic component. This paper deals with the choice of line-to-line voltage modulation method, which is effective in using DC(direct current) source voltage and in controlling harmonic oscillation, and the pattern to reduce swiching loss through 1/3 pause of switching interral. So, this paper deals with valid realization of harmonic component and high torque response on variable load by simulation and experiments to compensate occurring problems when line-to-line modulation are applied to PWM inverter.

  • PDF

A study on the Suprasegmental Parameters Exerting an Effect on the Judgment of Goodness or Badness on Korean-spoken English (한국인 영어 발음의 좋음과 나쁨 인지 평가에 영향을 미치는 초분절 매개변수 연구)

  • Kang, Seok-Han;Rhee, Seok-Chae
    • Phonetics and Speech Sciences
    • /
    • v.3 no.2
    • /
    • pp.3-10
    • /
    • 2011
  • This study investigates the role of suprasegmental features with respect to the intelligibility of Korean-spoken English judged by Korean and English raters as being good or bad. It has been hypothesized that Korean raters would have different evaluations from English native raters and that the effect may vary depending on the types of suprasegmental factors. Four Korean and four English native raters, respectively, took part in the evaluation of 14 Korean subjects' English speaking. The subjects read a given paragraph. The results show that the evaluation for 'intelligibility' is different for the two groups and that the difference comes from their perception of L2 English suprasegmentals.

  • PDF

Hand gesture recognition for player control

  • Shi, Lan Yan;Kim, Jin-Gyu;Yeom, Dong-Hae;Joo, Young-Hoon
    • Proceedings of the KIEE Conference
    • /
    • 2011.07a
    • /
    • pp.1908-1909
    • /
    • 2011
  • Hand gesture recognition has been widely used in virtual reality and HCI (Human-Computer-Interaction) system, which is challenging and interesting subject in the vision based area. The existing approaches for vision-driven interactive user interfaces resort to technologies such as head tracking, face and facial expression recognition, eye tracking and gesture recognition. The purpose of this paper is to combine the finite state machine (FSM) and the gesture recognition method, in other to control Windows Media Player, such as: play/pause, next, pervious, and volume up/down.

  • PDF