• Title/Summary/Keyword: quality of pitch

Search Result 331, Processing Time 0.024 seconds

A Study on Multi-Pulse Speech Coding Method by using Individual Pitch Pulses (개별 피치펄스를 이용한 멀티펄스 음성부호화 방식에 관한 연구)

  • 이시우
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.5
    • /
    • pp.977-982
    • /
    • 2004
  • In this paper, I propose a new method of Multi-Pulse Coding(IP-MPC) use individual pitch pulses in order to accommodate the changes in each pitch interval and reduce pitch errors. The extraction rate of individual pitch pulses was 85% for female voice and 96% for male voice respectively. 1 evaluate the MPC by using pitch information of autocorrelation method and the IP-MPC by using individual pitch pulses. As a result, I knew that synthesis speech of the IP-MPC was better in speech quality than synthesis speech of the MPC.

Pitch Modification based on a Voice Source Model (음원 모델에 기초한 합성음의 피치 조절)

  • Choi, Yong-Jin;Yeo, Su-Jin;Kim, Jin-Young;Sung, Koeng-Mo
    • Speech Sciences
    • /
    • v.3
    • /
    • pp.132-147
    • /
    • 1998
  • Previously developed methods for pitch modification have not been based on the voice source model. Therefore, the synthesized speech often sounds unnatural although it may be highly intelligible. The purpose of this paper is to analyze the alteration of a voice source signal with pitch period and to establish the pitch-modification rule based on the result of this analysis. We examine the alteration of the interval of closing phase, closed phase and open phase using the excitation waveform as the pitch increases. In comparison to the previous methods which performed directly on the speech signal, the pitch modification method based on a voice source model shows high intelligibility and naturalness. This study might benefit the application to the speaker identification and the voice color conversion. Therefore the proposed method will provide high quality synthetic speech.

  • PDF

An Algorithm to Reduce the Pitch Computational Complexity Using Modified Delta Searching in G.723.1 Vocoder (CELP 보코더에서 델타 피치 검색 방법 개선에 대한 연구)

  • Min, So-Yeon;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.165-172
    • /
    • 2004
  • In this paper, we propose the computational complexity reduction methods of delta pitch search that is used in G.723.1 vocoder. In order to decrease the computational complexity in delta pitch search the characteristic of proposed algorithms is as the following. First, scheme to reduce the computational complexity in delta pitch search uses NAMDF. Developed the second scheme is the skipping technique of lags in pitch searching by using the threshold value. By doing so, we can reduce the computational amount of pitch searching more than 64% with negligible quality degradation.

  • PDF

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

  • Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

Analyzing the element of emotion recognition from speech (음성으로부터 감성인식 요소분석)

  • 심귀보;박창현
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.6
    • /
    • pp.510-515
    • /
    • 2001
  • Generally, there are (1)Words for conversation (2)Tone (3)Pitch (4)Formant frequency (5)Speech speed, etc as the element for emotional recognition from speech signal. For human being, it is natural that the tone, vice quality, speed words are easier elements rather than frequency to perceive other s feeling. Therefore, the former things are important elements fro classifying feelings. And, previous methods have mainly used the former thins but using formant is good for implementing as machine. Thus. our final goal of this research is to implement an emotional recognition system based on pitch, formant, speech speed, etc. from speech signal. In this paper, as first stage we foun specific features of feeling angry from his words when a man got angry.

  • PDF

Performance Assessment of Several Established Pitch Detection Algorithms in Voices of Benign Vocal Fold Lesions (양성후두 질환 음성에 대한 여러 기존 피치검출 알고리즘의 성능 평가)

  • Jang, Seung-Jin;Choi, Seong-Hee;Kim, Hyo-Min;Choi, Hong-Shik;Yoon, Young-Ro
    • Proceedings of the IEEK Conference
    • /
    • 2007.07a
    • /
    • pp.407-408
    • /
    • 2007
  • Robust pitch estimation is an important study in many areas of speech processing. In voice pathology, diverse statistics extracted form pitch were commonly used to test voice quality. In this study, we compared several established pitch detection algorithms (PDAs) for verification of adequacy of the PDAs. In the database of total pathological voices of 99 and normal voices of 30, an analysis of errors related with pitch detection was evaluated between pathological and normal voices, or among the types of pathological voices such as benign vocal fold lesions; polyp, nodule, and cysts. Consequently, it is required to survey the severity of tested voice in order to obtain accurate pitch estimates.

  • PDF

Evaluation of the Availability of the Wide Coverage Volume Axial Mode in Pediatric Cardiac CT (소아 심장 CT검사 시 Wide Coverage Volume Axial Mode의 유용성 평가)

  • Park, Ki Seok;Kim, Dong Hyun
    • Journal of the Korean Society of Radiology
    • /
    • v.13 no.5
    • /
    • pp.683-689
    • /
    • 2019
  • We are trying to evaluate usefulness of Wide Coverage Volume Axial Mode by comparing and analyzing for exposure doses to patients and video quality extracted from two materials. First material is a computer tomography scanning of paediatric cardiac made by using High Pitch Mode, which is designed for diagnosing Congenital cardiac diseases. Second material is computer tomography scanning of paediatric cardiac made by using Wide Coverage Volume Axial Mode. When we did computer tomography scanning of paediatric cardiac, we used High Pitch Mode and Wide Coverage Volume Axial Mode to 50 patients of each, overall 100 patients. Also, we compared exposure doses to patients using videos got from each protocol. Then we compared video quality by calculating SNR and CNR by setting ROI of each. Not only exposure doses to patients were reduced by 13.07 %, but also SNR and CNR were improved when testing used Wide Coverage Volume Axial Mode rather than using High Pitch Mode. Wide Coverage Volume Axial Mode reduced testing time by using high-speed scanner. Furthermore, we can find out that Wide Coverage Volume Axial Mode is an useful method through improving video quality and reducing exposure doses to patients than using High Pitch Mode from ASiR-V, which is low-dose technology.

A Study on the Design and Validation of Automatic Pitch Rocker System for Altitude, Speed and Deep Stall Recovery (항공기의 고도, 속도 및 깊은 실속의 회복을 위한 자동회복장치 설계 및 검증에 관한 연구)

  • Kim, Chong-Sup
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.15 no.2
    • /
    • pp.240-248
    • /
    • 2009
  • Modem version of supersonic jet fighter aircraft must have been guaranteed appropriate controllability and stability in HAoA(High Angle of Attack). The HAoA flight control law have two parts, one is control law of departure prevention and the other is control law of departure recovery support. The control laws of departure prevention for advanced jet trainer consist of HAoA limiter, roll command limiter and rudder fader. The control laws of departure recovery support are consist of yaw-rate limiter and MPO(Manual Pitch Override) mode. The guideline of pitch rocking using MPO mode is simple, but operating skill of pitch rocking is very difficult by the pilot with inexperience of departure situation. Therefore, automatic deep stall recovery system is necessary. The system called the "Automatic Pitch Rocker System" or APRS, provided a pilot initiated automatic maneuver capable of an aircraft recoveries in situations of deep stall, speed and altitude. This paper addresses the design and validation for APRS to recovery of an deep stall without manual pitch rocking by the pilot. Also, this system is designed to recovery of speed, attitude and altitude after deep stall recovery using ATCS (Automatic Thrust Control System) and autopilot. Finally, this system is verified by real-time pilot evaluation using HQS (Handling Quality Simulator).

Image Quality and Dose Assessment According to Examination Mode during Head CT Examination (두부 CT 검사 시 검사 모드에 따른 화질 및 선량평가)

  • Gang, Heon-Hyo;Choi, Woo-Jeon;Kim, Dong-Hyun
    • Journal of the Korean Society of Radiology
    • /
    • v.15 no.4
    • /
    • pp.437-444
    • /
    • 2021
  • To evaluate the usefulness of Volume Axial Mode by comparing analyzing the exposure dose of the patients and the quality of each images from CT images obtained from high pitch mode using the local phantom or volume axial mode to determine the usefulness of he volume axial mode in diagnosing the head and cervical disease in adults. High Pitch Mode, Helical Mode, and Volume axial Mode as adult phantom were tested according to 70 kVp, 80 kVp, and 100 kVp tube voltages during an adult frontal CT scans. The equipment used was GE's Revolution (GE Healthcare, Wisconsin USA) model and iMED X-ray Phantom. The exposure dose of phantom was compared using the images obtained from each protocol, and the image quality was compared by calculating SNR and CNR by setting ROI on each image. When examined using Volume Axial Mode, the exposure dose of phantom was measured 17.12% lower than Helical Mode, 5.35% lower than High Pitch Mode, and both SNR and CNR were improved. Volume Axial Mode is a useful test that reduces investigation time without table movement using high speed rotary scanner, and in which exposure dose is reduced and image quality is improved by acquiring images in a short time of 0.28 seconds of phantom than using High Pitch Mode and Helical Mode. In addition, the fast testing time of Volume Axial Mode can be seen as the biggest advantage CT scans of emergency patients or patients with physical discomfort.

A 4800 BPS LPS Vocoder with Improved Exitation (개선된 여기신호의 4800BPS LPC 보코우터)

  • 은종관;성원용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.1 no.1
    • /
    • pp.54-59
    • /
    • 1982
  • We present an improved 4800 bps LPC vocoder system that virtually eleminates the buzzy effect from synthetic speech. Excitation signal in the new system is formed by adding high-pass filtered pitch pulses or random noise to a baseband residual signal that has been coded by pitch predictive PCM. Since the baseband residual is used as a part of excitation, the system is also robust to V/UV and pitch errors. According to our informal listening tests, the synthetic speech of the new system does not have the buzzy effect. As a result the vocoder speech quality is more natural than that of a conventioinal LPC vocoder.

  • PDF