• Title/Summary/Keyword: Speech reconstruction

검색결과 88건 처리시간 0.026초

VoIP Receiver Structure for Enhancing Speech Quality Based on Telematics (텔레메틱스 기반의 VoIP 음성 통화품질 향상을 위한 수신단 구조)

  • Kim, Hyoung-Gook;Seo, Kwang-Duk
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • 제11권3호
    • /
    • pp.48-54
    • /
    • 2012
  • The quality of real-time voice communication over Internet Protocol networks based on telematics is affected by network impairments such as delays, jitters, and packet loss. To resolve this issue, this paper proposes a receiver-based enhancing method of VoIP speech quality. The proposed method enables users to deliver high-quality voice using playout control and signal reconstruction, which consists of concealment of lost packets, adaptive playout-buffer scheduling using active jitter estimation, and smooth interpolation between two signals in a transition region. The proposed algorithm achieves higher Perceptual Evaluation of Speech Quality (PESQ) values and low buffering delay than the reference algorithm.

A Low-Delay MDCT/IMDCT

  • Lee, Sangkil;Lee, Insung
    • ETRI Journal
    • /
    • 제35권5호
    • /
    • pp.935-938
    • /
    • 2013
  • This letter presents an algorithm for selecting a low delay for the modified discrete cosine transform (MDCT) and inverse MDCT (IMDCT). The implementation of conventional MDCT and IMDCT requires a 50% overlap-add (OLA) for a perfect reconstruction. In the OLA process, an algorithmic delay in the frame length is employed. A reduced overlap window and MDCT/IMDCT phase shifting is used to reduce the algorithmic delay. The performance of the proposed algorithm is evaluated by applying the low-delay MDCT to the G.729.1 speech codec.

The Performance Improvement of G.729 PLC in Situation of Consecutive Frame Loss (연속적인 프레임 손실 상황에서의 G.729 PLC 성능개선)

  • Hong, Seong-Hoon;Kim, Jin-Woo;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • 제29권1호
    • /
    • pp.34-40
    • /
    • 2010
  • As internet spread widely, various service which use the internet have been provided. One of the service is a internet phone. Its usage is increasing by the advantage of cost. But it has a falling off in quality of speech. because it use packet switching method while existing telephone use circuit switching method. Although vocoder use PLC (Packet Loss Concealment) algorithm, it has a weakness of continuous packet loss. In this paper, we propose methods to improve a lowering in quality of speech under continuous loss of packet by using PLC algorithm used in advanced G.729 and G.711. The proposed methods are LP (Linear Prediction) parameter interpolation, excitation signal reconstruction and excitation signal gain reconstruction. As a result, the proposed method shows superior performance about 11%.

Reconstruction of Pharyngolaryngeal Defects with the Ileocolon Free Flap: A Comprehensive Review and How to Optimize Outcomes

  • Escandon, Joseph M.;Santamaria, Eric;Prieto, Peter A.;Duarte-Bateman, Daniela;Ciudad, Pedro;Pencek, Megan;Langstein, Howard N.;Chen, Hung-Chi;Manrique, Oscar J.
    • Archives of Plastic Surgery
    • /
    • 제49권3호
    • /
    • pp.378-396
    • /
    • 2022
  • Several reconstructive methods have been reported to restore the continuity of the aerodigestive tract following resection of pharyngeal and hypopharyngeal cancers. However, high complication rates have been reported after voice prosthesis insertion. In this setting, the ileocolon free flap (ICFF) offers a tubularized flap for reconstruction of the hypopharynx while providing a natural phonation tube. Herein, we systematically reviewed the current evidence on the use of the ICFF for reconstruction of the aerodigestive tract. A systematic literature search was conducted across PubMed MEDLINE, Web of Science, ScienceDirect, Scopus, and Ovid MEDLINE(R). Data on the technical considerations and surgical and functional outcomes were extracted. Twenty-one studies were included. The mean age and follow-up were 54.65 years and 24.72 months, respectively. An isoperistaltic or antiperistaltic standard ICFF, patch flap, or chimeric seromuscular-ICFF can be used depending on the patients' needs. The seromuscular chimeric flap is useful to augment the closure of the distal anastomotic site. The maximum phonation time, frequency, and sound pressure level (dB) were higher with ileal segments of 7 to 15 cm. The incidence of postoperative leakage ranged from 0 to 13.3%, and the majority was occurring at the coloesophageal junction. The revision rate of the microanastomosis ranged from 0 to 16.6%. The ICFF provides a reliable and versatile alternative for reconstruction of middle-size defects of the aerodigestive tract. Its three-dimensional configuration and functional anatomy encourage early speech and deglutition without a prosthetic valve and minimal donor-site morbidity.

A Study on the Diphone Recognition of Korean Connected Words and Eojeol Reconstruction (한국어 연결단어의 이음소 인식과 어절 형성에 관한 연구)

  • ;Jeong, Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • 제14권4호
    • /
    • pp.46-63
    • /
    • 1995
  • This thesis described an unlimited vocabulary connected speech recognition system using Time Delay Neural Network(TDNN). The recognition unit is the diphone unit which includes the transition section of two phonemes, and the number of diphone unit is 329. The recognition processing of korean connected speech is composed by three part; the feature extraction section of the input speech signal, the diphone recognition processing and post-processing. In the feature extraction section, the extraction of diphone interval in input speech signal is carried and then the feature vectors of 16th filter-bank coefficients are calculated for each frame in the diphone interval. The diphone recognition processing is comprised by the three stage hierachical structure and is carried using 30 Time Delay Neural Networks. particularly, the structure of TDNN is changed so as to increase the recognition rate. The post-processing section, mis-recognized diphone strings are corrected using the probability of phoneme transition and the probability o phoneme confusion and then the eojeols (Korean word or phrase) are formed by combining the recognized diphones.

  • PDF

Improvement of phonetic function using modified two-flap palatoplasty and velar myoplasty : Report of a case (변형 피판 구개성형술 및 구개내 근육성형술의 언어기능의 개선 : 증례보고)

  • Yi, Ho;Myoung, Hoon;Choi, Jin-Young;Lee, Jong-Ho;Choung, Pil-Hoon;Kim, Myung-Jin;Seo, Byoung-Moo
    • Korean Journal of Cleft Lip And Palate
    • /
    • 제9권2호
    • /
    • pp.79-84
    • /
    • 2006
  • Cleft palate is one of the most devastating congenital facial deformities frequently accompanied by cleft lip. In many cases, it causes phonetic and swallowing difficulties although surgical interventionwas applied. Among the surgical methods, Veau-Wardill-Kilner pushback palatoplasty (V-Y reposition) is widely used in the most cleft palate cases. It is designed to lengthen the palate posteriorly, hence to overcome the speech and swallowing problems, but broad postoperative palatal scar might interfere the normal maxillary growth. If the velar muscles were not reoriented, it could result in incomplete speech recovery. In this case report, the modified two-flap palatoplasty with minimal pushback was successfully applied to a 21 month-old girl who has had incomplete cleft palate extended to the posterior third of hard palate. The speech evaluation was confirmed as functional reconstruction of cleft palate was achieved.

  • PDF

Wavelet-based Algorithm for Signal Reconstruction (신호 복원을 위한 웨이브렛기반 알고리즘)

  • Bae, Sang-Bum;Kim, Nam-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • 제11권1호
    • /
    • pp.150-156
    • /
    • 2007
  • Noise is generated by several causes, when signal is processed. Hence, it generates error in the process of data transmission and decreases recognition ratio of image and speech data. Therefore, after eliminating those noises, a variety of methods for reconstructing the signal have been researched. Recently, wavelet transform which has time-frequency localization and is possible for multiresolution analysis is applied to many fields of technology. Then threshold-and correlation-based methods are proposed for removing noise. But, conventional methods accept a lot of noise as an edge and are impossible to remove the additive white Gaussian noise (AWGN) and the impulse noise at the same time. Therefore, in this paper we proposed new wavelet-based algorithm for reconstructing degraded signal by noise and compared it with conventional methods.

IMPROVING THE SPEECH INTELLIGIBILITY IN AN AIR-TRFFIC CONTROL ROOM

  • Pavuza, Franz G.;Beszedics, Geza W.;Pichler, Heinrich
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.912-918
    • /
    • 1994
  • Poor speech intelligibility in an air traffic control room is frequently a result of many, quite different causes and occasionally leads to complaints of the controller personnel. The paper describes a sequence of successful tasks performed in a local control room. The initial measurements included an investigation of the background noise (caused by fans, air condition, computer and radar equipment) and performance checks of the electronic audio and communication equipment with respect to the audio transmission behavior. The spectral composition of the noise as well as the characteristics of the audio communication path between the controllers and the pilots(which showed a loss of spectral information in the audio band due to built-in notch filters for the suppression of control tones) required adaptations of the amplitude behavior of the amplifiers through user adjustable tone controls. The radar console fans, which contributed significantly to the overall noise floor of the room, underwent a substantial reconstruction by replacing the tight mounting with an elastic double suspension, reducing the noise level by 50%. Finally, a possible source of untimely fatigue of the controllers during their working hours has been found in strong spectral components of the noise above the audio band, radiated by numerous video monitors in the control through vibrating components excited by the line frequency of the video signal.

  • PDF

Acoustical Analysis of Phonological Reduction in Conversational Japanese (일본어 회화문에 나타난 축약형의 음운론적 해석과 음향음성학적 분석)

  • Choi, Young-Sook
    • Speech Sciences
    • /
    • 제8권4호
    • /
    • pp.229-241
    • /
    • 2001
  • Using eighteen texts from various genera of present-day Japanese, I collected phonologically reduced forms frequently observed in conversational Japanese, and classified them in search of a unified. explanation of phonological phenomena. I found 7,516 cases of reduced forms which I divided into 43 categories according to the types of phonological changes they have undergone. The general tendencies are that deletion and fusion of a phoneme or an entire syllable takes place frequently, resulting in the decrease in the number of syllables. From a morphosyntactic point of view, phonological reduction often occurs at the NP and VP morpheme boundaries. The following findings are drawn from phonetical observations of reduction. (1) Vowels are more easily deleted than consonants. (2) Bilabials ([m], [b], and [w]) are the most likely candidates for deletion. (3) In a concatenation of vowels, closed vowels are absorbed into open vowels, or two adjacent vowels come to create another vowel, in which case reconstruction of the original sequence is not always predictable. (4) Alveolars are palatalized under the influence of front vowels. (5) Regressive assimilation takes place in a syllable starting with [r], changing the entire syllable into a phonological choked sound or a syllabic nasal, depending on the voicing of the following phoneme.

  • PDF

LONG-TERM ANALYSIS OF RECONSTRUCTED TEMPOROMANDIBULAR JOINT AND MANDIBLE USING FREE FIBULAR FLAP (비골 피판을 이용한 하악 및 하악과두 재건의 장기간 임상적 평가)

  • Ahn, Kang-Min;Chung, Hun-Jong;Ryom, Hak-Ryol;Kim, Hang-Jin;Kim, Yoon-Tae;Hwang, Soon-Jung;Myoung, Hoon;Kim, Myung-Jin;Kim, Soung-Min;Jahng, Jeong-Won;Lee, Jong-Ho
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • 제31권5호
    • /
    • pp.409-416
    • /
    • 2005
  • Purpose of study: The temporomandibular joint (TMJ) occupies a key functional role in mastication and contributes to normal deglutition, speech as well as cosmesis. When a large amount of mandible including the condyle head is resected, it is very difficult to reconstruct it as a functional unit. In this retrospective study, we present the functional, radiographic and cosmetic results of reconstructed temporomandibular joint using free fibular flap. Patients and Methods: Total 12 patients (M:F = 6:6) who underwent condylar reconstruction with the fibular flap were interviewed and examined by radiographs and Bio-PAK$^{(R)}$. Mean follow up periods was $47.7{\pm}20.0$ months and the average age was $38.7{\pm}15.3$ years. Remodeling of condyle and function of TMJ were evaluated and facial contour was judged subjectively. Results: All flaps were viable and no immediate postoperative complication had happened. One patient showed decreased mouth opening, so interpositional gap arthroplasty was performed. The resorption rates of reconstructed fibular were minimal and the condyle heads were changed into domeshaped neocondyle after 2 years. All patients had normal diet and no speech difficulty was reported. Nine patients were satisfied with their facial contour but three patients complained about the depression of cheek. Conclusion: The reconstruction of TMJ with free fibular flap was reliable methods and very effective means of restoring mandibular function. The functional and morphologic results were excellent and showed little complications.