Search | Korea Science

Design of Wideband Speech Coder Compatible with CS-ACELP (CS-ACELP와 호환성을 갖는 광대역 음성 부호화기 설계)

김동주;이인성
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.4
- /
- pp.52-57
- /
- 2000
In this paper, we designed the 16 Kbps speech coder that has compatibility with CS-ACELP algorithm(G.729). The speech signal is sampled at rate of 16 KHz, divided into two narrowband signal by QMF filterbank, and decimated to rate of 8 KHz. The lower-band signal is encoded by CS-ACELP and the upper-band signal is encoded by Adaptive Transform Coding(ATC) algorithm. At the receiver, two band signals are synthesized by decoder of CS-ACELP and ATC, respectively. The reconstructed output is obtained by passing the QMF synthesis bank. The proposed wideband coder is evaluated with ITU-T G.722 coder through the Mean Opinion Score(MOS) test.
PDF

Tone Quality Improvement Algorithm using Intelligent Estimation of Noise Pattern (잡음 패턴의 지능적 추정을 통한 음질 개선 알고리즘)

Seo, Joung-Kook;Cha, Hyung-Tai
- Journal of the Korean Institute of Intelligent Systems
- /
- v.15 no.2
- /
- pp.230-235
- /
- 2005
In this paper, we propose an algorithm that improves a tone quality of a noisy audio signal in order to enhance a performance of perceptual filter using intelligent estimation of noise pattern from a band degraded by additive noise. The proposed method doesn't use the estimated noise which is obtained from silent range. Instead new estimated noise according to the power of signal and effect of noise variation is considered for each frame. So the noisy audio signal is enhanced by the method which controls a estimation of noise Pattern effectively in a noise corruption band. To show the performance of the proposed algorithm, various input signals which had a different signal-to-noise ratio(SNR) such as $5\cal{dB},\;10\cal{dB},\;15\cal{dB}\;and\;20\cal{dB}$ were used to test the proposed algorithm. we carry out SSNR and NMR of objective measurement and MOS test of subjective measurement. An approximate improvement of $7.4\cal{dB},\;6.8\cal{dB},\;5.7\cal{dB},\;5.1\cal{dB}$ in SSNR and $15.7\cal{dB},\;15.5\cal{dB},\;15.2\cal{dB},\;14.8\cal{dB}$ in NMR is achieved with the input signals, respectively. And we confirm the enhancement of tone quality in terms of mean opinion score(MOS) test which is result of subjective measurement.
https://doi.org/10.5391/JKIIS.2005.15.2.230 인용 PDF KSCI

Audio Stream Delivery Using AMR(Adaptive Multi-Rate) Coder with Forward Error Correction in the Internet (인터넷 환경에서 FEC 기능이 추가된 AMR음성 부호화기를 이용한 오디오 스트림 전송)

김은중;이인성
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.26 no.12A
- /
- pp.2027-2035
- /
- 2001
In this paper, we present an audio stream delivery using the AMR (Adaptive Multi-Rate) coder that was adopted by ETSI and 3GPP as a standard vocoder for next generation IMT-2000 service in which includes combined sender (FEC) and receiver reconstruction technique in the Internet. By use of the media-specific FEC scheme, the possibility to recover lost packets can be much increased due to the addition of repair data to a main data stream, by which the contents of lost packets can be recovered. The AMR codec is based on the code-excited linear predictive (CELP) coding model. So we use a frame erasure concealment for CELP-based coders. The proposed scheme is evaluated with ITU-T G.729 (CS-ACELP) coder and AMR - 12.2 kbit/s through the SNR (Signal to Noise Ratio) and the MOS (Mean Opinion Score) test. The proposed scheme provides 1.1 higher in Mean Opinion Score value and 5.61 dB higher than AMR - 12.2 kbit/s in terms of SNR in 10% packet loss, and maintains the communicab1e quality speech at frame erasure rates lop to 20%.
PDF

Video Quality Metric Using One-Dimensional Histograms of Motion Vectors (움직임 벡터의 1차원 히스토그램을 이용한 비디오 화질 평가 척도)

Han, Ho-Sung;Kim, Dong-O;Park, Bae-Hong;Sim, Dong-Gyu
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.45 no.2
- /
- pp.21-28
- /
- 2008
This paper proposes a novel reduced-reference assessment method for video quality assessment, in which one-dimensional (1-D) histograms of motion vectors (MVs) are used as features of videos. The proposed method is more efficient than the conventional methods in view of computation time, because the proposed quality metric decodes MVs directly from video stream in the parsing process instead of reconstructing the distorted video at the receiver. Moreover, in view of data size, the propose method is efficient because a sender transmits 1-D histograms of MVs accumulated over whole input video sequences. Here, we use 1-D histograms of MVs accumulated over the whole video sequences, which is different from the conventional methods that assessed each image independently. For testing the similarity between histograms, we use histogram intersection and histogram difference methods. We compare the proposed method with the conventional methods for 52 video clips, which are coded under varying bit rate, image size, and frame rate. Experimental results show that the proposed method is more efficient than the conventional methods and that the proposed method is more similar to the mean opinion score (MOS) than conventional algorithms.
PDF KSCI

Recoverable Oil Contents and Quality Evaluation of Reconstitute Orange Juice by Electronic Nose (전자코를 이용한 오렌지주스의 Recoverable Oil 함량 및 품질평가)

Lee Seung-Youp;Park Jong-Dae
- Food Science and Preservation
- /
- v.12 no.4
- /
- pp.361-366
- /
- 2005
An electronic nose equipped with metal oxide sensor(MOS) was used for investigating the quality of reconstitute orange juice added different recoverable oil(cold pressed valencia oil) contents during 21 days of storage at $4^{\circ}C$. Quality changes in orange juice was described in terms of the sensitivity(${\Delta}R_{gas}/R_{air}$) of the sensors. Principal component analysis(PCA) was carried out using data obtained from twelve metal oxide sensors. The flavor of orange juice contained with the different recoverable oil contents($0.01\%{\sim}0.05\%$) was separated in PCA plot, in which the first principal component score was correlated with the content of recoverable oil. As storage periods prolonged, no significantly different sensitivity score of orange juice was observed in electronic nose. The content of recoverable oil in orange juice was reduced rapidly within 14 days, and then the decreasing ratio was slow on the next 7 days during storage at $4^{\circ}C$. The sensory score for overall and orange flavor of orange juice added $0.03\%$ recoverable oil was decreased during the 14 days and then rapidly dropped next 7 days of storage at $4^{\circ}C$.
PDF KSCI

Effect of Electroacupuncture on Quality of Life of Patients with Urinary Incontinence (요실금(尿失禁) 환자의 삶의 질에 대한 전침치료 효과)

Ko, Young-Jin;Kim, Kyung-Tai;Kim, Eun-Jung;Woo, Hyun-Su;Kim, Chang-Hwan
- Journal of Acupuncture Research
- /
- v.23 no.1
- /
- pp.63-70
- /
- 2006
Objectives : This study was designed to evaluated the effect of electroacupuncture on Quality of life of patients with urinary incontinence Methods : Subjects were voluntarily recruited by newspapers and internet. Electroacupuncture was performed three times a week for 3 weeks. Acupuncture point for EA group was B32, Electrical stimulation frequency was 2Hz, duration 20 minutes, and intensity was up to pain threshold according to patients. The patients's symptoms were assessed before, after 3 weeks of treatment by QOL item of International Prostate Symptom Score(IPSS), Medical Outcomes Study(MOS) 36-Item Short-Form Health Survey(SF-36). Results : QOL score of IPSS were significantly improved after 3 weeks(p<0.05) compared to the pre-treatment. There were significant changes in Social functioning(SF), role-physical(RP), role emotional(RE), mental health(MH), bodily pain(BP) score of SF-36 after 3 weeks(p<0.05), but there were no significant changes in physical functioning(PF), vitality(VT), general health(GH) score of SF-36. Conclusion : This study suggests that electroacupuncture treatments can be applicable to improve symptoms in patients with urinary incontinence.
PDF

Proposed Assessment for Quality of Experience of Live IPTV in Home Environments

Jeong, Jongpil;Choi, Jae-Young
- International journal of advanced smart convergence
- /
- v.4 no.1
- /
- pp.18-30
- /
- 2015
As the speed of networks that subscribers can use has greatly increased, demand for high-quality broadcast content, such as from Internet Protocol Television (IPTV) and Video on Demand (VoD), is likewise increasing. Therefore, while broadcasters are increasing content and channels, they are striving to improve consumer quality of experience (QoE) to differentiate themselves from competitors, including by producing higher physical-quality content. Recently, subjective measurement methods have been internationally standardized as the most reliable approach for measuring and evaluating IPTV QoE. However, a majority of these methods are performed in experimental environments and are based on the extremely brief viewing period of approximately ten seconds using original reference videos. It is actually difficult to apply standard evaluation methods based on a ten-second viewing interval to assess real broadcast watching of IPTV or other services that involve a longer time (i.e., more than thirty minutes). In this paper, we therefore propose a method that accommodates actual viewing environments. Using the mean opinion score, we experimentally analyze the effects of evaluation interval changes under actual conditions in which IPTV service is provided. In addition, we propose improvements by applying the results into actual live broadcast IPTV service and by analyzing consumer service QoE.
https://doi.org/10.7236/IJASC.2015.4.1.18 인용 PDF KSCI

An end-to-end synthesis method for Korean text-to-speech systems (한국어 text-to-speech(TTS) 시스템을 위한 엔드투엔드 합성 방식 연구)

Choi, Yeunju;Jung, Youngmoon;Kim, Younggwan;Suh, Youngjoo;Kim, Hoirin
- Phonetics and Speech Sciences
- /
- v.10 no.1
- /
- pp.39-48
- /
- 2018
A typical statistical parametric speech synthesis (text-to-speech, TTS) system consists of separate modules, such as a text analysis module, an acoustic modeling module, and a speech synthesis module. This causes two problems: 1) expert knowledge of each module is required, and 2) errors generated in each module accumulate passing through each module. An end-to-end TTS system could avoid such problems by synthesizing voice signals directly from an input string. In this study, we implemented an end-to-end Korean TTS system using Google's Tacotron, which is an end-to-end TTS system based on a sequence-to-sequence model with attention mechanism. We used 4392 utterances spoken by a Korean female speaker, an amount that corresponds to 37% of the dataset Google used for training Tacotron. Our system obtained mean opinion score (MOS) 2.98 and degradation mean opinion score (DMOS) 3.25. We will discuss the factors which affected training of the system. Experiments demonstrate that the post-processing network needs to be designed considering output language and input characters and that according to the amount of training data, the maximum value of n for n-grams modeled by the encoder should be small enough.
https://doi.org/10.13064/KSSS.2018.10.1.039 인용 PDF KSCI

Noise Reduction Using the Standard Deviation of the Time-Frequency Bin and Modified Gain Function for Speech Enhancement in Stationary and Nonstationary Noisy Environments

Lee, Soo-Jeong;Kim, Soon-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.3E
- /
- pp.87-96
- /
- 2007
In this paper we propose a new noise reduction algorithm for stationary and nonstationary noisy environments. Our algorithm classifies the speech and noise signal contributions in time-frequency bins, and is not based on a spectral algorithm or a minimum statistics approach. It relies on calculating the ratio of the standard deviation of the noisy power spectrum in time-frequency bins to its normalized time-frequency average. We show that good quality can be achieved for enhancement speech signal by choosing appropriate values for ${\delta}_t\;and\;{\delta}_f$. The proposed method greatly reduces the noise while providing enhanced speech with lower residual noise and somewhat higher mean opinion score (MOS), background intrusiveness (BAK) and signal distortion (SIG) scores than conventional methods.
PDF KSCI

A Study on a Improvement of the Speech Quality by Spectrum Analysis with Variable Window in CELP Vocoder (가변 윈도우 스펙트럼 분석을 이용한 CELP 부호화기의 음질 향상에 관한 연구)

나덕수;민소연;배명진
- Proceedings of the IEEK Conference
- /
- 2000.06d
- /
- pp.106-109
- /
- 2000
There have been proposed two types of low bit rate vocoder upto now : One is MBE type using the spectrum modeling and another is CELP type using the hybrid coding method. CELP type vocoder has mainly studied between them. Specially, much of intensity is concentrated in CELP vocoder due to the emergence of Internet Phone and PCS in a domestic. In order to improve the speech quality in CELP vocoder, in this paper, we proposed a new spectrum analysis algorithm with variable window, In CELP vocoder, the spectrum of the synthesised speech signal is distorted because the fixed size windows is used for spectrum analysis. So we have measured the spectral leakage and in order to minimize the spectral leakage have adjusted the window size. Applying this method G.723.1 ACELP, we can get SD(Spectral Distortion) reduction 0.084(dB), residual energy reduction 6.3% and MOS(Mean Opinion Score) improvement 0.1.
PDF

Search Result 117, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)