Search | Korea Science

Voice Activity Detection in Noisy Environment using Speech Energy Maximization and Silence Feature Normalization (음성 에너지 최대화와 묵음 특징 정규화를 이용한 잡음 환경에 강인한 음성 검출)

Ahn, Chan-Shik;Choi, Ki-Ho
- Journal of Digital Convergence
- /
- v.11 no.6
- /
- pp.169-174
- /
- 2013
Speech recognition, the problem of performance degradation is the difference between the model training and recognition environments. Silence features normalized using the method as a way to reduce the inconsistency of such an environment. Silence features normalized way of existing in the low signal-to-noise ratio. Increase the energy level of the silence interval for voice and non-voice classification accuracy due to the falling. There is a problem in the recognition performance is degraded. This paper proposed a robust speech detection method in noisy environments using a silence feature normalization and voice energy maximize. In the high signal-to-noise ratio for the proposed method was used to maximize the characteristics receive less characterized the effects of noise by the voice energy. Cepstral feature distribution of voice / non-voice characteristics in the low signal-to-noise ratio and improves the recognition performance. Result of the recognition experiment, recognition performance improved compared to the conventional method.
https://doi.org/10.14400/JDPM.2013.11.6.169 인용 PDF

A Phase-related Feature Extraction Method for Robust Speaker Verification (열악한 환경에 강인한 화자인증을 위한 위상 기반 특징 추출 기법)

Kwon, Chul-Hong
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.14 no.3
- /
- pp.613-620
- /
- 2010
Additive noise and channel distortion strongly degrade the performance of speaker verification systems, as it introduces distortion of the features of speech. This distortion causes a mismatch between the training and recognition conditions such that acoustic models trained with clean speech do not model noisy and channel distorted speech accurately. This paper presents a phase-related feature extraction method in order to improve the robustness of the speaker verification systems. The instantaneous frequency is computed from the phase of speech signals and features from the histogram of the instantaneous frequency are obtained. Experimental results show that the proposed technique offers significant improvements over the standard techniques in both clean and adverse testing environments.
https://doi.org/10.6109/jkiice.2010.14.3.613 인용 PDF KSCI

Robust Detection Technique for Abandoned Objects to Overcome Visual Occlusion (시각적 가려짐을 극복하는 강인한 유기물 탐지 기법)

Kim, Won
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.10 no.6
- /
- pp.23-29
- /
- 2010
Nowadays it is required to design intelligent visual surveillance systems which automatically detect abandoned objects in public places to strengthen the social safety. Already recognized abandoned objects can be occluded partially or fully by surrounding people in public places after the first recognition. To improve an essential recognition performance index PAT, the system should overcome the occlusion problems. In this research, a design scheme is newly proposed to construct the robust detection system which is comprised of multiple stages considering the occlusion problem. To show the feasibilities of the proposed system, the evaluation was tried for the prepared image streams including 6 various situations and the experimental results show 96% and 75% in PAT performance for intrusion and abandoning events, respectively. Finally in spite of full occlusions by multiple persons, the proposed system shows the capability to continuously recognize the abandoned object after complex occlusions disappear.
PDF KSCI

A frame structure of modified ATSC system for terrestial 3D HDTV broadcasting (지상파 3D HDTV 방송을 위한 수정된 ATSC 전송 시스템의 프레임 구조에 대한 연구)

Oh, Jong-Gyu;Kim, Joon-Tae
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2010.11a
- /
- pp.257-259
- /
- 2010
본 논문에서는 지상파 3D HDTV 방송 서비스를 제공하기 위해 수정된 ATSC (Advanced Television Systems Committee) 전송 시스템 [2]을 위한 시변다중경로채널에 강인한 프레임 구조를 제안하고 성능을 측정하였다. 수정된 ATSC 전송 시스템 [2]은 기존 ATSC 전송 시스템[1]의 채널 부호화부를 수정하고, 변조 성상도를 증가 시키면서 적정한 수준의 TOV (Threshold of Visibility)에서의 전송 용량 증대 가능성을 확인하였다. 이를 토대로, 증가된 전송 데이터 전송률에 대한 순수 데이터 전송률을 최대한 보장하면서 시변다중경로채널에서 효율적으로 채널을 추정하고 복구하기 위해, ISI (Inter Symbol Interference)를 방지하기 위한 프레임 헤더의 보호구간에 알려진 PN (Pseudorandom Noise) 심벌을 삽입하였다. PN 심벌을 보호 구간에 이용할 경우 시간 영역에서 채널 임펄스 응답 (CIR: Channel Impulse Response)을 추정하여, 주파수 영역에서의 채널 보상을 가능케 하여 정확한 채널 추정 및 보상을 수행할 수 있다. 또한 수신기의 속도에 따른 다양한 최대 도플러 주파수가 존재하는 채널에 강인한 프레임 구조들을 제안하였다. 컴퓨터 시뮬레이션을 통해 수정된 ATSC 전송 시스템에 제안된 프레임 구조를 적용하여 TU (Typical Urban)-6 채널에서의 SER (Symbol Error Rate) 성능을 측정하였다.
PDF

A Design of Optimal Satellite-Tracking Control System with Two-Degree-of Freedom for Communication Antenna Equipments (통신안테나 설비의 2자유도 체상 위상 추적 제어 시스템의 설계)

Hwang, Chang-Sun;Hwang, Hyun-Joon;Kim, Dong-Wan;Kim, Mun-Soo;Jeong, Ho-Seong
- The Proceedings of the Korean Institute of Illuminating and Electrical Installation Engineers
- /
- v.11 no.3
- /
- pp.97-105
- /
- 1997
The aim of this paper is to introduce a design technique of the Two-Degree-of-Freedom(TDF) satellite-tracking control system which has not only the robust stability for a unstructured uncertainty but also the robust performance for a structured uncertainty. This TDF system which can design the feedforward controller KI and the feedback one K independently is designed by , $\mu$-synthesis. The effectiveness of this TDF system is verified and compared with the One-Degree-of -Freedom(ODF) satellitetracking control system by computer simulation.
PDF

A Method of Constructing Robust Descriptors Using Scale Space Derivatives (스케일 공간 도함수를 이용한 강인한 기술자 생성 기법)

Park, Jongseung;Park, Unsang
- Journal of KIISE
- /
- v.42 no.6
- /
- pp.764-768
- /
- 2015
Requirement of effective image handling methods such as image retrieval has been increasing with the rising production and consumption of multimedia data. In this paper, a method of constructing more effective descriptor is proposed for robust keypoint based image retrieval. The proposed method uses information embedded in the first order and second order derivative images, in addition to the scale space image, for the descriptor construction. The performance of multi-image descriptor is evaluated in terms of the similarities in keypoints with a public domain image database that contains various image transformations. The proposed descriptor shows significant improvement in keypoint matching with minor increase of the length.
https://doi.org/10.5626/JOK.2015.42.6.764 인용 KSCI

Audio Fingerprint Binarization by Minimizing Hinge-Loss Function (경첩 손실 함수 최소화를 통한 오디오 핑거프린트 이진화)

Seo, Jin Soo
- The Journal of the Acoustical Society of Korea
- /
- v.32 no.5
- /
- pp.415-422
- /
- 2013
This paper proposes a robust binary audio fingerprinting method by minimizing hinge-loss function. In the proposed method, the type of fingerprints is binary, which is conducive in reducing the size of fingerprint DB. In general, the binarization of features for fingerprinting deteriorates the performance of fingerprinting system, such as robustness and discriminability. Thus it is necessary to minimize such performance loss. Since the similarity between two audio clips is represented by a hinge-like function, we propose a method to derive a binary fingerprinting by minimizing a hinge-loss function. The derived hinge-loss function is minimized by using the minimal loss hashing. Experiments over thousands of songs demonstrate that the identification performance of binary fingerprinting can be improved by minimizing the proposed hinge loss function.
https://doi.org/10.7776/ASK.2013.32.5.415 인용 PDF KSCI

An Optimized Multi-Bit Digital FSK Receiver Robust to CFO for Long-Range WPAN Applications (광역 WPAN 응용을 위한 주파수 오차에 강인한 최적 다중비트 디지털 FSK 수신기)

Oh, Mi-Kyung;Choi, Sangsung
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.39A no.1
- /
- pp.43-49
- /
- 2014
This paper proposes an optimized multi-bit digital FSK receiver robust to large carrier frequency offset (CFO) toward recently emerging long-range WPAN standards. Due to a short preamble length and strict BER requirements, we design a simple multi-bit digital demodulator combined with CFO estimator which guarantees the target BER performance. Simulation results verify that the proposed FSK receiver achieves CFO-free BER performance with the short preamble and satisfies the BER requirement by the recent WPAN applications.
https://doi.org/10.7840/kics.2014.39A.1.43 인용 PDF KSCI

Speech Estimators Based on Generalized Gamma Distribution and Spectral Gain Floor Applied to an Automatic Speech Recognition (잡음에 강인한 음성인식을 위한 Generalized Gamma 분포기반과 Spectral Gain Floor를 결합한 음성향상기법)

Kim, Hyoung-Gook;Shin, Dong;Lee, Jin-Ho
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.8 no.3
- /
- pp.64-70
- /
- 2009
This paper presents a speech enhancement technique based on generalized Gamma distribution in order to obtain robust speech recognition performance. For robust speech enhancement, the noise estimation based on a spectral noise floor controled recursive averaging spectral values is applied to speech estimation under the generalized Gamma distribution and spectral gain floor. The proposed speech enhancement technique is based on spectral component, spectral amplitude, and log spectral amplitude. The performance of three different methods is measured by recognition accuracy of automatic speech recognition (ASR).
PDF

Auto tonal detection method robust to interference for passive sonar (간섭 소음에 강인한 수동 소나 자동 토널 탐지 기법)

Kang, Tae-Su;Kim, Dong Gwan;Choi, Chang-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.36 no.4
- /
- pp.229-237
- /
- 2017
In this paper we propose an auto tonal detection method which exploits short term stationary when targets located in a detection beam area and then additional methods are proposed in order to reduce the computational complexity of the proposed method. The proposed method is adaptive to input signals and robust against interference caused by multiple targets because it compares an expected value of input signals with a threshold value which are estimated from a single beam while signals are keep stationary. The performances of the proposed methods are evaluated using by simulated data and acquired data from real ocean. The proposed method has shown better performance than conventional CFAR (Constant False Alarm Rate) methods.
https://doi.org/10.7776/ASK.2017.36.4.229 인용 PDF KSCI

Search Result 1,408, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)