• Title/Summary/Keyword: 음성적 거리

Search Result 135, Processing Time 0.027 seconds

Perceptual and Adaptive Quantization of Line Spectral Frequency Parameters (선 스펙트럼 주파수의 청각 적응 부호화)

  • 한우진;김은경;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.8
    • /
    • pp.68-77
    • /
    • 2000
  • Line special frequency (LSF) parameters have been widely used in low bit-rate speech coding due to their efficiency for representing the short-time speech spectrum. In this paper, a new distance measure based on the masking properties of human ear is proposed for quantizing LSF parameters whereas most conventional quantization methods are based on the weighted Euclidean distance measure. The proposed method derives the perceptual distance measure from the definition of noise-to-mask ratio (NMR) which has high correspondence with the actual distortion received in the human ear and uses it for quantizing LSF parameters. In addition, we propose an adaptive bit allocation scheme, which allocates minimal bits to LSF parameters maintaining the perceptual transparency of given speech frame for reducing the average bit-rates. For the performance evaluation, we has shown the ratio of perceptually transparent frames and the corresponding average bit-rates for the conventional and proposed methods. By jointly combining the proposed distance measure and adaptive bit allocation scheme, the proposed system requires only 770 bps for obtaining 95.5% perceptually transparent frames, while the conventional systems produce 89.9% at even 1800 bps.

  • PDF

A Study on Voice Communication Quality Improvement of Intercom System for KUH (한국형 기동헬기 내부통화장치의 통화품질 향상에 관한 연구)

  • Kim, Young Mok;Chang, Joong Jin;Jun, Byung Kyu;Kim, Chang Young;Jeong, Jin Woong
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.41 no.12
    • /
    • pp.1002-1010
    • /
    • 2013
  • Intercom System(ICS) of Korean Utility Helicopter(KUH) is an essential equipment for pilot to perform flight mission and it consists of communication system of KUH with VHF-FM radio set and U/VHF-AM radio set. It provides pilots and crews with internal communication, external communication and audible alarm. It has function of controlling volume and selecting two communication modes, normal mode and backup mode. This paper summarizes pilot comments in flight test which are classified by cause of occurrence and the troubleshooting process about each comment. It also describes design improvements which was derived from troubleshooting and suggests verification results of flight test.

Phoneme-Boundary-Detection and Phoneme Recognition Research using Neural Network (음소경계검출과 신경망을 이용한 음소인식 연구)

  • 임유두;강민구;최영호
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 1999.11a
    • /
    • pp.224-229
    • /
    • 1999
  • In the field of speech recognition, the research area can be classified into the following two categories: one which is concerned with the development of phoneme-level recognition system, the other with the efficiency of word-level recognition system. The resonable phoneme-level recognition system should detect the phonemic boundaries appropriately and have the improved recognition abilities all the more. The traditional LPC methods detect the phoneme boundaries using Itakura-Saito method which measures the distance between LPC of the standard phoneme data and that of the target speech frame. The MFCC methods which treat spectral transitions as the phonemic boundaries show the lack of adaptability. In this paper, we present new speech recognition system which uses auto-correlation method in the phonemic boundary detection process and the multi-layered Feed-Forward neural network in the recognition process respectively. The proposed system outperforms the traditional methods in the sense of adaptability and another advantage of the proposed system is that feature-extraction part is independent of the recognition process. The results show that frame-unit phonemic recognition system should be possibly implemented.

  • PDF

Design of QPSK Ultrasonic Transceiver For Underwater Communication (수중 통신을 위한 QPSK 초음파 송수신기의 설계)

  • Cho Nai-Hyun;Kim Duk-Yung;Kim Yong-Deuk;Chung Yun-Mo
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.43 no.3 s.309
    • /
    • pp.51-59
    • /
    • 2006
  • In this paper, we propose an excellent ultrasonic transceiver system based on a QPSK modulation technique for underwater communication. The transmitter sends a still image at the level of 187dB re $1{\mu}Pa/V@1m$ through a power amplifier by driving an ultrasonic sensor. The receiver performs digital conversion at the 100kHz sampling frequency, demodulation and decoding process for the image sent from the transmitter through the underwater communication. We have shown that the processed image at the receiver is almost the same as the orignal one. The maximum detection distance of the system proposed in this paper is approximately 1.17km. To cope with the difficulties of transmission loss, this paper proposes, implements and analyzes important parameters of sensors and circuits used in the system. Most of the underwater communication has focused on the transmission of audio signal, but this paper suggests an efficient underwater communication system for still image transmission.

Study on U-City Infra Based Realtime Children Anti-abduction System (U-City Infra 기반 실시간 어린이 유괴방지 시스템 연구)

  • Jo, Byung-Wan;Jun, Woo-Hyun;Lee, Kay-Sam;Park, Jung-Hoon;Yoon, Kwang-Won;Lee, Kyung-Soo
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2009.04a
    • /
    • pp.467-470
    • /
    • 2009
  • 본 논문에서는 유비쿼터스 기반 인프라를 이용한 실시간 유괴방지 시스템을 구축하였다. 급속한 사회 발전과 더불어 강력사건이 증가되고 있으며 그중에서 어린이 유괴 범죄 같이 질적으로 흉악한 범죄가 해마다 증가되고 있는 실정이다. 이러한 유괴 범죄를 예방하기 위하여 현재 GPS(Global Positioning System)을 이용한 위치인식 기술 및 이동통신 기지국을 이용한 위치인식 기술이 사용되고 있다. 단순히 위치인식 기술은 위험상황이 발생하였을 때, 상황을 정확히 인지하기 어려워 유괴된 어린이 44%가 1시간 이내 사망하고 74%가 3시간 이내 사망 한다는 통계를 감안하면 기존 시스템은 어린이 생명 보호 능력에 한계가 있다. 본 연구에서는 유비쿼터스 도시 기반 인프라를 구축하여 WPAN(Wireless Personal Area Network)환경에서 RF만으로 거리 측정이 가능한 IEEE 802.15.4a의 ISM Band CSS(Chirp Spread Spectrum)방식을 이용하여 보다 저 전력으로 정확한 위치정보 시스템을 적용하였다. 이에 CSS방식을 통하여 얻은 위치정보를 지능형 CCTV와 융합하여 CCTV가 단말기 위치로 자동 초점하는 시스템을 구성하였다. 도시통합운영센터에서 상황을 정확히 인지하고 신속하게 출동할 수 있도록 단말기 위치를 지속적으로 요원의 PDA 및 핸드폰으로 통보하고 현장 주변의 미디어 보드 표시와 음성 경고로 경찰의 적절한 대응 및 주변의 도움을 받을 수 있는 시스템을 구성하였다.

  • PDF

VRmeeting : Distributed Virtual Environment Supporting Real Time Video Chatting on WWW (VRmeeting: 웹상에서 실시간 화상 대화 지원 분산 가상 환경)

  • Jung, Heon-Man;Tak, Jin-Hyun;Lee, Sei-Hoon;Wang, Chang-Jong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10a
    • /
    • pp.715-718
    • /
    • 2000
  • 다중 사용자 분산 가상환경 시스템에서는 참여자들 사이의 의사 교환을 위해 텍스트 중심의 채팅과 TTS 등을 지원하고 언어 외적인 의사교환을 지원하기 위해 참여자의 대리자인 아바타에 몸짓이나 얼굴 표정 및 감정등을 표현할 수 있도록 애니메이션 기능을 추가하여 사용한다. 하지만 아바타 애니메이션으로 참여자의 의사 및 감정 표현을 표현하는 데는 한계가 있기 때문에 자유로운 만남 및 대화를 지원할 수 있는 환경이 필요하다. 따라서 이러한 문제를 해결하기 위해서는 참여자의 얼굴과 음성을 가상 공간상에 포함시킴으로써 보다 분명하고 사실적인 의사교환과 감정표현이 가능할 것이다. 이 논문에서는 컴퓨터 네트워크를 통해 형성되는 다중 사용자 가상 환경에서 참여자들의 의사 교환 및 감정 표현을 극대화하고 자유로운 만남과 대화를 제공하는 실시간 화상 대화가 가능한 분산 가상 환경 시스템을 설계하였다. 설계한 시스템은 참여자들의 거리와 주시 방향에 따라 이벤트의 양을 동적으로 제어함으로써 시스템의 부하를 최적화할 수 있는 구조를 갖고 있다.

  • PDF

Smart Portable Navigation System Development and Implementation of 1:N service for Visually impaired person (Smart Portable Navigation System 개발 및 1:N 서비스 구현)

  • Kim, Jae-Kyung;Seo, Jae-Gil;Kim, Young-Kil
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.11
    • /
    • pp.2424-2430
    • /
    • 2012
  • The current Navigation System for the Visually Impaired Person has a short and limited communication distance and can't receive enough information from Visually Impaired Person to assist directly. In addition, because the path is dangerous and incomplete for the Visually Impaired Person, moving with White Stick is still inconvenient and dangerous. To solve this problem we implement communication that can send and receive video, voice, location information between the Visually Impaired Person's Smart Portable Navigation System Development and assistant's PC.

Design of Network Architecture in Underground Structure Field Information Based on VI-GNSS (VI-GNSS 지하구조물 현장정보 네트워크 아키텍쳐 설계)

  • Jeon, Heung-Soo;Jang, Yong-Gu;Oh, Chang-Kyun;Kim, Min-Koan
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.18 no.1
    • /
    • pp.64-73
    • /
    • 2015
  • Recently, the integrated utilization of technology with IT is in demand for the effectiveness of field management together with the prevention and prompt action on safety accident at construction site. In addition, the establishment of construction site support system is necessary to implement the securing of worker's safety, smooth work instruction, efficiency in construction, and others. Data standardization and network architecture were designed regarding data and sound information for data transmission between systems and management. These were to construct USFSS based on integrated VI-GNSS technology in this research. In the stability test of data for each system constructed through it, around 98% stability was secured between workers and for transfer vehicle system within underground structure and field server system in regards to the data transmission stability, around 100% stability was secured between field server system and control system, respectively. Also, in the sound transmission stability test, around 99% reliability could be secured with 1km distance as its standard in case of sound transmission from underground structure construction site to field office near the field through wireless FRS system.

Augmented Reality based Museum Guidance System Selective Viewing (증강현실을 이용한 선택적 가이드 시스템 -관람자의 관심에 따라 박물관 관람을 안내 하는 가이드 시스템)

  • Park, Joon-Suk;Lee, Dong-Hyun;Park, Jun
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.45-48
    • /
    • 2008
  • Using these systems, additional information on the paintings and exhibits may be provided in the forms of text, image, speech, and video However, at museums and exhibitions, many tourists are often interested in exhibits of some particular style, authors, or coteries. The proposed Augmented Reality based guidance system may guide the users to exhibits of their interest for selective viewing. Location of the next exhibit of interest may be informed to the users as well as additional multimedia information on the exhibits of interest Such information is shown on the Augmented Reality views of the user's display device. The proposed system is composed an Ultra-Mobile PC (UMPC), an inertia tracker, and a camera. In the beginning, the user may select his/her preference on the exhibits from the menu, and then the system starts guiding by showing the relative orientation, distance, and visual cue to find a next exhibit. When the user finds and locates the matching visual cue within a matching box of the display screen, the system provides multimedia information on the exhibit. According to the preliminary user test, the proposed system is convenient and useful for navigating through large-scale exhibition.

  • PDF

Front-End Processing for Speech Recognition in the Telephone Network (전화망에서의 음성인식을 위한 전처리 연구)

  • Jun, Won-Suk;Shin, Won-Ho;Yang, Tae-Young;Kim, Weon-Goo;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.4
    • /
    • pp.57-63
    • /
    • 1997
  • In this paper, we study the efficient feature vector extraction method and front-end processing to improve the performance of the speech recognition system using KT(Korea Telecommunication) database collected through various telephone channels. First of all, we compare the recognition performances of the feature vectors known to be robust to noise and environmental variation and verify the performance enhancement of the recognition system using weighted cepstral distance measure methods. The experiment result shows that the recognition rate is increasedby using both PLP(Perceptual Linear Prediction) and MFCC(Mel Frequency Cepstral Coefficient) in comparison with LPC cepstrum used in KT recognition system. In cepstral distance measure, the weighted cepstral distance measure functions such as RPS(Root Power Sums) and BPL(Band-Pass Lifter) help the recognition enhancement. The application of the spectral subtraction method decrease the recognition rate because of the effect of distortion. However, RASTA(RelAtive SpecTrAl) processing, CMS(Cepstral Mean Subtraction) and SBR(Signal Bias Removal) enhance the recognition performance. Especially, the CMS method is simple but shows high recognition enhancement. Finally, the performances of the modified methods for the real-time implementation of CMS are compared and the improved method is suggested to prevent the performance degradation.

  • PDF