• Title/Summary/Keyword: 음성다중연구

Search Result 149, Processing Time 0.025 seconds

ETRI신기술-다기능 ISDN PC 카드 기술

  • Electronics and Telecommunications Research Institute
    • Electronics and Telecommunications Trends
    • /
    • v.14 no.4 s.58
    • /
    • pp.129-130
    • /
    • 1999
  • N-ISDN의 대중화 및 활성화를 위해서 연구 개발한 회선 및 패킷 통신 기능을 지원하는 N-ISDN 접속용 PC 카드로서, NT(Network Terminator)및 TA(Terminal Adapter) 기능을 하며, PC에서 다중 통신이 가능하도록 한다. 음성 통화중 다양한 ISDN 서비스를 이용 가능하도록 하며, 64kbps로 인터넷 접속을 할 수 있다. 또한 전용선 없이 ISDN을 통해 64kbps로 패킷망에 접속하여 데이터 통신이 가능하도록 하는 등 협대역 ISDN 통신 기능을 지원한다.

  • PDF

Incorporation of IMM-based Feature Compensation and Uncertainty Decoding (IMM 기반 특징 보상 기법과 불확실성 디코딩의 결합)

  • Kang, Shin-Jae;Han, Chang-Woo;Kwon, Ki-Soo;Kim, Nam-Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.6C
    • /
    • pp.492-496
    • /
    • 2012
  • This paper presents a decoding technique for speech recognition using uncertainty information from feature compensation method to improve the speech recognition performance in the low SNR condition. Traditional feature compensation algorithms have difficulty in estimating clean feature parameters in adverse environment. Those algorithms focus on the point estimation of desired features. The point estimation of feature compensation method degrades speech recognition performance when incorrectly estimated features enter into the decoder of speech recognition. In this paper, we apply the uncertainty information from well-known feature compensation method, such as IMM, to the recognition engine. Applied technique shows better performance in the Aurora-2 DB.

Performance Evaluation of the Dynamic Slot Assignment Protocol using Collision Resolution Algorithm (충돌해결기법을 적용한 동적슬롯할당 프로토콜의 성능 개선)

  • 강경훈;임석구;김수중
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.36S no.5
    • /
    • pp.1-11
    • /
    • 1999
  • 유선망이 ISDN/B-ISDN으로 확장되고, 서비스도 음성에서 데이터, 영상 등의 멀티미디어로 통합 및 발전됨에 따라 무선망에서의 가입자 서비스 요구도 유선망과 대등한 수준으로 확대될 것이다. 이러한 무선 ATM망은 초기 사설망으로부터 서비스가 도입되기 시작하여 마이크로 셀을 기반으로 한 공중망으로 확장되어, 궁극적으로 유무선 통합 transparent 망으로 발전하게 될 것이다. 무선 ATM 망에서는 한정된 무선 자원을 이용하여 다양한 응용 서비스를 효율적으로 제공할 수 있는 다중접속 프로토콜을 필요로 한다. 본고에서는 먼저 현대 MBS에서 연구되고 있는 단일 및 다중 동적슬롯할당 기법을 소개하고 다양한 소스 트래픽에 대한 시뮬레이션을 통해 문제점을 분석하였으며, DSA 프로토콜의 단점을 개선한 두가지 방안의 충돌 해결 알고리즘을 제안하고 동질 및 이질 트래픽에 대한 성능을 비교 검토하였다. 제안된 알고리즘을 이용하여 연결상태에 있는 가상채널의 개수가 증가할수록 시스템 전체의 지연 성능을 현저하게 개선하였다.

  • PDF

Comparative Analysis of Written Language and Colloquial Language for Information Communication of Multi-Modal Interface Environment (다중 인터페이스 환경에서의 문자언어와 음성언어의 차이에 관한 비교 연구)

  • Choi, In-Hwan;Lee, Kun-Pyo
    • Archives of design research
    • /
    • v.19 no.2 s.64
    • /
    • pp.91-98
    • /
    • 2006
  • The product convergence and complex application environment raise the need of multi-modal interface which enables us to interact products through various human senses. The sense of vision has been used predominantly more than any other senses for the traditional and general information gathering situation, but in the future which will be developed based on the digital network technology, the practical use of the various senses will be desired for more convenient and rational usage of the information appliances. The sense of auditory which possibility of practical use is becoming higher than ever with the sense of vision, the possible usage will be developed broader and in the various ways in the future. Based on this situation, the characteristics of the written language and the colloquial language and the comparative analysis of the difference between male and female's reaction for each language were examined through this study. To achieve this purpose, the literature research about the diverse components of the language system was peformed. Then, some peculiar characters of the sense of vision and auditory were reviewed and the appropriate experimentation was planned and carried out. The result of the accomplished experimentation was examined by the objective analysis method. The main results of this study are as follows: first, the reaction time for written language is shorter than colloquial language, second, there is a partial difference between the male's and female's reaction for those two stimuli, third, there is no selection bias between the sense of sight and the sense of hearing. I think the continuous development of the broad and diverse ways of study for various senses is needed based on this study.

  • PDF

Speech Recognition Model Based on CNN using Spectrogram (스펙트로그램을 이용한 CNN 음성인식 모델)

  • Won-Seog Jeong;Haeng-Woo Lee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.4
    • /
    • pp.685-692
    • /
    • 2024
  • In this paper, we propose a new CNN model to improve the recognition performance of command voice signals. This method obtains a spectrogram image after performing a short-time Fourier transform (STFT) of the input signal and improves command recognition performance through supervised learning using a CNN model. After Fourier transforming the input signal for each short-time section, a spectrogram image is obtained and multi-classification learning is performed using a CNN deep learning model. This effectively classifies commands by converting the time domain voice signal to the frequency domain to express the characteristics well and performing deep learning training using the spectrogram image for the conversion parameters. To verify the performance of the speech recognition system proposed in this study, a simulation program using Tensorflow and Keras libraries was created and a simulation experiment was performed. As a result of the experiment, it was confirmed that an accuracy of 92.5% could be obtained using the proposed deep learning algorithm.

Enhancement of Authentication Performance based on Multimodal Biometrics for Android Platform (안드로이드 환경의 다중생체인식 기술을 응용한 인증 성능 개선 연구)

  • Choi, Sungpil;Jeong, Kanghun;Moon, Hyeonjoon
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.3
    • /
    • pp.302-308
    • /
    • 2013
  • In this research, we have explored personal authentication system through multimodal biometrics for mobile computing environment. We have selected face and speaker recognition for the implementation of multimodal biometrics system. For face recognition part, we detect the face with Modified Census Transform (MCT). Detected face is pre-processed through eye detection module based on k-means algorithm. Then we recognize the face with Principal Component Analysis (PCA) algorithm. For speaker recognition part, we extract features using the end-point of voice and the Mel Frequency Cepstral Coefficient (MFCC). Then we verify the speaker through Dynamic Time Warping (DTW) algorithm. Our proposed multimodal biometrics system shows improved verification rate through combining two different biometrics described above. We implement our proposed system based on Android environment using Galaxy S hoppin. Proposed system presents reduced false acceptance ratio (FAR) of 1.8% which shows improvement from single biometrics system using the face and the voice (presents 4.6% and 6.7% respectively).

Multiplier Using CRT and Overlapped Multiple-bit Scanning Method (CRT와 중첩다중비트 주사기법을 접목한 승산기)

  • 김우완;장상동
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.12
    • /
    • pp.749-755
    • /
    • 2003
  • Digital signal processing hardware based in RNS is currently considered as an important method for high speed and low cost hardware realization. This research designs and implements the method for conversion from a specific residue number system with moduli of the from $(2^k-1, 2^k, 2^k+1)$ to a weighted number system. Then, it simulates the implementation using a overlapped multiple-bit scanning method in the process of CRT conversion. In conclusion, the simulation shows that the CRT method which is adopted in this research, performs arithmetic operations faster than the traditional approaches, due to advantages of parallel processing and carry-free arithmetic operation.

A Study on the Improvement of Normalized Channel Equalization for the Asynchronous DS-CDMA System (비동기 DS-CDMA 시스템에서 정규화된 채널 등화 개선에 관한 연구)

  • 박노진;강철호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.6B
    • /
    • pp.736-745
    • /
    • 2001
  • 차세대 이동통신 시스템은 고속의 멀티미디어 데이터의 신뢰성 있는 전송을 요구하고 다양한 전파환경에서 신뢰성 있는 음성, 데이터, 영상서비스 등의 다양한 서비스를 제공한다. 하지만 광대역 무선 접속을 지원하는 다중 접속 기술은 DS-CDMA(Direct Sequence Code Division Multiple Access) 시스템에서 시스템 성능을 저하시키는 심벌간 간섭(ISI) 혹은 다중접속간섭(MAI) 신호를 발생시킨다. 이러한 간섭 신호를 개선하기 위해 적응 블라인드 등화 방식을 사용하는데 적응 블라인드 등화 방식 중에서도 가장 많이 이용하는 Constant Modulus Algorithm(CMA)을 적절한 초기화 없이 사용하면 ill-convergence 현상이 나타난다. 본 논문에서는 채널의 효율을 높이기 위한 등화 방식으로 기존의 NCMA 알고리듬을 이용한 새로운 블라인드 등화 방식(Modified NCMA)을 제안하고, 이를 비동기 DS-CDMA 시스템의 다중 사용자 환경에서 컴퓨터 모의 실험 및 성능분석을 하였다. 제안한 등화 방식의 자승오차(SE : Squared Error)의 개선은 spreading gain 31과 127에 대해 cell 내의 사용자가 10명일 때 약 17dB 정도이고, 사용자가 15, 25명으로 증가시킴에 따라 자승오차의 개선은 각각 20dB, 15dB 정도로 전체 평균 자승오차는 약 17.3dB 정도로 개선됨을 확인할 수 있었다.

  • PDF

Effects of PSK Modulation Methods in Underwater Acoustic Communication (PSK 변조방식이 수중통신에 미치는 영향에 관한 연구)

  • Cho, Jin-Soo;Jung, Seung-Back;Shim, Tae-Bo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.7
    • /
    • pp.366-374
    • /
    • 2007
  • In underwater wireless communication, needs for long distance communication using the high frequency are surpassing ones of short range communication by ultrasonic wave, and demands for transmitting and receiving various data such as voice or high resolution image data are increasing as well. In this work, we studied the effects on the real underwater communication depending on the difference of digital modulation methods. Simulation shows that only the performance of GMSK among many other PSK based modulation schemes(BPSK, QPSK, MSK, GMSK) is significant. Test condition simulates the oceanographic conditions along the 207-survey line, 15Km south of Busan and SNR is maintained 35dB or below. Simulated tests are composed of both transmitting image data($3{\times}10^5$ pixel, 4 bit per pixel) and voice communication($10^{-2}$BER, channel capacity of 1Kbps). Test results show that there are gain of about 7 seconds in transmission time in image transmission case, where channel capacity for BPSK, QPSK, and MSK and for GMSK were 65 Kbps and 45 Kbps, respectively and gain of about 8Km in distances in voice communication case.

A Study on Transmit Diversity of Repeaters for 1x EV-DO Networks (1x EV-DO 서비스망을 위한 이동통신 중계기의 송신 다이버스티에 관한 연구)

  • 김선근;이영섭;김기문
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.4
    • /
    • pp.761-766
    • /
    • 2004
  • Rayleigh fading due to multi-path degrades the mobile service quality, especially high data rate mobile services such as 1xEVDO and W-CDMA. The field test showed that down load date rate of 1xEVDO is seriously affected by Rayleigh fading. To reduce the effect of Rayleigh fading, transmit diversity was implemented in RF repeater. In field test, transmit diversity function increased the data rate about twice comparing with no transmit diversity repeater. Recently high data rate service is getting more important, so transmit diverstiy function will be an important function in repeater.