• Title/Summary/Keyword: voice data

Search Result 1,260, Processing Time 0.028 seconds

The Study on Internet Voice Conference using MGCP and IP-Multicast (MGCP와 IP-Multicast를 이용한 Internet Voice Conference에 관한 연구)

  • Lee, Song-Ho;Choe, Gyeong-Sam;Lee, Jong-Su
    • Proceedings of the KIEE Conference
    • /
    • 2001.11c
    • /
    • pp.130-133
    • /
    • 2001
  • VoIP(voice over internet protocol) technology is based on IP protocol. The IP protocol can be involved in two types of communication: unicasting and multicasting. Unicasting is the communication between one sender and one receiver. It is one-to-one communication. Multicasting is one-to-many communication. So that, many receivers can get same data from one sender simultaneously. and, the different protocol are proposed for VoIP; H.323, SIP and MGCP. MGCP is perfect server-client protocol, so MGCP is very attractive VoIP protocol to ISP. This paper uses MGCP and offers modified MGCP for conference call. So that, Modified MGCP is compatible to MGCP, and supports conference call using IP-multicast.

  • PDF

Multipoint VoIP of End-point Mixing in Various Environments (다양한 환경에서 단말혼합 방법의 다자간 VoIP 운용)

  • Kim, Do-Yun;Park, Eun-Sung;Lee, Sung-Min;Seong, Dong-Su;Lee, Keon-Bae
    • Proceedings of the IEEK Conference
    • /
    • 2009.05a
    • /
    • pp.16-18
    • /
    • 2009
  • VoIP(Voice over IP) is the technology to transport voice and video over IP networks such as Internet. Today, VoIP technology is viewed as the right choice for provide voice, video, and data communication over next generation network. We are sure that the multipoint VoIP will help enhancing the various application services in ubiquitous environment. The paper shows multipoint VoIP system implemented with end-point mixing model and introduces various embedded systems such as UFC(Ubiquitous Fashionable Computer), tourist guide terminal and industrial terminal which use the multipoint VoIP.

  • PDF

Voice Recognition Elevator for Handicapped People (장애인을 위한 음성인식 엘리베이터)

  • Oh, Yong-Jae;Kim, Jeong-Rae;Chung, Ik-Joo
    • Journal of Industrial Technology
    • /
    • v.33 no.A
    • /
    • pp.55-60
    • /
    • 2013
  • In this paper, we proposed an efficient method for implementing a voice recognition elevator. Unlike the existing ones, the proposed system is based on the bluetooth communication and smartphones equipped with the google speech recognition software, which makes it possible that the speech recognition capability can be added to the previously installed elevators. In order to improve the recognition accuracy, instead of using the result of the google recognizer, we built a web server where the user data are accumulated and they are used for recognition error correction.

  • PDF

Policy and Managerial Issues of Voice over Internet Protocol(VoIP) (인터넷전화의 정책 및 경영이슈측면에서의 이용자분석)

  • Kim, Ji-Hee;Sung, Yoon-Young;Kweon, O-Sang;Kim, Jin-Ki
    • Journal of Information Technology Applications and Management
    • /
    • v.14 no.4
    • /
    • pp.221-233
    • /
    • 2007
  • Which factors should influence consumer consideration to subscribe to Voice over Internet Protocol (VoIP)? Policy issues, managerial concerns, and demographic variables are possible factors. This paper discusses policy and managerial issues regarding VoIP adoption. A model that explains VoIP adoption is proposed and tested. This study analyzes a survey of 750 prospective VoIP users in Korea. The testing is accompanied by logistic regression and discriminant analysis. The results show that trust in VoIP, relative comparison of Quality to fixed service, numbering plan, satisfactions of call Quality and customer services on both fixed and mobile services have impacts on the adoption of VoIP. Implications for VoIP providers and policy makers are presented.

  • PDF

Improvement of Speech/Music Classification Based on RNN in EVS Codec for Hearing Aids (EVS 코덱에서 보청기를 위한 RNN 기반의 음성/음악 분류 성능 향상)

  • Kang, Sang-Ick;Lee, Sang Min
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.11 no.2
    • /
    • pp.143-146
    • /
    • 2017
  • In this paper, a novel approach is proposed to improve the performance of speech/music classification using the recurrent neural network (RNN) in the enhanced voice services (EVS) of 3GPP for hearing aids. Feature vectors applied to the RNN are selected from the relevant parameters of the EVS for efficient speech/music classification. The performance of the proposed algorithm is evaluated under various conditions and large speech/music data. The proposed algorithm yields better results compared with the conventional scheme implemented in the EVS.

A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval (음악검색을 위한 가변임계치 기반의 음성 질의 변환 기법)

  • Han, Byeong-Jun;Rho, Seung-Min;Hwang, Een-Jun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.2
    • /
    • pp.445-451
    • /
    • 2010
  • This paper presents a threshold adaptation based voice query transcription scheme for music information retrieval. The proposed scheme analyzes monophonic voice signal and generates its transcription for diverse music retrieval applications. For accurate transcription, we propose several advanced features including (i) Energetic Feature eXtractor (EFX) for onset, peak, and transient area detection; (ii) Modified Windowed Average Energy (MWAE) for defining multiple small but coherent windows with local threshold values as offset detector; and finally (iii) Circular Average Magnitude Difference Function (CAMDF) for accurate acquisition of fundamental frequency (F0) of each frame. In order to evaluate the performance of our proposed scheme, we implemented a prototype music transcription system called AMT2 (Automatic Music Transcriber version 2) and carried out various experiments. In the experiment, we used QBSH corpus [1], adapted in MIREX 2006 contest data set. Experimental result shows that our proposed scheme can improve the transcription performance.

Terminal-Assisted Hybrid MAC Protocol for Differentiated QoS Guarantee in TDMA-Based Broadband Access Networks

  • Hong, Seung-Eun;Kang, Chung-Gu;Kwon, O-Hyung
    • ETRI Journal
    • /
    • v.28 no.3
    • /
    • pp.311-319
    • /
    • 2006
  • This paper presents a terminal-assisted frame-based packet reservation multiple access (TAF-PRMA) protocol, which optimizes random access control between heterogeneous traffic aiming at more efficient voice/data integrated services in dynamic reservation TDMA-based broadband access networks. In order to achieve a differentiated quality-of-service (QoS) guarantee for individual service plus maximal system resource utilization, TAF-PRMA independently controls the random access parameters such as the lengths of the access regions dedicated to respective service traffic and the corresponding permission probabilities, on a frame-by-frame basis. In addition, we have adopted a terminal-assisted random access mechanism where the voice terminal readjusts a global permission probability from the central controller in order to handle the 'fair access' issue resulting from distributed queuing problems inherent in the access network. Our extensive simulation results indicate that TAF-PRMA achieves significant improvements in terms of voice capacity, delay, and fairness over most of the existing medium access control (MAC) schemes for integrated services.

  • PDF

A study on IP mobility for data service between heterogeneous networks based on IMS (IMS 기반의 이종망간 데이터 이동성기술 적용방안 연구)

  • Kim, Tae-Wan;Kim, Hee-Dong;Nan, Sung-Yong;Sung, Min-Mo
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2008.08a
    • /
    • pp.304-308
    • /
    • 2008
  • The standardization for Seamless IP Mobility between heterogeneous networks is being progressed according to each specific purpose of the organizations such as ITU-T (International Telecommunications Union - Telecommunication), 3GPP(3rd Generation Partnership Project), IETF(Internet Engineering Task Force), and IEEE(Institute of Electrical and Electronics Engineers). [1] Specially VCC (Voice Call Continuity) for seamless voice continuity between heterogeneous networks using IMS (IP Multimedia Subsystem) of the next generation platform is being introduced into a few companies, and also MIH (Media Independent Handover) and MIP (Mobile IP) for IP mobility between heterogeneous networks are being customized by their internal situation. This article describes the idea to support IP Mobility between heterogeneous networks in the network which IMS platform is deployed and only supports the mobility for the voice by the type of AS (Application Server).

  • PDF

A study for maximum channelizing by FIR filter in voice band (음성대역에서 FIR필터에 의한 최대 채널화에 관한 연구)

  • Kim, Seong-Cheol;Park, Kyung-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.8
    • /
    • pp.1472-1477
    • /
    • 2007
  • Users are offered by the multimedia service of various information on current information-oriented society. The digitize became essential that process of various data is not to selected. Also, Filter technology is required to use the lacking frequency resources efficiently. This paper designs FIR digital band-pass filter of the voice band by narrow band pass filter md verify the characteristics of filter to use by the DSP practice SET.

Robust Speech Recognition Algorithm of Voice Activated Powered Wheelchair for Severely Disabled Person (중증 장애우용 음성구동 휠체어를 위한 강인한 음성인식 알고리즘)

  • Suk, Soo-Young;Chung, Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.6
    • /
    • pp.250-258
    • /
    • 2007
  • Current speech recognition technology s achieved high performance with the development of hardware devices, however it is insufficient for some applications where high reliability is required, such as voice control of powered wheelchairs for disabled persons. For the system which aims to operate powered wheelchairs safely by voice in real environment, we need to consider that non-voice commands such as user s coughing, breathing, and spark-like mechanical noise should be rejected and the wheelchair system need to recognize the speech commands affected by disability, which contains specific pronunciation speed and frequency. In this paper, we propose non-voice rejection method to perform voice/non-voice classification using both YIN based fundamental frequency(F0) extraction and reliability in preprocessing. We adopted a multi-template dictionary and acoustic modeling based speaker adaptation to cope with the pronunciation variation of inarticulately uttered speech. From the recognition tests conducted with the data collected in real environment, proposed YIN based fundamental extraction showed recall-precision rate of 95.1% better than that of 62% by cepstrum based method. Recognition test by a new system applied with multi-template dictionary and MAP adaptation also showed much higher accuracy of 99.5% than that of 78.6% by baseline system.