• Title/Summary/Keyword: voice data

Search Result 1,250, Processing Time 0.029 seconds

Analysis of AMR Compressed Bit Stream for Insertion of Voice Data in QR Code (QR 코드에 음성 데이터 삽입을 위한 AMR 압축 비트열 분석)

  • Oh, Eun-ju;Cho, Hyun-ji;Jung, Hyeon-ah;Bae, Joung-eun;Yoo, Hoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.490-492
    • /
    • 2018
  • This paper presents an analysis of the AMR speech data as a basic work to study the technique of inputting and transmitting AMR voice data which is widely used in the public cell phone. AMR consists of HEADER and Speech Data, and it is transmitted in bit format and has 8 bit-rate modes in total. HEADER contains mode information of Speech Data, and the length of Speech Data differs depending on the mode. We chose the best mode which is best to input into QR code and analyzed that mode. It is a goal to show a higher compression ratio for voice data by the analysis and experiments. This analysis shows improvement in that it can transmit voice data more effectively.

  • PDF

Comparison of vowel pitch results among several commercial voice analysis programs (각종 음성분석 상용 프로그램의 모음 기본주기 분석 결과 비교)

  • Nam, Ki-Chang;Lee, Seung-Hoon;Choi, Jai-Nam;Choi, Hong-Shik;Nam, Do-Hyun;Kim, Deok-Won
    • Proceedings of the KIEE Conference
    • /
    • 2005.05a
    • /
    • pp.54-56
    • /
    • 2005
  • Analysis of the voice and its corresponding studies are examined from the recording of the voice through microphone and various calculation processes of the signals by using computer. Voice analyser include data acquisition and analyzing program. Since oath program uses different voice signal processing algorithm, thorough understanding of the operation is essential. In this study, analysis result of patient voice were compared by using four different voice analysis programs such as MDVP, Praat, TF32, and the program developed in this study. Pitch, jitter and shimmer were selected as comparison analysis factors. As a result, pitch, jitter and shimmer showed different result since each program uses different pitch computation algorithm.

  • PDF

Acoustic Characteristics of the Voices of Korean Normal Adults by Gender on MDVP (성별에 따른 한국 정상 성인 음성의 음향학적 평가 기준치)

  • Kim, Jae-Ock
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.147-157
    • /
    • 2009
  • The purpose of the study is to develop the normal voice database and to analyze the acoustic characteristics of Korean adults' voices by gender using MDVP. Eight categories in the 34 parameters of MDVP were analyzed in the voices of 170 Korean normal adults taken from /a/ vowel. Among them, Fundamental Frequency Parameters and Frequency Perturbation Parameters were significantly different by gender. In addition, Fundamental Frequency Parameters of our data were remarkably different from the data suggested in the MDVP program which currently used in clinics. Therefore, the data obtained from the current study can be effectively used for the diagnosis of voice disorders of Korean adults as the standard parameter values of MDVP.

  • PDF

Priority-based Reservation Code Multiple Access (P-RCMA) Protocol (우선순위 기반의 예약 코드 다중 접속 (P-RCMA) 프로토콜)

  • 정의훈
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.2A
    • /
    • pp.187-194
    • /
    • 2004
  • We propose priority-based reservation code multiple access (P-RCMA) which can enhance voice traffic quality of the previous RCMA. The proposed protocol maintains two power levels and consider traffic characteristics in contending shared available codes to transmit packets. P-RCMA gives priority to the voice request packets rather than data packets by capture effect at the receiver part of base station. We show numerical results from EPA (equilibrium point analysis) analysis and simulation study in terms of voice packet dropping probability and average data packet transmission delay.

A study on traffic analysis in voice/data mixed PCS system (음성/데이타 복합서비스 PCS시스템의 트래픽 분석)

  • 김영일;진용욱
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.6
    • /
    • pp.136-148
    • /
    • 1996
  • In this paper, we analyze the traffic characteristics in microcell and macrocell overlaid PCS system which process voice and dta calls separately each others. in this system, data calls are delayed in queue when all of channels are occupied, while voice calls are bolcked in that case. For this, we calculated inter-microcell handoff area dwelling time distribution and handoff area dwelling time distribution between microcell and macrocell. We analyze traffic performance using this results. We used M/M/C/K model, and analyzed traffic performance of macrocell with handoff area variation of microcell.

  • PDF

Implement PAMD for discriminate human and ARS (수화자(受話者) 구별을 위한 PAMD 구현)

  • 서봉수
    • Proceedings of the IEEK Conference
    • /
    • 2003.11a
    • /
    • pp.61-64
    • /
    • 2003
  • In this paper, we implement PAMD(Positive Answering Machine Detection) for discrimination human and ARS. We are used Grunt detection, Glitch Noise detection and Tone detection for PAMD. It distinguishes voice signals from ring-back tone and glitch noise respectively. And as a second step, it judges whether human responses or ARS responses after integrating pattern changes like initial response period, the number of voice data, each time of voice data period and glitch noise. The accuracy is about 9375 in ASR and about 98% in Mobile phone.

  • PDF

Analysis of VoLTE Charge Reduction under VoLTE Growth (VoLTE 활성화에 따른 요금 인하 여력 분석)

  • Lee, Sang-Woo;Jeong, Seon-Hwa
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.1
    • /
    • pp.92-100
    • /
    • 2016
  • It is informed that the Voice over LTE(VoLTE) which serves voice and message on IP networks is better in terms of economies of scale than the legacy voice service on 2G/3G circuit-switched networks because of its technological and cost efficiency. In addition, services of voice and data are running on a single LTE network and as a result VoLTE has the more economies of scope. But, there is no study about how much technology-efficiency VoLTE has compared to circuit-based voice service and how much voice charge can be reduced as VoLTE grows up. This paper analyzes empirically cost-efficiency of VoLTE against circuit-based voice service and quantifies the reduction of voice charge as 2G/3G voice traffic shifts to VoLTE. The results describe the first is that the average cost of the total voice traffic rises shortly just after the investment of LTE network for providing VoLTE but it will soon have a capacity available to reduce the charge due to VoLTE's outstanding cost efficiency on the assumption that voice traffic is fixed, and the second is that the charge can be cut to 60% of the current rate in case of all the voice traffic moves to VoLTE. The latter proves partially the validation of data-focusing pricing plan. Our results are expected to become basic data for network operators' establishing pricing strategies and for policy makers' inducing price cutting.

A Study of the Correlation between Subjective and Objective Evaluation of Voice Disorders (음성장애 주관적 평가와 객관적 평가 간의 상관성 연구)

  • Lee, Ok-Bun;Kim, So-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.167-172
    • /
    • 2011
  • The purpose of this study was to examine the relationship between subjective and objective evaluation in speakers with voice disorders. Subjective evaluation indicates the self-reports of voice problems by dysphonic speakers. The relating protocol is the Voice Handicap Index (VHI) and the self-awareness index of voice problems (SAIVP-14). A total of 48 individuals with voice disorders replied to the questionnaire and participated in a voice assessment. Objective evaluations included the perceptual judgement of G grade in GRBAS, acoustic measurements (jitter, shimmer, NHR) by MDVP (CSL 4400), and aerodynamic measurements (MPT, MFR, psub) by PAS (Phonatory Aerodynamic System, KayPentax, USA). Pearson and Spearman correlations were used for the analysis. In the correlation with perceptual judgement (G grade) and VHI-Total, VHI-Physical, and SAIVP-14, there was a significant correlation, but the overall correlation was poor. NHR, jitter, and shimmer were significantly correlated with overall VHI and SAIVP-14. Specifically, the correlation with shimmer was stronger compared to the other measurements. In aerodynamic measures, MFR and MPT showed a significant correlation with VHI-Total, VHI-Emotional, and SAIVP-14, but their correlation was poor. The results of this study suggested that subjective evaluation of self voice problems is meaningfully correlated with objective evaluations, but more data in the multidimensional voice assessment should be collected and analyzed for the reliability and validity of the voice handicap questionnaire.

  • PDF

Impact of Voice Activity Detection on Channel Allocation in Cellular Networks

  • Limsaksri, Wichan;Thipchaksurat, Sakchai;Varakulsiripunth, Ruttikorn
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1067-1071
    • /
    • 2004
  • In this paper, the performance enhancement algorithm of channel allocation for voice and data transmission in cellular networks is proposed. The voice activity detection has been applied to dynamic channel allocation procedure to detect and separate the silence and speech among conversation periods. Hence a data user can use the silent period of an active voice channel to transmit its information. To control the selecting of channel allocation policies, the information of number of data in transmission waiting queue has been determined in order to accept the performance measurement. In the simulation results, the improvement of the performance shows via the quality of services, which are an average delay in queue, a blocking probability, and an impact of the proposed scheme is presented in the system.

  • PDF

The Development of Data Capturing Modules by Speech-Voice Recognition (음성인식에 의한 측량자료취득 모듈개발)

  • 조규전;이영진;차득기
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.18 no.3
    • /
    • pp.279-285
    • /
    • 2000
  • Men's desire for the human interface, due to the development of voice processing technology of computer, and the development of intelligent MMI (Man-Machine Interface) computer technology enabled us to operate computers with our voice without using keyboards or other input systems. Especially, by obtaining field data and layout from the complicated surveying environment and applying the voice recognition technology to the actual surveying work, we can save a lot of working hours and costs. According to the result of this study, the real time Geo-Coding and graphic data-coding were possible with only 25 words by connecting the software engine which recognizes 50,000 different words and the voice recognition technology based on the super IC which recognizes 60 different words with the Total-station and the RTK-GPS.

  • PDF