• Title/Summary/Keyword: voice data

Search Result 1,250, Processing Time 0.029 seconds

Ten years of clinical experience with the patients with vocal nodule (성대결절 환자에 대한 10년간 임상 경험)

  • Lim, Hye Jin;Kim, Jeong Kyu;Choi, Chul-Hee;Choi, Seong Hee
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.99-106
    • /
    • 2017
  • Clinical data about vocal nodules have seldom been reported, even though vocal nodules are commonly diagnosed in outpatient speech and voice clinic. This study aims to investigate clinical characteristics of the patients who are diagnosed with vocal nodules. This study analyzed the data for 10 years from the 319 patients diagnosed with vocal nodules (45 males and 274 females with the mean age of 39.4 ranging from 2 to 83) in terms of gender, age, occupation, voice change initiation pattern, change with time, throat clearing, smoking history, type of voice abuse, acoustic analysis, maximum phonation time, GRBAS, and VHI. Thirteen patients (4.08%) had unilateral vocal nodule and 306 patients (95.9%) had bilateral vocal nodule, the majority of which had a pattern of asymmetry (73.9%). The glottal closure pattern was hourglass in 72.1% of patients, posterior chink in 17.9% of patients, and irregular in 7.9% of patients. The most common occupational category was professional voice users (43.4%). The voice abuse pattern included excessive talking in 96 patients (76.8%), loud voice in 78 (62.4%) patients, and excessive singing in 17 patients (21.6%). The patients showed worse scores in G, B, and S than in R and A for the GRBAS evaluation. The most recommended treatment for vocal nodules was voice therapy. The current clinical data will be helpful for treatment planning for the patients of vocal nodule.

Design and Implementation of Voice One-Time Password(V-OTP) based User Authentication Mechanism on Smart Phone (스마트폰에서 음성 정보를 이용한 일회용 패스워드(V-OTP) 기반 사용자 인증 메커니즘 설계 및 구현)

  • Cho, Sik-Wan;Lee, Hyung-Woo
    • The KIPS Transactions:PartC
    • /
    • v.18C no.2
    • /
    • pp.79-88
    • /
    • 2011
  • It is necessary for us to enhance the security service on smart phone by using voice data on authentication procedure. In this study, a voice data based one-time password generation mechanism is designed and implemented for enhancing user authentication on smart phone. After receiving a PIN value from the server, a user inputs his/her own voice biometric data using mike device on smart phone. And then this captured a voice biometric data will be used to generate one-time token on server side after verification procedures. Based on those mutual authentication steps, a voice data based one-time password(V-OTP) will be generated by client module after receiving the one-time token from the server finally. Using proposed voice one-time password mechanism, it is possible for us to provide more secure user authentication service on smart phone.

The Interactive Voice Services based on VoiceXML (VoiceXML 기반 음성인식시스템을 이용한 서비스 개발)

  • Kim Hak-Gyoon;Kim Eun-Hyang;Kim Jae-In;Koo Myoung-Wan
    • MALSORI
    • /
    • no.43
    • /
    • pp.113-125
    • /
    • 2002
  • As there are needs to search the Web information via wire or wireless telephones, VoiceXML forum was established to develop and promote the Voice eXtensible Markup Language (VoiceXML). VoiceXML simplifies the creation of personalized interactive voice response services on the Web, and allows voice and phone access to information on Web sites, call center databases. Also, it can utilize the Web-based technologies, such as CGI(Common Gateway Interface) scripts. In this paper, we have developed the voice portal service platform based on VoiceXML called TeleGateway. It enables integration of voice services with data services using the Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) engines. Also, we have showed the various services on voice portal services.

  • PDF

Analysis of Voice Color Similarity for the development of HMM Based Emotional Text to Speech Synthesis (HMM 기반 감정 음성 합성기 개발을 위한 감정 음성 데이터의 음색 유사도 분석)

  • Min, So-Yeon;Na, Deok-Su
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.9
    • /
    • pp.5763-5768
    • /
    • 2014
  • Maintaining a voice color is important when compounding both the normal voice because an emotion is not expressed with various emotional voices in a single synthesizer. When a synthesizer is developed using the recording data of too many expressed emotions, a voice color cannot be maintained and each synthetic speech is can be heard like the voice of different speakers. In this paper, the speech data was recorded and the change in the voice color was analyzed to develop an emotional HMM-based speech synthesizer. To realize a speech synthesizer, a voice was recorded, and a database was built. On the other hand, a recording process is very important, particularly when realizing an emotional speech synthesizer. Monitoring is needed because it is quite difficult to define emotion and maintain a particular level. In the realized synthesizer, a normal voice and three emotional voice (Happiness, Sadness, Anger) were used, and each emotional voice consists of two levels, High/Low. To analyze the voice color of the normal voice and emotional voice, the average spectrum, which was the measured accumulated spectrum of vowels, was used and the F1(first formant) calculated by the average spectrum was compared. The voice similarity of Low-level emotional data was higher than High-level emotional data, and the proposed method can be monitored by the change in voice similarity.

Multimedia Traffic Analysis using Markov Chain Model in CDMA Mobile Communication Systems (CDMA 이동통신 시스템에서 멀티미디어 트래픽에 대한 마르코프 체인 해석)

  • 김백현;김철순;곽경섭
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.7
    • /
    • pp.1219-1230
    • /
    • 2003
  • We analyze an integrated voice/data CDMA system, where the whole channels are divided into voice prioritized channels and voice non-prioritized channels. For real-time voice service, a preemptivc priority is granted in the voice prioritized channels. And, for delay-tolerant data service, the employment of buffer is considered. On the other hand, the transmission permission probability in best-effort packet-data service is controlled by estimating the residual capacity available for users. We build a 2-dimensional markov chain about prioritized-voice and stream-data services and accomplish numerical analysis in combination with packet-data traffic based on residual capacity equation.

  • PDF

A Suggestion of Efficient Method for Integrating XML and Voice XML (XML과 Voice XML의 효율적인 통합 방안 제시)

  • 장민석;홍용택
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2001.05a
    • /
    • pp.260-264
    • /
    • 2001
  • In this paper we suggest a method for translating (or integrating) XML documents into Voice XML documents in order to provide voice communication service. In the forthcoming web environment, XML will certainly overwhelms HTML. At this situation, the method of accessing data through the more various types of terminal machines is required.'rho best way is to use Voice XML by which the data accessing method is able to change from Web to the wired or/and wireless terminal at low costs. Thus we suggest a method for integrating the XML-based system into the Voice XML-based one.

  • PDF

The Association between Duration of Self-reported Voice Problems and Voice Disorders among Adults (주관적 음성문제 인지 기간과 병인학적 음성질환과의 관계)

  • Byeon, Hae-Won
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.125-132
    • /
    • 2011
  • Studies on the risk factors of voice disorders in Korean adults are rare. I evaluated the association between the duration of self-reported voice problem and voice disorders in Korean adults. Data were from the 2008 Korea National Health and Nutritional Examination Survey. Subjects were 3,135 people (1,310 men and 1,825 women) aged 19 years and older. Multi-nominal logistic regression analyses were used to examine the association between the duration of self-reported voice problem and voice disorders. The prevalence of self-reported voice problems was 5.9% among Korean adults. Adjusting for covariates (age, sex, education level, length of employment, tobacco consumption, alcohol consumption, thyroid disorders, pain and discomfort during the last two weeks), self-reported voice problems lasting longer than three weeks were independently associated with functional voice disorders (OR=5.30, 95% CI: 3.30-8.50) and organic voice disorders (OR=4.84, 95% CI: 1.82-12.89). Self-reported voice problems in the past three weeks were significantly associated with functional voice disorders (OR=3.64, 95% CI: 1.84-7.19), but not significantly associated with organic voice disorders. Self-reported voice problems are prevalent among adults. This study highlights that self-perception of a voice problem for more than three weeks is related to functional voice disorders and organic voice disorders.

  • PDF

Performance of cellular CDMA system for voice/data integrated service (음성/데이타 집적서비스를 위한 CDMA 셀룰러 시스템의 성능 연구)

  • 강군화;조동호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.9
    • /
    • pp.1748-1758
    • /
    • 1994
  • Recently, the demand of mobile communication is rapidly increasing. Also, not only the voice service but also the nonvoice services such as data, FAX, and image service are required. Therefore, in this paper, the voice/data intergraed service methods that will be utilized as a basic core technology of the PCS systems are proposed and their performances are analyzed and compared by computer simulation. According to the simulation results, it could be seen that the performance of voice/data integrated PR-CDMA method is better than that of voice/data integrated broadband CDMA method using a dedicated terminal or a voice/data integrated terminal. The reason is that the voice/data integrated PR-CDMA method can overcome the weak points of CDMA protocol, such as a limitation of the fixed CDMA logical channel number and a falling-off in channel utilization, by using PRMA protocol as a multiple access method that the terminals to which a CDMA logical channel is assigned compete.

  • PDF

The Influence of Noise Environment upon Voice and Data Transmission in the RF-CBTC System

  • Kim, Min-Seok;Lee, Sang-Hyeok;Lee, Jong-Woo
    • International Journal of Railway
    • /
    • v.3 no.2
    • /
    • pp.39-45
    • /
    • 2010
  • The RF-CBTC (Radio Frequency-Communication Based Train Control) System is a communication system in railroad systems. The communication method of RF-CBTC system is the wireless between the wayside device and on-board device. The wayside device collects its location and speed from each train and transmits the distance from the forwarding train to the speed-limit position to it. The on-board device controlling device controls the speed optimum for the train. In the case of the RF-CBTC system used in Korea, transmission frequency is 2.4 [GHz]. It is the range of ISM(Industrial Scientific and Medical equipment) band and transmission of voice and data is performed by CDMA (Code Division Multiple Access) method. So noises are made in the AWGN (Additive White Gaussian Noise) and fading environment. Currently, the SNR (Signal to Noise Ratio) is about 20 [dB], so due to bit errors made by noises, transmission of reliable information to the train is not easy. Also, in the case that two tracks are put to a single direction, it is needed that two trains transmit reliable voice and data to a wayside device. But, by noises, it is not easy that just a train transmits reliable information. In this paper, we estimated the BER (Bit Error Rate) related to the SNR of voice and data transmission in the environment such as AWGN and fading from the RF-CBTC system using the CDMA method. Also, we supposed the SNR which is required to meet the BER standard for voice and data transmission. By increasing the processing gain that is a ratio of chip transmission to voice and data transmission, we made possible voice and data transmission from maximally two trains to a wayside device, and demonstrated it by using Matlab program.

  • PDF

A Security-Enhanced Storing Method for the Voice Data in the Aircraft (항공기에서 보안 강화된 음성 데이터 저장 방식)

  • Cho, Seung Hoon;Suh, Jeong Bae;Moon, Yong Ho
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.6 no.4
    • /
    • pp.255-261
    • /
    • 2011
  • In this paper, we propose a security-enhanced storing method for the voice data obtained during the flight. When an emergency occurs during flight, the flight data in the storage device such as DTS or Blackbox can be exposed to antagonist or enemy. Currently, zeroize function is embedded in these devices in order to prevent this situation. However, this could not be operated if the system is malfunctioned or the pilot is wounded in the emergency. In order to solve this problem, the voice data compressed by the ADPCM is encrypted in the proposed method composed of the AES algorithm and a reordering method. The simulation results show that the security for the voice date is further enhanced due to the proposed method.