• Title/Summary/Keyword: voice communication

검색결과 1,027건 처리시간 0.032초

기동무기체계에서의 통신을 위한 음성신호 포착 연구 (A Study of Voice signal Capture for communication in the AFV)

  • 김석봉;이성태
    • 한국군사과학기술학회지
    • /
    • 제6권1호
    • /
    • pp.81-90
    • /
    • 2003
  • In the military communication environment, it is very difficult to obtain clear voice signal due to the high level noise. The purpose of this study is to find out the best body spot to get the vocal chords signal by measuring the skin or the bone conducting vibrations of different body positions within the noise environment. Based on the experimental study, it was found out that the measurement of sound signal within the ear is the best way to get the voice which comes from the vocal chords and this method can prevent the interruption of noise. This study will give the effective voice communication method in the high noise environment and be applicable to military purpose.

센서 네트워크 기반의 다수 사용자간 Full-Duplex 음성 통신 시스템을 위한 TDMA/TDD MAC 프로토콜 설계 (A Design of TDMA/TDD MAC Protocol for Full-Duplex Multi-User Voice Communication Systems Based on Sensor Network)

  • 김지수;이재형;조성호
    • 한국통신학회논문지
    • /
    • 제38C권3호
    • /
    • pp.239-246
    • /
    • 2013
  • 기존 IEEE 802.15.4는 PHY 계층과 MAC 계층에서의 표준을 제공하며 저전력, 저대역폭, 저속 데이터 통신을 특징으로 한다. 이러한 한계점으로 인하여 IEEE 802.15.4는 센서 검출, 홈 네트워크 등의 제한된 용도로만 쓰였으나 최근 음성과 같은 멀티미디어 데이터를 전송하려는 연구가 활발히 진행되고 있다. 본 논문에서는 기존 센서 네트워크 기반 Peer-to-Peer 음성 통신의 개선을 통해 다수 사용자간의 음성 통신을 지원하기 위하여 새로운 IEEE 802.15.4 PHY 기반 TDMA/TDD MAC을 설계하고 그룹 통신을 할 수 있는 하드웨어를 개발 하였다. 또한 설계된 시스템의 성능을 평가하기 위하여 실험을 통해 Mean Opinion Score (MOS)를 측정 하였으며 이는 사인파를 사용하는 방법을 이용하여 검증하였고 본 논문에서 제안하는 시스템이 실제 환경에서 다양한 응용 솔루션으로 개발 될 수 있음을 기대하였다.

한국인의 음성질환이 삶의 질에 미치는 영향 (The Effect of Voice Disorders on Quality of Life(QOL) in the Korean)

  • 송윤경;심현섭;권기환;이경철;이용배;진성민
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.51-60
    • /
    • 2000
  • Background and Objectives : Quality of life(QOL) is a construct representing physical, mental and social well-being. QOL has been used as a device for measuring the severity of health-related condition and treatment outcomes. As the social welfare system develops, the attention to QOL increases as well. The aims of this study was to examine whether the patients with voice disorder perceived significantly more the effects of voice disorder on QOL than nonpatient group did and if any, identify the sociodemographic risk factors influencing QOL of patients. Materials and Methods : This study asked 113 adults with voice disorders who were enrolled in Voice Clinic in the Department of Otolaryngology, Kangbuk Samsung Hospital between lune 1998 and January 1999 and 111 nonpatients to complete a questionnaire designed to elicit information about the effete of voice disorders on quality of lift. The questionnaire included items concerning sociodemographic areas, voice symptoms, job, effects of voice disorders on QOL domains(work, social, psychological, physical, and communication areas), potential risk factors to exposures, familial and medical history of voice disorders. Results : The sociodemographic characteristics of the patient group are as follows : (1) 75.2% of total patient group were female and the rest were male. (2) Age of total patient group ranged from 20 to 65 years. Hoarseness was the most commonly reported complaints, followed by complaints of high note difficulties during singing and voice fatigue. The patient group perceived effects of voice disorders on the areas of work, social, psychological, physical and communication more adversely than the comparison group did (p<0.05). QOL impairments were evaluated as a function of age, gender, education, and income, controlling other independent effects. The results were that (1) age was significantly associated with work problems and (2) gender and income were significantly associated with psychological problems. Conclusions : The findings indicated that the patients with voice disorders would perceive markedly adverse effect on all QOL domains, that is, work, social, psychological, physical, communicational areas. Therefore, the results of study suggest that lurker investigations about the nature of voice disorders, the prevention, treatment, and coping strategies are needed in the future.

  • PDF

VoiceXML을 이용한 자동차 정보 안내 시스템 구현 (An Implementation of Automobile Information System using VoiceXML)

  • 양정수;김동규;김정현;노용완;홍광석
    • 융합신호처리학회 학술대회논문집
    • /
    • 한국신호처리시스템학회 2005년도 추계학술대회 논문집
    • /
    • pp.290-293
    • /
    • 2005
  • 음성 인식 기술이 발달함에 따라 음성 인식 기술을 이용한 응용의 개발이 중요한 문제로 떠오르고 있다. VoiceXML은 전화기를 통한 음성 인터페이스를 위한 XML 언어로서 손쉬운 방법으로서 음성 인터페이스를 설계, 구현할 수 있도록 만들어진 언어이다. 본 논문에서는 이를 이용해 전화를 통하여 음성으로 자동차 정보 안내 시스템을 사용할 수 있는 사용자 인터페이스를 구현한다. 구현된 시스템 및 서비스는 VoiceXML의 장점을 활용하여 원거리에서 편리하게 사용자가 자동차의 정보를 안내받고 제어할 수 있는 인터페이스 자체보다는 음성 인터페이스의 설계 및 구현에 중점을 두었다. 10인의 피실험자가 각 10회씩 총 100회를 실험한 결과 99.3%의 인식률을 보였다. 추후 차세대 자동차 텔레메틱스 서비스와 연동하면 구현되어진 시스템의 활용이 증대될 것이라 판단된다.

  • PDF

차세대 멀티미디어 음성보안 IP-PBX 시스템 개발 (Development of the Integrated Multimedia IP-PBX System)

  • 김삼택
    • 한국인터넷방송통신학회논문지
    • /
    • 제11권5호
    • /
    • pp.95-100
    • /
    • 2011
  • 차세대 IP-PBX는 음성 보안은 물론 UC(Unified Communication)를 수행하기위한 다양한 멀티미디어 기능이 요구된다. 따라서 본 논문에서는 SIP 기반의 VPN IPSec을 이용하여 터널링 기법의 음성보안 및 멀티미디어 통합 커뮤니케이션 솔루션을 개발하여 차세대 교환기인 IP-PBX와 연동하며, PC 기반의 커뮤니케이션 시스템과 PSTN 전화를 함께 사용한다. 특히, 화상회의, 개별 스위칭, 분산처리 기능을 적용함으로 차세대 IP-PBX에 임베디드화 하였고 인터넷 교환기의 성능을 측정하였다. 또한 본 차세대 IP-PBX는 소프트 폰과 연동되어 다양한 부가 서비스를 제공한다.

병적 음성과 정상 음성의 음향학적 파라미터 분포에 대한 통계적 분석 (An analysis of a statistical difference of acoustic Parameters' distribution between normal voice and pathological voice)

  • 김용주;권순복;김기련;신민철;조철우;왕수건
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(4)
    • /
    • pp.249-252
    • /
    • 2001
  • The most basic means of communication among humans is a voice. Without speaking of voice technologies, we found it is important and convenient to use a voice in everyday life. But. in consideration to speech recognition systems, we can't always desire a normal voice input as input signal to the system. Generally speaking. a pathological voice as against a normal which is a voice with a problem in the larynx. could be also special case of input voice. Of course, but the distortion of a speech signal by environmental effects i.e., noise or transmission channel was a raised problem. we will take up a pathological voices with laryngeal disease which is essential distortion factor in voice. Also, we are to find out the difference of acoustic parameters distribution between normal and pathological voice by a statistical method in our research.

  • PDF

Probabilistic Neural Network Based Learning from Fuzzy Voice Commands for Controlling a Robot

  • Jayawardena, Chandimal;Watanabe, Keigo;Izumi, Kiyotaka
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.2011-2016
    • /
    • 2004
  • Study of human-robot communication is one of the most important research areas. Among various communication media, any useful law we find in voice communication in human-human interactions, is significant in human-robot interactions too. Control strategy of most of such systems available at present is on/off control. These robots activate a function if particular word or phrase associated with that function can be recognized in the user utterance. Recently, there have been some researches on controlling robots using information rich fuzzy commands such as "go little slowly". However, in those works, although the voice command interpretation has been considered, learning from such commands has not been treated. In this paper, learning from such information rich voice commands for controlling a robot is studied. New concepts of the coach-player model and the sub-coach are proposed and such concepts are also demonstrated for a PA-10 redundant manipulator.

  • PDF

Voice Quality Criteria for Heterogenous Network Communication Under Mobile-VoIP Environments

  • Choi, Jae-Hun;Seol, Soon-Uk;Chang, Joon-Hyuk
    • The Journal of the Acoustical Society of Korea
    • /
    • 제28권3E호
    • /
    • pp.99-108
    • /
    • 2009
  • In this paper, we suggest criteria for objective measurement of speech quality in mobile VoIP (Voice over Internet Protocol) services over wireless mobile internet such as mobile WiMAX networks. This is the case that voice communication service is available under other networks. When mobile VoIP service users in the mobile internet network based on packet call up PSTN and mobile network users, but there have not been relevant quality indexes and quality standards for evaluating speech quality of mobile VoIP. In addition, there are many factors influencing on the speech quality in packet network. Especially, if the degraded speech with packet loss transfers to the other network users through the handover, voice communication quality is significantly deteriorated by the transformation of speech codecs. In this paper, we eventually adopt the Gilbert-Elliot channel model to characterize packet network and assess the voice quality through the objective speech quality method of ITU-T P. 862. 1 MOS-LQO for the various call scenario from mobile VoIP service user to PSTN and mobile network users under various packet loss rates in the transmission channel environments. Our simulation results show that transformation of speech codecs results in the degraded speech quality for different transmission channel environments when mobile VoIP service users call up PSTN and mobile network users.

다중 채널을 지원하는 Voice over Sensor Network(VoSN) Base Station 설계 (A Design of Voice Over Sensor Network (VoSN) Base Station with Multi-Channel Support)

  • 이훈재;이재형;강민수;조성호
    • 한국통신학회논문지
    • /
    • 제39C권1호
    • /
    • pp.90-96
    • /
    • 2014
  • 센서 네트워크를 위한 표준인 IEEE802.15.4는 저전력, 저속 데이터 통신이 특징으로 주로 ZigBee 네트워크와 같은 Wireless Personal Area Network (WPAN)를 구성하기 위해 사용하고 있다. 그러나 최근 센서 네트워크 기반의 음성통신과 Session Initiation Protocol (SIP)를 연동하여 장거리 및 대규모 사용자를 지원하기 위한 연구가 활발히 진행되고 있다. 본 논문에서는 센서 네트워크 기반의 음성통신과 SIP를 연동하여 다수 사용자 지원하고 기존 시스템을 하나의 통합 Base Station으로 설계하였다. 또한, 설계한 Base Station의 성능을 평가하기 위하여 사용자수 증가에 따른 Packet 수와 Delay를 측정하였다.

Real time instruction classification system

  • Sang-Hoon Lee;Dong-Jin Kwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제16권3호
    • /
    • pp.212-220
    • /
    • 2024
  • A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.