• 제목/요약/키워드: voice communication

검색결과 1,030건 처리시간 0.025초

Real time instruction classification system

  • Sang-Hoon Lee;Dong-Jin Kwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제16권3호
    • /
    • pp.212-220
    • /
    • 2024
  • A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.

Designing Health Games for Anti-Smoking Advertising Targeting College Students: The Impact of Message Types and Voice-Over

  • Yoo, Seung-Chul;Eastin, Matthew S.
    • International Journal of Contents
    • /
    • 제13권3호
    • /
    • pp.17-24
    • /
    • 2017
  • Video games are an alternative channel for public health authorities constantly striving to reach target audiences and make positive changes. The objective of this research is to address effectiveness of health games as a health promotion technology for anti-smoking communication. This study tested the impact of advertising message types and voice-over relative to anti-smoking persuasion. By using commercial-level custom-made first person shooter (FPS) games, two experimental studies demonstrated usefulness of applying health games to change a player's attitude towards smoking. In particular, an interactive in-game message with a persuasive voice over was the most effective method to change targets' attitude towards smoking. Findings of our research offer meaningful insight on health promotion research and provide possible directions for future anti-smoking communication using health games.

MGCP와 IP-Multicast를 이용한 Internet Voice Conference에 관한 연구 (The Study on Internet Voice Conference using MGCP and IP-Multicast)

  • 이송호;최경삼;이종수
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2001년도 합동 추계학술대회 논문집 정보 및 제어부문
    • /
    • pp.130-133
    • /
    • 2001
  • VoIP(voice over internet protocol) technology is based on IP protocol. The IP protocol can be involved in two types of communication: unicasting and multicasting. Unicasting is the communication between one sender and one receiver. It is one-to-one communication. Multicasting is one-to-many communication. So that, many receivers can get same data from one sender simultaneously. and, the different protocol are proposed for VoIP; H.323, SIP and MGCP. MGCP is perfect server-client protocol, so MGCP is very attractive VoIP protocol to ISP. This paper uses MGCP and offers modified MGCP for conference call. So that, Modified MGCP is compatible to MGCP, and supports conference call using IP-multicast.

  • PDF

U-Sports용 음성통신 서비스 모델 제안 및 Hands-free 기기의 구현 (A Design of Voice Communication Service for U-Sports)

  • 허명선;이종덕;김재오;양윤석;안현식;정구민
    • 융합신호처리학회논문지
    • /
    • 제9권3호
    • /
    • pp.208-212
    • /
    • 2008
  • 본 논문에서는 휴대용 단말기와 Bluetooth를 이용하여 레져 스포츠를 즐기면서 음성통신을 할 수 있는 서비스 모델을 제안하고 이에 필요한 Hands-free 기기를 구현한다. 제안한 서비스 모델은 Bluetooth를 이용하여 음성 네트워크를 형성하고, 선점형 알고리듬을 이용하여 다수의 사용자들과 음성 공유를 할 수 있도록 한다. 제안하는 서비스 모델은 두 가지로 나눌 수 있다. 하나는 Hands-free 기기만을 이용하여 최대 4명의 사용자까지 음성통신을 하는 모델이고, 다른 하나는 휴대용 단말기를 마스터로 하여 최대 3명의 사용자까지 음성통신을 하는 모델이다. 두 번째 모델은 Scatternet과 Call Forwarding을 이용하여 음성통신 중에도 전화통화가 가능하도록 한다. Scatternet을 이용할 경우, 음성통신을 위한 피코넷과 휴대용 단말기와의 피코넷이 하나의 Scatternet을 형성한다. Call Forwading을 이용할 경우, 음성 네트워크를 형성하기 전에 각 휴대용 단말기의 정보를 교환하여 음성통신 중에 전화가 왔을 경우 마스터인 단말기를 통해 해당 사용자가 전화 통화를 할 수 있다.

  • PDF

선거 연설에서 대통령 후보자의 목소리 변화에 따른 유권자의 인지 변화에 대한 융합 연구 (A Interdisciplinary Study about Voice Change of the Presidential Candidate and Cognition Change of the Voters)

  • 함상우;박형우
    • 한국인터넷방송통신학회논문지
    • /
    • 제18권3호
    • /
    • pp.193-200
    • /
    • 2018
  • 공식 연설에서 연설자의 목소리는 청취자에게 다양한 영향을 미칠 수 있다. 목소리 특징에 따라 연설의 효과성과 효율성도 변화하게 된다. 대통령 선거에서도 후보자의 목소리 특징은 유권자들의 인지에 영향을 미치게 될 것이다. 그래서 우리는 보다 효과적인 후보자의 목소리가 어떤 것인지를 파악할 필요가 있을 것이다. 이 연구는 후보자가 목소리를 변화시킨다면 이 목소리에 대한 유권자의 인지도 변화할 것인지를 입증한다. 한 후보자의 변화된 목소리 특징에 따라 유권자의 인지가 변화한다면, 우리는 후보자에게 필요한 목소리가 어떤 것인지를 설명할 수 있게 될 것이다. 또한 효과적인 연설을 위해 필요한 목소리 변화 전략에 대해서도 논의할 수 있을 것이다. 우리는 이 연구를 통해 대통령 후보자의 목소리 변화에 따른 유권자의 인지 변화를 소리공학의 차원과 인지 차원으로 설명하여, 효과적인 연설을 위해 후보자에게 필요한 목소리 특징과 변화 전략을 설명한다.

압전 초음파 센서를 이용한 수중통신에 관한 연구 (A Study on the underwater communication system of ultrasonic transducer)

  • 김동현;우형관;황현석;진홍범;송준태
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2000년도 하계학술대회 논문집 C
    • /
    • pp.1658-1660
    • /
    • 2000
  • Simple signs were usually exchanged as the means of underwater communications. As people recently, need more informations for underwater activities, necessities of underwater communication systems exchanging hunman voice are increased. The purpose of this paper is understanding the ordinary characteristics of underwater communication and investigating the necessary conditions for a good underwater communication system by making a basic communication module. The experiment is achieved by applying AM (Amplitude Modulation) which is mainly used for the underwater communication systems and using common ultrasonic transducers. Ultrasonic transducers usually have narrow bandwidth for transducing electrical energy to mechanical energy. For improvement of sound reconstruction, transducers need more bandwidth which covers voice's frequency range, and goof linearity characteristics in this frequency range. As underwater transmissions have many factors to distort signals. Amplitude Modulation is not a proper way for underwater communications. Using digital signal by sampling human voice should be a good way for this systems, because digital communication simplify transmitting signals.

  • PDF

CONTINUOUS DIGIT RECOGNITION FOR A REAL-TIME VOICE DIALING SYSTEM USING DISCRETE HIDDEN MARKOV MODELS

  • Choi, S.H.;Hong, H.J.;Lee, S.W.;Kim, H.K.;Oh, K.C.;Kim, K.C.;Lee, H.S.
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.1027-1032
    • /
    • 1994
  • This paper introduces a interword modeling and a Viterbi search method for continuous speech recognition. We also describe a development of a real-time voice dialing system which can recognize around one hundred words and continuous digits in speaker independent mode. For continuous digit recognition, between-word units have been proposed to provide a more precise representation of word junctures. The best path in HMM is found by the Viterbi search algorithm, from which digit sequences are recognized. The simulation results show that a interword modeling using the context-dependent between-word units provide better recognition rates than a pause modeling using the context-independent pause unit. The voice dialing system is implemented on a DSP board with a telephone interface plugged in an IBM PC AT/486.

  • PDF

VoiceXML VU를 위한 Dialog 설계에 관한 연구 (A Study on Design of Dialog for VoiceXML VUI)

  • 장민석;예상후
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2002년도 추계종합학술대회
    • /
    • pp.792-795
    • /
    • 2002
  • Nowadays the corporations related to Information & Communication field are researching more and more on VoiceXML development. VoiceXML can provide users with more efficient interface, VUI(VoiceXML User Interface) in web environment than the existing one. But more research and development for designing the Dialog have to be done for VUI to be used in efficient way. That was a main topic in "2002 VoiceXML Conference & Expo". According to the importance this paper presents VoiceXML Dialog designed for the purpose of its efficient use and the experimental result.

  • PDF

IEEE 802.15.4 표준에 적용을 위한 음성부호화 기술 (A Voice Coding Technique for Application to the IEEE 802.15.4 Standard)

  • 진진흥;강석근
    • 방송공학회논문지
    • /
    • 제13권5호
    • /
    • pp.612-621
    • /
    • 2008
  • 이용 가능한 데이터 영역과 전송전력 등 다양한 제한 요소들로 인하여 지그비 통신의 기술규격에는 음성통신에 대한 기준 사양이 포함되지 않았다. 본 논문에서는 지그비의 기반인 IEEE 802.15.4 표준에 적용하기 위한 음성부호화 기법이 제시된다. 여기서는 높은 압축율과 파형 복구능력이 우수한 파형부호기의 실현이 필수적이다. 이를 위하여 제시된 방법에서는 다단 이산 웨이블릿변환과 두 가지 펄스부호변조로 구성된 이진부호기가 사용된다. 이론적인 분석과 실내 무선 환경에서의 모의실험 결과 2단 웨이블릿변환을 적용한 경우가 압축율과 음성신호 복구능력 면에서 가장 적합한 것으로 판단된다. 직선전파경로 성분이 지배적인 경우 제시된 방법은 중간 정도의 신호 대 잡음비에서도 만족스러운 복구능력을 가진다. 따라서 제시된 음성부호화 방법은 향후 지그비를 이용한 음성통신의 표준 선정에 참고 가능한 기술이 될 수 있을 것으로 사료된다.

Convergence research on the speaker's voice perceived by listener, and suggestions for future research application

  • Hahm, SangWoo
    • International journal of advanced smart convergence
    • /
    • 제11권1호
    • /
    • pp.55-63
    • /
    • 2022
  • Although research on the leader's or speaker's voice has been continuously conducted, existing research has a single point of view. Sound analysis of voice characteristics has been studied from engineering perspectives, and leadership trait theory has been studied from a business perspective. Convergence studies on leader voice and member cognition are being attempted today. Convergence research on voice has a positive effect on refinement of voice analysis, diversification of voice use, and establishment of voice utilization strategy. This study explains the current flow of research on convergence between speaker's voice and listener's perception, and suggests a direction for the future development of voice fusion research. Furthermore, in connection with AI in the 4th industrial age, new attempts for voice research are sought. First, advances in AI focus on strategically generating the voices needed for individual situations. Second, the voice corrected in real time will support the leader and speaker to utilize the desired voice type. Third, voices through AI based on big data will affect the cognition, attitude and behavior of individual listeners who members, customers, and students in more diverse situations. The purpose and significance of this study is to suggest the way to research the leader's voice recognized by members, and to suggest a method that can be applied in various situations.