• 제목/요약/키워드: 음성인식률

Search Result 549, Processing Time 0.027 seconds

An Implementation of Cellular Phone - Automobile Interface System (휴대폰 - 자동차 인터페이스 시스템 구현)

  • Kim, Dong-Gyu;Yang, Jung-Su;Kim, Jung-Hyun;Roh, Yong-Wan;Chung, Kwang-Woo;Hong, Kwang-Seok
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2005.11a
    • /
    • pp.268-273
    • /
    • 2005
  • 기존의 텔레메틱스 서비스는 교통정보안내 및 생활정보 등의 차량 내에서의 서비스 위주로 제공한다. 그러나 차량 밖에 사용자가 있을 때 사용자가 차량의 상태 및 기타 정보를 확인하고 제어 하는 서비스의 초기단계로 전화연결 서비스(원격진단, 원격 문열림, 도난차량 추적, 내차위치확인)가 있다. 이 서비스는 중앙센터의 안내원 연결을 통하여 사용자에게 차량의 상태 및 제어를 할 수 있는 시스템이다. 본 연구는 안내원 연결을 통하지 않고 VXML을 이용하여 인간과 자동차의 인터페이스를 제공한다. VXML기술은 전화 사용자의 필요한 정보의 요구를 음성인식하며 차량으로부터의 정보는 음성합성에 의해 말로써 사용자에게 전달된다. 구현된 기술은 주차한 차량의 상태를 사용자가 궁금할 때 언제나 직접 가보지 않고 전화로 확인가능하게 함으로써 사용자의 시간적 손실을 줄인다. 또한, 주차한 차량의 훼손이나 침입 여부를 즉시 SMS로 사용자에게 알려 대처 하도록 하였다. 10인의 피실험자가 각각 10번씩 모두 100번을 실험한 결과 응답시간이 3초 이내로 나타났고 100%의 동작 성공률을 보였다. 자동차 이용자의 차량 정보 및 서비스 이용에 편리성을 제공하고, 정보 단말기와 전화 사용자 간에 VXML을 이용한 인터페이스 기술 확보 및 무한한 잠재력을 가지고 있는 지능형 자동차의 시장에 기여할 것으로 예상된다.

  • PDF

Prediction of Domain Action Using a Neural Network (신경망을 이용한 영역 행위 예측)

  • Lee, Hyun-Jung;Seo, Jung-Yun;Kim, Hark-Soo
    • Korean Journal of Cognitive Science
    • /
    • v.18 no.2
    • /
    • pp.179-191
    • /
    • 2007
  • In a goal-oriented dialogue, spoken' intentions can be represented by domain actions that consist of pairs of a speech art and a concept sequence. The domain action prediction of user's utterance is useful to correct some errors that occur in a speech recognition process, and the domain action prediction of system's utterance is useful to generate flexible responses. In this paper, we propose a model to predict a domain action of the next utterance using a neural network. The proposed model predicts the next domain action by using a dialogue history vector and a current domain action as inputs of the neural network. In the experiment, the proposed model showed the precision of 80.02% in speech act prediction and the precision of 82.09% in concept sequence prediction.

  • PDF

The Remote HMI System Control Using the Transformed Successive State Splitting Algorithm (변형된 상태분할 알고리즘을 이용한 원격 HMI 시스템 제어)

  • Lee, Jong-Woock;Lee, Jeong-Bae;Hwang, Yeong-Seop;Nam, Ji-Eun
    • Convergence Security Journal
    • /
    • v.8 no.4
    • /
    • pp.135-143
    • /
    • 2008
  • Currently, The HMI system is being used on the network is limited in the ability. In this paper, an Industrial HMI applied the transformed state splitting algorithm. this study suggests by applying a transformed the Successive state splitting algorithm, for the modeling in the questions of the expected data. So, you can save time and reliable and precise as high as 98.15 percent repregented recognition rate. HMI system applied to the voice of industrial equipment the man can not act directly in the industry environment was able to drive devices. Optimize the performance of the engine was the voice of HMI system.

  • PDF

Analysis of IT Technology through the Trends in Home Video Game Console (가정용 게임기 동향을 통해 본 IT 기술 분석)

  • Bae, Jung-Min;Bae, Yu-Mi;Jung, Sung-Jae;Jang, Rea-Young;Sung, Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.675-678
    • /
    • 2014
  • One time, Home video game console's penetration was as comparable to the personal computer's penetration, growth has slowed since the advent of smartphones, tablets and moblie devices. But game console actively introducing new IT technologies not available in the pc games and mobile games, still keeping a firm position in the relevent market. In this paper Home video game console's history, contemporary trends, and learn about trends in the company, New IT technologies applied to gaming was analyzed. Home video console market become the arena of New IT technologies according to the introduction of New IT technologies such as gesture recognition technology, speech recognition technology, media facade technology, virtual reality technology.

  • PDF

Detecting Adversarial Example Using Ensemble Method on Deep Neural Network (딥뉴럴네트워크에서의 적대적 샘플에 관한 앙상블 방어 연구)

  • Kwon, Hyun;Yoon, Joonhyeok;Kim, Junseob;Park, Sangjun;Kim, Yongchul
    • Convergence Security Journal
    • /
    • v.21 no.2
    • /
    • pp.57-66
    • /
    • 2021
  • Deep neural networks (DNNs) provide excellent performance for image, speech, and pattern recognition. However, DNNs sometimes misrecognize certain adversarial examples. An adversarial example is a sample that adds optimized noise to the original data, which makes the DNN erroneously misclassified, although there is nothing wrong with the human eye. Therefore studies on defense against adversarial example attacks are required. In this paper, we have experimentally analyzed the success rate of detection for adversarial examples by adjusting various parameters. The performance of the ensemble defense method was analyzed using fast gradient sign method, DeepFool method, Carlini & Wanger method, which are adversarial example attack methods. Moreover, we used MNIST as experimental data and Tensorflow as a machine learning library. As an experimental method, we carried out performance analysis based on three adversarial example attack methods, threshold, number of models, and random noise. As a result, when there were 7 models and a threshold of 1, the detection rate for adversarial example is 98.3%, and the accuracy of 99.2% of the original sample is maintained.

A Statistical Prediction Model of Speakers' Intentions in a Goal-Oriented Dialogue (목적지향 대화에서 화자 의도의 통계적 예측 모델)

  • Kim, Dong-Hyun;Kim, Hark-Soo;Seo, Jung-Yun
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.9
    • /
    • pp.554-561
    • /
    • 2008
  • Prediction technique of user's intention can be used as a post-processing method for reducing the search space of an automatic speech recognizer. Prediction technique of system's intention can be used as a pre-processing method for generating a flexible sentence. To satisfy these practical needs, we propose a statistical model to predict speakers' intentions that are generalized into pairs of a speech act and a concept sequence. Contrary to the previous model using simple n-gram statistic of speech acts, the proposed model represents a dialogue history of a current utterance to a feature set with various linguistic levels (i.e. n-grams of speech act and a concept sequence pairs, clue words, and state information of a domain frame). Then, the proposed model predicts the intention of the next utterance by using the feature set as inputs of CRFs (Conditional Random Fields). In the experiment in a schedule management domain, The proposed model showed the precision of 76.25% on prediction of user's speech act and the precision of 64.21% on prediction of user's concept sequence. The proposed model also showed the precision of 88.11% on prediction of system's speech act and the Precision of 87.19% on prediction of system's concept sequence. In addition, the proposed model showed 29.32% higher average precision than the previous model.

A Study on Protection of Iris and fingerprint Data Based on Digital Watermarking in Mid-Frequency Band (중간 주파수 영역에서의 디지털 워터마킹 기법에 의한 홍채 및 지문 데이터 보호 연구)

  • Jeong, Dae-Sik;Park, Kang-Ryoung
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.9
    • /
    • pp.1227-1238
    • /
    • 2005
  • Recently, with the advance of network and internet technologies, it is appeared the Problem that the digital contents such as image, voice and video are illegally pirated and distributed. To protect the copyright of the digital contents, the digital watermarking technology of inserting the provider's information into the contents has been widely used. In this paper, we propose the method of applying the digital watermarking into biometric information such as fingerprint and iris in order to prevent the problem caused by steal and misuse. For that, we propose the method of inserting watermark in frequency domain, compare the recognition performance before and aster watermark inserting. Also, we experiment the robustness of proposed method against blurring attack, which is conventionally taken on biometrics data. Experimental results show that our proposed method can be used for protecting iris and fingerprint data, efficiently.

  • PDF

A SVM-based Spam Filtering System for Short Message Service (SMS) (휴대폰 SMS를 위한 SVM 기반의 스팸 필터링 시스템)

  • Joe, In-Whee;Shim, Hye-Taek
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.9B
    • /
    • pp.908-913
    • /
    • 2009
  • Mobile phones became important household appliance that cannot be without in our daily lives. And the short messaging service (SMS) in these mobile phones is 1.5 to 2 times more than the voice service. However, the spam filtering functions installed in mobile phones take a method to receive specific number patterns or words and recognize spam messages when those numbers or words are present. However, this method cannot properly filters various types of spam messages currently dispatched. This paper proposes a more powerful and more adaptive spam filtering system using SVM and thesaurus. The system went through a process of isolating words from sample data through pro-processing device and integrating meanings of isolated words using a thesaurus. Then it generated characteristics of integrated words through the chi-square statistics and studied the characteristics. The proposed system is realized in a Window environment and the performance is confirmed through experiments.

OnDot: Braille Training System for the Blind (시각장애인을 위한 점자 교육 시스템)

  • Kim, Hak-Jin;Moon, Jun-Hyeok;Song, Min-Uk;Lee, Se-Min;Kong, Ki-sok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.6
    • /
    • pp.41-50
    • /
    • 2020
  • This paper deals with the Braille Education System which complements the shortcomings of the existing Braille Learning Products. An application dedicated to the blind is configured to perform full functions through touch gestures and voice guidance for user convenience. Braille kit is produced for educational purposes through Arduino and 3D printing. The system supports the following functions. First, the learning of the most basic braille, such as initial consonants, final consonant, vowels, abbreviations, etc. Second, the ability to check learned braille by solving step quizzes. Third, translation of braille. Through the experiment, the recognition rate of touch gestures and the accuracy of braille expression were confirmed, and in case of translation, the translation was done as intended. The system allows blind people to learn braille efficiently.

Usefulness of Chlorine Dioxide to Airborne Bacteria at a Hospital Using Biological Information (생물학적 정보를 활용한 병원에서 존재하는 공기중 부유 세균에 대한 이산화염소의 유용성)

  • Jung, Suk-Yul
    • Journal of Internet of Things and Convergence
    • /
    • v.6 no.2
    • /
    • pp.19-24
    • /
    • 2020
  • In the present study, using biological information of bacteria and biochemical information of chlorine dioxide gas, Gram-positive bacteria, e.g., Alloiococcus otitis, Erysipelothrix rhusiopathiae, Staphylococcus caprae, Staphylococcus lentus, and gram-negative bacteria, e.g., Acinetobacter baumannii complex, Aeromonas salmonicida, Brucella melitensis, Oligella ureolytica were used whether a plastic kit to release ClO2 gas could inhibit their growth. Overall, chlorine dioxide gas showed about 99% inhibition of bacterial growth, with less than 10 CFU. However, it was found that Gram positive Alloiococcus otitis and Gram negative Aeromonas salmonicida had more than about 50 CFU. When comparing the results of experiments with several bacteria, it suggested that the concentration of chlorine dioxide gas would be at least 10 ppm to 400 ppm for the bacterial inhibition. The results of this study could be used as basic data to evaluate the clinical usefulness of chlorine dioxide gas. If this study helps with prior knowledge to help clinicians to recognize and prevent the presence of micro-organisms that cause infections in hospitals, it would be helpful for activities such as patient care as a convergence field. In the future, it is considered that the research results will be the basis for rapidly inhibiting the microbes infected with patients by utilizing data of the information of the microbes that are inhibited for chlorine dioxide gas.