• Title/Summary/Keyword: computer-based speech analysis system

Search Result 35, Processing Time 0.027 seconds

Cognitive strategies-based Speaking Training system for elementary English vocabulary (초등 영어 어휘 습득을 위한 인지전략 기반의 Speaking Training System 설계 및 구현)

  • Seo, Byeong-Tae;Yang, Hae-Sool
    • Journal of Digital Convergence
    • /
    • v.13 no.4
    • /
    • pp.191-203
    • /
    • 2015
  • In foreign language, vocabulary is the most essential and fundamental elements. Traditional language learning methods that are to understand and to memorize the English contents can only be obvious limitations. In this paper, we proposed the speaking-centered learning methods based on cognitive strategies and speech recognition considering the learner characteristics. We have designed and implemented the cognitive strategy-based speaking training system for acquisition elementary English vocabulary. We were divided into control group and the experimental group and applied to the system to analyze the learning effect. The result of Analysis, the proposed system is increased motivation and achievement of learners. In addition, the proposed system is improved an academic learning participation, Project accomplish, self-interesting and leadership skills. Through this study, we expect that students improve the ability of practical skills in speaking English.

Detection of Dangerous Things to Infants through Image Analysis and Deep Learning (이미지 분석과 딥 러닝을 통한 영유아 위험물 탐지)

  • Kim, Hui-Joon;Park, Kil-Seop;Seo, Yeong-Hak;Kim, Kyung-Sup
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.11a
    • /
    • pp.845-848
    • /
    • 2017
  • In this paper, we implemented a system to detect dangerous situations by recognizing the dangerous elements for infants by reading 2D images of children's houses, parks, playgrounds, and living rooms where infants are present through Faster R-CNN. We have implemented a detection model based on data that can be easily obtained from real life. Currently, machine learning is commercialized based on speech recognition and behavior data. However, this model can be applied to various service fields Respectively.

Prototype Design and Development of Online Recruitment System Based on Social Media and Video Interview Analysis (소셜미디어 및 면접 영상 분석 기반 온라인 채용지원시스템 프로토타입 설계 및 구현)

  • Cho, Jinhyung;Kang, Hwansoo;Yoo, Woochang;Park, Kyutae
    • Journal of Digital Convergence
    • /
    • v.19 no.3
    • /
    • pp.203-209
    • /
    • 2021
  • In this study, a prototype design model was proposed for developing an online recruitment system through multi-dimensional data crawling and social media analysis, and validates text information and video interview in job application process. This study includes a comparative analysis process through text mining to verify the authenticity of job application paperwork and to effectively hire and allocate workers based on the potential job capability. Based on the prototype system, we conducted performance tests and analyzed the result for key performance indicators such as text mining accuracy and interview STT(speech to text) function recognition rate. If commercialized based on design specifications and prototype development results derived from this study, it may be expected to be utilized as the intelligent online recruitment system technology required in the public and private recruitment markets in the future.

Emotion Recognition and Expression System of User using Multi-Modal Sensor Fusion Algorithm (다중 센서 융합 알고리즘을 이용한 사용자의 감정 인식 및 표현 시스템)

  • Yeom, Hong-Gi;Joo, Jong-Tae;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.1
    • /
    • pp.20-26
    • /
    • 2008
  • As they have more and more intelligence robots or computers these days, so the interaction between intelligence robot(computer) - human is getting more and more important also the emotion recognition and expression are indispensable for interaction between intelligence robot(computer) - human. In this paper, firstly we extract emotional features at speech signal and facial image. Secondly we apply both BL(Bayesian Learning) and PCA(Principal Component Analysis), lastly we classify five emotions patterns(normal, happy, anger, surprise and sad) also, we experiment with decision fusion and feature fusion to enhance emotion recognition rate. The decision fusion method experiment on emotion recognition that result values of each recognition system apply Fuzzy membership function and the feature fusion method selects superior features through SFS(Sequential Forward Selection) method and superior features are applied to Neural Networks based on MLP(Multi Layer Perceptron) for classifying five emotions patterns. and recognized result apply to 2D facial shape for express emotion.

Developing a New Algorithm for Conversational Agent to Detect Recognition Error and Neologism Meaning: Utilizing Korean Syllable-based Word Similarity (대화형 에이전트 인식오류 및 신조어 탐지를 위한 알고리즘 개발: 한글 음절 분리 기반의 단어 유사도 활용)

  • Jung-Won Lee;Il Im
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.267-286
    • /
    • 2023
  • The conversational agents such as AI speakers utilize voice conversation for human-computer interaction. Voice recognition errors often occur in conversational situations. Recognition errors in user utterance records can be categorized into two types. The first type is misrecognition errors, where the agent fails to recognize the user's speech entirely. The second type is misinterpretation errors, where the user's speech is recognized and services are provided, but the interpretation differs from the user's intention. Among these, misinterpretation errors require separate error detection as they are recorded as successful service interactions. In this study, various text separation methods were applied to detect misinterpretation. For each of these text separation methods, the similarity of consecutive speech pairs using word embedding and document embedding techniques, which convert words and documents into vectors. This approach goes beyond simple word-based similarity calculation to explore a new method for detecting misinterpretation errors. The research method involved utilizing real user utterance records to train and develop a detection model by applying patterns of misinterpretation error causes. The results revealed that the most significant analysis result was obtained through initial consonant extraction for detecting misinterpretation errors caused by the use of unregistered neologisms. Through comparison with other separation methods, different error types could be observed. This study has two main implications. First, for misinterpretation errors that are difficult to detect due to lack of recognition, the study proposed diverse text separation methods and found a novel method that improved performance remarkably. Second, if this is applied to conversational agents or voice recognition services requiring neologism detection, patterns of errors occurring from the voice recognition stage can be specified. The study proposed and verified that even if not categorized as errors, services can be provided according to user-desired results.

Design and Implementation of YouTube-based Educational Video Recommendation System

  • Kim, Young Kook;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.5
    • /
    • pp.37-45
    • /
    • 2022
  • As of 2020, about 500 hours of videos are uploaded to YouTube, a representative online video platform, per minute. As the number of users acquiring information through various uploaded videos is increasing, online video platforms are making efforts to provide better recommendation services. The currently used recommendation service recommends videos to users based on the user's viewing history, which is not a good way to recommend videos that deal with specific purposes and interests, such as educational videos. The recent recommendation system utilizes not only the user's viewing history but also the content features of the item. In this paper, we extract the content features of educational video for educational video recommendation based on YouTube, design a recommendation system using it, and implement it as a web application. By examining the satisfaction of users, recommendataion performance and convenience performance are shown as 85.36% and 87.80%.

A study on user defined spoken wake-up word recognition system using deep neural network-hidden Markov model hybrid model (Deep neural network-hidden Markov model 하이브리드 구조의 모델을 사용한 사용자 정의 기동어 인식 시스템에 관한 연구)

  • Yoon, Ki-mu;Kim, Wooil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.2
    • /
    • pp.131-136
    • /
    • 2020
  • Wake Up Word (WUW) is a short utterance used to convert speech recognizer to recognition mode. The WUW defined by the user who actually use the speech recognizer is called user-defined WUW. In this paper, to recognize user-defined WUW, we construct traditional Gaussian Mixture Model-Hidden Markov Model (GMM-HMM), Linear Discriminant Analysis (LDA)-GMM-HMM and LDA-Deep Neural Network (DNN)-HMM based system and compare their performances. Also, to improve recognition accuracy of the WUW system, a threshold method is applied to each model, which significantly reduces the error rate of the WUW recognition and the rejection failure rate of non-WUW simultaneously. For LDA-DNN-HMM system, when the WUW error rate is 9.84 %, the rejection failure rate of non-WUW is 0.0058 %, which is about 4.82 times lower than the LDA-GMM-HMM system. These results demonstrate that LDA-DNN-HMM model developed in this paper proves to be highly effective for constructing user-defined WUW recognition system.

On the Development of Animated Tutoring Dialogue Agent for Elementary School Science Learning (초등과학 수업을 위한 애니메이션 기반 튜터링 다이얼로그 에이전트 개발)

  • Jeong, Sang-Mok;Han, Byeong-Rae;Song, Gi-Sang
    • Journal of The Korean Association of Information Education
    • /
    • v.9 no.4
    • /
    • pp.673-684
    • /
    • 2005
  • In this research, we have developed a "computer tutor" that mimics the human tutor with animated tutoring dialog agent and the agent was integrated to teaching-learning material for elementary science subject. The developed system is a natural language based teaching-learning system using one-to-one dialogue. The developed pedagogical dialogue teaching-learning system analysis student's answer then provides appropriate answer or questions after comparing the student's answer with elementary school level achievement. When the agent gives either question or answer it uses the TTS(Text-to-Speech) function. Also the agent has an animated human tutor face for providing more human like feedback. The developed dialogue interface has been applied to 64 6th grade students. The test results show that the test group's average score is higher than the control group by 10.797. This shows that unlike conventional web courseware, our approach that "ask-answer" process and the animated character, which has human tutor's emotional expression, attracts students and helps to immerse to the courseware.

  • PDF

Development of AI-based Real Time Agent Advisor System on Call Center - Focused on N Bank Call Center (AI기반 콜센터 실시간 상담 도우미 시스템 개발 - N은행 콜센터 사례를 중심으로)

  • Ryu, Ki-Dong;Park, Jong-Pil;Kim, Young-min;Lee, Dong-Hoon;Kim, Woo-Je
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.2
    • /
    • pp.750-762
    • /
    • 2019
  • The importance of the call center as a contact point for the enterprise is growing. However, call centers have difficulty with their operating agents due to the agents' lack of knowledge and owing to frequent agent turnover due to downturns in the business, which causes deterioration in the quality of customer service. Therefore, through an N-bank call center case study, we developed a system to reduce the burden of keeping up business knowledge and to improve customer service quality. It is a "real-time agent advisor" system that provides agents with answers to customer questions in real time by combining AI technology for speech recognition, natural language processing, and questions & answers for existing call center information systems, such as a private branch exchange (PBX) and computer telephony integration (CTI). As a result of the case study, we confirmed that the speech recognition system for real-time call analysis and the corpus construction method improves the natural speech processing performance of the query response system. Especially with name entity recognition (NER), the accuracy of the corpus learning improved by 31%. Also, after applying the agent advisor system, the positive feedback rate of agents about the answers from the agent advisor was 93.1%, which proved the system is helpful to the agents.

A Corpus-based Analysis of EFL Learners' Use of Hedges in Cross-cultural Communication

  • Min, Su-Jung
    • English Language & Literature Teaching
    • /
    • v.16 no.4
    • /
    • pp.91-106
    • /
    • 2010
  • This study examines the use of hedges in cross-cultural communication between EFL learners in an e-learning environment. The study analyzes the use of hedges in a corpus of an interactive web with a bulletin board system through which college students of English at Japanese and Korean universities interacted with each other discussing the topics of local and global issues. It compares the use of hedges in the students' corpus to that of a native English speakers' corpus. The result shows that EFL learners tend to use relatively smaller number of hedges than the native speakers in terms of the frequencies of the total tokens. It further reveals that the learners' overuse of a single versatile high-frequency hedging item, I think, results in relative underuse of other hedging devices. This indicates that due to their small repertoire of hedges, EFL learners' overuse of a limited number of hedging items may cause their speech or writing to become less competent. Based on the result and interviews with the learners, the study also argues that hedging should be understood in its social contexts and should not be understood just as a lack of conviction or a mark of low proficiency. Suggestions were made for using computer corpora in understanding EFL learners' language difficulties and helping them develop communicative and pragmatic competence.

  • PDF