• Title/Summary/Keyword: Image to Speech

Search Result 188, Processing Time 0.03 seconds

Lip Shape Synthesis of the Korean Syllable for Human Interface (휴먼인터페이스를 위한 한글음절의 입모양합성)

  • 이용동;최창석;최갑석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.4
    • /
    • pp.614-623
    • /
    • 1994
  • Synthesizing speech and facial images is necessary for human interface that man and machine converse naturally as human do. The target of this paper is synthesizing the facial images. In synthesis of the facial images a three-dimensional (3-D) shape model of the face is used for realizating the facial expression variations and the lip shape variations. The various facial expressions and lip shapes harmonized with the syllables are synthesized by deforming the three-dimensional model on the basis of the facial muscular actions. Combications with the consonants and the vowels make 14.364 syllables. The vowels dominate most lip shapes but the consonants do a part of them. For determining the lip shapes, this paper investigates all the syllables and classifies the lip shapes pattern according to the vowels and the consonants. As the results, the lip shapes are classified into 8 patterns for the vowels and 2patterns for the consonants. In advance, the paper determines the synthesis rules for the classified lip shape patterns. This method permits us to obtain the natural facial image with the various facial expressions and lip shape patterns.

  • PDF

A Study on an Performance Improvement of FIR Digital Filter using Window Function Design Method (창함수 설계 기법을 이용한 FIR 디지털 필터의 성능 향상에 관한 연구)

  • Lee, Kyung-Hyo;Bae, Sang-Bum;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.10a
    • /
    • pp.351-354
    • /
    • 2007
  • In recent years, digital processing techniques have been applied diversity of fields. Typical signal processing techniques are speech processing and image processing. And filters for the signal processing can be divided in FIR (finite impulse response) filter and IIR (infinite impulse response) filter. Compared with IIR filter, the FIR Filter has a defect of high-degree, but has a merit of stability and uses simply. Futhermore, FIR filter also has linear phase response characteristics, it is using in fields regarding wave information importantly. To FIR Filter design, the main issue is to remove the Gibbs phenomenon. Therefore, in this paper I was proposed a method using FIR digital filter applied a modified window function and the method was compared with conventional methods.

  • PDF

Performance Analyzer for Embedded AI Processor (내장형 인공지능 프로세서를 위한 성능 분석기)

  • Hwang, Dong Hyun;Yoon, Young Hyun;Han, Chang Yeop;Lee, Seung Eun
    • Journal of Internet Computing and Services
    • /
    • v.21 no.5
    • /
    • pp.149-157
    • /
    • 2020
  • Recently, as interest in artificial intelligence has increased, many studies have been conducted to implement AI processors. However, the AI processor requires functional verification as well as performance verification on whether the AI processor is suitable for the application. In this paper, We propose an AI processor performance analyzer that can verify the application performance and explore the limitations of the processor. By Using the performance analyzer, we explore the limitations of the AI processor and optimize the AI model to fit an AI processor in image recognition and speech recognition applications.

A Review on Deep Learning Platform for Artificial Intelligence (인공지능 딥러링 학습 플랫폼에 관한 선행연구 고찰)

  • Jin, Chan-Yong;Shin, Seong-Yoon;Nam, Soo-Tai
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.169-170
    • /
    • 2019
  • Lately, as artificial intelligence becomes a source of global competitiveness, the government is strategically fostering artificial intelligence that is the base technology of future new industries such as autonomous vehicles, drones, and robots. Domestic artificial intelligence research and services have been launched mainly in Naver and Kakao, but their size and level are weak compared to overseas. Recently, deep learning has been conducted in recent years while recording innovative performance in various pattern recognition fields including speech recognition and image recognition. In addition, deep running has attracted great interest from industry since its inception, and global information technology companies such as Google, Microsoft, and Samsung have successfully applied deep learning technology to commercial products and are continuing research and development. Therefore, we will look at artificial intelligence which is attracting attention based on previous research.

  • PDF

Object Tracking Method using Deep Learning and Kalman Filter (딥 러닝 및 칼만 필터를 이용한 객체 추적 방법)

  • Kim, Gicheol;Son, Sohee;Kim, Minseop;Jeon, Jinwoo;Lee, Injae;Cha, Jihun;Choi, Haechul
    • Journal of Broadcast Engineering
    • /
    • v.24 no.3
    • /
    • pp.495-505
    • /
    • 2019
  • Typical algorithms of deep learning include CNN(Convolutional Neural Networks), which are mainly used for image recognition, and RNN(Recurrent Neural Networks), which are used mainly for speech recognition and natural language processing. Among them, CNN is able to learn from filters that generate feature maps with algorithms that automatically learn features from data, making it mainstream with excellent performance in image recognition. Since then, various algorithms such as R-CNN and others have appeared in object detection to improve performance of CNN, and algorithms such as YOLO(You Only Look Once) and SSD(Single Shot Multi-box Detector) have been proposed recently. However, since these deep learning-based detection algorithms determine the success of the detection in the still images, stable object tracking and detection in the video requires separate tracking capabilities. Therefore, this paper proposes a method of combining Kalman filters into deep learning-based detection networks for improved object tracking and detection performance in the video. The detection network used YOLO v2, which is capable of real-time processing, and the proposed method resulted in 7.7% IoU performance improvement over the existing YOLO v2 network and 20 fps processing speed in FHD images.

The Study on Body Language in Animation as Functional Aspects -Focusing on Mulan, Beauty and the beast, Aladdin, Sinbad- (기능론적 관점에서 본 애니메이션의 신체언어 연구 - 뮬란, 미녀와 야수, 알라딘, 신밧드를 중심으로-)

  • Chung, Mi-Ghang;Lee, Mi-Young;Kim, Sung-Hee;Kim, Jae-Ho
    • Archives of design research
    • /
    • v.20 no.1 s.69
    • /
    • pp.55-64
    • /
    • 2007
  • Non-verbal communications are important because they support and replace verbal communication. Body language of various non-verbal communications is the communication using the body. In animation, expression of body language is very important because characters play an important role in communicating the scenario. Animation has a dual communication structure, different from general communication. One is the communication between the speaker character and the hearer character, the other is the image and the audience, which includes the communication between the speaker character and the hearer character. In this study, we divide the body language from the characters into the discourse-in act and discourse-out act according to this dual structure and classify it into adaptors, emblem, illustrator, regulator, affect display by a functional approach method. Especially, the illustrator is subdivided into pragmatic speech act. Finally, this study analyzes the features of body language in animation and represents animation character's body language for an effective expression of the communications in animation.

  • PDF

A Study on the Development Trend of Artificial Intelligence Using Text Mining Technique: Focused on Open Source Software Projects on Github (텍스트 마이닝 기법을 활용한 인공지능 기술개발 동향 분석 연구: 깃허브 상의 오픈 소스 소프트웨어 프로젝트를 대상으로)

  • Chong, JiSeon;Kim, Dongsung;Lee, Hong Joo;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.1-19
    • /
    • 2019
  • Artificial intelligence (AI) is one of the main driving forces leading the Fourth Industrial Revolution. The technologies associated with AI have already shown superior abilities that are equal to or better than people in many fields including image and speech recognition. Particularly, many efforts have been actively given to identify the current technology trends and analyze development directions of it, because AI technologies can be utilized in a wide range of fields including medical, financial, manufacturing, service, and education fields. Major platforms that can develop complex AI algorithms for learning, reasoning, and recognition have been open to the public as open source projects. As a result, technologies and services that utilize them have increased rapidly. It has been confirmed as one of the major reasons for the fast development of AI technologies. Additionally, the spread of the technology is greatly in debt to open source software, developed by major global companies, supporting natural language recognition, speech recognition, and image recognition. Therefore, this study aimed to identify the practical trend of AI technology development by analyzing OSS projects associated with AI, which have been developed by the online collaboration of many parties. This study searched and collected a list of major projects related to AI, which were generated from 2000 to July 2018 on Github. This study confirmed the development trends of major technologies in detail by applying text mining technique targeting topic information, which indicates the characteristics of the collected projects and technical fields. The results of the analysis showed that the number of software development projects by year was less than 100 projects per year until 2013. However, it increased to 229 projects in 2014 and 597 projects in 2015. Particularly, the number of open source projects related to AI increased rapidly in 2016 (2,559 OSS projects). It was confirmed that the number of projects initiated in 2017 was 14,213, which is almost four-folds of the number of total projects generated from 2009 to 2016 (3,555 projects). The number of projects initiated from Jan to Jul 2018 was 8,737. The development trend of AI-related technologies was evaluated by dividing the study period into three phases. The appearance frequency of topics indicate the technology trends of AI-related OSS projects. The results showed that the natural language processing technology has continued to be at the top in all years. It implied that OSS had been developed continuously. Until 2015, Python, C ++, and Java, programming languages, were listed as the top ten frequently appeared topics. However, after 2016, programming languages other than Python disappeared from the top ten topics. Instead of them, platforms supporting the development of AI algorithms, such as TensorFlow and Keras, are showing high appearance frequency. Additionally, reinforcement learning algorithms and convolutional neural networks, which have been used in various fields, were frequently appeared topics. The results of topic network analysis showed that the most important topics of degree centrality were similar to those of appearance frequency. The main difference was that visualization and medical imaging topics were found at the top of the list, although they were not in the top of the list from 2009 to 2012. The results indicated that OSS was developed in the medical field in order to utilize the AI technology. Moreover, although the computer vision was in the top 10 of the appearance frequency list from 2013 to 2015, they were not in the top 10 of the degree centrality. The topics at the top of the degree centrality list were similar to those at the top of the appearance frequency list. It was found that the ranks of the composite neural network and reinforcement learning were changed slightly. The trend of technology development was examined using the appearance frequency of topics and degree centrality. The results showed that machine learning revealed the highest frequency and the highest degree centrality in all years. Moreover, it is noteworthy that, although the deep learning topic showed a low frequency and a low degree centrality between 2009 and 2012, their ranks abruptly increased between 2013 and 2015. It was confirmed that in recent years both technologies had high appearance frequency and degree centrality. TensorFlow first appeared during the phase of 2013-2015, and the appearance frequency and degree centrality of it soared between 2016 and 2018 to be at the top of the lists after deep learning, python. Computer vision and reinforcement learning did not show an abrupt increase or decrease, and they had relatively low appearance frequency and degree centrality compared with the above-mentioned topics. Based on these analysis results, it is possible to identify the fields in which AI technologies are actively developed. The results of this study can be used as a baseline dataset for more empirical analysis on future technology trends that can be converged.

Study on development of the remote control door lock system including speeker verification function in real time (화자 인증 기능이 포함된 실시간 원격 도어락 제어 시스템 개발에 관한 연구)

  • Kwon, Soon-Ryang
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.6
    • /
    • pp.714-719
    • /
    • 2005
  • The paper attempts to design and implement the system which can remotely check visitors' speech or Image by a mobile phone. This system is designed to recognize who a visitor is through the automatic calling service, not through a short message, via the mobile phone, even when the home owner is outside. In general, door locks are controlled through the home Server, but it is more effective to control door locks by using DTMF signal from a real-time point of view. The technology suggested in this paper makes it possible to communicate between the visiter and the home owner by making a phone call to tile home owner's mobile phone automatically when the visiter visits the house even if the home owner is outside, and if necessary, it allows for the home owner to control the door lock remotely. Thanks to the system, the home owner is not restricted by time or space for checking the visitor's identification and controlling the door lock. In addition, the security system is improved by changing from the existing password form to the combination of password and speaker verification lot the verification procedure required for controlling the door lock and setting the environment under consideration of any disadvantages which may occur when the mobile Phone is lost. Also, any existing problems such as reconnection to tile network for controlling tile door lock are solved by controlling the door lock in real time by use of DTMF signal while on the phone.

ORAL REHABILITATION IN ECTODERMAL DYSPLASIA WITH OLIGODONTIA

  • Kim, Ryoung;Choi, Yeong-Chul;Lee, Keung-Ho
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.26 no.4
    • /
    • pp.636-643
    • /
    • 1999
  • Ectodermal dysplasia is a genetic birth defect in which at least abnormally develop two structures derived from the ectoderm. It is usually inherited in autosomal dominant or autosomal recessive pattern. Oral manifestations are oligodontia, anodontia, dysmorphic teeth(conical shape), decreased occlusal vertical dimension and alveolar bone. Extraoral signs may include decreased or absent sweat glands, sparse and fine hair, saddle nose, hearing loss and decreased production of body fluids including saliva. Most affected children require extensive dental treatment to restore their appearance and help the development of a positive self image. The patient's overclosed profile was due to a decreased vertical dimension. The use of overdenture is to preserve erupted teeth, to accomodate the newly constructed occlusal plane, to improve retention and stability of denture and to maintain the remaining alveolar bone. The restoration of vertical dimension improved the child's speech, swallowing, and eating. Growth continue until the age of approximately 18. As child grows, replacement dentures will have to be fabricated primarily to accomodate increasing vertical dimension and changing dentition. Implants may be indicated later if the alveolar bone is adequate. Periodic recall visits are advised, to monitor the dentures during periods of growth and development, and eruption of the permanent teeth.

  • PDF

The Red Book : the East and West Issues - With Special Reference to Lao Zi, Dao De Jing - (『붉은 책』 -동서(東西)의 문제, 특히 노자(老子) 도덕경과 관련하여)

  • Bou-Yong Rhi
    • Sim-seong Yeon-gu
    • /
    • v.30 no.1
    • /
    • pp.1-30
    • /
    • 2015
  • The Red Book contains C.G. Jung's insightful comment on life suggesting the thoughts of the Eastern philosophers, particularly that of Lao Zi. The author reviewed Jung's commentaries in the Red Book in comparison with Lao Zi Dao De Jing. Jung's comments on the image of despised Surpreme Being, on the Simplicity, the attitudes of 'the Spirit of the Depth' toward intellectual knowledges and speech, toward the small and the mockered one resemble to what Lao Zi spoke on Dao in his Dao De Jing. The 'good and evil' are regarded by both C.G. Jung and Lao Zi as two poles in one total psyche. The favorite words of Lao Zi : 'emptiness' or 'empty' are frequently mentioned in the Red Book. The investigation in this concern revealed that C.G. Jung, contrary to Lao Zi has applied the word 'emptiness' mostly as the opposite to the fullness. C.G. Jung's way of encountering with the darkest side of soul in the Hell and his bold confrontation to the authoritative person such as Philemon, above all, the intensity of his experiences in the state of the utmost tension between the opposites are extraordinarily impressive and somehow strange when regarded from traditional eastern way of behavior such as I-You relationship and the patterns of emotional life based on Confucian tradition. Confucius never talked about the prodigies, feasts of strength and disorders or spirits. Lao Zi never mentioned infernal cruelty. Noteworthy is however, both have enough experienced the cruelty of life and conflicts in the reality and what they spoke was not a process in search for solution but the final proposals for the solution of human agony. C.G. Jung was, like great shaman in central and East-Asia forced to go through inferno in his unique way and from these experiences obtained the insight which resembles not only to Lao Zi but also to wisdoms from the western philosophies and also from the Christianity.