Search | Korea Science

Trends of Hardware Accelerator for the Embedded Speech Recognition (내장형 음성인식기를 위한 전용 하드웨어가속기 기술개발 동향)

Kim, J.Y.;Kim, T.J.;Lee, J.H.;Eum, N.W.
- Electronics and Telecommunications Trends
- /
- v.29 no.4
- /
- pp.91-100
- /
- 2014
사람의 말소리를 문자로 변환하여 기기의 제어명령으로 활용하는 것이 음성인식 기술이다. 음성인식에 대한 기술개발 요구는 수십 년 전부터 있어 왔고, 꾸준히 제품화되고 있는 분야라 하겠다. 제품으로의 상용화가 가능한 알고리즘 및 데이터 처리체계는 HMM(Hidden Markov Model)이라는 수학적 모델링으로 정형화되어 있으며, 대규모의 반복적 데이터 수집과 정교한 학습 데이터베이스의 구축이 음성인식기술의 핵심요소라는 것이 일반적인 시각이다. 이러한 이유로 인해, 대용량 음성인식 데이터베이스의 수집, 가공 등이 가능한 인프라를 갖춘 기관 및 업체들이 음성인식기술 시장을 점유할 수 있는 것이다. 그러나, 이러한 음성인식의 서비스 제공 체계는 사물인터넷 또는 웨어러블 디바이스 등으로 음성인식 사용자 인터페이스가 확대되고 통신 및 네트워크가 연결이 불가한 경우 그 한계를 보일 수 있다. 본고에서는 이러한 문제를 해결하기 위한 내장형 음성인식기의 하드웨어가속기 기술개발에 대한 내용과 국내외 현황을 살펴보기로 한다.
PDF

Articulatory robotics (조음 로보틱스)

Nam, Hosung
- Phonetics and Speech Sciences
- /
- v.13 no.2
- /
- pp.1-7
- /
- 2021
Speech is a spatiotemporally coordinated structure of constriction actions at discrete articulators such as lips, tongue tip, tongue body, velum, and glottis. Like other human movements (e.g., reaching), each action as a linguistic task is completed by a synergy of involved basic elements (e.g., bone, muscle, neural system). This paper discusses how speech tasks are dynamically related to joints as one of the basic elements in terms of robotics of speech production. Further this introduction of robotics to speech sciences will hopefully deepen our understanding of how speech is produced and provide a solid foundation to developing a physical talking machine.
https://doi.org/10.13064/KSSS.2021.13.2.001 인용 PDF KSCI

한국어 문자음성 변환시스템 : 가라사대

권철홍;정원국;구준모;김형순
- Information and Communications Magazine
- /
- v.11 no.9
- /
- pp.17-25
- /
- 1994
본 논문에서는 국내 최초의 상용 한국어 무제한 음성합성 시스템인 가라사대에 관하여 기술한다. 우선, 음성합성 과정의 각 단계에 이용된 알고리즘을 설명한다. 즉, 문장의 분석을 위해서는 문장 전처리, parsing 발음표기 변환 등의 규칙에 의하여 순차적으로 수행된다. 문장 분석후에는 강세, 억양과 지속시간 등의 운율을 제어하는 요소가 계산되고 음성신호는 확장된 diphone 단위의 음성신호를 연결하여 생성된다. 다음으로 가라사대 하드웨어 및 소프트웨어의 구성에 관하여 서술한다. 범용의 디지탈 신호처리 IC를 이용하여 구현한 하드웨어와 가라사대의 소프트웨어뿐만 아니라 PC내의 소프트웨어의 구성과 역할에 관하여 살펴본다.
PDF

The Effect of Helium Gas Intake on the Characteristics Change of the Acoustic Organs for Voice Signal Analysis Parameter Application (음성신호 분석 요소의 적용으로 헬륨가스 흡입이 음성 기관의 특성 변화에 미치는 영향)

Kim, Bong-Hyun;Cho, Dong-Uk
- The KIPS Transactions:PartB
- /
- v.18B no.6
- /
- pp.397-404
- /
- 2011
In this paper, we were carried out experiments to apply parameter of voice analysis to measure changing characteristic articulator according to inhale the helium gas. The helium gas was used to overcome air embolism nitrogen gas to deal a fatal blow in body nitrogen gas by diver. However, the helium gas has been much trouble interpretation about abnormal voice of diver to cause squeaky voice of low articulation. Therefor, we was carried out experiments about pitch and spectrogram measurement, analysis based on to influence in acoustic organs before and after of inhaled helium gas.
https://doi.org/10.3745/KIPSTB.2011.18B.6.397 인용 PDF KSCI

An Implementation of Speech DB Gathering System Using VoiceXML (VoiceXML을 이용한 음성 DB 수집 시스템 구현)

Kim Dong-Hyun;Roh Yong-Wan;Hong Kwang-Seok
- Journal of Internet Computing and Services
- /
- v.6 no.1
- /
- pp.39-50
- /
- 2005
Speech DB is basically required factor when we are study for phonetics, speech recognition and speech synthesis and so on. The quantity and quality of speech DB decide the efficiency of system that we develop. therefore. speech DB has an extremely important factor, Recently, development of the various telephone service technique such as voice portal. it is actual condition where the necessity of collection of telephone speech DB. The existing IVR application telephone speech DB collection system used C/C++ language or the exclusive development tool. Thus it is the actual condition where the recycle of each application service for resources is difficult and have a problem of many labors and time necessity. But. VoiceXML is a language having tag form ipredicated in XML. which has easy and simple grammar system. Therefore, if we make a few efforts we could draw up easily. it has a merit reducing labors and time, Also, VoiceXML has many advantages of various telephone speech DB gathering because of changing contents of DB. In this paper, we introduce telephone speech DB gathering system which is the mast important factor for development of speech information processing technique.
PDF

Change Analysis of Heart Related Voice Analysis Parameter Based on Auricular Acupuncture (이침요법(耳針療法)을 기반으로 한 심장 관련 음성 분석 요소의 변화 분석)

Kim, Bong-Hyun;Lim, Soon-Yong;Lim, Sung-Su;Yoo, Hwang-Jun;Yeon, Yong-Heum;Min, Ji-Seon;Han, Sang-Hyo;Ka, Min-Kyoung;Cho, Dong-Uk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.1043-1046
- /
- 2011
건강에 대한 예방과 관리를 반영한 것이 대체의학이다. 대체의학 중에 이침(耳針)요법은 부작용이 적은 방법으로 널리 사용되고 있다. 이침요법은 간단한 교육과정을 거친 후 자가 진단을 통해 응급처치가 가능한 것으로 실생활에서 손쉽게 이용되고 있다. 따라서 본 논문에서는 심장에 해당하는 이(耳)혈 상응점을 자극하여 심장과 관련된 음성 요소의 변화를 측정하였다. 이를 위해 심장에 해당하는 이(耳)혈 상응점을 자극하기 전과 후의 음성을 수집하여 음성 분석 요소 중 Jitter와 2Formant Frequency Bandswidth을 적용하여 단위 시간안의 발음에서 성대 진동의 변화율과 공명강의 변화를 통해 심장과 음성의 상관성을 분석하는 연구를 수행하였다.
https://doi.org/10.3745/PKIPS.y2011m11a.1043 인용 PDF

Context sentiment analysis based on Speech Tone (발화 음성을 기반으로 한 감정분석 시스템)

Jung, Jun-Hyeok;Park, Soo-Duck;Kim, Min-Seung;Park, So-Hyun;Han, Sang-Gon;Cho, Woo-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.11a
- /
- pp.1037-1040
- /
- 2017
현재 머신러닝과 딥러닝의 기술이 빠른 속도로 발전하면서 수많은 인공지능 음성 비서가 출시되고 있지만, 발화자의 문장 내 존재하는 단어만 분석하여 결과를 반환할 뿐, 비언어적 요소는 인식할 수 없기 때문에 결과의 구조적인 한계가 존재한다. 따라서 본 연구에서는 인간의 의사소통 내 존재하는 비언어적 요소인 말의 빠르기, 성조의 변화 등을 수치 데이터로 변환한 후, "플루칙의 감정 쳇바퀴"를 기초로 지도학습 시키고, 이후 입력되는 음성 데이터를 사전 기계학습 된 데이터를 기초로 kNN 알고리즘을 이용하여 분석한다.
https://doi.org/10.3745/PKIPS.y2017m11a.1037 인용 PDF

A Study on Change of Voice Analysis Parameter According to the Eucalyptus Fragrance (유칼립투스 발향에 따른 음성 분석 요소의 변화 분석 연구)

Kim, Bong-Hyun;Lim, Soon-Yong;Lim, Sung-Su;Ka, Min-Kyoung;Cho, Dong-Uk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.1035-1038
- /
- 2011
아로마테라피로 알려진 향기요법은 19세기 과학적인 근거로 천연오일을 의학적으로 사용하면서 체계적인 근간을 이루게 된 대체요법이다. 본 논문에서는 향기요법의 이론적 배경을 기반으로 기관지에 효과적인 유칼립투스 천연오일의 발향을 통해 음성기관의 변화 정도를 측정하는 연구를 수행하였다. 특히, 기관지와 관련된 음성분석 요소인 성대 진동의 변화율과 진폭의 규칙성을 측정, 분석하여 유칼립투스 천연오일의 발향에 따른 기관지 기능의 효과성을 객관적으로 입증하는 실험을 수행하였다.
https://doi.org/10.3745/PKIPS.y2011m11a.1035 인용 PDF

문화콘텐츠의 보호.유통과 법적 문제

조용순
- Review of Korea Contents Association
- /
- v.2 no.1
- /
- pp.9-21
- /
- 2004
법적 의미로 "콘텐츠"란 "부호.문자.음성.음향 및 영상 등의 자료 또는 정보"(문화산업진흥기본법 제2조 제3호)이며, "문화적요소"란 "예술성.창의성 오락성.여가성.대중성"(문화산업진흥기본법 제2조 제1호 바목)을 말한다. 따라서 문화콘텐츠란 "문화적 요소가 체화된 부호.문자.음성.음향 및 영상 등의 자료 또는 정보"라고 할 수 있다. 이것이 '디지털'이라는 존재형식을 가지게 되면 "문화적 요소가 체화되어 경제적 부가가치를 창출하는 디지털콘텐츠"인 디지털문화콘텐츠가 된다(문화산업진홍기본법 제2조 제1호 제5호).(중략) 창출하는 디지털콘텐츠"인 디지털문화콘텐츠가 된다(문화산업진홍기본법 제2조 제1호 제5호).(중략).(중략)
https://doi.org/10.20924/CCTHBL.2004.2.1.009 인용 PDF

How to Use EVT Figures for Actor Voice Training I (배우 음성 훈련을 위한 EVT 구조연습 활용방안 I)

Lee, Young-Su
- The Journal of the Korea Contents Association
- /
- v.21 no.9
- /
- pp.136-148
- /
- 2021
In this study, the theoretical principle and structural practice of Estill Voice Training model that enables independent control of voice organs in the actor's acting art using voice as a medium of artistic expression. Its purpose is to explore the positive utility that can be applied to operation. The research on the speech science methodology that controls the differences in speech output due to the principle of the generation organ is a reality that has not been actively introduced in Korea compared to the existing actor's speech training that encompasses both the mind and the body. Voice can guarantee the accuracy and stability of operation when an understanding of our body is preceded based on anatomical physiology as well as contribute to the characterization of the character's phonetic character an element of character creation. Considering the training model through proprioception in actor voice training has practical value and alternative significance that the actor can be sought as a principle and practical methodology in the process of generating a series of target sounds.
https://doi.org/10.5392/JKCA.2021.21.09.136 인용 PDF KSCI HTML

Search Result 402, Processing Time 0.044 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)