• Title/Summary/Keyword: Articulation paper

Search Result 145, Processing Time 0.027 seconds

Performance Improvement of Continuous Digits Speech Recognition Using the Transformed Successive State Splitting and Demi-syllable Pair (반음절쌍과 변형된 연쇄 상태 분할을 이용한 연속 숫자 음 인식의 성능 향상)

  • Seo Eun-Kyoung;Choi Gab-Keun;Kim Soon-Hyob;Lee Soo-Jeong
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.1
    • /
    • pp.23-32
    • /
    • 2006
  • This paper describes the optimization of a language model and an acoustic model to improve speech recognition using Korean unit digits. Since the model is composed of a finite state network (FSN) with a disyllable, recognition errors of the language model were reduced by analyzing the grammatical features of Korean unit digits. Acoustic models utilize a demisyllable pair to decrease recognition errors caused by inaccurate division of a phone or monosyllable due to short pronunciation time and articulation. We have used the K-means clustering algorithm with the transformed successive state splitting in the feature level for the efficient modelling of feature of the recognition unit. As a result of experiments, 10.5% recognition rate is raised in the case of the proposed language model. The demi-syllable fair with an acoustic model increased 12.5% recognition rate and 1.5% recognition rate is improved in transformed successive state splitting.

  • PDF

Analysis on the Formation of Dualistic Space and Networks of the Ceramic Industry in Icheon, Korea (이천 도자기 산업의 이원적 공간 형성 및 네트워크 분석)

  • Cheu, Giwan;Lee, Sung-Cheol
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.18 no.4
    • /
    • pp.556-572
    • /
    • 2015
  • Since the late 1990s dualistic spatial structure has been configurated in Icheon ceramic industrial space due to the articulation of transmitted ceramics space rooted from imitating the Goryeo and Joseon ceramics and contemporary ceramics space based on academic ceramic arts. Therefore, the main purpose of this paper is to identify the formation of dualistic space in Icheon by investigating the development paths of ceramic industry in historical perspectives and analyzing inter- and extra-firm relations in Icheon. The main results of this research are as follows. Firstly, the development path of transmitted ceramics has declined gradually, while the development path of contemporary ceramics has been embedded in Icheon region. Secondly, the research pointed out that networks of transmitted ceramics and contemporary ceramics are different in the perspectives of inter-firm and extra-firm relations. Thirdly, the government has played a critical role as a financial and administrative supporter and as a network broker between university and Icheon ceramic firms(mainly with transmitted ceramics) for technological cooperation and collaborative R&D.

  • PDF

Performance Improvement of Connected Digit Recognition by Considering Phonemic Variations in Korean Digit and Speaking Styles (한국어 숫자음의 음운변화 및 화자 발성특성을 고려한 연결숫자 인식의 성능향상)

  • 송명규;김형순
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.401-406
    • /
    • 2002
  • Each Korean digit is composed of only a syllable, so recognizers as well as Korean often have difficulty in recognizing it. When digit strings are pronounced, the original pronunciation of each digit is largely changed due to the co-articulation effect. In addition to these problems, the distortion caused by various channels and noises degrades the recognition performance of Korean connected digit string. This paper dealt with some techniques to improve recognition performance of it, which include defining a set of PLUs by considering phonemic variations in Korean digit and constructing a recognizer to handle speakers various speaking styles. In the speaker-independent connected digit recognition experiments using telephone speech, the proposed techniques with 1-Gaussian/state gave string accuracy of 83.2%, i. e., 7.2% error rate reduction relative to baseline system. With 11-Gaussians/state, we achieved the highest string accuracy of 91.8%, i. e., 4.7% error rate reduction.

Performance Improvement of Continuous Digits Speech Recognition using the Transformed Successive State Splitting and Demi-syllable pair (반음절쌍과 변형된 연쇄 상태 분할을 이용한 연속 숫자음 인식의 성능 향상)

  • Kim Dong-Ok;Park No-Jin
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.8
    • /
    • pp.1625-1631
    • /
    • 2005
  • This paper describes an optimization of a language model and an acoustic model that improve the ability of speech recognition with Korean nit digit. Recognition errors of the language model are decreasing by analysis of the grammatical feature of korean unit digits, and then is made up of fsn-node with a disyllable. Acoustic model make use of demi-syllable pair to decrease recognition errors by inaccuracy division of a phone, a syllable because of a monosyllable, a short pronunciation and an articulation. we have used the k-means clustering algorithm with the transformed successive state splining in feature level for the efficient modelling of the feature of recognition unit . As a result of experimentations, $10.5\%$ recognition rate is raised in the case of the proposed language model. The demi-syllable pair with an acoustic model increased $12.5\%$ recognition rate and $1.5\%$ recognition rate is improved in transformed successive state splitting.

The injection petrol control system about CMAC neural networks (CMAC 신경회로망을 이용한 가솔린 분사 제어 시스템에 관한 연구)

  • Han, Ya-Jun;Tack, Han-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.2
    • /
    • pp.395-400
    • /
    • 2017
  • The paper discussed the air-to-fuel ratio control of automotive fuel-injection systems using the cerebellar model articulation controller(CMAC) neural network. Because of the internal combustion engines and fuel-injection's dynamics is extremely nonlinear, it leads to the discontinuous of the fuel-injection and the traditional method of control based on table look up has the question of control accuracy low. The advantages about CMAC neural network are distributed storage information, parallel processing information, self-organizing and self-educated function. The unique structure of CMAC neural network and the processing method lets it have extensive application. In addition, by analyzing the output characteristics of oxygen sensor, calculating the rate of fuel-injection to maintain the air-to-fuel ratio. The CMAC may easily compensate for time delay. Experimental results proved that the way is more good than traditional for petrol control and the CMAC fuel-injection controller can keep ideal mixing ratio (A/F) for engine at any working conditions. The performance of power and economy is evidently improved.

스웨덴어 발음 교육상의 몇 가지 문제점 - 모음을 중심으로 -

  • Byeon Gwang-Su
    • MALSORI
    • /
    • no.4
    • /
    • pp.20-30
    • /
    • 1982
  • The aim of this paper is to analyse difficulties of the pronunciation in swedish vowels encountered by Koreans learners and to seek solutions in order to correct the possible errors. In the course of the analysis the swedish and Korean vowels in question are compared with the purpose of describing differences aha similarities between these two systems. This contrastive description is largely based on the students' articulatory speech level ana the writer's auditory , judgement . The following points are discussed : 1 ) Vowel length as a distinctive feature in Swedish compared with that of Korean. 2) A special attention is paid on the Swedish vowel [w:] that is characterized by its peculiar type of lip rounding. 3) The six pairs of Swedish vowels that are phonologically contrastive but difficult for Koreans to distinguish one from the other: [y:] ~ [w:], [i:] ~ [y:], [e:] ~ [${\phi}$:], [w;] ~ [u:] [w:] ~ [$\theta$], [$\theta$] ~ [u] 4) The r-colored vowel in the case of the postvocalic /r/ that is very common in American English is not allowed in English sound sequences. The r-colored vowel in the American English pattern has to be broken up and replaced hi-segmental vowel-consonant sequences . Korean accustomed to the American pronunciation are warned in this respect. For a more distinct articulation of the postvocalic /r/ trill [r] is preferred to fricative [z]. 5) The front vowels [e, $\varepsilon, {\;}{\phi}$) become opener variants (${\ae}, {\;}:{\ae}$] before / r / or supradentals. The results of the analysis show that difficulties of the pronunciation of the target language (Swedish) are mostly due to the interference from the Learner's source language (Korean). However, the Learner sometimes tends to get interference also from the other foreign language with which he or she is already familiar when he or she finds in that language more similarity to the target language than in his or her own mother tongue. Hence this foreign language (American English) in this case functions as a second language for Koreans in Learning Swedish.

  • PDF

Design of Sound Quality Index for Laser Printers and Its Application for Improvement Study (프린터의 음질 인덱스 제작과 음질개선에 대한 응용)

  • Kim, Eui-Youl;Lee, Young-Jun;Lee, Sang-Kwon
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.22 no.6
    • /
    • pp.509-523
    • /
    • 2012
  • The sound quality based on design optimization, throughout the development process of various electronic office equipments, needs to be considered in order to respond the increased needs for the emotional satisfaction of customers in terms of psycho-acoustics. This paper focuses on how to describe the characteristics of operating sound radiated from laser printers by using various sound attributes, and to model the sound quality index that can properly evaluate the subjective preference on modification conditions in the improvement study quantitatively. Especially, the proposed verification process, in the form of combining the correlation based method and the decision error based method, was applied to improve the generality and reliability of a group of participants in the jury evaluation. The modified Aures tonality model was also proposed to improve the correlation coefficient with the mean response of participants by optimizing some parameters. As a result, the loudness, articulation index, roughness, tonality, fluctuation strength were used to model the sound quality index for laser printers by using the multiple-linear regression method. Through the improvement study, it was confirmed that replacing the absorbing materials is effective to reduce the tonalness radiated from the side of a reference printer model. Based on above results, it can be concluded that the proposed model has enough usefulness as quantitative evaluation index to evaluate the difference between modification conditions in the improvement study.

A Study on the Currents and Implications of the Japanese University Admissions Reform (일본 대학입시정책의 변화 동향과 시사점)

  • Kim, Yong;Eom, Areum
    • Korean Journal of Comparative Education
    • /
    • v.28 no.3
    • /
    • pp.185-216
    • /
    • 2018
  • This paper intends to comprehend the background and various responses in sites of the university admissions reform in Japan which attracts the huge attention of educators in Korea these days. First of all, in order to grasp the roots of the reform, the historic development of various tests and university admission system was analyzed. Tracing the discussion of the three government committees, the process that the reform proposal was formed was also examined. After reviewing the formation and sample tests of the new university entrance examination and the introduction of the International Baccalaureate to Japanese high schools, several points to be considered in the reform of the university entrance examination system in Korea were suggested.

Robustness Evaluation of Tactical Network based on SNA

  • Park, Ji-Hye;Yoon, Soung-woong;Lee, Sang-Hoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.10
    • /
    • pp.205-213
    • /
    • 2019
  • Network robustness is one of the most important characteristics needed as the network. Over the military tactical communication network, robustness is a key function for maintaining attack phase constantly. Tactical Information Communication Network, called TICN, has mixed characteristics of lattice- and tree-type network topology, which looks somewhat weak in the viewpoint of network robustness. In this paper, we search articulation points and bridges in a current Tactical Information Communication Network using graph theory. To improve the weak points empirically searched, we try to add links to create the concrete network and then observe the change of network-based verification values through diminishing nodes. With these themes, we evaluate the generated networks through SNA techniques. Experimental results show that the generated networks' robustness is improved compared with current network structure.

Sub-modality of Mental Images to Make lines Alive (대사를 생명력 있게 만드는 멘탈 이미지의 하위양식)

  • Choi, Jung-Sun
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.4
    • /
    • pp.119-129
    • /
    • 2019
  • Traditional speech training in acting education focused on the technical aspects of expressing the lines such as finding long/short syllables in the word, exercising articulation of consonants and vowels, and practicing diction etc. There was a limit on this education to transform written words to vivid verbal words. The lines become live when the actor sees the concrete mental images hidden in the words while speaking the lines. I will bring the knowledge of cognitive brain science and NLP(Neural Linguistic Programming) to investigate what mental images are and why mental images are fundamental elements of thought and emotion. In addition to that, I will examine how the muscles of the body react in the process of visualization of delicate mental images (subordinate form) and how to use the responsive muscles to express speaking materials such as intensity, pause, pitch, intonation etc. Conclusion, I will enumerate the obstacles encountered by actors in the course of practicing mental images, and suggest 'activation of breathing' as a thesis of the follow-up paper to eliminate those obstacles. This process, I intend to make mental images to be the concrete and practical information that can be applied to speak the dialogue in the play.