• Title/Summary/Keyword: Speech improvement

Search Result 610, Processing Time 0.027 seconds

Korean Phoneme Recognition Using Self-Organizing Feature Map (SOFM 신경회로망을 이용한 한국어 음소 인식)

  • Jeon, Yong-Koo;Yang, Jin-Woo;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.2
    • /
    • pp.101-112
    • /
    • 1995
  • In order to construct a feature map-based phoneme classification system for speech recognition, two procedures are usually required. One is clustering and the other is labeling. In this paper, we present a phoneme classification system based on the Kohonen's Self-Organizing Feature Map (SOFM) for clusterer and labeler. It is known that the SOFM performs self-organizing process by which optimal local topographical mapping of the signal space and yields a reasonably high accuracy in recognition tasks. Consequently, SOFM can effectively be applied to the recognition of phonemes. Besides to improve the performance of the phoneme classification system, we propose the learning algorithm combined with the classical K-mans clustering algorithm in fine-tuning stage. In order to evaluate the performance of the proposed phoneme classification algorithm, we first use totaly 43 phonemes which construct six intra-class feature maps for six different phoneme classes. From the speaker-dependent phoneme classification tests using these six feature maps, we obtain recognition rate of $87.2\%$ and confirm that the proposed algorithm is an efficient method for improvement of recognition performance and convergence speed.

  • PDF

Improvement of Recognition Speed for Real-time Address Speech Recognition (실시간 주소 음성인식을 위한 인식 시스템의 인식속도 개선)

  • Hwang Cheol-Jun;Oh Se-Jin;Kim Bum-Koog;Jung Ho-Youl;Chung Hyun-Yeol
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.74-77
    • /
    • 1999
  • 본 논문에서는 본 연구실에서 개발한 주소 음성인식 시스템의 인식 속도를 개선시키기 위하예 새로운 가변 프루닝 문턱치를 적용하는 방법을 제안하고 실험을 통하여 그 유효성을 확인하였다. 기존의 가변 프루닝 문턱치는 일정 프레임이 경과하면 일정 값을 가진 문턱치를 계속하여 감소시켜나가는 방법을 반복하기 때문에, 불필요한 탐색공간을 탐색하게 된다. 본 논문에서 새로이 제안하는 가변 프루닝 문턱치를 채용하는 방법은 처음 일정 구간이 경과되면 일정 문턱치를 감소시키나, 다음 일정 프레임에서는 탐색되어야할 후보에 따라서 문턱치를 변화시켜 프루닝시키기 때문에 탐색공간을 효과적으로 감소시킬 수 있다. 제안된 방법의 유효성을 확인하기 위하여, 본 연구실에서 개발한 한국어 주소 입력 시스템에 적용하였다. 이 시스템은 48개의 연속 HMM 유사음소단위(Phoneme Like Units; PLUs)를 인식의 기본단위로 하고, .사용환경 변화에 의한 인식성능의 저하를 최소화하기 위해 최대사후 확률추정법(Maximum A Posteriori Probability Estimation; MAP)을 사용하며, 인식알고리즘으로는OPDP(One Pass Dynamic Programming)법을 이용하고 있다. 남성화자 3인에 의한 75개의 연결주소명을 이용하여 인식 실험을 수행한 결과 고정 프루닝 문턱치를 적용한 경우 인식률은 평균 $96.0\%$, 인식 시간은 5.26초였고, 기존의 가변 프루닝 문턱치의 경우 인식률은 평균 $96.0\%$, 인식 시간은 5.1초인 데 비하여, 새로운 가변 프루닝 문턱치를 적용찬 경우에는 인식률 저하없이 인식 시간이 4.34초로, 기존에 비해 각각 0.92초, 0.76초 인식 시간이 감소되어 제안한 방법의 유효성을 확인할 수 있었다.는 달리 각 산란 영역에서 그 지수는 1씩 작은 값을 갖는다.향에 따라 음장변화가 크게 다를 것이 예상되므로 이를 규명하기 위해서는 궁극적으로 3차원적인 음장분포 연구가 필요하다. 음향센서를 해저면에 매설할 경우 수충의 수온변화와 센서 주변의 수온변화 사이에는 어느 정도의 시간지연이 존재하게 되므로 이에 대한 영향을 규명하는 것도 센서의 성능예측을 위해서 필요하리라 사료된다.가지는 심부 가스의 개발 성공률을 증가시키기 위하여 심부 가스가 존재하는 지역의 지질학적 부존 환경 및 조성상의 특성과 생산시 소요되는 생산비용을 심도에 따라 분석하고 생산에 수반되는 기술적 문제점들을 정리하였으며 마지막으로 향후 요구되는 연구 분야들을 제시하였다. 또한 참고로 현재 심부 가스의 경우 미국이 연구 개발 측면에서 가장 활발한 활동을 전개하고 있으며 그 결과 다수의 신뢰성 있는 자료들을 확보하고 있으므로 본 논문은 USGS와 Gas Research Institute(GRI)에서 제시한 자료에 근거하였다.ऀĀ耀Ā삱?⨀؀Ā Ā?⨀ጀĀ耀Ā?돀ꢘ?⨀硩?⨀ႎ?⨀?⨀넆돐쁖잖⨀쁖잖⨀/ࠐ?⨀焆덐瀆倆Āⶇ퍟ⶇ퍟ĀĀĀĀ磀鲕좗?⨀肤?⨀⁅Ⴅ?⨀쀃잖⨀䣙熸ጁ↏?⨀

  • PDF

A Speaker Pruning Method for Reducing Calculation Costs of Speaker Identification System (화자식별 시스템의 계산량 감소를 위한 화자 프루닝 방법)

  • 김민정;오세진;정호열;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.6
    • /
    • pp.457-462
    • /
    • 2003
  • In this paper, we propose a speaker pruning method for real-time processing and improving performance of speaker identification system based on GMM(Gaussian Mixture Model). Conventional speaker identification methods, such as ML (Maximum Likelihood), WMR(weighting Model Rank), and MWMR(Modified WMR) we that frame likelihoods are calculated using the whole frames of each input speech and all of the speaker models and then a speaker having the biggest accumulated likelihood is selected. However, in these methods, calculation cost and processing time become larger as the increase of the number of input frames and speakers. To solve this problem in the proposed method, only a part of speaker models that have higher likelihood are selected using only a part of input frames, and identified speaker is decided from evaluating the selected speaker models. In this method, fm can be applied for improving the identification performance in speaker identification even the number of speakers is changed. In several experiments, the proposed method showed a reduction of 65% on calculation cost and an increase of 2% on identification rate than conventional methods. These results means that the proposed method can be applied effectively for a real-time processing and for improvement of performance in speaker identification.

Improving QoS using Cellular-IP/PRC in Wireless Internet Environment (Cellular-IP/PRC에서 핸드오프 상태 머신에 의한 QoS 개선)

  • Kim Dong-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.6
    • /
    • pp.1302-1308
    • /
    • 2005
  • Propose Cellular-IP/PRC network with united paging and Cellular IP special duality that use roof information administration cache to secure lake acceptance method in wireless Internet environment and QoS in lesser extent cell environment. When speech quality is secured considering increment of interference to receive in case of suppose that proposed acceptance method grooves base radio station capacity of transfer node is plenty, and moat of contiguity cell transfer node was accepted at groove base radio station with a blow, groove base radio station new trench lake acceptance method based on transmission of a message electric power estimate of transfer node be. Do it so that may apply composing PC(Paging Cache) and RC(Routing Cache) that was used to manage paging and router in radio Internet network in integral management and all nodes as one PRC(Paging Router Cache), and add hand off state machine in transfer node so that can manage hand off of transfer node and Roaming state efficiently, and studies so that achieve connection function at node. Analyze benevolent person who influence on telephone traffic in system environment and forecasts each link currency rank and imbalance degree, forecast most close and important lake interception probability and lake falling off probability, GoS(Grade of Service), efficiency of cell capacity in QoS because applies algorithm proposing based on algorithm use gun send-receive electric power that judge by looking downward link whether currency book was limited and accepts or intercept lake and handles and displays QoS performance improvement.

Effective Feature Vector for Isolated-Word Recognizer using Vocal Cord Signal (성대신호 기반의 명령어인식기를 위한 특징벡터 연구)

  • Jung, Young-Giu;Han, Mun-Sung;Lee, Sang-Jo
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.3
    • /
    • pp.226-234
    • /
    • 2007
  • In this paper, we develop a speech recognition system using a throat microphone. The use of this kind of microphone minimizes the impact of environmental noise. However, because of the absence of high frequencies and the partially loss of formant frequencies, previous systems developed with those devices have shown a lower recognition rate than systems which use standard microphone signals. This problem has led to researchers using throat microphone signals as supplementary data sources supporting standard microphone signals. In this paper, we present a high performance ASR system which we developed using only a throat microphone by taking advantage of Korean Phonological Feature Theory and a detailed throat signal analysis. Analyzing the spectrum and the result of FFT of the throat microphone signal, we find that the conventional MFCC feature vector that uses a critical pass filter does not characterize the throat microphone signals well. We also describe the conditions of the feature extraction algorithm which make it best suited for throat microphone signal analysis. The conditions involve (1) a sensitive band-pass filter and (2) use of feature vector which is suitable for voice/non-voice classification. We experimentally show that the ZCPA algorithm designed to meet these conditions improves the recognizer's performance by approximately 16%. And we find that an additional noise-canceling algorithm such as RAST A results in 2% more performance improvement.

The Study on the Application of He-Ne Laser with Low Energy ILIB to the Superficial Venules (저용량(低容量) He-Ne 레이저침의 혈락적용(血絡適用) 연구(硏究))

  • Kim Sung-Chul;Cho Eun-Hee;Na Chang-Su
    • Korean Journal of Acupuncture
    • /
    • v.20 no.3
    • /
    • pp.35-47
    • /
    • 2003
  • Objective : The purpose of this study was to investigate the significance of the Oriental medical treatment using He-Ne laser with low energy intravascular Laser Irradiation of Blood(ILIB) through the superficial venules. Methods : The investigation of details connected with the superficial venules in the literature is performed. The investigation of details connected with the pricking blood techniques through the superficial venules in the literature is performed. The classification of the pricking blood techniques through the superficial venules by the blood-letting puncture methods in the literature is performed. The arrangement of domestic clinical treatises on the effectiveness of medical treatment using He-Ne laser with low energy ILIB through the superficial venules is performed. The consideration on the methodology for the improvement of the clinical effectiveness of He-Ne laser with low energy ILIB through superficial venules is performed. Results and Conclusions : The superficial venules are small arteries, veins and capillaries in the superficial region of the human body. In the pricking blood techniques, there are the blood-letting puncture using the implement of acupuncture to the Jing points, Extra points and superficial blood vessels and the acupuncture using the Hirudo. The methods of the blood-letting puncture are classified into the venous blood-letting puncture, the pricking , the picking out white fiber-like substances from the subcutaneous tissue, the cluster needling, the scattered needling, the blood-letting puncture of the tready collateral branch of the large channel and the blood-letting puncture of skin. The He-Ne laser with low energy ILIB through the superficial venules belongs to the Oriental medical treatment as the method of the blood-letting puncture in the vein of cubital fossa. The He-Ne laser with low energy ILIB has an effect on hyperfibrinogenemia, hyperlipidemia, speech and motor dysfunction in the case of cerebral infarction, headache, dizziness, pain and numbness. It is considered that fundamental research on the biological change of the human body, the experimental animal and the unicellular animal, and research on the effectiveness and the safety, and the development of He-Ne laser with low energy ILIB of an effective wavelength range are necessary.

  • PDF

TREATMENT OF OPEN BITE BY TONGUE THRUSTING HABIT USING HABIT BREAKING APPLIANCE AND MYOFUNCTIONAL THERAPY (습관제거장치와 근기능요법을 이용한 혀내밀기 원인성 개방교합의 치료)

  • Choi, Ji-Won;Oh, You-Hyang;Lee, Chang-Seop;Lee, Sang-Ho;Lee, Nan-Young
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.32 no.2
    • /
    • pp.229-235
    • /
    • 2005
  • A problem that affects children's dentitions is the harmful habit which is difficult to treat. Harmful habits for children are such as abnormal swallowing patterns, low/forward tongue rest posture problem, habitual open-lips resting posture, habitual mouth-breathing, excessive digital sucking habit and tongue thrusting. Tongue thrusting habits cause a bit of cranio-facial skeletal changes and a great deal of dental malocclusion such as anterior open bite. Anterior open bite causes masticatory, speech, and esthetic problems in the growing children and difficulties in diagnosis, treatment, and the prediction of its prognosis. The treatments of such abnormal behaviors involve orofacial myofunctional therapy and using of habit breaking appliance. The prognosis is not determined by the presence of severity of oral habit but the skeletal tendency of the patient. Usage of tongue crib resulted in not only the discontinuance of the habit but also improvement in overbite and overbite. This study showed that relatively successful results could be generated by using removable tongue crib and myofunctional therapy in the case of openbite related to tongue thrusting habit.

  • PDF

A Case Study on the Professional Education Using SAFMEDS Teaching Strategy (SAFMEDS 교수전략을 적용한 전문가 교육 사례연구)

  • Jeong, Gyeong-Hee;Choi, Jinhyeok;Ahn, Sung-Woo;Shin, Chang-Suk
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.10 no.1
    • /
    • pp.9-18
    • /
    • 2016
  • This study reported a case study that showed educational usefulness of SAFMEDS (Say All Fast a Minute Every Day Shuffled) on the improvement of Fluency. The participants were 3 experts with special teacher and speech and pathology, who enrolled a graduate level course, Research in Children with Autism Spectrum Disorder. The SAFMEDS strategy was employed as a study tool for the participants to acquire fluent verbal repertoires related to the key terminologies of Skinner's (1957) analysis of verbal behavior, list 60 pairs of terms. The participants developed 60 term flash cards which presented a target term on the front of the card, and its definition on the back. During the intervention, the participants were required to see the definition and says its term. The results of this study indicated that the SAFMEDS was effective to improve participants' fluent verbal repertoires in terms of both accuracy and fluency. The results of this study would be able to contribute for education professionals to improve certain target operant's accuracy and fluency.

Improving QoS using Cellular-IP/PRC in Hospital Wireless Network (병원 무선망에서 Cellular-IP/PRC에 의한 QoS 개선)

  • Suk, Kyung Hyu;Kim, Sung-Hong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.3 no.3
    • /
    • pp.188-194
    • /
    • 2008
  • In this paper, we propose for improving QoS in Hospital wireless network using Cellular-IP/PRC(Paging Route Cache) with Paging Cache and Route Cache in Cellular-IP and propose for performance of realtime and non-real time handoff service using Handoff state machine Paging Route Cache. Although the Cellular-IP/PRC technology is devised for mobile internet communication, it has its vulnerability in frequent handoff environment. This handoff state machine using differentiated handoff improves quality of services in Cellular-IP/PRC Suggested algorithm shows better performance than existing technology in wireless mobile internet communication environment. When speech quality is secured considering increment of interference to receive in case of suppose that proposed acceptance method grooves base radio station capacity of transfer node is plenty, and moat of contiguity cell transfer node was accepted at groove base radio station with a blow, groove base radio station new trench lake acceptance method based on transmission of a message electric power estimate of transfer node be. Do it so that may apply composing PC(Paging Cache) and RC(Routing Cache) that was used to manage paging and router in radio Internet network in integral management and all nodes as one PRC(Paging Router Cache), and add hand off state machine in transfer node so that can manage hand off of transfer node and Roaming state efficiently, and studies so that achieve connection function at node. Analyze benevolent person who influence on telephone traffic in system environment and forecasts each link currency rank and imbalance degree, forecast most close and important lake interception probability and lake falling off probability, GoS(Grade of Service), efficiency of cell capacity in QoS because applies algorithm proposing based on algorithm use gun send-receive electric power that judge by looking downward link whether currency book was limited and accepts or intercept lake and handles and displays QoS performance improvement.

  • PDF

A STUDY OF THE INFLUENCE ON PHONATION WHEN MAXILLARY ANTERIOR TEETH ARE MISSING (상악 전치부 결손이 발음에 미치는 영향에 관한 연구)

  • Roh Chang-Sup;Choi Dae-Gyun;Woo Yi-Hyung;Choi Boo-Byung
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.30 no.3
    • /
    • pp.338-360
    • /
    • 1992
  • This study was performed to investigate the phonetic alterations with upper anterior teeth were missing. To compare the changes of the phonations, before and after insertion of the temporary prosthesis, six subjects who lost their upper anterior teeth were selected (2-male, 4-female). Tested sounds (/ga(가), na(나), da(다), ra(라), sa(사), ja(자), cha(차), ta(타), pa(파), ha(하), gi(기), ni(니), di(디), ri(리), si(시), jl(지), chi(치), ti(티), pi(피), hi(히), seu(스), se(세), so(소), su(수)/were programmed into an IBM AT with and without temporary prosthesis. These experiments were analyzed by formants, consonants durations, and energy level changes with an LSI speech work station program. During the pronunciation of the tested sounds (with and without temporary prosthesis), mandibular movements were recorded to a Mandibular Kinesiogram and analyzed . The findings led to the following conclusions: 1. Objective differences could not be found. However, in every informant, subjective improvement could be noticed. 2. There were no persistant correlations of the formant's changes. And in every informant, phonetic changes were variable. 3. There were various changes of the consonant durations in every informant. By and large, those of /si(시), jl(지), chi(치), Pi(피), hi(히)/ were longer than other tested sounds. After insertion of the prosthesis, durations were shorter. Consonants with /i(ㅣ)/ were longer than with /a(ㅏ)/, with or without prosthesis. 4. With and without temporary prosthesis, mandibular movements were various in the frontal view. Mandibular movements showed lateral deviations, and mandibular positions with /si(시), ji(지), ti(티), seu(스), hi(히)/ were nearer to the mandibular rest position. 5. The kinds of temporary prosthesis and conditions of the missing teeth influenced every informant variously, so there were no correlation between informants. 6. Energy levels increased in all tested sounds with a fixed temporary prosthesis. And, there were no differences between before and after insertion of a removable temporary prosthesis. However, sibilant sounds, and consonants with /i(ㅣ)/ showed a little increased energy level.

  • PDF