• Title/Summary/Keyword: 음성인식률

Search Result 549, Processing Time 0.029 seconds

HunMinJeomUm: Text Extraction and Braille Conversion System for the Learning of the Blind (시각장애인의 학습을 위한 텍스트 추출 및 점자 변환 시스템)

  • Kim, Chae-Ri;Kim, Ji-An;Kim, Yong-Min;Lee, Ye-Ji;Kong, Ki-Sok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.5
    • /
    • pp.53-60
    • /
    • 2021
  • The number of visually impaired and blind people is increasing, but braille translation textbooks for them are insufficient, which violates their rights to education despite their will. In order to guarantee their rights, this paper develops a learning system, HunMinJeomUm, that helps them access textbooks, documents, and photographs that are not available in braille, without the assistance of others. In our system, a smart phone app and web pages are designed to promote the accessibility of the blind, and a braille kit is produced using Arduino and braille modules. The system supports the following functions. First, users select documents or pictures that they want, and the system extracts the text using OCR. Second, the extracted text is converted into voice and braille. Third, a membership registration function is provided so that the user can view the extracted text. Experiments have confirmed that our system generates braille and audio outputs successfully, and provides high OCR recognition rates. The study has also found that even completely blind users can easily access the smart phone app.

Classification of nasal places of articulation based on the spectra of adjacent vowels (모음 스펙트럼에 기반한 전후 비자음 조음위치 판별)

  • Jihyeon Yun;Cheoljae Seong
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.25-34
    • /
    • 2023
  • This study examined the utility of the acoustic features of vowels as cues for the place of articulation of Korean nasal consonants. In the acoustic analysis, spectral and temporal parameters were measured at the 25%, 50%, and 75% time points in the vowels neighboring nasal consonants in samples extracted from a spontaneous Korean speech corpus. Using these measurements, linear discriminant analyses were performed and classification accuracies for the nasal place of articulation were estimated. The analyses were applied separately for vowels following and preceding a nasal consonant to compare the effects of progressive and regressive coarticulation in terms of place of articulation. The classification accuracies ranged between approximately 50% and 60%, implying that acoustic measurements of vowel intervals alone are not sufficient to predict or classify the place of articulation of adjacent nasal consonants. However, given that these results were obtained for measurements at the temporal midpoint of vowels, where they are expected to be the least influenced by coarticulation, the present results also suggest the potential of utilizing acoustic measurements of vowels to improve the recognition accuracy of nasal place. Moreover, the classification accuracy for nasal place was higher for vowels preceding the nasal sounds, suggesting the possibility of higher anticipatory coarticulation reflecting the nasal place.

Development of Urban Flood Forecasting Model using Statistical Method (통계학적 기법을 이용한 도시홍수 예.경보모형의 개발)

  • Lee, Beum-Hee;Lim, Jong-Il
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.805-809
    • /
    • 2007
  • 최근 도시의 발달은 하상공간에 대한 이용도를 높이는 방향으로 개발이 진행되어가는 추세이며, 하상도로 및 하상주차장의 이용은 이제 도시 내에서 이용 가능한 마지막 여유 공간으로 인식될 정도로 그 의존도가 높아져가고 있다. 그러나 하상공간의 활용도가 높아져갈수록 도시홍수의 발생으로 인한 대피문제가 발생하게 되고 돌발홍수로 인하여 하상도로의 차단 혹은 하상 주차장에 주차된 차량의 소거가 늦어지는 경우 고스란히 피해를 보게 되는 등 그 부작용도 계속 증가되고 있다. 도시홍수의 특성을 살펴보면 국지성 돌발 강우에 의한 유량의 급격한 증가와 짧은 유하시간, 작은 유역면적 등에 의하여 주요 예보지점까지의 도달시간이 매우 짧아 수문학적 홍수예측 모형을 이용하여 홍수예측 업무를 수행하는데 선행시간을 충분히 확보할 수 없다는 단점을 지니고 있다. 이에 따라 본 연구에서는 기존의 하천시스템에 대한 설계 등을 목적으로 하여 모형의 적용을 통한 시뮬레이션 기법을 적용하고 이를 통하여 홍수 예경보를 발령하기에는 선행시간의 확보(대피시간의 확보)라는 측면에서 상당한 어려움을 지닐 수 있으므로 시시각각으로 측정되는 실시간 수위측정 자료 및 실시간 강우자료를 이용하여 모형의 수행과정을 생략하고 하천의 수위변동을 직접 예측하고 대피할 수 있는 시나리오 기반의 수문모형을 개발하였다. SPSS를 사용한 통계학적 모형을 대전광역시 3대 하천에 대하여 적용한 결과 예측자료가 실측자료를 고수위 및 저수위 부근에서 정확히 모의하지 못하는 경향이 나타났으나 경계 및 위험수위를 설정하고 이를 넘어가는 시점에 대한 예측을 하는 홍수경보 시점 예측에는 효율적인 적용성을 나타내었다.씬 간편하면서도 정확도가 높아서, 환경방사성 스트론튬의 정량분석에 적절히 사용될 수 있다.e form of Jones matrix, which allows a new interpretation in the conversion efficiency of the thin-film optical waveguides.있다는 장점이 있었다. 따라서 소아에서 복막투석도관 수술 시 복강경적 방법을 이용하는 것이 효율적인 복막 투석을 위해 유용하다고 생각된다.상부 방광천자에 비해 민감도 59.5%(25/42), 특이도 86.6%(13/15)였고 위양성률 13.3%(2/15), 위음성률 40.5%(17/42) 로 정확도가 낮았다. 결론 : 소변을 가리지 못하는 영유아에서 요로 감염을 진단하기 위해서는 도뇨관 채뇨에 비해 초음파 감시하 치골상부 방광천자가 정확하고 안전한 채뇨법으로 권장되어야 한다고 생각한다.應裝置) 및 운용(運用)에 별다른 어려움이 없고, 내열성(耐熱性)이 강(强)하므로 쉬운 조건하(條件下)에서 경제적(經濟的)으로 공업적(工業的) 이용(利用)에 유리(有利)하다고 판단(判斷)되어진다.reatinine은 함량이 적었다. 관능검사결과(官能檢査結果) 자가소화(自家消化)시킨 크릴간장은 효소(酵素)처리한 것이나 재래식 콩간장에 비하여 품질 면에서 손색이 없고 저장성(貯藏性)이 좋은 크릴간장을 제조(製造)할 수 있다는 결론을 얻었다.이 있음을 확인할 수 있었다.에 착안하여 침전시 슬러지층과 상등액의 온도차를 측정하여 대사열량의 발생량을 측정하고 슬러지의 활성을 측정할 수 있는 방법을 개발하였다.enin과 Rhaponticin의 작용(作用)에 의(依)한 것이며,

  • PDF

Framework Switching of Speaker Overlap Detection System (화자 겹침 검출 시스템의 프레임워크 전환 연구)

  • Kim, Hoinam;Park, Jisu;Cha, Shin;Son, Kyung A;Yun, Young-Sun;Park, Jeon Gue
    • Journal of Software Assessment and Valuation
    • /
    • v.17 no.1
    • /
    • pp.101-113
    • /
    • 2021
  • In this paper, we introduce a speaker overlap system and look at the process of converting the existed system on the specific framework of artificial intelligence. Speaker overlap is when two or more speakers speak at the same time during a conversation, and can lead to performance degradation in the fields of speech recognition or speaker recognition, and a lot of research is being conducted because it can prevent performance degradation. Recently, as application of artificial intelligence is increasing, there is a demand for switching between artificial intelligence frameworks. However, when switching frameworks, performance degradation is observed due to the unique characteristics of each framework, making it difficult to switch frameworks. In this paper, the process of converting the speaker overlap detection system based on the Keras framework to the pytorch-based system is explained and considers components. As a result of the framework switching, the pytorch-based system showed better performance than the existing Keras-based speaker overlap detection system, so it can be said that it is valuable as a fundamental study on systematic framework conversion.

A Study on Improving of Access to School Library Collection through High School Students' DLS Search Behavior Analysis (고등학생의 DLS 검색행태 분석을 통한 학교도서관 자료 접근성 향상 방안 고찰)

  • Jung, Youngmi;Kang, Bong-Suk
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.2
    • /
    • pp.355-379
    • /
    • 2020
  • Digital Library System(DLS) for the school library is a key access tool for school library materials. The purpose of this study was to find ways to improve the accessibility of materials through analysis of students' information search behavior in DLS. Data were collected through recording of 42 participants' DLS search process, and questionnaire. As a result, the search success rate and search satisfaction were found to be lower when the main purpose of DLS is simple leisure reading, information needs are relatively ambiguous, and when user experiences the complicated situations in the search process. The satisfaction level of search time sufficiency was the highest, and the search result satisfaction was the lowest. Besides, there was a need to improve DLS, such as integrated search of other library collection information, the recommendation of related materials, the print output of collection location, voice recognition through mobile apps, and automatic correction of search errors. Through this, the following can be suggested. First, DLS should complement the function of providing career information by reflecting the demand of education consumers. Second, improvements to DLS functionality to the general information retrieval system level must be made. Third, an infrastructure must be established for close cooperation between school library field personnel and DLS management authorities.

Multi channel far field speaker verification using teacher student deep neural networks (교사 학생 심층신경망을 활용한 다채널 원거리 화자 인증)

  • Jung, Jee-weon;Heo, Hee-Soo;Shim, Hye-jin;Yu, Ha-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.37 no.6
    • /
    • pp.483-488
    • /
    • 2018
  • Far field input utterance is one of the major causes of performance degradation of speaker verification systems. In this study, we used teacher student learning framework to compensate for the performance degradation caused by far field utterances. Teacher student learning refers to training the student deep neural network in possible performance degradation condition using the teacher deep neural network trained without such condition. In this study, we use the teacher network trained with near distance utterances to train the student network with far distance utterances. However, through experiments, it was found that performance of near distance utterances were deteriorated. To avoid such phenomenon, we proposed techniques that use trained teacher network as initialization of student network and training the student network using both near and far field utterances. Experiments were conducted using deep neural networks that input raw waveforms of 4-channel utterances recorded in both near and far distance. Results show the equal error rate of near and far-field utterances respectively, 2.55 % / 2.8 % without teacher student learning, 9.75 % / 1.8 % for conventional teacher student learning, and 2.5 % / 2.7 % with proposed techniques.

Prototype Design and Development of Online Recruitment System Based on Social Media and Video Interview Analysis (소셜미디어 및 면접 영상 분석 기반 온라인 채용지원시스템 프로토타입 설계 및 구현)

  • Cho, Jinhyung;Kang, Hwansoo;Yoo, Woochang;Park, Kyutae
    • Journal of Digital Convergence
    • /
    • v.19 no.3
    • /
    • pp.203-209
    • /
    • 2021
  • In this study, a prototype design model was proposed for developing an online recruitment system through multi-dimensional data crawling and social media analysis, and validates text information and video interview in job application process. This study includes a comparative analysis process through text mining to verify the authenticity of job application paperwork and to effectively hire and allocate workers based on the potential job capability. Based on the prototype system, we conducted performance tests and analyzed the result for key performance indicators such as text mining accuracy and interview STT(speech to text) function recognition rate. If commercialized based on design specifications and prototype development results derived from this study, it may be expected to be utilized as the intelligent online recruitment system technology required in the public and private recruitment markets in the future.

The Clinical Features of Endobronchial Tuberculosis - A Retrospective Study on 201 Patients for 6 years (기관지결핵의 임상상-201예에 대한 후향적 고찰)

  • Lee, Jae Young;Kim, Chung Mi;Moon, Doo Seop;Lee, Chang Wha;Lee, Kyung Sang;Yang, Suck Chul;Yoon, Ho Joo;Shin, Dong Ho;Park, Sung Soo;Lee, Jung Hee
    • Tuberculosis and Respiratory Diseases
    • /
    • v.43 no.5
    • /
    • pp.671-682
    • /
    • 1996
  • Background : Endobronchial tuberculosis is definded as tuberculous infection of the tracheobronchial tree with microbiological and histopathological evidence. Endobronchial tuberculosis has clinical significance due to its sequela of cicatrical stenosis which causes atelectasis, dyspnea and secondary pneumonia and may mimic bronchial asthma and pulmanary malignancy. Method : The authors carried out, retrospectively, a clinical study on 201 patients confirmed with endobronchial tuberculosis who visited the Department of Pulmonary Medicine at Hangyang University Hospital from January 1990 10 April 1996. The following results were obtained. Results: 1) Total 201 parients(l9.5%) were confirmed as endobronchial tuberculosis among 1031 patients who had been undergone flexible bronchofiberscopic examination. The number of male patients were 55 and that of female patients were 146. and the male to female ratio was 1 : 2.7. 2) The age distribution were as follows: there were 61(30.3%) cases in the third decade, 40 cases(19.9%) in the fourth decade, 27 cases(13.4%) in the sixth decade, 21 cases(10.4%) in the fifth decade, 19 cases(9.5%) in the age group between 15 and 19 years, 19 cases(9.5%) in the seventh decade, and 14 cases(7.0%) over 70 years, in decreasing order. 3) The most common symptom, in 192 cases, was cough 74.5%, followed by sputum 55.2%, dyspnea 28.6%, chest discomfort 19.8%, fever 17.2%, hemoptysis 11.5%, in decreasing order, and localized wheezing was heard in 15.6%. 4) In chest X-ray of 189 cases, consolidation was the most frequent finding in 67.7%, followed by collapse 43.9%. cavitary lesion 11.6%, pleural effusion 7.4%, in decreasing order, and there was no abnormal findings in 3.2%. 5) In the 76 pulmanary function tests, a normal pattern was found in 44.7%, restrictive pattern in 39.5 %, obstructive pattern in 11.8%, and combined pattern in 3.9%. 6) Among total 201 patients, bronchoscopy showed caseous pseudomembrane in 70 cases(34.8%), mucosal erythema and edema in 54 cases(26.9%), hyperplastic lesion in 52 cases(25.9%), fibrous s.enosis in 22 cases(10.9%), and erosion or ulcer in 3 cases(1.5%). 7) In total 201 cases, bronchial washing AFB stain was positive in 103 cases(51.2%), bronchial washing culture for tuberculous bacilli in 55 cases(27.4%). In the 99 bronchoscopic biopsies, AFB slain positive in 36.4%. granuloma without AFB stain positive in 13.1%, chronic inflammation only in 36.4%. and non diagnostic biopsy finding in 14.1%. Conclusions : Young female patients, whose cough resistant to genenal antitussive agents, should be evaluated for endobronchial tuberculosis, even with clear chest roentgenogram and negative sputum AFB stain. Furthermore, we would like to emphasize that the bronchoscopic approach is a substantially useful means of making a differential diagnosis of atelectasis in older patients of cancer age. At this time we have to make a standard endoscopic classification of endobronchial tuberculosis, and well designed prospective studies are required to elucidate the effect of combination therapy using antituberculous chemotherapy with steroids on bronchial stenosis in patients with endobronchial tuberculosis.

  • PDF

Therapeutic Use of Music for Stuttering Children (말더듬 아동을 위한 음악치료적 접근)

  • Cho, Jung Min
    • Journal of Music and Human Behavior
    • /
    • v.4 no.1
    • /
    • pp.21-30
    • /
    • 2007
  • Unlike other common forms of speech disorder, such as phonological disorder or dysphonia, stuttering has not been studied within the context of music therapy. Most cases of stuttering display no difficulty in singing, and fluency within the musical structure does not translate to fluency in speech. Hence, musical approach has been generally considered to be ineffective to the treatment of stuttering. However, the fundamentals of music therapy assume its extensive application in treating variety of speech disorders, including the case of stuttering. Presented in this paper are the case studies designed to validate the efficacy of music therapy as a remedy for stuttering. This study enrolled 6 children with stuttering and conducted 20 individual sessions over a period of 10 weeks. The sessions focused on the Melodic Intonation Therapy, Reinforcement of speech rhythm, song writing and singing. Musical elements were structured to enhance the verbal expression and rhythmic senses, as well as to facilitate the initiation of verbal communication. The result is as follows. First, it was noticed that the disfluency had been decreased in before and after of the music therapy in every child although the result was somewhat different depending the child. The overall result of the investigation shows the significant difference statistically. And categorically speaking, the significant difference was checked in the frequency of the stuttering. In the steps of the session, the increase and decrease was happened repeatedly, and then after it was decreased little by little. Secondly, the Communication Attitude was decreased in before and after of the music therapy, and also there was significant difference statistically. although the avoidance behavior was decreased in before and after of the music therapy, the increase and the decrease was repeated irregularly in the steps of session. All the results described above shows that music therapy gives positive effect to decrease in disfluency of stuttering child and also to develop the Communication Attitude. And new possibility and effectiveness can be proposed in the musical approach to the stuttering.

  • PDF