• 제목/요약/키워드: source text

검색결과 267건 처리시간 0.024초

교육용 한국어 TTS 플랫폼 개발 (A Korean TTS System for Educational Purpose)

  • 이정철;이상호
    • 대한음성학회지:말소리
    • /
    • 제50호
    • /
    • pp.41-50
    • /
    • 2004
  • Recently, there has been considerable progress in the natural language processing and digital signal processing components and this progress has led to the improved synthetic speech qualify of many commercial TTS systems. But there still remain many obstacles to overcome for the practical application of TTS. To resolve the problems, the cooperative research among the related areas is highly required and a common Korean TTS platform is essential to promote these activities. This platform offers a general framework for building Korean speech synthesis systems and a full C/C++ source for modules supports to implement and test his own algorithm. In this paper we described the aspect of a Korean TTS platform to be developed and a developing plan.

  • PDF

음성 데이터베이스로부터의 효율적인 색인데이터베이스 구축과 정보검색 (The Extraction of Effective Index Database from Voice Database and Information Retrieval)

  • 박미성
    • 한국도서관정보학회지
    • /
    • 제35권3호
    • /
    • pp.271-291
    • /
    • 2004
  • 전자도서관과 같은 정보제공원은 이미지, 음성, 동영상 등과 같은 비정형 멀티미디어 데이터 서비스에 대한 요구를 받고 있다. 그리하여 본 연구에서는 음성 처리를 위해 어절생성기, 음절복원기, 형태소분석기, 교정기를 제안하였다. 제안한 음성처리 기술로 음성데이터베이스를 텍스트데이터베이스로 변환 한후 텍스트데이터베이스로부터 색인데이터베이스를 추출하였다. 그리고 추출한 색인데이터베이스로 텍스트와 음성의 내용기반정보검색에 활용할 수 있음을 보이기 위해 정보검색모델을 제안하였다.

  • PDF

Vocabulary Expansion Technique for Advertisement Classification

  • Jung, Jin-Yong;Lee, Jung-Hyun;Ha, Jong-Woo;Lee, Sang-Keun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제6권5호
    • /
    • pp.1373-1387
    • /
    • 2012
  • Contextual advertising is an important revenue source for major service providers on the Web. Ads classification is one of main tasks in contextual advertising, and it is used to retrieve semantically relevant ads with respect to the content of web pages. However, it is difficult for traditional text classification methods to achieve satisfactory performance in ads classification due to scarce term features in ads. In this paper, we propose a novel ads classification method that handles the lack of term features for classifying ads with short text. The proposed method utilizes a vocabulary expansion technique using semantic associations among terms learned from large-scale search query logs. The evaluation results show that our methodology achieves 4.0% ~ 9.7% improvements in terms of the hierarchical f-measure over the baseline classifiers without vocabulary expansion.

다중 뷰 편집환경을 위한 점진적 다중진입 지원 파서에 대한 연구 (A Study of Incremental and Multiple Entry Support Parser for Multi View Editing Environment)

  • 염세훈;방혜자
    • 디지털산업정보학회논문지
    • /
    • 제14권3호
    • /
    • pp.21-28
    • /
    • 2018
  • As computer performance and needs of user convenience increase, computer user interface are also changing. This changes had great effects on software development environment. In past, text editors like vi or emacs on UNIX OS were the main development environment. These editors are very strong to edit source code, but difficult and not intuitive compared to GUI(Graphical User Interface) based environment and were used by only some experts. Moreover, the trends of software development environment was changed from command line to GUI environment and GUI Editor provides usability and efficiency. As a result, the usage of text based editor had decreased. However, because GUI based editor use a lot of computer resources, computer performance and efficiency are decreasing. The more contents are, the more time to verify and display the contents it takes. In this paper, we provide a new parser that provide multi view editing, incremental parsing and multiple entry of abstract syntax tree.

택리지의 의미적 고찰 (A study on the second Intention of Taek-li-ji)

  • 정기호
    • 한국조경학회지
    • /
    • 제17권3호
    • /
    • pp.49-57
    • /
    • 1990
  • In general, Taek-li-ji is noted for its reliable source in studies on the old korean culture. But when we examine it in detail, we can find its contrary propositions with regard to the context ; for the thematic question of the good place to settle, that is the leitmotiv of Taek-li-ji, opinion, in consideration of the place of good settlement, has the author the conclusion ; nowhere, but by means of "I Ging" can be found a good one - place of no place. In the text, we find some other questionable points. In means that the content of this book must be to make understandable and we need a solution of the contradiction in context. 1 think it might be intended by the author. In this study I have tried, the intention of the author to find and the context to review, in order to interpretate the text and to resolve its contradiction. And I have earned a hidden opinion of the author, the second intention of the book.

  • PDF

Analyzing Customer Experience in Hotel Services Using Topic Modeling

  • Nguyen, Van-Ho;Ho, Thanh
    • Journal of Information Processing Systems
    • /
    • 제17권3호
    • /
    • pp.586-598
    • /
    • 2021
  • Nowadays, users' reviews and feedback on e-commerce sites stored in text create a huge source of information for analyzing customers' experience with goods and services provided by a business. In other words, collecting and analyzing this information is necessary to better understand customer needs. In this study, we first collected a corpus with 99,322 customers' comments and opinions in English. From this corpus we chose the best number of topics (K) using Perplexity and Coherence Score measurements as the input parameters for the model. Finally, we conducted an experiment using the latent Dirichlet allocation (LDA) topic model with K coefficients to explore the topic. The model results found hidden topics and keyword sets with high probability that are interesting to users. The application of empirical results from the model will support decision-making to help businesses improve products and services as well as business management and development in the field of hotel services.

태극침법(太極鍼法)의 확장형인 오장원혈침법(五臟原穴鍼法)의 적응증 연구 - "황제내경(黃帝內經).영추(靈樞)"를 중심으로 - (A study on the indications of Five Viscera Source Point Acupuncture extended from Taegeuk Acupuncture : Focused on Yeoungchu(靈樞))

  • 모한영;임교민;백진웅
    • 대한한의학원전학회지
    • /
    • 제25권4호
    • /
    • pp.123-147
    • /
    • 2012
  • Objective : By establishing the Five Viscera Source Point Acupuncture as the targeted acupuncture treatment for stadardization, as the first step, this study was conducted to sort the indications of each acupuncture remedies, which can be referred as one of the most important factors in acupuncture treatment, based on Yeoungchu. Method : This study selected only the contents related to indications of five viscera, by extracting the relevant sentences from Yeoungchu using the search words Liver(Liver Meridian, First Yin), Heart(Pericardium, Heart Meridian, Second Yin), Spleen(Spleen meridian, Third Yin), Lung(Lung Meridian, Third Yin), and Kidney(Kidney Meridian, Second Yin). Result & Conclusion : 1. We selected and extracted text related to liver disease from Chapter 16, heart (pericardium) disease from Chapter 16, spleen disease from Chapter 19, lung disease from Chapter 17, and finally kidney disease from Chapter 17 of Yeoungchu. 2. The basic theory of applying Five Viscera Source Point Acupuncture to five viscera diseases is first assorting the diseases according to its state (i.e. deficiency or excess), then draining the source point of the appropriate viscus in case of excess, or supplementing the source point of the appropriate viscus in case of deficiency. 3. For the correct application of Five Viscera Source Point Acupuncture, the classification of the disease, not only the judgement on its state, must be presented systematically and synthetically in combination with Four Examinations. Therefore the follow-up studies needs to be conducted.

오픈소스 소프트웨어를 활용한 자연어 처리 패키지 제작에 관한 연구 (Research on Natural Language Processing Package using Open Source Software)

  • 이종화;이현규
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제25권4호
    • /
    • pp.121-139
    • /
    • 2016
  • Purpose In this study, we propose the special purposed R package named ""new_Noun()" to process nonstandard texts appeared in various social networks. As the Big data is getting interested, R - analysis tool and open source software is also getting more attention in many fields. Design/methodology/approach With more than 9,000 R packages, R provides a user-friendly functions of a variety of data mining, social network analysis and simulation functions such as statistical analysis, classification, prediction, clustering and association analysis. Especially, "KoNLP" - natural language processing package for Korean language - has reduced the time and effort of many researchers. However, as the social data increases, the informal expressions of Hangeul (Korean character) such as emoticons, informal terms and symbols make the difficulties increase in natural language processing. Findings In this study, to solve the these difficulties, special algorithms that upgrade existing open source natural language processing package have been researched. By utilizing the "KoNLP" package and analyzing the main functions in noun extracting command, we developed a new integrated noun processing package "new_Noun()" function to extract nouns which improves more than 29.1% compared with existing package.

농정신편(農政新編)의 출전고(出典攷) (A Study on the Source of Manual for Agriculture)

  • 김명배
    • 한국식생활문화학회지
    • /
    • 제1권4호
    • /
    • pp.383-394
    • /
    • 1986
  • This study was conducted to search the origin of Nong Jung Shin Pyun ( 農政新編 ), a book of agricultural manual. This book was edited by An Jong-Soo who translated the agricultural manual of Japan and China, both were writted in Japanese. This book might be used not only as text book for agricultural workshop but as reference book for peasants.

  • PDF

Performance of H.26L Video Coding

  • Nga, N.H.;Fernando, W.A.C.
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -3
    • /
    • pp.1582-1585
    • /
    • 2002
  • In recent years the demand for digital video and image communications has been increased tremendously. On the other hand, video communications requires very much bandwidth in comparison with other information types such as text and data. Thus to adapt with the bandwidth-limited channels, especially wireless channels, video source must be compressed extremely. A new video coding standard namely H.26L is being developed by Joint Video Team (JVT). In this paper performance of H.26L is analyzed in an AWGN environment.

  • PDF