• Title/Summary/Keyword: Natural language process

Search Result 248, Processing Time 0.023 seconds

Train Booking Agent with Adaptive Sentence Generation Using Interactive Genetic Programming (대화형 유전 프로그래밍을 이용한 적응적 문장생성 열차예약 에이전트)

  • Lim, Sung-Soo;Cho, Sung-Bae
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.12 no.2
    • /
    • pp.119-128
    • /
    • 2006
  • As dialogue systems are widely required, the research on natural language generation in dialogue has raised attention. Contrary to conventional dialogue systems that reply to the user with a set of predefined answers, a newly developed dialogue system generates them dynamically and trains the answers to support more flexible and customized dialogues with humans. This paper proposes an evolutionary method for generating sentences using interactive genetic programming. Sentence plan trees, which stand for the sentence structures, are adopted as the representation of genetic programming. With interactive evolution process with the user, a set of customized sentence structures is obtained. The proposed method applies to a dialogue-based train booking agent and the usability test demonstrates the usefulness of the proposed method.

A Study on Creation of Hangeu-Romanization Conversion Table Using Petri-Nets (페트리넷을 이용한 한글-로마자 표기 변환표 생성에 관한 연구)

  • Kim, Kyung-Jing;Choi, Young-Kyoo;Rhee, Sang-Burm
    • The KIPS Transactions:PartB
    • /
    • v.9B no.6
    • /
    • pp.827-834
    • /
    • 2002
  • In this paper, we proposed the formation of Korean-Roman alphabet notation conversion table for the generation of Korean-Roman alphabet notation that also meets revised Roman alphabet notation. Introduced a mathematical analyzing method of the natural language which used a petrinet model so that a base of Roman alphabet notation analyzed standard pronunciation and Roman alphabet notation to work mathematically. It display the practical example through a petrinet modeling of a plan and Roman alphabet notation to create a Korean Roman alphabet notation conversion table with the method of the analysis that used a petrinet model, and present a mathematical modeling plan and application method of Korean. We developed application program based on window in order to verify a created Korean-Roman alphabet notation conversion table, and compared the result of an application program with Roman alphabet notation of an Roman alphabet notation example dictionary.

Light Weight Korean Morphological Analysis Using Left-longest-match-preference model and Hidden Markov Model (좌최장일치법과 HMM을 결합한 경량화된 한국어 형태소 분석)

  • Kang, Sangwoo;Yang, Jaechul;Seo, Jungyun
    • Korean Journal of Cognitive Science
    • /
    • v.24 no.2
    • /
    • pp.95-109
    • /
    • 2013
  • With the rapid evolution of the personal device environment, the demand for natural language applications is increasing. This paper proposes a morpheme segmentation and part-of-speech tagging model, which provides the first step module of natural language processing for many languages; the model is designed for mobile devices with limited hardware resources. To reduce the number of morpheme candidates in morphological analysis, the proposed model uses a method that adds highly possible morpheme candidates to the original outputs of a conventional left-longest-match-preference method. To reduce the computational cost and memory usage, the proposed model uses a method that simplifies the process of calculating the observation probability of a word consisting of one or more morphemes in a conventional hidden Markov model.

  • PDF

Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems (질의응답 시스템에서 처음 보는 단어의 역문헌빈도 기반 단어 임베딩 기법)

  • Lee, Wooin;Song, Gwangho;Shim, Kyuseok
    • Journal of KIISE
    • /
    • v.43 no.8
    • /
    • pp.902-909
    • /
    • 2016
  • Question answering system (QA system) is a system that finds an actual answer to the question posed by a user, whereas a typical search engine would only find the links to the relevant documents. Recent works related to the open domain QA systems are receiving much attention in the fields of natural language processing, artificial intelligence, and data mining. However, the prior works on QA systems simply replace all words that are not in the training data with a single token, even though such unseen words are likely to play crucial roles in differentiating the candidate answers from the actual answers. In this paper, we propose a method to compute vectors of such unseen words by taking into account the context in which the words have occurred. Next, we also propose a model which utilizes inverse document frequencies (IDF) to efficiently process unseen words by expanding the system's vocabulary. Finally, we validate that the proposed method and model improve the performance of a QA system through experiments.

A Study on Image Generation from Sentence Embedding Applying Self-Attention (Self-Attention을 적용한 문장 임베딩으로부터 이미지 생성 연구)

  • Yu, Kyungho;No, Juhyeon;Hong, Taekeun;Kim, Hyeong-Ju;Kim, Pankoo
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.63-69
    • /
    • 2021
  • When a person sees a sentence and understands the sentence, the person understands the sentence by reminiscent of the main word in the sentence as an image. Text-to-image is what allows computers to do this associative process. The previous deep learning-based text-to-image model extracts text features using Convolutional Neural Network (CNN)-Long Short Term Memory (LSTM) and bi-directional LSTM, and generates an image by inputting it to the GAN. The previous text-to-image model uses basic embedding in text feature extraction, and it takes a long time to train because images are generated using several modules. Therefore, in this research, we propose a method of extracting features by using the attention mechanism, which has improved performance in the natural language processing field, for sentence embedding, and generating an image by inputting the extracted features into the GAN. As a result of the experiment, the inception score was higher than that of the model used in the previous study, and when judged with the naked eye, an image that expresses the features well in the input sentence was created. In addition, even when a long sentence is input, an image that expresses the sentence well was created.

A Study on Interior Design Process by approaching Typological Method (유형학적 접근방식에 의한 실내디자인 과정에 관한 연구 (II))

  • 한경희;이선민
    • Korean Institute of Interior Design Journal
    • /
    • no.21
    • /
    • pp.165-172
    • /
    • 1999
  • For the useful method capable of modern expression on traditional residence architecture, a study was performed on the methodological establishment and possibility of typological method could be examinated to interior design process by typological method. First of all, through the establishment verbal of our Korean traditional architecture and further investigation of environmental and cultural idealogical facts, it could be extracted from natural instinct, duality, continuance, flexibility and transitiov. In second process, based on these results, it could be framed and described the individual typological language and, for the sake of drawing for visual and spatial typology, it was made by sketch in terms and view of possibile guidance of prototype, transforming and application method. from these results of investigated sketches, it cold be used for criteria of application method as the parts of visual and spatial typological elements to have an applicable expression of it/s traditionality. Based on above facts, for the subjects of spatial system, form & shape system, circulation system, order system, decoration system, color & material system in interior design fields, we cold propose the practical possibility through the consideration of application method for built-in meaning that could be adaptable for the interior design practices. These facts were extracted from the based on visual & spatial typology, as above mentiov. Also, through preparing and suggesting the criteria of evaluation and measurement of design quality , we could propose the applicable methodology for further & basically Korean traditional embodiment.

  • PDF

A Validity Verification of Human Error Probability using a Fuzzy Model (퍼지모델을 이용한 인적오류확률의 타당성 검증)

  • Jang, Tong-Il;Lee, Yong-Hee;Lim, Hyeon-Kyo
    • Journal of the Korean Society of Safety
    • /
    • v.21 no.3 s.75
    • /
    • pp.137-142
    • /
    • 2006
  • Quantification of error possibility, in an HRA process, should be performed so that the result of the qualitative analysis can be utilized in other areas in conjunction with overall safety estimation results. And also, the quantification is an essential process to analyze the error possibility in detail and to obtain countermeasures for the errors through screening procedures. In previous studies for the quantification of error possibility, nominal values were assigned by the experts' judgements and utilized as corresponding probabilities. The values assigned by experts' experiences and judgements, however, require verifications on their reliability. In this study, the validity of new error possibility values in new MCR design was verified by using the Onisawa's model which utilizes fuzzy linguistic values to estimate human error probabilities. With the model of error probabilities are represented as analyst's estimations and natural language expression instead of numerical values. As results, the experts' estimation values about error probabilities are well agreed to the existing error probability estimation model. Thus, it was concluded that the occurrence probabilities of errors derived from the human error analysis process can be assessed by nominal values suggested in the previous studies. It is also expected that our analysis method can supplement the conventional HRA method because the nominal values are based on the consideration of various influencing factors such as PSFs.

Untold story about why King Sejong invented the Korean alphabet

  • JUNG, Sanggyu
    • Journal of Koreanology Reviews
    • /
    • v.1 no.1
    • /
    • pp.1-23
    • /
    • 2022
  • HunMinJeongEum, meaning "the right sound to teach the people," was created in 1443 CE by King Sejong the Great, the fourth king of the Joseon Dynasty. In today's modern language, this letter, called Hangeul, is internationally recognized for its linguistic science. However, it is hard to find a comprehensive study on the fact that King Sejong himself created Hangeul, the Confucian perspective on natural disasters and democracy revealed in the process of writing, the independent efforts emphasized from a certain period, and the achievements of King Sejong, who shared the sorrow of the people and carried out national policies despite the extreme opposition of the nobility. Accordingly, I analyzed the consonants of HunMinJeongEum and looked at the essence of humanity and oriental philosophy (Yin-Yang Five Elements, Sangsu Philosophy, Hado). Surprisingly, different meanings from previous studies and interpretations were found, and King Sejong's "Da Vinci Code," which was left behind in the process of making the consonant, is reinterpreted and revealed. King Sejong's achievements were all connected as one. This is the root of democracy in the Republic of Korea today, and this is why King Sejong was selected as the most beloved and respected historical figure by the Korean people. This study will start with more people's understanding of the fundamental perception and philosophy of the world in Asia, including Korea, to reinterpret and reveal the hardships and great achievements experienced by a leader of a country in the process of creating korean alphabet, and to emphasize democracy, which is an important value for Asians and Westerners' mutual respect and co-prosperity.

Research on R&D Planning Through NLP Analysis of Patent Information: Focusing on Display Technology (특허정보의 NLP 분석을 통한 R&D 계획수립 방안 연구: 디스플레이 기술 분석을 중심으로)

  • Kim, Jung-Heui;Kim, Young-Min
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.25 no.5
    • /
    • pp.817-826
    • /
    • 2022
  • Patent information describes the history of technological progress in the relevant field, so it can be usefully used to identify trends in technological development and change and to establish R&D development strategies. This study proposes a method to identify the needs and problems of technology development at the planning stage of the R&D process and to analyze core technologies through patent analysis using Natural Language Processing(NLP) technology. As a big data source, collected patent documents registered in Google Patents for foldable technology, the latest technology in the display industry, and then extracted keywords using NLP analyzer. By classifying the extracted keywords into needs and problems for technology development, developed technology and materials, identified the needs of the market and customers and analyzed the technologies being researched and developed. Unlike previous studies that performed patent analysis, this methodology is different in that it can quickly and conveniently analyze the latest technology trends from big data called patents even if you do not have specialized knowledge and skills in the text mining. This study contributes to the digitalization of the R&D process based on data analysis.

Rule Construction for Determination of Thematic Roles by Using Large Corpora and Computational Dictionaries (대규모 말뭉치와 전산 언어 사전을 이용한 의미역 결정 규칙의 구축)

  • Kang, Sin-Jae;Park, Jung-Hye
    • The KIPS Transactions:PartB
    • /
    • v.10B no.2
    • /
    • pp.219-228
    • /
    • 2003
  • This paper presents an efficient construction method of determination rules of thematic roles from syntactic relations in Korean language processing. This process is one of the main core of semantic analysis and an important issue to be solved in natural language processing. It is problematic to describe rules for determining thematic roles by only using general linguistic knowledge and experience, since the final result may be different according to the subjective views of researchers, and it is impossible to construct rules to cover all cases. However, our method is objective and efficient by considering large corpora, which contain practical osages of Korean language, and case frames in the Sejong Electronic Lexicon of Korean, which is being developed by dozens of Korean linguistic researchers. To determine thematic roles more correctly, our system uses syntactic relations, semantic classes, morpheme information, position of double subject. Especially by using semantic classes, we can increase the applicability of the rules.