• Title/Summary/Keyword: Keyword Extract

Search Result 116, Processing Time 0.023 seconds

Concept-based Question Answering System

  • Kang Yu-Hwan;Shin Seung-Eun;Ahn Young-Min;Seo Young-Hoon
    • International Journal of Contents
    • /
    • v.2 no.1
    • /
    • pp.17-21
    • /
    • 2006
  • In this paper, we describe a concept-based question-answering system in which concept rather than keyword itself makes an important role on both question analysis and answer extraction. Our idea is that concepts occurred in same type of questions are similar, and if a question is analyzed according to those concepts then we can extract more accurate answer because we know the semantic role of each word or phrase in question. Concept frame is defined for each type of question, and it is composed of important concepts in that question type. Currently the number of question type is 79 including 34 types for person, 14 types for location, and so on. We experiment this concept-based approach about questions which require person s name as their answer. Experimental results show that our system has high accuracy in answer extraction. Also, this concept-based approach can be used in combination with conventional approaches.

  • PDF

Building Domain Ontology Based on Linguistic Patterns

  • Kim, Kweon-Yang;Lim, Soo-Yeon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.16 no.6
    • /
    • pp.766-771
    • /
    • 2006
  • In this paper, we focus on the building domain ontology from corpus by extracting concepts and properties relationships based on linguistic patterns. The pharmacy field is selected as an experiment domain and we present an algorithm to extract hierarchical structure for terminology based on the noun/suffix patterns of terminology in domain texts. In order to show usefulness of our domain ontology, we compare a typical keyword based retrieval method with an ontology based retrieval mettled which uses related information in an ontology for a related feedback. As a result, our method shows the improvement of precision by 4.97% without losing recall.

Implementation of Extracting Specific Information by Sniffing Voice Packet in VoIP

  • Lee, Dong-Geon;Choi, WoongChul
    • International journal of advanced smart convergence
    • /
    • v.9 no.4
    • /
    • pp.209-214
    • /
    • 2020
  • VoIP technology has been widely used for exchanging voice or image data through IP networks. VoIP technology, often called Internet Telephony, sends and receives voice data over the RTP protocol during the session. However, there is an exposition risk in the voice data in VoIP using the RTP protocol, where the RTP protocol does not have a specification for encryption of the original data. We implement programs that can extract meaningful information from the user's dialogue. The meaningful information means the information that the program user wants to obtain. In order to do that, our implementation has two parts. One is the client part, which inputs the keyword of the information that the user wants to obtain, and the other is the server part, which sniffs and performs the speech recognition process. We use the Google Speech API from Google Cloud, which uses machine learning in the speech recognition process. Finally, we discuss the usability and the limitations of the implementation with the example.

An Analysis on Keywords in the Journal of Korean Safety Management Science from 2018 to 2021 (2018년부터 2021년까지 대한안전경영과학회지의 주제어에 관한 분석)

  • Byoung-Hak Yang
    • Journal of the Korea Safety Management & Science
    • /
    • v.25 no.1
    • /
    • pp.1-6
    • /
    • 2023
  • This study tried to analyze the keywords of the papers published in the Korea Safety Management Science by using the social network analysis. In order to extract the keywords, information on journal articles published from 2018 to 2021 was extracted from the SCIENCE ON. Among the keywords extracted from a total of 129 papers, the keywords with similar meanings were standardized. The keywords used in the same paper were visualized by connecting them through a network. Four centrality indicators of the social network analysis were used to analyze the effect of the keyword. Safety, Safety management, Apartment, Fire hose, SMEs, Virtual reality, Machine learning, Waterproof time, R&D capability, and Job crafting were selected as the keywords analyzed with high influence in the four centrality indicators.

An Attempt to Measure the Familiarity of Specialized Japanese in the Nursing Care Field

  • Haihong Huang;Hiroyuki Muto;Toshiyuki Kanamaru
    • Asia Pacific Journal of Corpus Research
    • /
    • v.4 no.2
    • /
    • pp.57-74
    • /
    • 2023
  • Having a firm grasp of technical terms is essential for learners of Japanese for Specific Purposes (JSP). This research aims to analyze Japanese nursing care vocabulary based on objective corpus-based frequency and subjectively rated word familiarity. For this purpose, we constructed a text corpus centered on the National Examination for Certified Care Workers to extract nursing care keywords. The Log-Likelihood Ratio (LLR) was used as the statistical criterion for keyword identification, giving a list of 300 keywords as target words for a further word recognition survey. The survey involved 115 participants of whom 51 were certified care workers (CW group) and 64 were individuals from the general public (GP group). These participants rated the familiarity of the target keywords through crowdsourcing. Given the limited sample size, Bayesian linear mixed models were utilized to determine word familiarity rates. Our study conducted a comparative analysis of word familiarity between the CW group and the GP group, revealing key terms that are crucial for professionals but potentially unfamiliar to the general public. By focusing on these terms, instructors can bridge the knowledge gap more efficiently.

Extracting Method of User's Interests by Using SNS Follower's Relationship and Sequential Pattern Evaluation Indices for Keyword (키워드를 위한 시퀀셜 패턴 평가 지표와 SNS 팔로워의 관계를 이용한 사용자 관심사항 추출방법)

  • Shin, Bong-Hi;Jeon, Hye-Kyoung
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.8
    • /
    • pp.71-75
    • /
    • 2017
  • Due to the spread of SNS, web-based consumer-generated data is increasing exponentially. It is important in many fields to accurately extract what is appropriate for the user's interest in a large amount of data. It is especially important for business mangers to establish marketing policies to find the right customers for them in many users. In this paper, we try to obtain important information centering on customers who are interested in each account through Twitter follow - following relationship. Because Twitter's current follower relationships do not reflect the user's interests, we try to figure out the details of interest using keyword extraction methods for tweets of followers. To do this, we select two domestic commercial Twitter accounts and apply the sequential pattern evaluation index to the mining key phrase of the text data collected from the follower.

Development of Search Method using Semantic technologies about RESTful Web Services (시맨틱 기술을 활용한 RESTful 웹서비스의 검색 기법 개발)

  • Cha, Seung-Jun;Choi, Yun-Jeong;Lee, Kyu-Chul
    • Journal of Korea Spatial Information System Society
    • /
    • v.12 no.1
    • /
    • pp.100-104
    • /
    • 2010
  • Recently with advent of Web 2.0, RESTful Web Services are becoming increasing trend to emphasize Web as platform. There are already many services and the number of service increases in very fast pace. So it is difficult to find the service what we want by keyword based search. To solve this problem, we developed the search method using sem antic technologies about RESTful Web Services. For that, first we define the system structure and model the description format based on the integrated search system for OpenAPIs, and then we add Semantic Markup (tagging, semantic annotation) on the HTML description pages. Next we extract RDF document from them and store it in service repository. Based on the keywords that are extended by means of ontology, the developed system provides more purified and extended results than similarity-based keyword searching system.

Identification of sentiment keywords association-based hotel network of hotel review using mapper method in topological data analysis (Topological Data Analysis 기법을 활용한 호텔 리뷰데이터의 감성 키워드 기반 호텔 관계망 구축)

  • Jeon, Ye-Seul;Kim, Jeong-Jae
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.75-86
    • /
    • 2020
  • Hotel review data can extract various information that includes purchasing factors that lead to consumption, advantages, and disadvantages for hotels. In particular, the sentiment keyword of the review data helps consumers understand the pros and cons of hotels. However, it is not efficient for consumers to read a large number of reviews. Therefore, it is necessary to offer a summary review to customers. In this study, we suggest providing summary information on sentiment keywords association as well as a network of hotels based on sentiment keywords. Based on a sentiment keyword dictionary, the extracted sentiment keywords associations construct the hotel network through topological data analysis based mapper. This hotel network allows a consumer to find some hotels associated with specific sentiment keywords as well as recommends the same related hotels. This summary information provides users with a summarized emotional assessment of hotels and helps hotel marketing teams understand consumers' perceptions of their hotel.

A Study on Ontology Instance Generation Using Keywords (키워드를 활용한 온톨로지 인스턴스 생성에 관한 연구)

  • Han, Kwang-Rok;Kang, Hyun-Min;Sohn, Surg-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.5
    • /
    • pp.1-11
    • /
    • 2010
  • The success of semantic web depends largely on the semantic annotation which systematizes knowledge for the construction and production of ontology. Therefore, the efficiency of semantic annotation is very important in order to change many knowledge expressions and generate into ontology instances. In this paper, we presents a generation system of rule-based ontology instances which are produced accurately and efficiently via semantic annotation in conventional web sites. In conventional studies, the manual process is necessary for finding relevant information, comparing it with ontology, and entering information. We propose a new method that manages keyword data regarding extracted information and rule information separately. Thus, it is quite practical to extract information efficiently from various web documents by adding a small number of keywords and rules. The proposed method shows the possibility of ontology instance generation which reuses the rules and keywords from the various websites.

Comparison of term weighting schemes for document classification (문서 분류를 위한 용어 가중치 기법 비교)

  • Jeong, Ho Young;Shin, Sang Min;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.2
    • /
    • pp.265-276
    • /
    • 2019
  • The document-term frequency matrix is a general data of objects in text mining. In this study, we introduce a traditional term weighting scheme TF-IDF (term frequency-inverse document frequency) which is applied in the document-term frequency matrix and used for text classifications. In addition, we introduce and compare TF-IDF-ICSDF and TF-IGM schemes which are well known recently. This study also provides a method to extract keyword enhancing the quality of text classifications. Based on the keywords extracted, we applied support vector machine for the text classification. In this study, to compare the performance term weighting schemes, we used some performance metrics such as precision, recall, and F1-score. Therefore, we know that TF-IGM scheme provided high performance metrics and was optimal for text classification.