• Title/Summary/Keyword: 용어추출

Search Result 365, Processing Time 0.026 seconds

Investigating Major Topics Through the Analysis of Depression-related Facebook Group Posts (페이스북 그룹 게시물 분석을 통한 우울증 관련 주제에 대한 고찰)

  • Zhu, Yongjun;Kim, Donghun;Lee, Changho;Lee, Yongjeong
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.4
    • /
    • pp.171-187
    • /
    • 2019
  • The study aims to analyze the posts of depression-related Facebook groups to understand major topics discussed by group users. Specifically, the purpose of the study is to identify the topics and keywords of the posts to understand what users discuss about depression. Depression is a mental disorder that is somewhat sensitive in the online community, which is characterized by accessibility, openness and anonymity. The researchers have implemented a natural language-based data analysis framework that includes components ranging from Facebook data collection to the automated extraction of topics. Using the framework, we collected and analyzed 885 posts created in the past one year from the largest Facebook depression group. To derive more complete and accurate topics, we combined both automated and manual (e.g., stop words removal, topic size determination) methods. Results indicate that users discuss a variety of topics including depression in general, human relations, mood and feeling, depression symptoms, suicide, medical references, family and etc.

Detection of Adverse Drug Reactions Using Drug Reviews with BERT+ Algorithm (BERT+ 알고리즘 기반 약물 리뷰를 활용한 약물 이상 반응 탐지)

  • Heo, Eun Yeong;Jeong, Hyeon-jeong;Kim, Hyon Hee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.11
    • /
    • pp.465-472
    • /
    • 2021
  • In this paper, we present an approach for detection of adverse drug reactions from drug reviews to compensate limitations of the spontaneous adverse drug reactions reporting system. Considering negative reviews usually contain adverse drug reactions, sentiment analysis on drug reviews was performed and extracted negative reviews. After then, MedDRA dictionary and named entity recognition were applied to the negative reviews to detect adverse drug reactions. For the experiment, drug reviews of Celecoxib, Naproxen, and Ibuprofen from 5 drug review sites, and analyzed. Our results showed that detection of adverse drug reactions is able to compensate to limitation of under-reporting in the spontaneous adverse drugs reactions reporting system.

Attention-based word correlation analysis system for big data analysis (빅데이터 분석을 위한 어텐션 기반의 단어 연관관계 분석 시스템)

  • Chi-Gon, Hwang;Chang-Pyo, Yoon;Soo-Wook, Lee
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.27 no.1
    • /
    • pp.41-46
    • /
    • 2023
  • Recently, big data analysis can use various techniques according to the development of machine learning. Big data collected in reality lacks an automated refining technique for the same or similar terms based on semantic analysis of the relationship between words. Since most of the big data is described in general sentences, it is difficult to understand the meaning and terms of the sentences. To solve these problems, it is necessary to understand the morphological analysis and meaning of sentences. Accordingly, NLP, a technique for analyzing natural language, can understand the word's relationship and sentences. Among the NLP techniques, the transformer has been proposed as a way to solve the disadvantages of RNN by using self-attention composed of an encoder-decoder structure of seq2seq. In this paper, transformers are used as a way to form associations between words in order to understand the words and phrases of sentences extracted from big data.

Artificial Intelligence and Literary Sensibility (인공지능과 문학 감성의 상호 연결)

  • Seunghee Sone
    • Science of Emotion and Sensibility
    • /
    • v.26 no.4
    • /
    • pp.115-124
    • /
    • 2023
  • This study explores the intersection of literary studies and artificial intelligence (AI), focusing on the common theme of human emotions to foster complementary advancements in both fields. By adopting a comparative perspective, the paper investigates emotion as a shared focal point, analyzing various emotion-related concepts from both literary and AI perspectives. Despite the scarcity of research on the fusion of AI and literary studies, this study pioneers an interdisciplinary approach within the humanities, anticipating future developments in AI. It proposes that literary sensibility can contribute to AI by formalizing subjective literary emotions, thereby enhancing AI's understanding of complex human emotions. This paper's methodology involves the terminology-centered extraction of emotions, aiming to blend subjective imagination with objective technology. This fusion is expected to not only deepen AI's comprehension of human complexities but also broaden literary research by rapidly analyzing diverse human data. The study emphasizes the need for a collaborative dialogue between literature and engineering, recognizing each field's limitations while pursuing a convergent enhancement that transcends these boundaries.

  • PDF

A Theoretical Study on Indexing Methods using the Metadata for the Automatic Construction of a Thesaurus Browser (시소러스 브라우저 자동구현을 위한 Metadata를 이용한 색인어 처리방안에 대한 연구)

  • Seo , Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.35 no.4
    • /
    • pp.451-467
    • /
    • 2004
  • This paper is intended to present the theoretical analyses on automatic indexing, which is vital in the process of constructing a thesaurus browser, and clustering algorithms to construct hierarchical relations among terms as well as the methods for the automatic construction of a thesaurus browser. The methods to select the index term automatically in the web documents are studied by surveying the methods for analyzing and processing metadata which conforms to bibliographical roles of traditional paper documents in web documents. Also, the result of the study suggests to adding or involving the metadata in web documents, using the metadata automatic editor because metadata is not listed in most of the web documents.

  • PDF

Thesaurus Updating Using Collective Intelligence: Based on Wikipedia Encyclopedia (집단지성을 활용한 시소러스 갱신에 관한 연구: 위키피디아를 중심으로)

  • Han, Seung-Hee
    • Journal of the Korean Society for information Management
    • /
    • v.26 no.3
    • /
    • pp.25-43
    • /
    • 2009
  • The purpose of this study is to suggest how the classic thesaurus structure of terms and links can be mined and updated from Wikipedia encyclopedia, which is the best practice of collective intelligence. In a comparison with ASIS&T thesaurus, it was found that Wikipedia contains a substantial coverage of domain-specific concepts and semantic relations. Furthermore, it was resulted that the structural characteristics of Wikipedia, such as redirects, categories, and mutual links are suitable to extract semantic relationships of thesaurus. It is needed to apply to update various thesauri, including multilingual thesaurus, in order to generalize the results of this research.

A Research on Automatic Data Extract Method for Herbal Formula Combinations Using Herb and Dosage Terminology - Based on 『Euijongsonik』 - (본초 및 용량 용어를 이용한 방제구성 자동추출방법에 대한 연구 -『의종손익』을 중심으로-)

  • Keum, Yujeong;Lee, Byungwook;Eom, Dongmyung;Song, Jichung
    • Journal of Korean Medical classics
    • /
    • v.33 no.4
    • /
    • pp.67-81
    • /
    • 2020
  • Objectives : This research aims to suggest a automatic data extract method for herbal formula combinations from medical classics' texts. Methods : This research was carried out by using Access of Microsoft Office 365 in Windows 10 of Microsoft. The subject text for extraction was 『Euijongsonik』. Using data sets of herb and dosage terminology, herbal medicinals and their dosages were extracted. Afterwards, using the position value of the character string, the formula combinations were automatically extracted. Results :The PC environment of this research was Intel Core i7-1065G7 CPU 1.30GHz, with 8GB of RAM and a Windows 10 64bit operation system. Out of 6,115 verses, 19,277 herb-dosage combinations were extracted. Conclusions : In this research, it was demonstrated that in the case of classical texts that are available as data, knowledge on herbal medicine could be extracted without human or material resources. This suggests an applicability of classical text knowledge to clinical practice.

Boolean Formulation of Korean Natural Language Queries Using Syntactic Analysis (구문 분석에 기반한 자연어 질의로부터의 불리언 질의 생성)

  • Park, Mi-Hwa;Won, Hyung-Suk;Lee, Won-Il;Lee, Geun-Bae
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.73-80
    • /
    • 1998
  • 본 연구는 자연어 질의의 형태 및 구문 정보를 바탕으로 불리언 질의를 생성하는데 그 목적을 둔다. 일반적으로 대부분의 상용정보검색시스템은 입력형식을 검색성능이 종은 불리언 형태로 하고 있으나, 일반 사용자는 자신이 원하는 정보를 불리언 형태로 표현하는데 익숙하지 않다. 그러므로 본 정보검색시스템은 자연어 질의를 기본 입력형태로 하여 사용자의 편의성을 높이고, 이 질의를 범주문법에 기반한 구문분석 결과에 의해 복합명사를 고려한 불리언 형태로 변환하여 검색을 수행함으로써 시스템의 검색 성능의 향상을 도모하였다. 정보검색 실험용 데이터 모음인 KTSET2.0으로 실험한 결과 본 논문에서 제안한 자연어 질의로부터 자동 생성된 불리언 질의의 검객성능이 KTSET2.0에서 제공하는 수동으로 추출한 불리언 질의보다 8% 더 우수한 성능을 보였고, 기존 자연어질의 시스템이 수용해온 방법인 형태소 분석을 거쳐 불용어를 제거한 후 Vector 모델을 적용하여 검색을 수행한 경우보다는 23% 더 나은 성능을 보였다.

  • PDF

Development of Mathematics Learning Contents based on Storytelling for Concept Learning (초등학교 수학과 개념학습을 위한 스토리텔링 기반학습 콘텐츠 개발)

  • Oh, Young-Bum;Park, Sang-Seop
    • Journal of The Korean Association of Information Education
    • /
    • v.14 no.4
    • /
    • pp.537-545
    • /
    • 2010
  • The purpose of this paper is to develop mathematics learning contents for elementary school 3rd graders and to verify the educational effectiveness of contents developed. An ADDIE model was applied to develop mathematics learning contents based on storytelling for concept learning. After extracting 54 concepts from the mathematics curriculum, researchers designed strategies using concepts that were combined with context which is familiar to young students. Researchers implemented a survey and interview to students and teachers to verify the effectiveness of contents. As a result, the understanding, interest, concentration, and expectation of students toward the contents developed were very high, and teachers also mentioned that these contents could be very useful teaching materials for motivation.

  • PDF

Construction of Variable Pattern Net for Korean Sentence Understanding and Its Application (한국어 문장이해를 위한 가변패턴네트의 구성과 응용)

  • Han, Gwang-Rok
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.2
    • /
    • pp.229-236
    • /
    • 1995
  • The conceptual world of sentence is composed f substantives(nouns) and verbal. The verbal is a semantic center of sentence, the substantives are placed under control of verbal, and they are combined in a various way. In this paper, the structural relation of verbal and substantives are analyzed and the phrase unit sentence which is derived from the result of morphological analysis is interpreted by a variable pattern net. This variable pattern net analyzes the phrases syntactically and semantically and extracts conceptual units of clausal form. This paper expands the traditionally restricted Horn clause theory to the general sentence, separates a simple sentence from a complex sentence automatically, constructs knowledge base by clausal form of logical conceptual units, and applies it to a question-answering system.

  • PDF