• Title/Summary/Keyword: 토픽 추출

Search Result 211, Processing Time 0.026 seconds

Entitymetrics Analysis of the Research Works of Dong-ju Yun using Textmining (텍스트마이닝을 이용한 윤동주 연구의 개체계량학적 분석)

  • Park, Jinkyeun;Kim, Taekyoun;Song, Min
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.28 no.1
    • /
    • pp.191-207
    • /
    • 2017
  • This paper employs entitymetrics analysis on the research works of Dong-ju Yun. He was a Korean poet who was studied by many researchers on his works, religion and life. We collected 1,076 papers about Dong-ju Yun and conducted various approaches including co-author citation analysis, topic modeling analysis to identify the topic trend in the study of Dong-ju Yun. Also we extracted entities like person's name and literature's title from abstract to examine the relationship among them. The result of this paper enables us to objectively identify the topic trend and infer implicit relationships between key concept associated with Dong-ju Yun based on text data. Moreover, we observed sub-research topics such as life, poem, aesthetic existence, comparative literature, literary translation, and religious beliefs. This paper shows how entitymetrics can be utilized to study intellectual structures in the humanities.

A Study on Graph-based Topic Extraction from Microblogs (마이크로블로그를 통한 그래프 기반의 토픽 추출에 관한 연구)

  • Choi, Don-Jung;Lee, Sung-Woo;Kim, Jae-Kwang;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.5
    • /
    • pp.564-568
    • /
    • 2011
  • Microblogs became popular information delivery ways due to the spread of smart phones. They have the characteristic of reflecting the interests of users more quickly than other medium. Particularly, in case of the subject which attracts many users, microblogs can supply rich information originated from various information sources. Nevertheless, it has been considered as a hard problem to obtain useful information from microblogs because too much noises are in them. So far, various methods are proposed to extract and track some subjects from particular documents, yet these methods do not work effectively in case of microblogs which consist of short phrases. In this paper, we propose a graph-based topic extraction and partitioning method to understand interests of users about a certain keyword. The proposed method contains the process of generating a keyword graph using the co-occurrences of terms in the microblogs, and the process of splitting the graph by using a network partitioning method. When we applied the proposed method on some keywords. our method shows good performance for finding a topic about the keyword and partitioning the topic into sub-topics.

Improvement of recommendation system using attribute-based opinion mining of online customer reviews

  • Misun Lee;Hyunchul Ahn
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.12
    • /
    • pp.259-266
    • /
    • 2023
  • In this paper, we propose an algorithm that can improve the accuracy performance of collaborative filtering using attribute-based opinion mining (ABOM). For the experiment, a total of 1,227 online consumer review data about smartphone apps from domestic smartphone users were used for analysis. After morpheme analysis using the KKMA (Kkokkoma) analyzer and emotional word analysis using KOSAC, attribute extraction is performed using LDA topic modeling, and the topic modeling results for each weighted review are used to add up the ratings of collaborative filtering and the sentiment score. MAE, MAPE, and RMSE, which are statistical model performance evaluations that calculate the average accuracy error, were used. Through experiments, we predicted the accuracy of online customers' app ratings (APP_Score) by combining traditional collaborative filtering among the recommendation algorithms and the attribute-based opinion mining (ABOM) technique, which combines LDA attribute extraction and sentiment analysis. As a result of the analysis, it was found that the prediction accuracy of ratings using attribute-based opinion mining CF was better than that of ratings implementing traditional collaborative filtering.

A Keyword Analysis of Collection Development Policies of University and Public Libraries Using Text Mining (텍스트 마이닝을 활용한 대학도서관과 공공도서관의 장서개발 정책 키워드 분석)

  • Da-Hyeon Lee;Dong-Hee Shin
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.1
    • /
    • pp.285-302
    • /
    • 2024
  • For this article, we conducted frequency analysis, topic modeling, and network analysis on eleven texts related to collection development policy found in the National Library of Korea. We deduced the main keywords related to collection development policies and analyzed the relationship between them. We subsequently conducted a pie coefficient analysis to identify the characteristics of collection development policies of university libraries and public libraries by category. The results showed that keywords such as "material," "library," "collection development," "user," and "collection" were the main keywords in frequency analysis and network centrality. Meanwhile, the pie coefficient analysis revealed that keywords such as "university," "construction," "student," "target," and "cost" were prevalent in university libraries, indicating that the academic needs of users and the discussion of digital resources were primary issues, while keywords related to the information needs of various user groups-including "adults," "survey," "feature," and "religion" -appeared in public libraries.

Topic Map automatic construction Study for research information resource (학술정보자원에 대한 Topic Map 자동구축 방안)

  • Jang, Hwa-Su;Ko, Il-Ju
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2009.01a
    • /
    • pp.13-18
    • /
    • 2009
  • Topic Map을 구축하는데 있어서 봉착하는 문제는 특정분야 전문가들이 Topic Map의 구성과 체계에 익숙하지 않다는 점이다. 이를 해결하기 위해서 Topic Map의 모든 요소들을 새로이 작성하는 것 보다는 작성하려는 분야에 대해 기 구축된 정보자원이 존재할 경우 이를 최대한 재활용하여, 모은 요소들을 추출한 다음 Topic Map 온톨로지로 변환하고 이용하는 것이 시간과 비용을 절약할 수 있는 효율적인 방법일 것이다. 본 연구에서는 기 구축된 학술DB보부터 Topic Map에서 재활용할 수 있는 요소들을 추출하기 위한 정보 소스로서 데이터베이스 스키마와 MARC에서 언급하는 메타데이터를 이용하는 것은, 기초 학문자료의 복잡한 관계의 개념구조, 자료유형 및 자료간의 의미적 상관관계 표현에 있어 효율적인 개발방법임을 제안한다.

  • PDF

Product reputation mining based on sentiment analysis (감성 분석 기반의 제품 평판 마이닝)

  • Song, In-Hwan;Han, Jinju;On, Byung-Won
    • Annual Conference on Human and Language Technology
    • /
    • 2019.10a
    • /
    • pp.429-433
    • /
    • 2019
  • 스마트폰 보급의 확산으로 제품 구매 시 웹 사이트 및 SNS를 이용하여 제품 리뷰를 참고하는 소비자들이 증가하고 있다. 전자 상거래 사이트의 제품 리뷰는 구매 예정자들에게 유용한 정보로 활용되곤 한다. 하지만 구매 예정자가 직접 제품에 대한 리뷰 데이터를 찾아 전체 내용을 일일이 읽고 분석해야하기 때문에 시간이 오래 걸릴뿐만 아니라 가공되지 않는 데이터가 줄 수 있는 정보는 한정적이다. 또한 이러한 리뷰들은 상품의 특징을 파악하기에도 어려움이 있다. 본 논문에서는 제품의 주요 이슈를 추출하고 주요 이슈에 대한 감성 분석과 감성 요약을 통해 제품 분석 및 평가를 제공하는 시스템을 설계 및 구현하였다. 이를 휴대폰 제품에 적용하여 구축한 시스템을 통해 소비자가 방대한 양의 제품의 리뷰 데이터를 분석할 필요 없이 제품의 주요 이슈와 가공된 분석 결과를 시각적으로 빠르게 제공받을 수 있음을 보였다.

  • PDF

Fintech Trends and Mobile Payment Service Anlaysis in Korea: Application of Text Mining Techniques (국내 핀테크 동향 및 모바일 결제 서비스 분석: 텍스트 마이닝 기법 활용)

  • An, JungKook;Lee, So-Hyun;An, Eun-Hee;Kim, Hee-Woong
    • Informatization Policy
    • /
    • v.23 no.3
    • /
    • pp.26-42
    • /
    • 2016
  • Recently, with the rapid growth of the O2O market, Fintech combining the finance and ICT technology is drawing attention as innovation to lead "O2O of finance", along with Fintech-based payment, authentication, security technology and related services. For new technology industries such as Fintech, technical sources, related systems and regulations are important but previous studies on Fintech lack in-depth research about systems and technological trends of the domestic Fintech industry. Therefore, this study aims to analyze domestic Fintech trends and find the insights for the direction of technology and systems of the future domestic Fintech industry by comparing Kakao Pay and Samsung Pay, the two domestic representative mobile payment services. By conducting a complete enumeration survey about the tweets mentioning Fintech until June 2016, this study visualized topics extraction, sensitivity analysis and keyword analyses. According to the analysis results, it was found that various topics have been created in the technologies and systems between 2014 and 2016 and different keywords and reactions were extracted between topics of Samsung Pay based on "devices" such as Galaxy and Kakao Pay based on "service" such as KakaoTalk. This study contributes to analyzing the unstructured data of social media by period by using social media mining and quantifying the expectations and reactions of consumers to services through the sentiment analysis. It is expected to be the foundation of Fintech industry development by presenting a strategic direction to Fintech related practitioners.

Matching of Topic Words and Non-Sympathetic Types on YouTube Videos for Predicting Video Preference (영상 선호도 예측을 위한 유튜브 영상에 대한 토픽어와 비공감 유형 매칭)

  • Jung, Jimin;Kim, Seungjin;Lee, Dongyun;Kim, Gyotae
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.189-192
    • /
    • 2021
  • YouTube, the world's largest video sharing platform, is loved by many users in that it provides numerous videos and makes it easy to get helpful information. However, the ratio of like/hate for each video varies according to the subject or upload time, even though they are in the same channel; thus, previous studies try to understand the reason by inspecting some numerical statistics such as the ratio and view count. They can help know how each video is preferred, but there is an explicit limitation to identifying the cause of such preference. Therefore, this study aims to determine the reason that affects the preference through matching between topic words extracted from comments in each video and non-sympathetic types defined in advance. Among the top 10 channels in the field of 'pets' and 'cooking', where outliers occur a lot, the top 10 videos (the threshold of pet: 4.000, the threshold of cooking: 0.723) with the highest ratio were selected. 11,110 comments collected totally, and topics were extracted and matched with non-sympathetic types. The experimental results confirmed that it is possible to predict whether the rate of like/hate would be high or which non-sympathetic type would be by analyzing the comments.

  • PDF

Classification of Public Perceptions toward Smog Risks on Twitter Using Topic Modeling (Topic Modeling을 이용한 Twitter상에서 스모그 리스크에 관한 대중 인식 분류 연구)

  • Kim, Yun-Ki
    • Journal of Cadastre & Land InformatiX
    • /
    • v.47 no.1
    • /
    • pp.53-79
    • /
    • 2017
  • The main purpose of this study was to detect and classify public perceptions toward smog disasters on Twitter using topic modeling. To help achieve these objectives and to identify gaps in the literature, this research carried out a literature review on public opinions toward smog disasters and topic modeling. The literature review indicated that there are huge gaps in the related literature. In this research, this author formed five research questions to fill the gaps in the literature. And then this study performed research steps such as data extraction, word cloud analysis on the cleaned data, building the network of terms, correlation analysis, hierarchical cluster analysis, topic modeling with the LDA, and stream graphs to answer those research questions. The results of this research revealed that there exist huge differences in the most frequent terms, the shapes of terms network, types of correlation, and smog-related topics changing patterns between New York and London. Therefore, this author could find positive answers to the four of the five research questions and a partially positive answer to Research question 4. Finally, on the basis of the results, this author suggested policy implications and recommendations for future study.

Analysis on Trend of Study Related to Computational Thinking Using Topic Modeling (토픽 모델링을 이용한 컴퓨팅 사고력 관련 연구 동향 분석)

  • Moon, Seong-Yun;Song, Ki-Sang
    • Journal of The Korean Association of Information Education
    • /
    • v.23 no.6
    • /
    • pp.607-619
    • /
    • 2019
  • As software education was introduced through the 2015 revised curriculum, various research activities have been carried out to improve the computational thinking of learners beyond the existing ICT literacy and software utilization education. With this change, the purpose of this study is to examine the research trends of various research activities related to computational thinking which is emphasized in software education. To this end, we extracted the key words from 190 papers related to computational thinking subject published from January 2014 to September 2019, and conducted frequency analysis, word cloud, connection centrality, and topic modeling analysis on the words. As a result of the topical modeling analysis, we found that the main studies so far have included studies on 'computational thinking education program', 'computational thinking education for pre-service teacher education', 'robot utilization education for computational thinking', 'assessment of computational thinking', and 'computational thinking connected education'. Through this research method, it was possible to grasp the research trend related to computational thinking that has been conducted mainly up to now, and it is possible to know which part of computational thinking education is more important to researchers.