• Title/Summary/Keyword: 의견 문서 검색

Search Result 12, Processing Time 0.027 seconds

An Opinionated Document Retrieval System based on Hybrid Method (혼합 방식에 기반한 의견 문서 검색 시스템)

  • Lee, Seung-Wook;Song, Young-In;Rim, Hae-Chang
    • Journal of the Korean Society for information Management
    • /
    • v.25 no.4
    • /
    • pp.115-129
    • /
    • 2008
  • Recently, as its growth and popularization, the Web is changed into the place where people express, share and debate their opinions rather than the space of information seeking. Accordingly, the needs for searching opinions expressed in the Web are also increasing. However, it is difficult to meet these needs by using a classical information retrieval system that only concerns the relevance between the user's query and documents. Instead, a more advanced system that captures subjective information through documents is required. The proposed system effectively retrieves opinionated documents by utilizing an existing information retrieval system. This paper proposes a kind of hybrid method which can utilize both a dictionary-based opinion analysis technique and a machine learning based opinion analysis technique. Experimental results show that the proposed method is effective in improving the performance.

A Study on the Characteristics of Opinion Retrieval Using Term Statistical Analysis in Opinion Documents (의견 문서의 단어 통계 분석을 통한 의견 검색 특성에 관한 연구)

  • Han, Kyoung-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.11
    • /
    • pp.21-29
    • /
    • 2010
  • Opinion retrieval which searches the opinions expressed in documents by users cannot outperform significantly yet traditional topical retrieval which searches the facts. Therefore, the focus of this paper is to identify the statistical characteristics which can be applied to opinion retrieval by comparing and analyzing the term statistics of opinion and non-opinion documents in the blog domain. The TREC Blogs06 collection and 150 TREC topics are used in the experiments. The difference between term probability distributions in opinion documents is measured by JS divergence, and the difference according to the topic types and topic domains is also investigated. Moreover, the term probabilities of opinion terms are analyzed comparatively. The main findings of this study include the following: it is necessary to consider the topic-specific characteristics for the opinion detection; it is effective to extract positive and negative opinion terms according to the topics; the topic types are complementary to the topic domains; and special attention has to be given to the usage of the positive opinion terms.

Distributed Document Classification System using Susceptibility Terms and Patterns (감성용어 및 패턴을 이용한 감성기반 분산 문서분류시스템)

  • Kim, Myung-Kyu;In, Joo-Ho;Chae, Soo-Hoan
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2007.06d
    • /
    • pp.356-360
    • /
    • 2007
  • 인터넷이 폭 넓게 보급되어 개인의 의견을 개진할 기회가 확대됨에 따라 정치, 경제 등의 사안이나 제품 기업의 이미지, 공인에 대한 긍정.부정의 글을 개진할 수 있게 되었다. 이러한 현상에 따라 기업, 제품, 혹은 공공의 분야에서 일반 개인들이 어떻게 생각하는가에 대한 분석 및 자료수집의 필요성이 높아지고 있다. 감성용어 문서분류시스템은 문서의 내용 중 감성기반의 용어들에 기반하여 이에 대한 패턴을 정의하고 이에 대응하는 범주에 문서를 자동으로 할당하는 작업으로써 효율적인 정보 관리 및 검색을 가능하게 한다. 하지만 자동문서 분류를 하기 위해서는 방대한 양의 데이터를 수집 보관하기 위한 분산 환경이 반드시 필요하다. 본 논문에서는 감성기반 문서분류 시스템을 위한 감성용어 추출 및 긍정, 부정의 패턴을 검색해 자동 문서분류를 위해 RTI(Run Time Infrastructure)를 통한 분산 시스템 환경으로 구성하였다.

  • PDF

Question and Answering System through Search Result Summarization of Q&A Documents (Q&A 문서의 검색 결과 요약을 활용한 질의응답 시스템)

  • Yoo, Dong Hyun;Lee, Hyun Ah
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.4
    • /
    • pp.149-154
    • /
    • 2014
  • A user should pick up relevant answers by himself from various search results when using user participation question answering community like Knowledge-iN. If refined answers are automatically provided, usability of question answering community must be improved. This paper divides questions in Q&A documents into 4 types(word, list, graph and text), then proposes summarizing methods for each question type using document statistics. Summarized answers for word, list and text type are obtained by question clustering and calculating scores for words using frequency, proximity and confidence of answers. Answers for graph type is shown by extracting user opinion from answers.

A Study on Web Mining System for Real-Time Monitoring of Opinion Information Based on Web 2.0 (의견정보 모니터링을 위한 웹 마이닝 시스템에 관한 연구)

  • Joo, Hae-Jong;Hong, Bong-Hwa;Jeong, Bok-Cheol
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.1
    • /
    • pp.149-157
    • /
    • 2010
  • As the use of the Internet has recently increased, the demand for opinion information posted on the Internet has grown. However, such resources only exist on the website. People who want to search for information on the Internet find it inconvenient to visit each website. This paper focuses on the opinion information extraction and analysis system through Web mining that is based on statistics collected from Web contents. That is, users' opinion information which is scattered across several websites can be automatically analyzed and extracted. The system provides the opinion information search service that enables users to search for real-time positive and negative opinions and check their statistics. Also, users can do real-time search and monitoring about other opinion information by putting keywords in the system. Proposed technologies proved to have outstanding capabilities in comparison to existing ones through tests. The capabilities to extract positive and negative opinion information were assessed. Specifically, test movie review sentence testing data was tested and its results were analyzed.

Design and Implementation of Standard Document Management System (XML.을 적용한 표준 문서 관리 시스템의 설계 및 구현)

  • 이준섭;유정연;권석훈;나재열;이규철;구경철;박기식;박치항
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.35 no.1
    • /
    • pp.77-99
    • /
    • 2001
  • The Request of the information exchange is increasing because of the advanced rapid science and technology. But a different system environment has occurred many problems on the information exchange. The information exchange on based XML is a solution to the problem. It takes effect in the standard document management application that is make standard document to cooperate with many researchers mutually. This paper is design and implementation of system model for efficient exchange, store, search and manage document on based XML document in established course of standard document.

  • PDF

An application of XML Technology to a Standard Document Management (표준 문서 관리에 XML 기술의 적용)

  • Yu, Jeon-Youn;Na, Jae-Youl;Kwon, Seok-Hun;Koo, Kyoung-Chul;Lee, Jun-Seob;Park, Ki-Shik;Lee, Kyu-Chul
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10a
    • /
    • pp.461-464
    • /
    • 2000
  • 급속한 과학 기술의 발달로 인해 상호간의 정보교환의 요구는 증가하게 되었으나, 서로 다른 시스템 환경으로 인해 정보 교환에 많은 문제점이 발생하였다. XML 기반의 정보 교환은 이를 위한 해결하기 위한 방안이며, 여러 연구자들이 공동으로 의견을 교환하여 작성해야 하는 표준 문서의 관리에 XML을 적용하면 매우 효과적이다. 본 논문에서는 보다 생산적이며 효율적인 표준 관련 문서의 공유 및 상호 교환을 위해 차세대 인터넷 문서의 표준인 XML을 기반으로 문서를 교환하며 이를 효과적으로 저장, 검색, 관리할 수 있는 시스템 모델을 제안하였다. 또한, 표준 제정 과정에서 이루어지는 문서의 편집, 수정, 삭제의 기능을 사용자의 권한에 따라 제한함으로써 효율적인 정보교환 기능을 제안하였다.

  • PDF

Construction of Evaluation-Annotated Datasets for EA-based Clothing Recommendation Chatbots (패션앱 후기글 평가분석에 기반한 의류 검색추천 챗봇 개발을 위한 학습데이터 EVAD 구축)

  • Choi, Su-Won;Hwang, Chang-Hoe;Yoo, Gwang-Hoon;Nam, Jee-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2021.10a
    • /
    • pp.467-472
    • /
    • 2021
  • 본 연구는 패션앱 후기글에 나타나는 구매자의 의견에 대한 '평가분석(Evaluation Analysis: EA)'을 수행하여, 이를 기반으로 상품의 검색 및 추천을 수행하는 의류 검색추천 챗봇을 개발하는 LICO 프로젝트의 언어데이터 구축의 일환으로 수행되었다. '평가분석 트리플(EAT)'과 '평가기반요청 쿼드러플(EARQ)'의 구성요소들에 대한 주석작업은, 도메인 특화된 단일형 핵심어휘와 다단어(MWE) 핵심패턴들을 FST 방식으로 구조화하는 DECO-LGG 언어자원에 기반하여 반자동 언어데이터 증강(SSP) 방식을 통해 진행되었다. 이 과정을 통해 20여만 건의 후기글 문서(230만 어절)로 구성된 EVAD 평가주석데이터셋이 생성되었다. 여성의류 도메인의 평가분석을 위한 '평가속성(ASPECT)' 성분으로 14가지 유형이 분류되었고, 각 '평가속성'에 연동된 '평가내용(VALUE)' 쌍으로 전체 35가지의 {ASPECT-VALUE} 카테고리가 분류되었다. 본 연구에서 구축된 EVAD 평가주석 데이터의 성능을 평가한 결과, F1-Score 0.91의 성능 평가를 획득하였으며, 이를 통해 향후 다른 도메인으로의 확장된 적용 가능성이 유효함을 확인하였다.

  • PDF

Design and Implementation of RFID Based Computer Equipment Management System (RFID 기반 전산 장비 관리 시스템 설계 및 구현)

  • Lim, Hyunjeong;Bok, Kyoungsoo;Yoo, Jaesoo
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.8
    • /
    • pp.79-92
    • /
    • 2019
  • The existing computer equipment management system is designed to computerize and manage equipment information that was managed only by documents in the computer room. However, the existing system focuses on computerizing the equipment arranged in the document, so there is a limit to the equipment that can be registered and it is difficult to find the necessary equipment. In addition, it caused inconvenience to confirm whether the registered equipment is currently used. In this paper, we redesign and implement the computer equipment management system to solve the problem. For this purpose, the existing computer equipment management system was thoroughly analyzed throughout, and the system is designed and implemented to improve the system by reflecting the opinions of public officials and management companies using the system. In the performance evaluation, the efficiency of the improved system is proved by comparing the number of equipment registrations and equipment search accuracy of the existing system.

Design and Implementation of Potential Advertisement Keyword Extraction System Using SNS (SNS를 이용한 잠재적 광고 키워드 추출 시스템 설계 및 구현)

  • Seo, Hyun-Gon;Park, Hee-Wan
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.7
    • /
    • pp.17-24
    • /
    • 2018
  • One of the major issues in big data processing is extracting keywords from internet and using them to process the necessary information. Most of the proposed keyword extraction algorithms extract keywords using search function of a large portal site. In addition, these methods extract keywords based on already posted or created documents or fixed contents. In this paper, we propose a KAES(Keyword Advertisement Extraction System) system that helps the potential shopping keyword marketing to extract issue keywords and related keywords based on dynamic instant messages such as various issues, interests, comments posted on SNS. The KAES system makes a list of specific accounts to extract keywords and related keywords that have most frequency in the SNS.