• Title/Summary/Keyword: 웹 사용 마이닝

Search Result 160, Processing Time 0.115 seconds

Classifying Korean Comparative Sentences Using Transformation-based Learning (변환 기반 학습을 이용한 한국어 비교 문장 유형 분류)

  • Yang, Seon;Ko, Youngjoong
    • Annual Conference on Human and Language Technology
    • /
    • 2009.10a
    • /
    • pp.31-34
    • /
    • 2009
  • 본 연구의 목표는 비교 문장들을 일곱 가지 유형으로 자동 분류하는 것으로서, 비교 문장 추출, 비교 문장 유형 분류, 유형별 비교 관계 분석으로 이어지는 비교마이닝 세 단계 중 두 번째 과제이다. 본 연구에서는 변환 기반 학습(Transformation-based Learning) 기법을 이용한다. 자연어 처리 분야 여러 부문에서 사용되고 있는 변환 기반 학습은 오류를 감소시키는 최적의 규칙을 자동으로 생성하여 정답을 찾는 규칙 기반 학습 방법이다. 웹상의 다양한 도메인에서 추출한 비교 문장들을 대상으로 실험한 결과, 일곱 가지 비교 문장 유형을 분류하는데 있어서 정확도 80.01%의 우수한 성능을 산출하였다.

  • PDF

Review Analysis by using the Opinion Mining Techniques (오피니언 마이닝을 이용한 상품평 분석)

  • Song, Jun Seok;Cho, Kyung Soo;Kim, Ung-mo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.35-38
    • /
    • 2010
  • 인터넷 시장이 빠르게 성장함에 따라 사용자들의 참여도가 매우 높아졌다. 인터넷 사용자들은 인터넷 쇼핑의 상품에 관한 의견을 웹 상에 표현하기 시작했고, 실제 소비자이 판단하는 데에 많은 영향을 미치고 있다. 하지만 현재에 들어 그 양이 엄청나게 방대해 졌기 때문에 사용자들이 원하는 정보만을 찾아내는 것은 어려운 일이다. 본 논문에서는 사용들이 작성한 인터넷 쇼핑에서 상품평에 관한 리뷰를 모아 방대한 양에서 오피니언 마이닝 기법을 이용해 유용한 정보를 효율적으로 도출해서 사용자가 원하는 정보를 요약하여 제공하는 방법을 제안한다. 이러한 방법을 통해서 사용자는 상품을 구매하기 전에 좀 더 객관적이고 효율적으로 판단을 내릴 수 있을 것이다.

Design of Personalized System using an Association Rule (연관규칙을 이용한 개인화 시스템 설계)

  • Yun, Jong-Chan;Youn, Sung-Dae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.6
    • /
    • pp.1089-1098
    • /
    • 2007
  • Currently, user require is diverse on the Web. Furthermore, each web user is wishing to retrieve data or goods that hey want to look for more conveniently and more quickly. Because different search criteria and dispositions of web users, they lead to unnecessary repeated operations in order to use implemented by web designer. In this paper, we suggest the system that analyzes user patterns on the Web using the technique of log file analysis and transfers more effectively the information of web sites to users. And we analyze the log file for customer data in the system the proposed method are implemented by means of EC-Miner that is one of the tool of datamining, and aims to offer appropriate Layout corresponding with personalization by giving weight to each transport path.

Sensitivity Identification Method for New Words of Social Media based on Naive Bayes Classification (나이브 베이즈 기반 소셜 미디어 상의 신조어 감성 판별 기법)

  • Kim, Jeong In;Park, Sang Jin;Kim, Hyoung Ju;Choi, Jun Ho;Kim, Han Il;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.51-59
    • /
    • 2020
  • From PC communication to the development of the internet, a new term has been coined on the social media, and the social media culture has been formed due to the spread of smart phones, and the newly coined word is becoming a culture. With the advent of social networking sites and smart phones serving as a bridge, the number of data has increased in real time. The use of new words can have many advantages, including the use of short sentences to solve the problems of various letter-limited messengers and reduce data. However, new words do not have a dictionary meaning and there are limitations and degradation of algorithms such as data mining. Therefore, in this paper, the opinion of the document is confirmed by collecting data through web crawling and extracting new words contained within the text data and establishing an emotional classification. The progress of the experiment is divided into three categories. First, a word collected by collecting a new word on the social media is subjected to learned of affirmative and negative. Next, to derive and verify emotional values using standard documents, TF-IDF is used to score noun sensibilities to enter the emotional values of the data. As with the new words, the classified emotional values are applied to verify that the emotions are classified in standard language documents. Finally, a combination of the newly coined words and standard emotional values is used to perform a comparative analysis of the technology of the instrument.

Research on User's Query Processing in Search Engine for Ocean using the Association Rules (연관 규칙 탐사 기법을 이용한 해양 전문 검색 엔진에서의 질의어 처리에 관한 연구)

  • 하창승;윤병수;류길수
    • Journal of the Korea Society of Computer and Information
    • /
    • v.8 no.2
    • /
    • pp.8-15
    • /
    • 2003
  • Recently various of information suppliers provide information via WWW so the necessary of search engine grows larger. However the efficiency of most search engines is low comparatively because of using simple pattern match technique between user's query and web document. A specialized search engine returns the specialized information depend on each user's search goal. It is trend to develop specialized search engines in many countries. However, most such engines don't satisfy the user's needs. This paper proposes the specialized search engine for ocean information that uses user's query related with ocean and the association rules in web data mining can prove relation between web documents. So this search engine improved the recall of data and the precision in existent search method.

  • PDF

A Study on Behavior Rule Induction Method of Web User Group using 2-tier Clustering (2-계층 클러스터링을 사용한 웹 사용자 그룹의 행동규칙추출방법에 관한 연구)

  • Hwang, Jun-Won;Song, Doo-Heon;Lee, Chang-Hoon
    • The KIPS Transactions:PartD
    • /
    • v.15D no.1
    • /
    • pp.139-146
    • /
    • 2008
  • It is very important to identify useful web user group and induce their behavior pattern in eCRM domain. Inducing user group with a similar inclination, a reliability of user group decreases because there is an uncertainty in online user data. In this paper, we have applied the 2-tier clustering, which uses the outcome of interaction with data from other tiers. Also we propose a method which induces user behavior pattern from a cluster and compare C4.5 with our method.

Collaboration Framework based on Social Semantic Web for Cloud Systems (클라우드 시스템에서 소셜 시멘틱 웹 기반 협력 프레임 워크)

  • Mateo, Romeo Mark A.;Yang, Hyun-Ho;Lee, Jae-Wan
    • Journal of Internet Computing and Services
    • /
    • v.13 no.1
    • /
    • pp.65-74
    • /
    • 2012
  • Cloud services are used for improving business. Moreover, customer relationship management(CRM) approaches use social networking as tools to enhance services to customers. However, most cloud systems do not support the semantic structures, and because of this, vital information from social network sites is still hard to process and use for business strategy. This paper proposes a collaboration framework based on social semantic web for cloud system. The proposed framework consists of components to support social semantic web to provide an efficient collaboration system for cloud consumers and service providers. The knowledge acquisition module extracts rules from data gathered by social agents and these rules are used for collaboration and business strategy. This paper showed the implementations of processing of social network site data in the proposed semantic model and pattern extraction which was used for the virtual grouping of cloud service providers for efficient collaboration.

Personalized Advertisement Service Method Using Web Log Mining (웹로그 마이닝을 이용한 개인화 광고 서비스 기법)

  • Kim, Seok-Hun;Kim, Eun-Soo
    • The Journal of Korean Association of Computer Education
    • /
    • v.8 no.1
    • /
    • pp.117-127
    • /
    • 2005
  • Numerous internet pop advertisement are being provided according to the rapid development of e-commercial and a rise in users. However, it has not been based on analysis of users' inclination but just one-sided providing. With that reason, many web-site provider want to advertis e more efficient and distinguished Internet-advertisement as analyzing Server's Log accessed. In this thesis, we have studied and tested relatively simply adoption system to provide personalized advertisement service. In order to influence personal disposition to system as the most effective way, it first of all uses History files as source data and after refining it, it can search not only visitors' inclination but also the others' visit-list on the other server. As a result of it, it can make advertisement more reality and activity.

  • PDF

OLAP System and Performance Evaluation for Analyzing Web Log Data (웹 로그 분석을 위한 OLAP 시스템 및 성능 평가)

  • 김지현;용환승
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.5
    • /
    • pp.909-920
    • /
    • 2003
  • Nowadays, IT for CRM has been growing and developed rapidly. Typical techniques are statistical analysis tools, on-line multidimensional analytical processing (OLAP) tools, and data mining algorithms (such neural networks, decision trees, and association rules). Among customer data, web log data is very important and to use these data efficiently, applying OLAP technology to analyze multi-dimensionally. To make OLAP cube, we have to precalculate multidimensional summary results in order to get fast response. But as the number of dimensions and sparse cells increases, data explosion occurs seriously and the performance of OLAP decreases. In this paper, we presented why the web log data sparsity occurs and then what kinds of sparsity patterns generate in the two and t.he three dimensions for OLAP. Based on this research, we set up the multidimensional data models and query models for benchmark with each sparsity patterns. Finally, we evaluated the performance of three OLAP systems (MS SQL 2000 Analysis Service, Oracle Express and C-MOLAP).

  • PDF

Design and Implementation of a Employment Information Service based on the Social Web Mining for Human-FTA (휴먼 FTA를 위한 소셜 웹 마이닝 기반 고용정보 서비스의 설계 및 구현)

  • Song, Jeo;Park, Yong-goo;Yoo, Jaesoo
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2015.05a
    • /
    • pp.419-420
    • /
    • 2015
  • 경제혁신 3개년 계획을 토대로 정부는 2015년 국내 생산가능 인구 감소에 대한 대응을 위해 외국인 인력 유치를 위한 휴먼 FTA를 발효하였다. 기존의 외국인 생산 인력에 대한 단순한 양적 증가뿐만이 아니라 해외로 생산거점을 이동한 국내 기업의 리턴을 유도하기 위해 석박사급의 고급 인력과 투자자 유치 등에 대한 내용도 포함하고 있다. 본 논문에서는 상기와 같은 노동시장의 새로운 제도인 휴먼 FTA에 대한 활성화와 원활한 운영을 위해 세계적으로 많이 사용되고 있는 트위터, 페이스북, 구글 등의 소셜 웹 데이터를 활용하여 국내 기업의 외국인 인력에 대한 고용 매칭을 위한 서비스 플랫폼을 제안한다.

  • PDF