• Title/Summary/Keyword: Journal PageRank

Search Result 80, Processing Time 0.026 seconds

A PageRank based Data Indexing Method for Designing Natural Language Interface to CRM Databases (분석 CRM 실무자의 자연어 질의 처리를 위한 기업 데이터베이스 구성요소 인덱싱 방법론)

  • Park, Sung-Hyuk;Hwang, Kyeong-Seo;Lee, Dong-Won
    • CRM연구
    • /
    • v.2 no.2
    • /
    • pp.53-70
    • /
    • 2009
  • Understanding consumer behavior based on the analysis of the customer data is one essential part of analytic CRM. To do this, the analytic skills for data extraction and data processing are required to users. As a user has various kinds of questions for the consumer data analysis, the user should use database language such as SQL. However, for the firm's user, to generate SQL statements is not easy because the accuracy of the query result is hugely influenced by the knowledge of work-site operation and the firm's database. This paper proposes a natural language based database search framework finding relevant database elements. Specifically, we describe how our TableRank method can understand the user's natural query language and provide proper relations and attributes of data records to the user. Through several experiments, it is supported that the TableRank provides accurate database elements related to the user's natural query. We also show that the close distance among relations in the database represents the high data connectivity which guarantees matching with a search query from a user.

  • PDF

Efficient Internet Information Extraction Using Hyperlink Structure and Fitness of Hypertext Document (웹의 연결구조와 웹문서의 적합도를 이용한 효율적인 인터넷 정보추출)

  • Hwang Insoo
    • Journal of Information Technology Applications and Management
    • /
    • v.11 no.4
    • /
    • pp.49-60
    • /
    • 2004
  • While the World-Wide Web offers an incredibly rich base of information, organized as a hypertext it does not provide a uniform and efficient way to retrieve specific information. Therefore, it is needed to develop an efficient web crawler for gathering useful information in acceptable amount of time. In this paper, we studied the order in which the web crawler visit URLs to rapidly obtain more important web pages. We also developed an internet agent for efficient web crawling using hyperlink structure and fitness of hypertext documents. As a result of experiment on a website. it is shown that proposed agent outperforms other web crawlers using BackLink and PageRank algorithm.

  • PDF

SNA to assess the Influence of Organization Members (Focusing on core members of North Korea)

  • Lee, Young-Seok;Yoon, Soungwoong;Lee, Sang-Hoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.7
    • /
    • pp.73-80
    • /
    • 2018
  • There are various organizations in modern society, in which people have direct and indirect relationships. Internal structure of these organizations can be analyzed by the relationships which are officially pressed on the media. However, this task will be difficult when the media information is strictly limited, though the necessity of analyzing organization structure remains. In this study, we try to estimate the influence of North Korea's core members by using PageRank centrality to supplement the limitation of previous SNA analysis methods. Experimental results show that we can show and predict NK's power shifts more efficiently.

Studying Structural Evaluation of Web Link Structure and Performance in Destination Marketing Organizations (웹링크 구조와 웹사이트 성과간의 구조적 평가에 관한 연구: 컨벤션비지터뷰로(CVB)를 대상으로)

  • Joun, Hyo-Jae;Cho, Nam-Jae
    • Journal of Digital Convergence
    • /
    • v.5 no.2
    • /
    • pp.91-98
    • /
    • 2007
  • Destination marketing organizations (DMO) have been building up the cyber city in the WWW. Website for DMO is a core channel to promote regional attractions. This research suggests the issue of criteria for evaluating DMO's performance in the Internet. The method of evaluation focuses on the structure in perspective of linkage based on small world theory and direct network. Convention & Visitors & Bureau (CVB) in tourism and travel industry playa role to promote and held the international meeting and exhibitions. CVB's websites evaluated according to web link structure and performance.

  • PDF

An Intelligent Search Modeling using Avatar Agent

  • Kim, Dae Su
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.4 no.3
    • /
    • pp.288-291
    • /
    • 2004
  • This paper proposes an intelligent search modeling using avatar agent. This system consists of some modules such as agent interface, agent management, preprocessor, interface machine. Core-Symbol Database and Spell Checker are related to the preprocessor module and Interface Machine is connected with Best Aggregate Designer. Our avatar agent system does the indexing work that converts user's natural language type sentence to the proper words that is suitable for the specific branch information retrieval. Indexing is one of the preprocessing steps that make it possible to guarantee the specialty of user's input and increases the reliability of the result. It references a database that consists of synonym and specific branch dictionary. The resulting symbol after indexing is used for draft search by the internet search engine. The retrieval page position and link information are stored in the database. We experimented our system with the stock market keyword SAMSUNG_SDI, IBM, and SONY and compared the result with that of Altavista and Google search engine. It showed quite excellent results.

Outlier Detection Techniques for Biased Opinion Discovery (편향된 의견 문서 검출을 위한 이상치 탐지 기법)

  • Yeon, Jongheum;Shim, Junho;Lee, Sanggoo
    • The Journal of Society for e-Business Studies
    • /
    • v.18 no.4
    • /
    • pp.315-326
    • /
    • 2013
  • Users in social media post various types of opinions such as product reviews and movie reviews. It is a common trend that customers get assistance from the opinions in making their decisions. However, as opinion usage grows, distorted feedbacks also have increased. For example, exaggerated positive opinions are posted for promoting target products. So are negative opinions which are far from common evaluations. Finding these biased opinions becomes important to keep social media reliable. Techniques of opinion mining (or sentiment analysis) have been developed to determine sentiment polarity of opinionated documents. These techniques can be utilized for finding the biased opinions. However, the previous techniques have some drawback. They categorize the text into only positive and negative, and they also need a large amount of training data to build the classifier. In this paper, we propose methods for discovering the biased opinions which are skewed from the overall common opinions. The methods are based on angle based outlier detection and personalized PageRank, which can be applied without training data. We analyze the performance of the proposed techniques by presenting experimental results on a movie review dataset.

Industrial Technology Leak Detection System on the Dark Web (다크웹 환경에서 산업기술 유출 탐지 시스템)

  • Young Jae, Kong;Hang Bae, Chang
    • Smart Media Journal
    • /
    • v.11 no.10
    • /
    • pp.46-53
    • /
    • 2022
  • Today, due to the 4th industrial revolution and extensive R&D funding, domestic companies have begun to possess world-class industrial technologies and have grown into important assets. The national government has designated it as a "national core technology" in order to protect companies' critical industrial technologies. Particularly, technology leaks in the shipbuilding, display, and semiconductor industries can result in a significant loss of competitiveness not only at the company level but also at the national level. Every year, there are more insider leaks, ransomware attacks, and attempts to steal industrial technology through industrial spy. The stolen industrial technology is then traded covertly on the dark web. In this paper, we propose a system for detecting industrial technology leaks in the dark web environment. The proposed model first builds a database through dark web crawling using information collected from the OSINT environment. Afterwards, keywords for industrial technology leakage are extracted using the KeyBERT model, and signs of industrial technology leakage in the dark web environment are proposed as quantitative figures. Finally, based on the identified industrial technology leakage sites in the dark web environment, the possibility of secondary leakage is detected through the PageRank algorithm. The proposed method accepted for the collection of 27,317 unique dark web domains and the extraction of 15,028 nuclear energy-related keywords from 100 nuclear power patents. 12 dark web sites identified as a result of detecting secondary leaks based on the highest nuclear leak dark web sites.

Nonparametric method in randomized block design for umbrella alternatives based on aligned method and placement (랜덤화 블록 계획법에서 우산형 대립가설에 대한 정렬방법과 위치를 이용한 비모수 검정법)

  • Kim, Jeonghyun;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1399-1409
    • /
    • 2016
  • Nonparametric methods in randomized block design were suggested by Friedman (1937) for general alternatives and were also proposed by Page (1963) for ordered alternatives in one-way layout; in addition, K-sample rank tests for umbrella alternatives were suggested by Mack and Wolfe (1981). In this paper, we proposed a nonparametric method of umbrella alternatives for randomized block design using the aligned method proposed by Hodges and Lehmann (1962) to use block information and using placement suggested by Kim (1999). Monte Carlo simulation was also adapted to compare the power of the proposed procedure with previous methods.

Personalized Document Snippet Extraction Method using Fuzzy Association and Pseudo Relevance Feedback (의사연관 피드백과 퍼지 연관을 이용한 개인화 문서 스니핏 추출 방법)

  • Park, Seon;Jo, Gwang-Mun;Yang, Hu-Yeol;Lee, Seong-Ro
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.2
    • /
    • pp.137-142
    • /
    • 2012
  • Snippet is a summaries information of representing web pages which search engine provides user. Snippet and page rank in search engine abundantly influence user for visiting web pages. User sometime visits the wrong page with respect to user intention when uses snippet. The snippet extraction method is difficult to accurate comprehending user intention. In order to solve above problem, this paper proposes a new snippet extraction method using fuzzy association and pseudo relevance feedback. The proposed method uses pseudo relevance feedback to expand the use's query. It uses the fuzzy association between the expanded query and the web pages to extract snippet to be well reflected semantic user's intention. The experimental results demonstrate that the proposed method can achieve better snippet extraction performance than the other methods.

Comparison the Difference of User Experience for Mobile Facebook and Instagram Using Nonparametric Statistics Methods -Focused on Emotional Interface Model- (비모수적 통계방법을 이용한 모바일 페이스북과 인스타그램의 사용자 경험 차이 비교 -감성인터페이스 모형을 중심으로-)

  • Ahn, Ji-Hyun;Kim, Seung-In
    • Journal of Digital Convergence
    • /
    • v.14 no.11
    • /
    • pp.481-488
    • /
    • 2016
  • This study is about comparing the mobile user experience of Facebook and Instagram which are most often used among the recent SNSs by the people in their 30s and under. This study analyzed the user experience level after dividing the user experience factors through the Creating Pleasurable Interfaces model, and suggested the mean analysis as well as the result of Wilcoxon rank test which is a nonparametric statistics method. As a result of study, the Display information visually factor in functional factor and the configuration of the main page in convenient factor were a statistically significant difference in the mobile user experience of Facebook and Instagram. It is expected that this study may help seeking the user experience factors to be promoted preferentially in a competitive situation through the statistical comparative evaluation of the experience of two SNS users.