• Title/Summary/Keyword: Network Centrality Analysis

Search Result 690, Processing Time 0.021 seconds

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

An Analysis of the Cruise Courses Network in Asian Regions Using Social Network Analysis (SNA를 이용한 아시아 지역 크루즈 항로의 네트워크 분석에 관한 연구)

  • Jeon, Jun-Woo;Cha, Young-Doo;Yeo, Gi-Tae
    • Journal of Korea Port Economic Association
    • /
    • v.32 no.1
    • /
    • pp.17-28
    • /
    • 2016
  • This study examines the cruise course network structure in the Asian regions and the centrality of ports using social network analysis (SNA). For network analysis of Asian cruise courses, a data network of cruise courses was constructed using data on courses of cruise ships operating in Asian ports collected from the reports of the Cruise Lines International Associations.There are 249 nodes or ports of ship companies that provide cruise courses to Asia between from October 2015 to June 2016, and these nodes connect 545 ports. Density analysis based on ports where cruise ship companies operated cruise ships showed that, from October 2015 to June 2016, the density was 0.009, which was lower than the average of global port network density (2006 to 2011) and railroad network density. In addition, was calculated to be, which means that connection with all ports was possible through 2,180 steps. In the analysis of the Asian cruise course network centrality, Singapore ranked first in both out-degree and in-degree in connection centrality, followed by Hong Kong, Shanghai, Ho Chi Minh, and Keelung. Singapore also ranked first in the result betweenness centrality analysis, followed by Penang, Dubai, and Hong Kong. From October 2015 to June 2016, the port with the highest Eigenvector centrality was Hong Kong, followed by Ho Chi Minh, Singapore, Shanghai, and Danang. In the case of the domestic ports Incheon, Busan, and Jeju, connection centrality, betweenness centrality, and Eigenvector centrality all ranked lower than their competitor Chinese ports.

Local Information-based Betweenness Centrality to Identify Important Nodes in Social Networks (사회관계망에서 중요 노드 식별을 위한 지역정보 기반 매개 중심도)

  • Shon, Jin Gon;Kim, Yong-Hwan;Han, Youn-Hee
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.2 no.5
    • /
    • pp.209-216
    • /
    • 2013
  • In traditional social network analysis, the betweenness centrality measure has been heavily used to identify the relative importance of nodes in terms of message delivery. Since the time complexity to calculate the betweenness centrality is very high, however, it is difficult to get it of each node in large-scale social network where there are so many nodes and edges. In this paper, we define a new type of network, called the expanded ego network, which is built only with each node's local information, i.e., neighbor information of the node's neighbor nodes, and also define a new measure, called the expended ego betweenness centrality. Through the intensive experiment with Barab$\acute{a}$si-Albert network model to generate the scale-free networks which most social networks have as their embedded feature, we also show that the nodes' importance rank based on the expanded ego betweenness centrality has high similarity with that based on the traditional betweenness centrality.

A study on women's welfare organization's network -Focusing on network centrality and organizational effectiveness- (여성복지조직의 네트워크에 관한 연구 -네트워크 중심성(centrality)과 조직효과성을 중심으로-)

  • Jang, Yeon Jin
    • Korean Journal of Social Welfare Studies
    • /
    • v.41 no.4
    • /
    • pp.313-343
    • /
    • 2010
  • The aim of this study is to examine the factors influencing network centrality on women's welfare organizations, and to investigate how the level of network centrality influence the effectiveness of the organization. To achieve this goal, this study conducted a survey on women's welfare organizations in Seoul from March to June, 2009. Network analysis method was used to get each organization's network centrality value. Also, through the Structural Equation Modelling, organizational characteristics predicting network centrality and effect of network centrality on organizational effectiveness. The main results are as follows. First, the significant affecting factors were different between three types of centralities with regards to the type of organization, recognition of resource dependency, attitude of top manager, and established year. Second, the common factors affecting three network centralities were the number of informal ties, accepting feminism as the main organizational philosophy, and the number of qualified staffs. Third, only closeness centrality positively predicted the level of organizational effectiveness among three types of centralities. The faster the organization reaches to other organizations in a network, the organizational effectiveness becomes higher, which means high closeness centrality is more important factor than high degree centrality or high betweenness centrality to increase organizational effectiveness. This result shows social welfare organization should consider changing inter-organizational network strategy from quantity-focused to quality-focused.

Trend Analysis of Data Mining Research Using Topic Network Analysis

  • Kim, Hyon Hee;Rhee, Hey Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.141-148
    • /
    • 2016
  • In this paper, we propose a topic network analysis approach which integrates topic modeling and social network analysis. We collected 2,039 scientific papers from five top journals in the field of data mining published from 1996 to 2015, and analyzed them with the proposed approach. To identify topic trends, time-series analysis of topic network is performed based on 4 intervals. Our experimental results show centralization of the topic network has the highest score from 1996 to 2000, and decreases for next 5 years and increases again. For last 5 years, centralization of the degree centrality increases, while centralization of the betweenness centrality and closeness centrality decreases again. Also, clustering is identified as the most interrelated topic among other topics. Topics with the highest degree centrality evolves clustering, web applications, clustering and dimensionality reduction according to time. Our approach extracts the interrelationships of topics, which cannot be detected with conventional topic modeling approaches, and provides topical trends of data mining research fields.

A Preliminary Study on the Semantic Network Analysis of Book Report Text (독후감 텍스트의 언어 네트워크 분석에 관한 기초연구)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.47 no.3
    • /
    • pp.95-114
    • /
    • 2016
  • The purpose of this preliminary study is to collect specific examples of book reports and understand semantic characteristics of them through semantic network. The analysis was conducted with 23 book reports which classified by three groups. The keywords were selected from the of book reports. Five types of keyword network were composed based on co-occurrence relations with keywords. The result of this study is following these. First, each keyword network of book reports of groups and individuals is shown to have different structural characteristics. Second, each network has different high centrality keywords according to the result analysis of 3 types of centrality(degree centrality, closeness centrality, betweenness centrality). These characteristic means that keyword network analysis is useful in recognizing the characteristics of not only groups' and but also individual's book reports.

Analysis of the Changes of Liner Service Networks by Using SNA: Focused on Incheon Port (사회연결망 분석을 활용한 컨테이너 정기선 항로 변화 분석: 인천항을 중심으로)

  • Park, Ki-Hyun;Lin, Mei-Shun;Ahn, Seung-Bum
    • Journal of Korea Port Economic Association
    • /
    • v.32 no.1
    • /
    • pp.97-122
    • /
    • 2016
  • Incheon port attained two million TEU of container throughput between 2013 and 2014 as a third port in domestic container throughput. It opened a new port in Song-do, Incheon in June 2015 to prepare for the continuing increase in container throughput.Therefore, it has provided the platform for being the major container port domestically and internationally. As the role of the new port increases, the role and direction of the Incheon port liner service network attracts attention. This study analyzes the centrality of the Incheon port liner service network by using SNA (Social Network Analysis), which was introduced in the maritime economics area recently, focusing on the Incheon port liner service network. We recognize the degree centrality, closeness centrality, and betweenness centrality of each port and its effect on the Incheon port liner service network. The study showed that for Incheon port, the centrality of the Busan port in Korea, and the Hong Kong port, is high outside the country. This helps us determine that the hub of the Incheon port is neither Shanghai nor Singapore, which ranks first and second, respectively, on container throughput. It is also helps us to know that eastern China's ports have not played a role of the hub of the Incheon port until now because of the relatively low centrality of eastern China's ports.

Monte-Carlo Methods for Social Network Analysis (사회네트워크분석에서 몬테칼로 방법의 활용)

  • Huh, Myung-Hoe;Lee, Yong-Goo
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.401-409
    • /
    • 2011
  • From a social network of n nodes connected by l lines, one may produce centrality measures such as closeness, betweenness and so on. In the past, the magnitude of n was around 1,000 or 10,000 at most. Nowadays, some networks have 10,000, 100,000 or even more than that. Thus, the scalability issue needs the attention of researchers. In this short paper, we explore random networks of the size around n = 100,000 by Monte-Carlo method and propose Monte-Carlo algorithms of computing closeness and betweenness centrality measures to study the small world properties of social networks.

An Efficient Algorithm for Betweenness Centrality Estimation in Social Networks (사회관계망에서 매개 중심도 추정을 위한 효율적인 알고리즘)

  • Shin, Soo-Jin;Kim, Yong-Hwan;Kim, Chan-Myung;Han, Youn-Hee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.1
    • /
    • pp.37-44
    • /
    • 2015
  • In traditional social network analysis, the betweenness centrality measure has been heavily used to identify the relative importance of nodes. Since the time complexity to calculate the betweenness centrality is very high, however, it is difficult to get it of each node in large-scale social network where there are so many nodes and edges. In our past study, we defined a new type of network, called the expanded ego network, which is built only with each node's local information, i.e., neighbor information of the node's neighbor nodes, and also defined a new measure, called the expanded ego betweenness centrality. In this paper, We propose algorithm that quickly computes expanded ego betweenness centrality by exploiting structural properties of expanded ego network. Through the experiment with virtual network used Barab$\acute{a}$si-Albert network model to represent the generic social network and facebook network to represent actual social network, We show that the node's importance rank based on the expanded ego betweenness centrality has high similarity with that the node's importance rank based on the existing betweenness centrality. We also show that the proposed algorithm computes the expanded ego betweenness centrality quickly than existing algorithm.

Co-occurrence Patterns of Bird Species in the World

  • Kim, Young Min;Hong, Sungwon;Lee, Yu Seong;Oh, Ki Cheol;Kim, Gu Yeon;Joo, Gea-Jae
    • Korean Journal of Ecology and Environment
    • /
    • v.50 no.4
    • /
    • pp.478-482
    • /
    • 2017
  • In order to identify key nations and bird species of conservation concern we described multinational collaborations as defined using network analysis linked by birds that are found in all nations in the network. We used network analysis to assess the patterns in bird occurrence for 10,422 bird inventories from 244 countries and territories. Nations that are important in multinational collaborations for bird conservation were assessed using the centrality measures, closeness and betweenness centrality. Countries important for the multinational collaboration of bird conservation were examined based on their centrality measures, which included closeness and betweenness centralities. Comparatively, the co-occurrence network was divided into four groups that reveal different biogeographical structures. A group with higher closeness centrality included countries in southern Africa and had the potential to affect species in many other countries. Birds in countries in Asia, Australia and the South Pacific that are important to the cohesiveness of the global network had a higher score of betweenness centrality. Countries that had higher numbers of bird species and more extensively distributed bird species had higher centrality scores; in these countries, birds may act as excellent indicators of trends in the co-occurrence bird network. For effective bird conservation in the world, much stronger coordination among countries is required. Bird co-occurrence patterns can provide a suitable and powerful framework for understanding the complexity of co-occurrence patterns and consequences for multinational collaborations on bird conservation.