• Title/Summary/Keyword: co-word network analysis

Search Result 92, Processing Time 0.022 seconds

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.

Towards Next Generation Multimedia Information Retrieval by Analyzing User-centered Image Access and Use (이용자 중심의 이미지 접근과 이용 분석을 통한 차세대 멀티미디어 검색 패러다임 요소에 관한 연구)

  • Chung, EunKyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.51 no.4
    • /
    • pp.121-138
    • /
    • 2017
  • As information users seek multimedia with a wide variety of information needs, information environments for multimedia have been developed drastically. More specifically, as seeking multimedia with emotional access points has been popular, the needs for indexing in terms of abstract concepts including emotions have grown. This study aims to analyze the index terms extracted from Getty Image Bank. Five basic emotion terms, which are sadness, love, horror, happiness, anger, were used when collected the indexing terms. A total 22,675 index terms were used for this study. The data are three sets; entire emotion, positive emotion, and negative emotion. For these three data sets, co-word occurrence matrices were created and visualized in weighted network with PNNC clusters. The entire emotion network demonstrates three clusters and 20 sub-clusters. On the other hand, positive emotion network and negative emotion network show 10 clusters, respectively. The results point out three elements for next generation of multimedia retrieval: (1) the analysis on index terms for emotions shown in people on image, (2) the relationship between connotative term and denotative term and possibility for inferring connotative terms from denotative terms using the relationship, and (3) the significance of thesaurus on connotative term in order to expand related terms or synonyms for better access points.

Knowledge Structure of Posttraumatic Growth Research: A Network Analysis (네트워크 분석을 통한 외상 후 성장 지식구조 연구)

  • Shin, JooYeon;Kwon, Sunyoung;Bae, Ka Ryeong
    • Journal of Industrial Convergence
    • /
    • v.20 no.10
    • /
    • pp.61-69
    • /
    • 2022
  • Posttraumatic growth literature has been rapidly expanding in multiple academic disciplines. Purpose of this study is to examine the knowledge structure of posttraumatic growth utilizing a network analysis. Papers published between 1996 and 2018 were searched on the Web of Science, focusing on terms related to posttraumatic growth. One thousand six-hundred and fifty-nine keywords were published 6,343 times in 1,780 papers; thus, a total of 322 keywords (5,195 appearances) were selected for the final analysis. The network analysis and network visualization tool used were NodeXL and PFnet, respectively. The keywords which appeared the most frequently were "Posttraumatic growth," followed by "Posttraumatic Stress Disease," "Cancer," and "Trauma." A total of 322 nodes have been reduced to 175 nodes and divided into a total of five groups. The five groups were "Posttraumatic Growth in Cancer, Chronic/Serious Illness, and Disability," "Posttraumatic Growth-related Psychological Variables and Psychotherapy," "Posttraumatic Growth in the Context of Death," "Cognitive Mechanisms of Posttraumatic Growth," and "Vicarious Posttraumatic Growth." This study provides a systematic overview on the knowledge structure of posttraumatic growth by quantitatively network analysis.

Network Analysis of the Intellectual Structure of Addiction Research in Social Sciences: Based on the KCI Articles Published in 2019 (사회과학 중독연구 분야의 지적구조에 관한 네트워크 분석 : 2019년도 KCI 등재 논문을 기반으로)

  • Lee, Serim;Chun, JongSerl
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.10
    • /
    • pp.21-37
    • /
    • 2021
  • This study investigated the intellectual structure of the latest trends in Korean addiction research in the social sciences. A network analysis of keywords with co-word occurrence was performed on 172 papers from the KCI database based on the data from the year of 2019, and a total of 432 keywords were extracted. The network analysis was performed using several programs: Bibexcel, COOC, WNET, and NodeXL. As a result of the study, keywords related to addiction type, study subjects, research methods, and research variables were found, and a total of 20 clusters were identified. Furthermore, to identify and measure weighted networks, the relationships between each keyword were explored and discussed in detail through a network analysis of global centralities, local centralities, and betweenness centralities. The study indicated that the latest issues were focused on smartphone addiction and provided implications for the future research and practice that fields and topics of relationship addiction, food addiction, and work addiction should be more considered. Further, the study discussed the relationship between drug addiction-crime, alcohol addiction-family, and gambling addiction-motivation and the necessity of qualitative study.

Arab Spring Effects on Meanings for Islamist Web Terms and on Web Hyperlink Networks among Muslim-Majority Nations: A Naturalistic Field Experiment

  • Danowski, James A.;Park, Han Woo
    • Journal of Contemporary Eastern Asia
    • /
    • v.13 no.2
    • /
    • pp.15-39
    • /
    • 2014
  • This research conducted a before/after naturalistic field experiment, with the early Arab Spring as the treatment. Compared to before the early Arab Spring, after the observation period the associations became stronger among the Web terms: 'Jihad, Sharia, innovation, democracy and civil society.' The Western concept of civil society transformed into a central Islamist ideological component. At another level, the inter-nation network based on Jihad-weighted Web hyperlinks between pairs of 46 Muslim Majority (MM) nations found Iran in one of the top two positions of flow betweenness centrality, a measure of network power, both before and after early Arab Spring. In contrast, Somalia, UAE, Egypt, Libya, and Sudan increased most in network flow betweenness centrality. The MM 'Jihad'-centric word co-occurrence network more than tripled in size, and the semantic structure more became entropic. This media "cloud" perhaps billowed as Islamist groups changed their material-level relationships and the corresponding media representations of Jihad among them changed after early Arab Spring. Future research could investigate various rival explanations for this naturalistic field experiment's findings.

Exploring the Research Trends of Learning Strategies in Korean Language Education Using Co-word Analysis (동시출현단어 분석을 활용한 한국어교육에서의 학습전략 연구 동향 탐색)

  • Heo, Youngsoo;Park, Ji-Hong
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.2
    • /
    • pp.65-86
    • /
    • 2021
  • In the foreign language education, learners are an important part of education, however in the Korean language education, the study of learners was insufficient compared to the contents of education, teaching methods and textbooks. Therefore, it is meaningful to analyze how learner research, especially learning strategy research, has been conducted and derive areas that need research for better education. In this study, co-word analysis was conducted on the titles of academic journals and dissertations in order to analyze the learning strategy research in Korean language education. I found it is about "reading" that the most studies related to Korean language learners' learning strategies were conducted and those studies' subjects mostly were 'Chinese international students' and 'marriage-immigrants'. In addition, the results of the subgroup analysis on the research topic show four major subgroups: a group related to 'reading for academic purposes', a group related to 'request, rejection, conversation, etc.', a group related to 'writing', and a group related to 'vocabulary, listening'. This shows that the researchers' major interests in studying Korean learner's strategies are "reading" and "speaking" and their studies have been concentrated in the specific areas. Therefore, it is necessary for researchers to study various functions and subjects in Korean language learner's learning strategies.

An Investigation on Scientific Data for Data Journal and Data Paper (Scientific Data 학술지 분석을 통한 데이터 논문 현황에 관한 연구)

  • Chung, EunKyung
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.1
    • /
    • pp.117-135
    • /
    • 2019
  • Data journals and data papers have grown and considered an important scholarly practice in the paradigm of open science in the context of data sharing and data reuse. This study investigates a total of 713 data papers published in Scientific Data in terms of author, citation, and subject areas. The findings of the study show that the subject areas of core authors are found as the areas of Biotechnology and Physics. An average number of co-authors is 12 and the patterns of co-authorship are recognized as several closed sub-networks. In terms of citation status, the subject areas of cited publications are highly similar to the areas of data paper authors. However, the citation analysis indicates that there are considerable citations on the journals specialized on methodology. The network with authors' keywords identifies more detailed areas such as marine ecology, cancer, genome, database, and temperature. This result indicates that biology oriented-subjects are primary areas in the journal although Scientific Data is categorized in multidisciplinary science in Web of Science database.

Text Mining Driven Content Analysis of Ebola on News Media and Scientific Publications (텍스트 마이닝을 이용한 매체별 에볼라 주제 분석 - 바이오 분야 연구논문과 뉴스 텍스트 데이터를 이용하여 -)

  • An, Juyoung;Ahn, Kyubin;Song, Min
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.50 no.2
    • /
    • pp.289-307
    • /
    • 2016
  • Infectious diseases such as Ebola virus disease become a social issue and draw public attention to be a major topic on news or research. As a result, there have been a lot of studies on infectious diseases using text-mining techniques. However, there is no research on content analysis of two media channels that have distinct characteristics. Accordingly, in this study, we conduct topic analysis between news (representing a social perspective) and academic research paper (representing perspectives of bio-professionals). As text-mining techniques, topic modeling is applied to extract various topics according to the materials, and the word co-occurrence map based on selected bio entities is used to compare the perspectives of the materials specifically. For network analysis, topic map is built by using Gephi. Aforementioned approaches uncovered the difference of topics between two materials and the characteristics of the two materials. In terms of the word co-occurrence map, however, most of entities are shared in both materials. These results indicate that there are differences and commonalties between social and academic materials.

Issues on Articles Covering Outstanding Management of Apartment Complexes - Content Analysis of Newspaper Reports with Lexical Statistics - (우수 아파트단지 취재기사에서의 관리상의 논점 - 탐방기사를 이용한 언어통계학적 내용분석 -)

  • Choi Jung-Min;Kang Soon-Joo
    • Journal of the Korean housing association
    • /
    • v.17 no.4
    • /
    • pp.131-143
    • /
    • 2006
  • Nowadays, diverse mass media discovers and introduces outstanding management cases of apartment complexes to induce vital competitions of constructors and active participation of residents to apartment management. This study statistically analyzed the management issues of outstanding apartment complexes that have been introduced by mass media with lexical criteria to examine the characteristics of their exemplary management. The key issues of outstanding apartment management are summarized as: efficient management of convenient facilities for residents, community activities based on residents' participation, and maintenance of pleasant living environments through transparent management. Also, the result of the relation arrangement of co-occurrence word from a Social Network Analysis included three key concepts of multi-family housing management - Maintenance Management, Operating Management, and Community Life Management - with emphasis on 'residents' and 'apartment complexes.' However, Operating Management was relatively deemphasized.

Examining China's Internet Policies through a Bibliometric Approach

  • Li, Jiang;Xu, Weiai Wayne;Wang, Fang;Chen, Si;Sun, Jianjun
    • Journal of Contemporary Eastern Asia
    • /
    • v.17 no.2
    • /
    • pp.237-253
    • /
    • 2018
  • In order to understand China's internet governance, this paper examined 1,931 Internet policies of China by bibliometric techniques. Specifically, the bibliometric techniques include simple document counting, co-word analysis, collaboration network analysis and citation analysis. The findings include: (1) China's Internet legislations mainly emphasized e-commerce and Internet governance, and, to some extent, neglected personal data protection; (2) China's Internet is under intensive multiple regulatory controls by central government. A large number of government agencies are involved in Internet policy-making. The Propaganda Department of the Central Committee of the Communist Party of China and the State Information Leading Group of the State Council, enforced fewer policy documents, but occupy higher positions in the Internet governance hierarchy; (3) China's Internet legislation system is primarily composed of industry-specific administrative rules, rather than laws or administrative regulations. Nevertheless, laws and administrative regulations received significantly more citations owing to their superior force. This paper also discussed current gaps in China's internet governance and how the country's internet policies are situated in the broader global context.