• Title/Summary/Keyword: Semantic Social Network

Search Result 169, Processing Time 0.024 seconds

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.

Investigations on Techniques and Applications of Text Analytics (텍스트 분석 기술 및 활용 동향)

  • Kim, Namgyu;Lee, Donghoon;Choi, Hochang;Wong, William Xiu Shun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.471-492
    • /
    • 2017
  • The demand and interest in big data analytics are increasing rapidly. The concepts around big data include not only existing structured data, but also various kinds of unstructured data such as text, images, videos, and logs. Among the various types of unstructured data, text data have gained particular attention because it is the most representative method to describe and deliver information. Text analysis is generally performed in the following order: document collection, parsing and filtering, structuring, frequency analysis, and similarity analysis. The results of the analysis can be displayed through word cloud, word network, topic modeling, document classification, and semantic analysis. Notably, there is an increasing demand to identify trending topics from the rapidly increasing text data generated through various social media. Thus, research on and applications of topic modeling have been actively carried out in various fields since topic modeling is able to extract the core topics from a huge amount of unstructured text documents and provide the document groups for each different topic. In this paper, we review the major techniques and research trends of text analysis. Further, we also introduce some cases of applications that solve the problems in various fields by using topic modeling.

A Study on Tourism Behavior in the New normal Era Using Big Data (빅데이터를 활용한 뉴노멀(New normal)시대의 관광행태 변화에 관한 연구)

  • Kyoung-mi Yoo;Jong-cheon Kang;Youn-hee Choi
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.167-181
    • /
    • 2023
  • This study utilized TEXTOM, a social network analysis program to analyze changes in current tourism behavior after travel restrictions were eased after the outbreak of COVID-19. Data on the keywords 'domestic travel' and 'overseas travel' were collected from blogs, cafes, and news provided by Naver, Google, and Daum. The collection period was set from April to December 2022 when social distancing was lifted, and 2019 and 2020 were each set as one year and compared and analyzed with 2022. A total of 80 key words were extracted through text mining and centrality analysis was performed using NetDraw. Finally, through the CONCOR, the correlated keywords were clustered into 4. As a result of the study, tourism behavior in 2022 shows tourism recovery before the outbreak of COVID-19, segmentation of travel based on each person's preferred theme, prioritization of each country's corona mitigation policy, and then selecting a tourist destination. It is expected to provide basic data for the development of tourism marketing strategies and tourism products for the newly emerging tourism ecosystem after COVID-19.

A Study on Public Policy through Semantic Network Analysis of Public Data related News in Korea (국내 공공데이터 관련 뉴스 의미망 분석을 통한 공공정책 연구)

  • Moon, HyeJung;Lee, Kyungseo
    • Journal of Broadcast Engineering
    • /
    • v.23 no.4
    • /
    • pp.536-548
    • /
    • 2018
  • Public data has been transformed from provider-oriented information disclosure to a form of personalized information sharing centered on individual citizens since government 3.0. As a result, the government is implementing policies and projects to maximize the value of public data and increase reuse. This study analyzes the issues related to public data in the news and seeks the status of government agencies and government projects by issue. We conducted semantic analysis on domestic online news and public agency bidding information including public data and conducted the work of linking major key words derived with social and economic values inherent in public data. As a result, major issues related to public data were divided into broader access to public data, growth of new technology, cooperation and conflict among stakeholders, and utilization of the private sector, which were closely related to transparency, efficiency, participation, and innovation mechanisms. Also major agencies of four issues include the Ministry of Strategy and Finance and Seoul, Ministry of Culture, Sports and Tourism and Gyeonggi-do, Ministry of Trade, Industry and Energy and Incheon, and Ministry of Land, Infrastructure and Transport and Gyeongsangbuk-do. Most of the issues are being led by the government.

Global Citizenship Education in the Primary Geography Curriculum of the Republic of Korea: Content Analysis Focusing on the Semantic Structure of 2009 Revised School Curriculum (초등지리 교육과정에 반영된 세계시민교육 관련 요소의 구조적 특성에 관한 연구: 2009 개정 교육과정 성취기준에 대한 내용분석을 중심으로)

  • Lee, Dong-Min
    • Journal of the Korean Geographical Society
    • /
    • v.49 no.6
    • /
    • pp.949-969
    • /
    • 2014
  • The purpose of this study is to analyze the share of global citizenship education in the 2009 Revised Social Studies (geography area) School Curriculum of the Republic of Korea. I selected the achievement standards of the geography domain in the fifth and sixth grades as the subjects of analysis. The chosen subjects were examined using content analysis: I used KrKwic, a Korean language content analysis tool, to analyze the content and drew a semantic network of the analysis results using UciNet/NetDraw. I found that the geography domain of the 2009 Revised Primary School Curriculum included the concepts of and factors of global citizenship education. However, global citizenship education did not account for a major portion of the curriculum, and the curriculum achievement standards were noticeably nation-state centered. Global citizenship education factors were not closely associated with to other related factors in fact, they even revealed a isolated pattern. These findings suggest that the inclusion of global citizenship education in primary geography education is limited, because the connections between global citizenship education and related contents, such as the environment, sustainable development, conflict, and cooperation, are probably impeded. Globalization accompanies the transformation of territories, identities, and the relations between nation-states and the world, although nation-states continue to play a significant role in the globalized worlds. Therefore global citizenship education, a educational trend focusing on the global community, is particularly important and is required in the geography curriculum of the global era. I expect that the examination undertaken in this study to contribute to future curriculum revisions regarding globalizatin and global citizenship.

  • PDF

Component Grid: A Developer-centric Environment for Defense Software Reuse (컴포넌트 그리드: 개발자 친화적인 국방 소프트웨어 재사용 지원 환경)

  • Ko, In-Young;Koo, Hyung-Min
    • Journal of Software Engineering Society
    • /
    • v.23 no.4
    • /
    • pp.151-163
    • /
    • 2010
  • In the defense software domain where large-scale software products in various application areas need to be built, reusing software is regarded as one of the important practices to build software products efficiently and economically. There have been many efforts to apply various methods to support software reuse in the defense software domain. However, developers in the defense software domain still experience many difficulties and face obstacles in reusing software assets. In this paper, we analyze practical problems of software reuse in the defense software domain, and define core requirements to solve those problems. To meet these requirements, we are currently developing the Component Grid system, a reuse-support system that provides a developer-centric software reuse environment. We have designed an architecture of Component Grid, and defined essential elements of the architecture. We have also developed the core approaches for developing the Component Grid system: a semantic-tagging-based requirement tracing method, a reuse-knowledge representation model, a social-network-based asset search method, a web-based asset management environment, and a wiki-based collaborative and participative knowledge construction and refinement method. We expect that the Component Grid system will contribute to increase the reusability of software assets in the defense software domain by providing the environment that supports transparent and efficient sharing and reuse of software assets.

  • PDF

A Study on the Comparison and Semantic Analysis between SNS Big Data, Search Portal Trends and Drug Case Statistics (SNS 빅데이터 및 검색포털 트렌드와 마약류 사건 통계간의 비교 및 의미분석 연구)

  • Choi, Eunjung;Lee, SuRyeon;Kwon, Hyemin;Kim, Myuhngjoo;Lee, Insoo;Lee, Seunghoon
    • Journal of Digital Convergence
    • /
    • v.19 no.2
    • /
    • pp.231-238
    • /
    • 2021
  • SNS data can catch the user's thoughts and actions. And the trend of the search portal is a representative service that can observe the interests of users and their changes. In this paper, the relationship was analyzed by comparing statistics on narcotics incidents and the degree of exposure to narcotics related words in tweets of SNS and in the trends of search portal. It was confirmed that the trend of SNS and search portal trends was the same in the statistics of the prosecution office with a certain time difference.In addition, cluster analysis was performed to understand the meaning of tweets in which narcotics related words were mentioned. In the 50,000 tweets collected in January 2020, it was possible to find meaning related to the sale of actual drugs. Therefore, through SNS monitoring alone it is possible to monitor narcotics-related incidents and to find specific sales or purchase-related information, and this can be used in the investigation process. In the future, it is expected that crime monitoring and prediction systems can be proposed as related crime analysis may be possible not only with text but also images.

A MVC Framework for Visualizing Text Data (텍스트 데이터 시각화를 위한 MVC 프레임워크)

  • Choi, Kwang Sun;Jeong, Kyo Sung;Kim, Soo Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.39-58
    • /
    • 2014
  • As the importance of big data and related technologies continues to grow in the industry, it has become highlighted to visualize results of processing and analyzing big data. Visualization of data delivers people effectiveness and clarity for understanding the result of analyzing. By the way, visualization has a role as the GUI (Graphical User Interface) that supports communications between people and analysis systems. Usually to make development and maintenance easier, these GUI parts should be loosely coupled from the parts of processing and analyzing data. And also to implement a loosely coupled architecture, it is necessary to adopt design patterns such as MVC (Model-View-Controller) which is designed for minimizing coupling between UI part and data processing part. On the other hand, big data can be classified as structured data and unstructured data. The visualization of structured data is relatively easy to unstructured data. For all that, as it has been spread out that the people utilize and analyze unstructured data, they usually develop the visualization system only for each project to overcome the limitation traditional visualization system for structured data. Furthermore, for text data which covers a huge part of unstructured data, visualization of data is more difficult. It results from the complexity of technology for analyzing text data as like linguistic analysis, text mining, social network analysis, and so on. And also those technologies are not standardized. This situation makes it more difficult to reuse the visualization system of a project to other projects. We assume that the reason is lack of commonality design of visualization system considering to expanse it to other system. In our research, we suggest a common information model for visualizing text data and propose a comprehensive and reusable framework, TexVizu, for visualizing text data. At first, we survey representative researches in text visualization era. And also we identify common elements for text visualization and common patterns among various cases of its. And then we review and analyze elements and patterns with three different viewpoints as structural viewpoint, interactive viewpoint, and semantic viewpoint. And then we design an integrated model of text data which represent elements for visualization. The structural viewpoint is for identifying structural element from various text documents as like title, author, body, and so on. The interactive viewpoint is for identifying the types of relations and interactions between text documents as like post, comment, reply and so on. The semantic viewpoint is for identifying semantic elements which extracted from analyzing text data linguistically and are represented as tags for classifying types of entity as like people, place or location, time, event and so on. After then we extract and choose common requirements for visualizing text data. The requirements are categorized as four types which are structure information, content information, relation information, trend information. Each type of requirements comprised with required visualization techniques, data and goal (what to know). These requirements are common and key requirement for design a framework which keep that a visualization system are loosely coupled from data processing or analyzing system. Finally we designed a common text visualization framework, TexVizu which is reusable and expansible for various visualization projects by collaborating with various Text Data Loader and Analytical Text Data Visualizer via common interfaces as like ITextDataLoader and IATDProvider. And also TexVisu is comprised with Analytical Text Data Model, Analytical Text Data Storage and Analytical Text Data Controller. In this framework, external components are the specifications of required interfaces for collaborating with this framework. As an experiment, we also adopt this framework into two text visualization systems as like a social opinion mining system and an online news analysis system.

Big Data Analysis on Daegu-Gyeongbuk Administrative Integration (대구·경북 행정통합에 대한 빅데이터 분석)

  • Song, Hwa Young;Park, Han Woo
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.5
    • /
    • pp.139-148
    • /
    • 2021
  • The study examines public attitude and reaction regarding administrative integration in Daegu and Gyeongbuk area. Specifically, it employs social big data including textual comments on online news articles and YouTube video clips. The collected data are analyzed in order to compare two periods, that is, before and after the inauguration of the Public Opinion Committee for One Daegu-Gyeongbuk. As a result, we have found that people's favorable response to administrative integration has gradually increased since the launch of the Committee. However, it still lacks specific administrative procedures and discussion topics among the frequently used words in the collected data. Thus, the Committee needs to provide a variety of information and materials related to administrative integration.

The Empathy and Justice Contemplated From the Neuroscientific Perspective in the Age of Social Divisions and Conflicts (분열과 반목의 시대에 신경과학적 관점에서 고찰해보는 공감과 정의)

  • Ji-Woong, Kim
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.30 no.2
    • /
    • pp.55-65
    • /
    • 2022
  • Although humans exist as Homo Empathicus, human society is actually constantly divided and conflicted between groups. The human empathy response is very sensitive to the justice of others, and depending on the level of others' justice, they may feel empathy or schadenfreude to the suffering of them. However, our empathy to others' suffering are not always fair, and have inherent limitations of ingroup-biased empathy. Depending on whether the suffering other persons belongs to an ingroup or an outgroup, we may feel biased empathy or biased schadenfreude to them without even realizing it. Recent advances in information and communication technology facilitate biased access to ingroup-related SNS or ingroup media, thereby deepening the establishment of a more biased semantic information network related groups. These processes, through interacting with the inherent limitation of empathy, can form a vicious cycle of more biased ingroup empathy and ingroup-related activities, and accelerate divisions and conflicts. This research investigated the properties and limitations of empathy by reviewing studies on the neural mechanism of empathy. By examining the relationship between empathy and justice from a neuroscientific point of view, this research tried to illuminate the modern society of division and conflict in a different dimension from the classical perspective of social science.