• Title/Summary/Keyword: 트위터 데이터

Search Result 227, Processing Time 0.028 seconds

Hot Topic Prediction Scheme Considering User Influences in Social Networks (소셜 네트워크에서 사용자의 영향력을 고려한 핫 토픽 예측 기법)

  • Noh, Yeon-woo;Kim, Dae-yun;Han, Jieun;Yook, Misun;Lim, Jongtae;Bok, Kyoungsoo;Yoo, Jaesoo
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.8
    • /
    • pp.24-36
    • /
    • 2015
  • Recently, interests in detecting hot topics have been significantly growing as it becomes important to find out and analyze meaningful information from the large amount of data which flows in from social network services. Since it deals with a number of random writings that are not confirmed in advance due to the characteristics of SNS, there is a problem that the reliability of the results declines when hot topics are predicted from the writings. To solve such a problem, this paper proposes a high reliable hot topic prediction scheme considering user influences in social networks. The proposed scheme extracts a set of keywords with hot issues instantly through the modified TF-IDF algorithm based on Twitter. It improves the reliability of the results of hot topic prediction by giving weights of user influences to the tweets. To show the superiority of the proposed scheme, we compare it with the existing scheme through performance evaluation. Our experimental results show that our proposed method has improved precision and recall compared to the existing method.

A Method of Identifying Ownership of Personal Information exposed in Social Network Service (소셜 네트워크 서비스에 노출된 개인정보의 소유자 식별 방법)

  • Kim, Seok-Hyun;Cho, Jin-Man;Jin, Seung-Hun;Choi, Dae-Seon
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.23 no.6
    • /
    • pp.1103-1110
    • /
    • 2013
  • This paper proposes a method of identifying ownership of personal information in Social Network Service. In detail, the proposed method automatically decides whether any location information mentioned in twitter indicates the publisher's residence area. Identifying ownership of personal information is necessary part of evaluating risk of opened personal information online. The proposed method uses a set of decision rules that considers 13 features that are lexicographic and syntactic characteristics of the tweet sentences. In an experiment using real twitter data, the proposed method shows better performance (f1-score: 0.876) than the conventional document classification models such as naive bayesian that uses n-gram as a feature set.

Enhancing the corporate image through social media: An approach based on multi-dimensional scaling (다차원척도법에 의한 기업이미지 제고를 위한 소셜미디어 활용방안)

  • Kim, Suhyun;Lee, Hanjun;Suh, Yongmoo;Han, Jinyoung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.3
    • /
    • pp.427-436
    • /
    • 2013
  • Social media is drawing attention among companies for its potential as a marketing tool. There are many types of social media and their characteristics are varied, and thus choosing the appropriate social media considering the purpose of the company is important. In this paper, we conduct comparative analysis on the popular social media such as Facebook, Twitter, Naver blog, Youtube, Cyworld and Me2day using multidimensional scaling method. The result shows that there are differences in the effectiveness of enhancing diverse dimensions of corporate image among social media. This result can be used in developing social media based marketing strategy.

Measuring Similarity Between Movies Based on Sentiment of Tweets (트위터를 활용한 감성 기반의 영화 유사도 측정)

  • Kim, Kyoungmin;Kim, Dong-Yun;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.3
    • /
    • pp.292-297
    • /
    • 2014
  • As a Social Network Service (SNS) has become an integral part of our everyday lives, millions of users can express their opinion and share information regardless of time and place. Hence sentiment analysis using micro-blogs has been studied in various field to know people's opinion on particular topics. Most of previous researches on movie reviews consider only positive and negative sentiment and use it to predict movie rating. As people feel not only positive and negative but also various emotion, the sentiment that people feel while watching a movie need to be classified in more detail to extract more information than personal preference. We measure sentiment distributions of each movie from tweets according to the Thayer's model. Then, we find similar movies by calculating similarity between each sentiment distributions. Through the experiments, we verify that our method using micro-blogs performs better than using only genre information of movies.

Web crawling process of each social network service for recognizing water quality accidents in the water supply networks (물공급네트워크 수질사고인지를 위한 소셜네트워크 서비스 별 웹크롤링 방법론 개발)

  • Yoo, Do Guen;Hong, Seunghyeok;Moon, Gihoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.398-398
    • /
    • 2022
  • 최근 수돗물 공급과정에 있어 적수, 유충 발생 등 지역 단위의 수질문제로 국민의 직간접적인 피해가 발생된 바 있다. 수질문제 발생 시, 소셜네트워크서비스(SNS)에 게시되는 피해 관련 의견은 시공간적으로 빠르게 확산되며, 궁극적으로는 물공급과정 전체의 부정적 인식증가와 신뢰도 저하를 초래한다. 따라서, 물공급시스템에서의 수질사고 발생을 빠르게 인지하는 다양한 방법론의 적용을 통한 피해 최소화를 위한 노력이 반드시 필요하다. 일반적으로 수질사고는 다양한 항목의 실시간 계측기에서 획득되는 시계열자료의 변화양상을 통해 판단할 수 있으나, 이와 같은 방법론의 효율적 적용을 위해서는 선진계측인프라의 도입이 선행되어야 한다. 본 연구에서는 국내의 발달된 정보통신기술환경을 활용하여, 물공급네트워크 내 수질사고인지를 위한 SNS 별 웹크롤링 방법론을 제안하고, 적용결과를 분석하였다. 방법론의 구현에 앞서, 각종 SNS 별(트위터, 인스타그램, 블로그, 네이버 카페 등) 프로그래밍을 통한 웹크롤링 가능여부, 정보획득 기간 등을 확인하였으며, 과거 유사 수질사고 발생 시 영향력과 관련 게시글이 크게 나타난 네이버 카페와 트위터를 중심으로 웹 크롤링 절차를 제시하였다. 네이버 카페의 경우 대상급수구역 내의 시민들이 다수 참여하는 카페를 목록화하고, 지자체명과 핵심 키워드(수돗물, 유충, 적수) 조합을 활용한 웹크롤링을 수행하여, 관련 게시물 건수와 의미를 실시간으로 분석하는 절차를 마련하였다. 개발된 SNS 별 웹크롤링 방법론에 따라 과거 수질사고가 발생된 바 있는 2개 이상의 지자체에 대한 분석을 실시하였으며, SNS 별 결과에 있어 차이점을 확인하여 제시하였다. 향후 제안된 방법을 적용하여 시공간적 수질사고 정보의 전파 및 확산양상을 추가적으로 분석할수 있을 것으로 기대된다.

  • PDF

Using Big Data and Small Data to Understand Linear Parks - Focused on the 606 Trail, USA and Gyeongchun Line Forest, Korea - (빅데이터와 스몰데이터로 본 선형공원 - 시카고 606 트레일과 서울 경춘선 숲길을 중심으로 -)

  • Sim, Ji-Soo;Oh, Chang Song
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.5
    • /
    • pp.28-41
    • /
    • 2020
  • This study selects two linear parks representing each culture and reveals the differences between them using a visitor survey as small data and social media analytics as big data based on the three components of the model of landscape perception. The 606 in Chicago, U.S., and the Gyeongchun Line in Seoul, Korea, are representative parks built on railroads. A total of 505 surveys were collected from these parks. The responses were analyzed using descriptive statistics, principal component analysis, and linear regression. Also, more than 20,000 tweets which mentioned two linear parks respectively were collected. By using those tweets, the authors conducted the clustering analysis and draw the bigram network diagram for identifying and comparing the placeness of each park. The result suggests that more diverse design concept links to less diversity in behavior; that half of the park users use the park as a shortcut; and that same physical exercise provides different benefits depending on the park. Social media analysis showed the 606 is more closely related to the neighborhoods rather than the Gyeongchun Line Forest. The Gyeongchun Line Forest was a more event-related place than the 606.

Public Perception and Usage Pattern of Science Museum by Social Media Big Data Analysis (소셜 빅데이터 분석을 통해 알아본 대중의 과학관에 대한 인식 및 사용 행태)

  • Yun, Eunjeong;Park, Yunebae
    • Journal of The Korean Association For Science Education
    • /
    • v.37 no.6
    • /
    • pp.1005-1014
    • /
    • 2017
  • Focusing on the role of the science museum as an institution to improve the scientific literacy of the public, this study investigated public perception and behavior about science museum to know how much science museums affect the public by using social media big data analysis. For this purpose, we extracted texts containing 'science museum' in Naver blogs and Twitter, analyzed them by using network, frequency, co-ocurrence, and semantics analysis and compared them with the results in English speaking countries. As a result, blogs were mainly concerned with science museum among parents who have young children, while in Twitter posts from many students who visited as a group appeared. Therefore, the Korean public used science museum mainly as a space for children's experience, and in this case, programs and exhibitions of science museums are perceived positively. On the other hand, students who visited as a group showed some negative emotions. The result of comparison with the cases of foreign countries in terms of the function of the third generation science museum such as communications with the science museum and the public and the participation of the public in science, the Korean public hardly mentioned the scientific contents, words related to communications such as 'argue', and curators or staff after visiting the science museum. In contrast to many verbs related to meaningful activities such as 'learn', 'participate', 'listen', 'read', 'ask', 'think' appeared in English, only a small number of verbs include 'ask' and 'thin' appeared in Korean. Therefore, science museum need to improve impression, communicating with public, and involving activity with impact and variety after visit.

Exploring Twitter Follower-Networks of Startup Companies Employing Social Network Analysis and Cluster Analysis (소셜네트워크 분석과 클러스터 분석 방법을 활용한 스타트업 회사의 트위터 팔로워 네트워크에 대한 탐색적 연구)

  • Yu, Seunghee
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.14 no.4
    • /
    • pp.199-209
    • /
    • 2019
  • The importance of business strategy for successful social media engagement has quickly increased as more businesses engage in social media. The importance is even greater for startup companies because startup companies are genuinely new to business, and they need to increase their presence in the market, and quickly access future customers. The objective of this paper lies in exploring key indicators of social media engagements by selected startup companies. The key indicators include two aspects of social media usages by the companies: i) overall social media activities, and ii) properties of network structure of the information flow platform provided by social media service. To better assess and evaluate the key indicators of social media usages by startup companies, the indicators will be compared with those of selected large established companies. Twitter is selected as a social media service for the analysis of this paper, and using Twitter REST API, data regarding the key indicators of overall Twitter activities and the Twitter follower-network of each company in the sample are collected. Then, the data are analyzed using social network analysis and hierarchical clustering analysis to examine the characteristics of the follower-network structures and to compare the characteristics between startup companies and established companies. The results show that most indicators are significantly different across startup companies and established companies. One key interesting finding is that the startup companies have proportionally more influencers in their follower-networks than the established companies have. Another interesting finding is that the follower-networks of startup companies in the sample have higher modularity and higher transitivity, suggesting that the startup companies tend to have a proportionally larger number of communities of users in their follower-networks, and the users in the networks are more tightly connected and cohesive internally. The key business implication for the future social media engagement efforts by startup companies in general is that startup companies may need to focus on getting more attention from influencers and promoting more cohesive communities in their follower-networks to appreciate the potential benefits of social media in the early stage of business of startup companies.

Real-Time Ransomware Infection Detection System Based on Social Big Data Mining (소셜 빅데이터 마이닝 기반 실시간 랜섬웨어 전파 감지 시스템)

  • Kim, Mihui;Yun, Junhyeok
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.7 no.10
    • /
    • pp.251-258
    • /
    • 2018
  • Ransomware, a malicious software that requires a ransom by encrypting a file, is becoming more threatening with its rapid propagation and intelligence. Rapid detection and risk analysis are required, but real-time analysis and reporting are lacking. In this paper, we propose a ransomware infection detection system using social big data mining technology to enable real-time analysis. The system analyzes the twitter stream in real time and crawls tweets with keywords related to ransomware. It also extracts keywords related to ransomware by crawling the news server through the news feed parser and extracts news or statistical data on the servers of the security company or search engine. The collected data is analyzed by data mining algorithms. By comparing the number of related tweets, google trends (statistical information), and articles related wannacry and locky ransomware infection spreading in 2017, we show that our system has the possibility of ransomware infection detection using tweets. Moreover, the performance of proposed system is shown through entropy and chi-square analysis.

Trends of Line Card Technology Based on 40G/100G Ethernet Standard (40G/100G 이더넷 표준 기반의 라인카드 기술 동향)

  • Yang, C.R.;Ahn, K.H.;Kim, S.H.;Ko, J.S.;Kim, K.
    • Electronics and Telecommunications Trends
    • /
    • v.25 no.6
    • /
    • pp.110-122
    • /
    • 2010
  • UCC, 트위터 등 멀티미디어 콘텐츠 증가, 유틸리티 컴퓨팅과 같은 다양한 신규 서비스의 급증, IPTV 등 높은 대역폭을 요구하는 애플리케이션의 증가, 가상화 데이터 센터의 등장과 함께 40G/l00G 이더넷 기술이 차세대 광대역 서비스 대역폭 요구에 대한 장기적 해결방안의 하나로 제시되고 있는 가운데 세계적으로 40G/100G 이더넷으로의 네트워크의 진화가 시작되고 있다 본 고에서는 최근 세계적으로 뜨거운 쟁점이 되고 있는 차세대 인프라 40G/100G 이더넷 표준을 기반으로 하는 디바이스 및 프로덕트의 출시 동향을 살펴보고 현재 사용 기능한 상용 칩을 이용한 40G 이더넷 라인카드의 구조와 향후 구현 가능한 100G 이더넷 라인카드의 구조 그리고 40G/100G 이더넷 상의 OTN 네트워크 응용에 대해 고찰한다.