• Title/Summary/Keyword: tweets

Search Result 176, Processing Time 0.029 seconds

Predicting the Unemployment Rate Using Social Media Analysis

  • Ryu, Pum-Mo
    • Journal of Information Processing Systems
    • /
    • v.14 no.4
    • /
    • pp.904-915
    • /
    • 2018
  • We demonstrate how social media content can be used to predict the unemployment rate, a real-world indicator. We present a novel method for predicting the unemployment rate using social media analysis based on natural language processing and statistical modeling. The system collects social media contents including news articles, blogs, and tweets written in Korean, and then extracts data for modeling using part-of-speech tagging and sentiment analysis techniques. The autoregressive integrated moving average with exogenous variables (ARIMAX) and autoregressive with exogenous variables (ARX) models for unemployment rate prediction are fit using the analyzed data. The proposed method quantifies the social moods expressed in social media contents, whereas the existing methods simply present social tendencies. Our model derived a 27.9% improvement in error reduction compared to a Google Index-based model in the mean absolute percentage error metric.

Content Modeling Based on Social Network Community Activity

  • Kim, Kyung-Rog;Moon, Nammee
    • Journal of Information Processing Systems
    • /
    • v.10 no.2
    • /
    • pp.271-282
    • /
    • 2014
  • The advancement of knowledge society has enabled the social network community (SNC) to be perceived as another space for learning where individuals produce, share, and apply content in self-directed ways. The content generated within social networks provides information of value for the participants in real time. Thus, this study proposes the social network community activity-based content model (SoACo Model), which takes SNC-based activities and embodies them within learning objects. The SoACo Model consists of content objects, aggregation levels, and information models. Content objects are composed of relationship-building elements, including real-time, changeable activities such as making friends, and participation-activity elements such as "Liking" specific content. Aggregation levels apply one of three granularity levels considering the reusability of elements: activity assets, real-time, changeable learning objects, and content. The SoACo Model is meaningful because it transforms SNC-based activities into learning objects for learning and teaching activities and applies to learning management systems since they organize activities -- such as tweets from Twitter -- depending on the teacher's intention.

The Management of Medical Information Quality Utilizing Big Data (빅 데이터를 활용한 의료정보 질 관리)

  • Cho, Young-bok;Woo, Sung-Hee;Lee, Sang-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.728-731
    • /
    • 2014
  • Today, the quality of medical service has become a major concern because that sustainable development of IT technology and extending people's life expectancy. This paper, it is used as a tool for the medical information quality management that analyze tweets big data form generated by individual's daily. The result of the analyze big data offers improvement medical information based evidence based medicine. Also it has been possible for a trace observation of chronic disease and can reduce additional other complications of patients. Therefore, effective treatment of disease and prevention is possible.

  • PDF

Geo-spatial Analysis of the Seoul Subway Station Areas Using the Haversine Distance and the Azimuth Angle Formulas (다트판형 공간분할 기법을 이용한 서울지역 지하철 역세권 분석)

  • Cho, Jae Hee;Baik, Eui Young
    • Journal of Information Technology Services
    • /
    • v.17 no.4
    • /
    • pp.139-150
    • /
    • 2018
  • This paper investigated the human distribution in subway station areas in Seoul, using geotweets and subway ridership data. Eight stations were selected from the districts of Gangnam and Gangbuk. Geotweets located within a 600-meter radius of the central coordinates of each station were extracted, and distances between the center of station and each tweet location were calculated. Donut-shaped dimension and pie-shaped dimension were generated, using the Haversine distance formula and the Azimuth angle formula respectively. By combining the two dimensions, Dartboard-shaped space division is created. Popular places within the subway station areas identified from this research are almost the same as the current well-known popular places, and this is an important case showing that people send tweets from various places where they engage in daily activities. We expect this study can be a methodological guideline for social scientists who use spatio-temporal or GPS data for their research.

Discovery of Urban Area and Spatial Distribution of City Population using Geo-located Tweet Data (위치기반 트윗 데이터를 이용한 도심권 추정과 인구의 공간분포 분석)

  • Kim, Tae Kyu;Lee, Jin Kyu;Cho, Jae Hee
    • Journal of Information Technology Services
    • /
    • v.18 no.1
    • /
    • pp.131-140
    • /
    • 2019
  • This study compares and analyzes the spatial distribution of people in two cities using location information in twitter data. The target cities were selected as Paris, a traditional tourist city, and Dubai, a tourist city that has recently attracted attention. The data was collected over 123 days in 2016 and 125 days in 2018. We compared the spatial distribution of two cities according to the two periods and residence status. In this study, we have found a hot place using a spatial statistical model called dart-shaped space division and estimated the urban area by reflecting the distribution of tweet population. And we visualized it as a CDF (cumulative distribution function) curve so that the distance between all the tweets' occurrence points and the city center point can be compared for different cities.

Twitter HashTag Recommendation Scheme based on Similar Tweet Analysis (유사 트윗 분석에 기반한 트위터 해시태그 추천기법)

  • Jeon, Mina;Jun, Sanghoon;Hwang, Eenjun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.11a
    • /
    • pp.962-963
    • /
    • 2013
  • 트위터 해시태그(#, HashTag)는 트윗(Tweets)에서 특정 키워드나 내용을 주제별로 분류하고 검색을 보다 효율적으로 사용하기 위한 사용자 정의 태그이다. 사용자가 정의하기에 따라 다양한 형태로 작성되기 때문에 오히려 검색의 효율성이 떨어질 수 있으며, 사용자는 자신이 작성한 트윗에 어떤 해시태그를 추가해야 하는지에 대한 궁금증이 생기는 경우가 발생한다. 본 논문에서는 이러한 문제를 해결하기 위해 사용자가 작성한 트윗에 적합한 해시태그를 추천하는 기법을 제안한다. 수집한 트윗과 해시태그의 키워드를 추출하고 트윗의 유사도를 계산하기 위해 TF-IDF와 Cosine Similarity를 적용하여 유사한 트윗을 갖는 해시태그를 추천한다. 본 논문에서 제안된 기법을 검증하기 위한 실험으로 추천의 정확성을 평가했다.

An Analysis of Corelation between Movie Attendance and Related Tweets for Predicting Box Office (영화 흥행 예측을 위한 영화 관객 수와 관련 트윗간의 상관관계 분석)

  • Yim, Junyeob;Hwang, Byung-Yeon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.11a
    • /
    • pp.1245-1247
    • /
    • 2013
  • 최근 들어 영화에 대한 수요가 증가하면서 국내 영화시장규모는 지속적으로 성장하고 있다. 이와 관련하여 여러 가지 위험요소를 제거하고 시장에서의 성공을 위해 영화의 흥행을 예측하기 위한 다양한 연구들이 진행되고 있다. 그러나 그러한 예측을 위한 관련 요소들 간의 상관관계를 정확한 수치로 표현하는 일은 매우 어려우며 관련연구 또한 아직 미흡하다. 본 논문에서는 트위터에서 발생되는 트윗을 설문 표본으로 삼고 영화 관련 트윗과 영화의 흥행을 의미하는 관객 수와의 상관관계를 분석하여 상관계수를 도출하였다. 실험 결과 실험에 사용된 영화 10편의 관객 수에 대한 데이터 모두 관련 트윗의 발생비율과 양의 상관관계를 가짐을 알 수 있었으며 이를 통해 트위터를 이용한 영화의 흥행 여부 예측에 대한 가능성을 제시했다.

Crossing the "Great Fire Wall": A Study with Grounded Theory Examining How China Uses Twitter as a New Battlefield for Public Diplomacy

  • Guo, Jing
    • Journal of Public Diplomacy
    • /
    • v.1 no.2
    • /
    • pp.49-74
    • /
    • 2021
  • In this paper, I applied grounded theory in exploring how Twitter became the battlefield for China's public diplomacy campaign. China's new move to global social media platforms, such as Twitter and Facebook, has been a controversial strategy in public diplomacy. This study analyzes Chinese Foreign Spokesperson Zhao Lijian's Twitter posts and comments. It models China's recent diplomatic move to Twitter as a "war of words" model, with features including "leadership," "polarization," and "aggression," while exerting possible effects as "resistance," "hatred," and "sarcasm" to the global community. Our findings show that by failing to gage public opinion and promote the country's positive image, China's current digital diplomacy strategy reflected by Zhao Lijian's tweets has instead constructed a polarized political public sphere, contradictory to the country's promoted "shared human destiny." The "war of words" model extends our understanding of China's new digital diplomacy move as a hybrid of state propaganda and self-performance. Such a strategy could spread hate speech and accelerate political polarization in cyberspace, despite improvements to China's homogenous network building on Twitter.

A Study on the Sentiment Analysis of Contemporary Pop Musicians and Classical Music Composers

  • Park, Youngjoo
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.352-359
    • /
    • 2022
  • The study examined a sentiment analysis based on Tweeter messages between contemporary pop musicians and classical music composers. Musicians of each genre were carefully selected for the sentiment analysis. Many opinion messages on Tweets that users have discussed were collected, and the messages were evaluated by using Naïve Bayes Classifier. The results demonstrated that users showed high positive sentiments for the two different genres. However, on average, the positive sentiment values for classical music composers are higher than for contemporary pop musicians. In addition, the rankings of the highest positive sentiments among contemporary pop musicians and classical music composers did not coincide with the popularity of the two different genres of musicians. This study will contribute to the study of future sentimental analysis between music and musicians.

Term Frequency-Inverse Document Frequency (TF-IDF) Technique Using Principal Component Analysis (PCA) with Naive Bayes Classification

  • J.Uma;K.Prabha
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.4
    • /
    • pp.113-118
    • /
    • 2024
  • Pursuance Sentiment Analysis on Twitter is difficult then performance it's used for great review. The present be for the reason to the tweet is extremely small with mostly contain slang, emoticon, and hash tag with other tweet words. A feature extraction stands every technique concerning structure and aspect point beginning particular tweets. The subdivision in a aspect vector is an integer that has a commitment on ascribing a supposition class to a tweet. The cycle of feature extraction is to eradicate the exact quality to get better the accurateness of the classifications models. In this manuscript we proposed Term Frequency-Inverse Document Frequency (TF-IDF) method is to secure Principal Component Analysis (PCA) with Naïve Bayes Classifiers. As the classifications process, the work proposed can produce different aspects from wildly valued feature commencing a Twitter dataset.