• Title/Summary/Keyword: Social big data analysis

Search Result 731, Processing Time 0.033 seconds

Design and Implementation of an Urban Safety Service System Using Realtime Weather and Atmosphere Data (실시간 기상 및 대기 데이터를 활용한 도시안전서비스 시스템 설계 및 구현)

  • Hwang, Hyunsuk;Seo, Youngwon;Jeon, Taegun;Kim, Changsoo
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.5
    • /
    • pp.599-608
    • /
    • 2018
  • As natural disasters are increasing due to the unusual weather and the modern society is getting complicated, the rapid change of the urban environment has increased human disasters. Thus, citizens are becoming more anxious about social safety. The importance of preparation for safety has been suggested by providing the disaster safety services such as regional safety index, life safety map, and disaster safety portal application. In this paper, we propose an application framework to predict the urban safety index based on user's location with realtime weather/atmosphere data after creating a predication model based on the machine learning using number of occurrence cases and weather/atmosphere history data. Also, we implement an application to provide traffic safety index with executing preprocessing occurrence cases of traffic and weather/atmosphere data. The existing regional safety index, which is displayed on the Si-gun-gu area, has been mainly utilized to establish safety plans for districts vulnerable to national policies on safety. The proposed system has an advantage to service useful information to citizens by providing urban safety index based on location of interests and current position with realtime related data.

Machine Learning based Firm Value Prediction Model: using Online Firm Reviews (머신러닝 기반의 기업가치 예측 모형: 온라인 기업리뷰를 활용하여)

  • Lee, Hanjun;Shin, Dongwon;Kim, Hee-Eun
    • Journal of Internet Computing and Services
    • /
    • v.22 no.5
    • /
    • pp.79-86
    • /
    • 2021
  • As the usefulness of big data analysis has been drawing attention, many studies in the business research area begin to use big data to predict firm performance. Previous studies mainly rely on data outside of the firm through news articles and social media platforms. The voices within the firm in the form of employee satisfaction or evaluation of the strength and weakness of the firm can potentially affect firm value. However, there is insufficient evidence that online employee reviews are valid to predict firm value because the data is relatively difficult to obtain. To fill this gap, from 2014 to 2019, we employed 97,216 reviews collected by JobPlanet, an online firm review website in Korea, and developed a machine learning-based predictive model. Among the proposed models, the LSTM-based model showed the highest accuracy at 73.2%, and the MAE showed the lowest error at 0.359. We expect that this study can be a useful case in the field of firm value prediction on domestic companies.

A Study on the Smart Tourism Awareness through Bigdata Analysis

  • LEE, Song-Yi;LEE, Hwan-Soo
    • The Journal of Industrial Distribution & Business
    • /
    • v.11 no.5
    • /
    • pp.45-52
    • /
    • 2020
  • Purpose: In the 4th industrial revolution, services that incorporate various smart technologies in the tourism sector have begun to gain popularity. Accordingly, academic discussions on smart tourism have also started to become active in various fields. Despite recent research, the definition of smart tourism is still ambiguous, and it is not easy to differentiate its scope or characteristics from traditional tourism concepts. Thus, this study aims to analyze the perception of smart tourism exposed online to identify the current point of smart tourism in Korea and present the research direction for conceptualizing smart tourism suitable for the domestic situation. Research design, data, and methodology: This study analyzes the perception of smart tourism exposed online based on 20,198 news data from portal sites over the past six years. Data on words used with smart tourism were collected from the leading portal sites Naver, Daum, and Google. Text mining techniques were applied to identify the social awareness status of smart tourism. Network analysis was used to visualize the results between words related to smart tourism, and CONCOR analysis was conducted to derive clusters formed by words having similarity. Results: As a result of keyword analysis, the frequency of words related to the development and construction of smart tourism areas was high. The analysis of the centrality of the connection between words showed that the frequency of keywords was similar, and that the words "smartphones" and "China" had relatively high connection centrality. The results of network analysis and CONCOR indicated that words were formed into eight groups including related technologies, promotion, globalization, service introduction, innovation, regional society, activation, and utilization guide. The overall results of data analysis showed that the development of smart tourism cities was a noticeable issue. Conclusions: This study is meaningful in that it clearly reflects the differences in the perception of smart tourism between online and research trends despite various efforts to develop smart tourism in Korea. In addition, this study highlights the need to understand smart tourism concepts and enhance academic discussions. It is expected that such academic discussions will contribute to improving the competitiveness of smart tourism research in Korea.

A Study on Tourism Behavior in the New normal Era Using Big Data (빅데이터를 활용한 뉴노멀(New normal)시대의 관광행태 변화에 관한 연구)

  • Kyoung-mi Yoo;Jong-cheon Kang;Youn-hee Choi
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.167-181
    • /
    • 2023
  • This study utilized TEXTOM, a social network analysis program to analyze changes in current tourism behavior after travel restrictions were eased after the outbreak of COVID-19. Data on the keywords 'domestic travel' and 'overseas travel' were collected from blogs, cafes, and news provided by Naver, Google, and Daum. The collection period was set from April to December 2022 when social distancing was lifted, and 2019 and 2020 were each set as one year and compared and analyzed with 2022. A total of 80 key words were extracted through text mining and centrality analysis was performed using NetDraw. Finally, through the CONCOR, the correlated keywords were clustered into 4. As a result of the study, tourism behavior in 2022 shows tourism recovery before the outbreak of COVID-19, segmentation of travel based on each person's preferred theme, prioritization of each country's corona mitigation policy, and then selecting a tourist destination. It is expected to provide basic data for the development of tourism marketing strategies and tourism products for the newly emerging tourism ecosystem after COVID-19.

A Study on Disaster Safety Management Policy Using the 4th Industrial Revolution and ICBMS (4차 산업혁명과 ICBMS를 활용한 재난안전관리에 관한 연구)

  • Kang, Heau-Jo
    • Journal of Digital Contents Society
    • /
    • v.18 no.6
    • /
    • pp.1213-1216
    • /
    • 2017
  • Recently due to the increasing uncertainty of the disaster environment caused by climate change the effects of disasters have become larger due to the confluence and solidification diversification into disaster type and secondary damage. In this paper, we apply ICBMS through intelligent information technology and big data analysis to all processes of disaster safety management to minimize human, social, economic and environment damage from accidents or disasters, and prevention by control technology preparation by education and training expansion to remember by body, response by advanced technology of disaster response unmanned technology restoration by creation of local community environment ecosystem, investigation and analysis by intelligent information technology learn about disaster safety management 4.0. In addition, technical limitation and problems in the $4^{th}$ industrial revolution and the application of big data were analyzed and suggested alternatives and strategies to overcome.

Analysis of preference convergence by analyzing search words for oralcare products : Using the Google trend (구강관리용품에 대한 검색어 분석을 통한 선호도 융합 분석 : 구글트렌드를 이용하여)

  • Moon, Kyung-Hui;Kim, Jang-Mi
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.6
    • /
    • pp.59-64
    • /
    • 2019
  • This study used the Google Trends site to analyze selection information that users expect from prominent Toothbrushes and Toothpastes through related search keywords that users wanted to obtain. From 2006 to 2018(sep), searches for Toothbrushes and Toothpastes were arranged in the order of popularity of related searched words. The total number of searches words exposed was each 25, total 325 collected. The analysis was conducted using two methods, first, by search function. second, by a word network using a Big Data program. The study has shown that toothbrushes there are high expectations for brands, toothpaste there are high expectations in the function. In order to increase the motivation for oral health education, it is recommended to use and provide knowledge about the brand of toothbrushes and Toothpastes by the function.

A Study on the Changes of the Restaurant Industry Before and After COVID-19 Using BigData (빅데이터를 활용한 코로나 19 이전과 이후 외식산업의 변화에 관한 연구)

  • Ahn, Youn Ju
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.787-793
    • /
    • 2022
  • After COVID-19, with the emergence of social distancing, non-face-to-face services, and home economics, visiting dining out is rapidly being replaced by non-face-to-face dining out. The purpose of this study is to find ways to create a safe dining culture centered on living quarantine in line with the changing trend of the restaurant industry after the outbreak of COVID-19, establish the direction of food culture improvement projects, and enhance the effectiveness of the project. This study used TEXTOM to collect and refine search frequency, perform TF-IDF analysis, and Ucinet6 programs to implement visualization using NetDraw from January 1, 2018 to October 31, 2019 and December 31, 2021, and identified the network between nodes of key keywords. Finally, clustering between them was performed through Concor analysis. As a result of the study, if you check the frequency of searches before and after COVID-19, it can be seen that the COVID-19 pandemic greatly affects the changes in the restaurant industry.

Big Data-based Monitoring System Design for Water Quality Analysis that Affects Human Life Quality (인간의 삶의 질에 영향을 끼치는 수질(물) 분석을 위한 빅데이터 기반 모니터링 시스템 설계)

  • Park, Sung-Hoon;Seo, Yong-Cheol;Kim, Yong-Hwan;Pang, Seung-Peom
    • Journal of Korea Entertainment Industry Association
    • /
    • v.15 no.3
    • /
    • pp.289-295
    • /
    • 2021
  • Today, the most important factor affecting the quality of human life is thought to be due to the environment. The importance of environmental monitoring systems to improve human life and improve welfare as the magnitude of the damage increases year by year due to the rapid increase in the frequency of hail, typhoons, collapse of incisions, landslides, etc. Is increasing day by day. Among environmental problems, problems caused by water quality have a very high proportion, and as there is a growing concern that the scale of damage will increase when water pollution accidents occur due to urbanization and industrialization, the demand for social water safety nets is increasing. have. In the last 5 years, 259 cases of water pollution (Han River 99, Nakdong River 31, Geum River 25, Seomjin River and Yeongsan River 19, and 85 others) have occurred in the four major river basins. Caused damage. Therefore, it is required to establish a water quality environment management strategy system based on big data that can minimize the uncertainty of the water quality environment by expanding the target of water quality management from the current water quality management system centered on the four major rivers to small and medium-sized rivers, tributaries/branches, and reservoirs. In this paper, we intend to construct and analyze a water quality monitoring system based on big data that can present useful water quality environment information by analyzing the water quality information accumulated for a long time.

Improving Performance of Recommendation Systems Using Topic Modeling (사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안)

  • Choi, Seongi;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.101-116
    • /
    • 2015
  • Recently, due to the development of smart devices and social media, vast amounts of information with the various forms were accumulated. Particularly, considerable research efforts are being directed towards analyzing unstructured big data to resolve various social problems. Accordingly, focus of data-driven decision-making is being moved from structured data analysis to unstructured one. Also, in the field of recommendation system, which is the typical area of data-driven decision-making, the need of using unstructured data has been steadily increased to improve system performance. Approaches to improve the performance of recommendation systems can be found in two aspects- improving algorithms and acquiring useful data with high quality. Traditionally, most efforts to improve the performance of recommendation system were made by the former approach, while the latter approach has not attracted much attention relatively. In this sense, efforts to utilize unstructured data from variable sources are very timely and necessary. Particularly, as the interests of users are directly connected with their needs, identifying the interests of the user through unstructured big data analysis can be a crew for improving performance of recommendation systems. In this sense, this study proposes the methodology of improving recommendation system by measuring interests of the user. Specially, this study proposes the method to quantify interests of the user by analyzing user's internet usage patterns, and to predict user's repurchase based upon the discovered preferences. There are two important modules in this study. The first module predicts repurchase probability of each category through analyzing users' purchase history. We include the first module to our research scope for comparing the accuracy of traditional purchase-based prediction model to our new model presented in the second module. This procedure extracts purchase history of users. The core part of our methodology is in the second module. This module extracts users' interests by analyzing news articles the users have read. The second module constructs a correspondence matrix between topics and news articles by performing topic modeling on real world news articles. And then, the module analyzes users' news access patterns and then constructs a correspondence matrix between articles and users. After that, by merging the results of the previous processes in the second module, we can obtain a correspondence matrix between users and topics. This matrix describes users' interests in a structured manner. Finally, by using the matrix, the second module builds a model for predicting repurchase probability of each category. In this paper, we also provide experimental results of our performance evaluation. The outline of data used our experiments is as follows. We acquired web transaction data of 5,000 panels from a company that is specialized to analyzing ranks of internet sites. At first we extracted 15,000 URLs of news articles published from July 2012 to June 2013 from the original data and we crawled main contents of the news articles. After that we selected 2,615 users who have read at least one of the extracted news articles. Among the 2,615 users, we discovered that the number of target users who purchase at least one items from our target shopping mall 'G' is 359. In the experiments, we analyzed purchase history and news access records of the 359 internet users. From the performance evaluation, we found that our prediction model using both users' interests and purchase history outperforms a prediction model using only users' purchase history from a view point of misclassification ratio. In detail, our model outperformed the traditional one in appliance, beauty, computer, culture, digital, fashion, and sports categories when artificial neural network based models were used. Similarly, our model outperformed the traditional one in beauty, computer, digital, fashion, food, and furniture categories when decision tree based models were used although the improvement is very small.

A Suggestion for Spatiotemporal Analysis Model of Complaints on Officially Assessed Land Price by Big Data Mining (빅데이터 마이닝에 의한 공시지가 민원의 시공간적 분석모델 제시)

  • Cho, Tae In;Choi, Byoung Gil;Na, Young Woo;Moon, Young Seob;Kim, Se Hun
    • Journal of Cadastre & Land InformatiX
    • /
    • v.48 no.2
    • /
    • pp.79-98
    • /
    • 2018
  • The purpose of this study is to suggest a model analysing spatio-temporal characteristics of the civil complaints for the officially assessed land price based on big data mining. Specifically, in this study, the underlying reasons for the civil complaints were found from the spatio-temporal perspectives, rather than the institutional factors, and a model was suggested monitoring a trend of the occurrence of such complaints. The official documents of 6,481 civil complaints for the officially assessed land price in the district of Jung-gu of Incheon Metropolitan City over the period from 2006 to 2015 along with their temporal and spatial poperties were collected and used for the analysis. Frequencies of major key words were examined by using a text mining method. Correlations among mafor key words were studied through the social network analysis. By calculating term frequency(TF) and term frequency-inverse document frequency(TF-IDF), which correspond to the weighted value of key words, I identified the major key words for the occurrence of the civil complaint for the officially assessed land price. Then the spatio-temporal characteristics of the civil complaints were examined by analysing hot spot based on the statistics of Getis-Ord $Gi^*$. It was found that the characteristic of civil complaints for the officially assessed land price were changing, forming a cluster that is linked spatio-temporally. Using text mining and social network analysis method, we could find out that the occurrence reason of civil complaints for the officially assessed land price could be identified quantitatively based on natural language. TF and TF-IDF, the weighted averages of key words, can be used as main explanatory variables to analyze spatio-temporal characteristics of civil complaints for the officially assessed land price since these statistics are different over time across different regions.