• Title/Summary/Keyword: news data

Search Result 894, Processing Time 0.027 seconds

Occupational Therapy in Long-Term Care Insurance For the Elderly Using Text Mining (텍스트 마이닝을 활용한 노인장기요양보험에서의 작업치료: 2007-2018년)

  • Cho, Min Seok;Baek, Soon Hyung;Park, Eom-Ji;Park, Soo Hee
    • Journal of Society of Occupational Therapy for the Aged and Dementia
    • /
    • v.12 no.2
    • /
    • pp.67-74
    • /
    • 2018
  • Objective : The purpose of this study is to quantitatively analyze the role of occupational therapy in long - term care insurance for the elderly using text mining, one of the big data analysis techniques. Method : For the analysis of newspaper articles, "Long - Term Care Insurance for the Elderly + Occupational Therapy for the Elderly" was collected after the period from 2007 to 208. Naver, which has a high share of the domestic search engine, utilized the database of Naver News by utilizing Textom, a web crawling tool. After collecting the article title and original text of 510 news data from the collection of the elderly long term care insurance + occupational therapy search, we analyzed the article frequency and key words by year. Result : In terms of the frequency of articles published by year, the number of articles published in 2015 and 2017 was the highest with 70 articles (13.7%), and the top 10 terms of the key word analysis showed the highest frequency of 'dementia' (344) In terms of key words, dementia, treatment, hospital, health, service, rehabilitation, facilities, institution, grade, elderly, professional, salary, industrial complex and people are related. Conclusion : In this study, it is meaningful that the textual mining technique was used to more objectively confirm the social needs and the role of the occupational therapist for the dementia and rehabilitation in the related key keywords based on the media reporting trend of the elderly long - term care insurance for 11 years. Based on the results of this study, future research should expand research field and period and supplement the research methodology through various analysis methods according to the year.

Analyzing the Effect of Characteristics of Dictionary on the Accuracy of Document Classifiers (용어 사전의 특성이 문서 분류 정확도에 미치는 영향 연구)

  • Jung, Haegang;Kim, Namgyu
    • Management & Information Systems Review
    • /
    • v.37 no.4
    • /
    • pp.41-62
    • /
    • 2018
  • As the volume of unstructured data increases through various social media, Internet news articles, and blogs, the importance of text analysis and the studies are increasing. Since text analysis is mostly performed on a specific domain or topic, the importance of constructing and applying a domain-specific dictionary has been increased. The quality of dictionary has a direct impact on the results of the unstructured data analysis and it is much more important since it present a perspective of analysis. In the literature, most studies on text analysis has emphasized the importance of dictionaries to acquire clean and high quality results. However, unfortunately, a rigorous verification of the effects of dictionaries has not been studied, even if it is already known as the most essential factor of text analysis. In this paper, we generate three dictionaries in various ways from 39,800 news articles and analyze and verify the effect each dictionary on the accuracy of document classification by defining the concept of Intrinsic Rate. 1) A batch construction method which is building a dictionary based on the frequency of terms in the entire documents 2) A method of extracting the terms by category and integrating the terms 3) A method of extracting the features according to each category and integrating them. We compared accuracy of three artificial neural network-based document classifiers to evaluate the quality of dictionaries. As a result of the experiment, the accuracy tend to increase when the "Intrinsic Rate" is high and we found the possibility to improve accuracy of document classification by increasing the intrinsic rate of the dictionary.

News Big Data Analysis of 'Media Literacy' Using Topic Modeling Analysis (미디어 리터러시 뉴스 빅데이터 분석: 토픽 모델링 분석을 중심으로)

  • Han, Songlee;Kim, Taejong
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.4
    • /
    • pp.26-37
    • /
    • 2021
  • This study conducted a big data analysis on news to identify the agenda of media literacy, which has been socially discussed, and on which relevant policy directions will be proposed. To this end 1,336 articles from January 1, 2019 to September 30, 2020 were collected and a topic modeling analysis was conducted according to four periods. Five topics for each period were derived through the analysis, and implications based on the results are as follows. First, the government should implement a nation-level systematic approach to media literacy education according to life cycle stages to generate economic and cultural value. Second, local communities and schools should provide systematic support and education guidance activities to ensure a sustainable ecosystem for media literacy and prevent an educational gap and loss in learning. Third, efforts should be made in various aspects to minimize the side effects resulting from constantly providing media literacy education; furthermore a culture of desirable media application should be established. Finally, a research environment for scientific research on media literacy, active exchange of experience and value obtained in the field, and long-term accumulation of research results should be encouraged to develop a robust knowledge exchange culture.

Analysis of news bigdata on 'Gather Town' using the Bigkinds system

  • Choi, Sui
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.3
    • /
    • pp.53-61
    • /
    • 2022
  • Recent years have drawn a great attention to generation MZ and Metaverse, due to 4th industrial revolution and the development of digital environment that blurs the boundary between reality and virtual reality. Generation MZ approaches the information very differently from the existing generations and uses distinguished communication methods. In terms of learning, they have different motivations, types, skills and build relationships differently. Meanwhile, Metaverse is drawing a great attention as a teaching method that fits traits of gen MZ. Thus, the current research aimed to investigate how to increase the use of Metaverse in Educational Technology. Specifically, this research examined the antecedents of popularity of Gather Town, a platform of Metaverse. Big data of news articles have been collected and analyzed using the Bigkinds system provided by Korea Press Foundation. The analysis revealed, first, a rapid increasing trend of media exposure of Gather Town since July 2021. This suggests a greater utilization of Gather Town in the field of education after the COVID-19 pandemic. Second, Word Association Analysis and Word Cloud Analysis showed high weights on education related words such as 'remote', 'university', and 'freshman', while words like 'Metaverse', 'Metaverse platform', 'Covid19', and 'Avatar' were also emphasized. Third, Network Analysis extracted 'COVID19', 'Avatar', 'University student', 'career', 'YouTube' as keywords. The findings also suggest potential value of Gather Town as an educational tool under COVID19 pandemic. Therefore, this research will contribute to the application and utilization of Gather Town in the field of education.

A Study on the Factors of Well-aging through Big Data Analysis : Focusing on Newspaper Articles (빅데이터 분석을 활용한 웰에이징 요인에 관한 연구 : 신문기사를 중심으로)

  • Lee, Chong Hyung;Kang, Kyung Hee;Kim, Yong Ha;Lim, Hyo Nam;Ku, Jin Hee;Kim, Kwang Hwan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.5
    • /
    • pp.354-360
    • /
    • 2021
  • People hope to live a healthy and happy life achieving satisfaction by striking a good work-life balance. Therefore, there is a growing interest in well-aging which means living happily to a healthy old age without worry. This study identified important factors related to well-aging by analyzing news articles published in Korea. Using Python-based web crawling, 1,199 articles were collected on the news service of portal site Daum till November 2020, and 374 articles were selected which matched the subject of the study. The frequency analysis results of text mining showed keywords such as 'elderly', 'health', 'skin', 'well-aging', 'product', 'person', 'aging', 'female', 'domestic' and 'retirement' as important keywords. Besides, a social network analysis with 45 important keywords revealed strong connections in the order of 'skin-wrinkle', 'skin-aging' and 'old-health'. The result of the CONCOR analysis showed that 45 main keywords were composed of eight clusters of 'life and happiness', 'disease and death', 'nutrition and exercise', 'healing', 'health', and 'elderly services'.

An Exploratory Study on the Learning Community: Focusing on the Covid19 Untact Era (배움공동체에 대한 탐색적 연구 : covid19 언택트시대를 중심으로)

  • Jeong, Su-Jeong;Im, Hong-Nam;Park, Hong-Jae
    • Journal of Convergence for Information Technology
    • /
    • v.12 no.5
    • /
    • pp.237-245
    • /
    • 2022
  • This study examines the social discourse on the characteristics of the learning community in the untact era, and discusses the directions that learning communities for children could explore and consider in the pandemic situation and beyond. For this purpose, big data for one year, from January 20, 2020 to January 20, 2021, were collected through internet portal sites (includingincluding Google News, Daum, Naver and other News surfaces), using two keywords "untact" and "learning community", and analyzed by employing a word frequency and network analysis method. The analysis results show that several important terms, such as 'village education community', 'operation', 'activity', 'corona 19', 'support', and 'online' are closely related to the learning community in the untact era. The findings from this study also have implications for developing the learning community as an alternative model to fill the existing gaps in public care and education for children during the prolonged pandemic and afterwards. In conclusion, the study findings highlight that it is meaningful to identify key terms and concepts through word frequency analysis in order to examine social trends and issues related to the learning community.

A Case Study of Infographics for National Defense - Focusing on the Datajournalism of Afghanistan War in Guardian (국방분야에서 인포그래픽 적용사례 연구 - 영(英) 가디언지 아프가니스탄전 데이터저널리즘을 중심으로)

  • Kim, Dong Hwan
    • Spatial Information Research
    • /
    • v.22 no.5
    • /
    • pp.43-52
    • /
    • 2014
  • Recently, Big Data is a buzzword in the creative economy generation. The organizations related to spatial information society focus on building the spatial big data systems. As spatial big data is a combination of spatial information and big data, the data visualization is essential in order to utilize them efficiently. One of the great methodologies for data visualization is infographics. Nationally, Chousn.com initiated the infographics news in 2010. Korean Administration Branches also recognized the importance of infographic and they adopted infographics for their briefings from 2013. Internationally, Visual.ly is leading company in the infographics market and they produced noticeable interactive infographics for Egypt Parliamentary Elections results. In the defense part, Guardian's datajournalism of Afghanistan war log was a good example of utilizing infographics. Throughout the research, five requirements are extracted. First source data should have precision and accuracy in terms of time and space manner. Second, infographics images have a compressibility. Third, the infographics is properly processed for military commanders. Fourth, sharing, openness and communication are essential for high quality infographic. Lastly, infographics should be an analytic tool for predicting future event based on the past data. Infographics is not a direct representation of data but an analytic tool for helping user's choice and decision in critical moments.

Accelerated Loarning of Latent Topic Models by Incremental EM Algorithm (점진적 EM 알고리즘에 의한 잠재토픽모델의 학습 속도 향상)

  • Chang, Jeong-Ho;Lee, Jong-Woo;Eom, Jae-Hong
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.12
    • /
    • pp.1045-1055
    • /
    • 2007
  • Latent topic models are statistical models which automatically captures salient patterns or correlation among features underlying a data collection in a probabilistic way. They are gaining an increased popularity as an effective tool in the application of automatic semantic feature extraction from text corpus, multimedia data analysis including image data, and bioinformatics. Among the important issues for the effectiveness in the application of latent topic models to the massive data set is the efficient learning of the model. The paper proposes an accelerated learning technique for PLSA model, one of the popular latent topic models, by an incremental EM algorithm instead of conventional EM algorithm. The incremental EM algorithm can be characterized by the employment of a series of partial E-steps that are performed on the corresponding subsets of the entire data collection, unlike in the conventional EM algorithm where one batch E-step is done for the whole data set. By the replacement of a single batch E-M step with a series of partial E-steps and M-steps, the inference result for the previous data subset can be directly reflected to the next inference process, which can enhance the learning speed for the entire data set. The algorithm is advantageous also in that it is guaranteed to converge to a local maximum solution and can be easily implemented just with slight modification of the existing algorithm based on the conventional EM. We present the basic application of the incremental EM algorithm to the learning of PLSA and empirically evaluate the acceleration performance with several possible data partitioning methods for the practical application. The experimental results on a real-world news data set show that the proposed approach can accomplish a meaningful enhancement of the convergence rate in the learning of latent topic model. Additionally, we present an interesting result which supports a possible synergistic effect of the combination of incremental EM algorithm with parallel computing.

The Consume Characteristic of Musicals through Korea Performing Arts Box Office Information System(KOPIS) (공연예술통합전산망(KOPIS)을 통한 뮤지컬 소비 특징)

  • Shin, Jong-Chul
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.6
    • /
    • pp.241-255
    • /
    • 2020
  • The purpose of this study is to analyze musical performances with the use of performance booking information from 2017 to 2019 which was obtained in Korea performing Arts Box Office Information system (KOPIS), and to make suggestions of Korean musical performances. Based on the data of KOPIS, relevant studies, internet based information, news articles, and magazines, musical performances were analyzed. In addition, the previous data of KOPIS and the data of the Broadway League were analyzed. The analysis results are as follows. Firstly, it is necessary to concentrate on Korean mid-sized theatre musical performances. Secondly, producers need to open their production costs invested in performances transparently. Thirdly, Off-Broadway system needs to be introduced after being modified in consideration of Korean situations. Thirdly, it is necessary to make long-run performances in order to achieve commercial success. Fifthly, it is necessary to make a bold attempt of theatre for performances just as in Broadway.

Visual Analytics using Topic Composition for Predicting Event Flow (토픽의 조합으로 이벤트 흐름을 예측하기 위한 시각적 분석 시스템)

  • Yeon, Hanbyul;Kim, Seokyeon;Jang, Yun
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.12
    • /
    • pp.768-773
    • /
    • 2015
  • Emergence events are the cause of much economic damage. In order to minimize the damage that these events cause, it must be possible to predict what will happen in the future. Accordingly, many researchers have focused on real-time monitoring, detecting events, and investigating events. In addition, there have also been many studies on predictive analysis for forecasting of future trends. However, most studies provide future tendency per event without contextual compositive analysis. In this paper, we present a predictive visual analytics system using topic composition to provide future trends per event. We first extract abnormal topics from social media data to find interesting and unexpected events. We then search for similar emergence patterns in the past. Relevant topics in the past are provided by news media data. Finally, the user combines the relevant topics and a new context is created for contextual prediction. In a case study, we demonstrate our visual analytics system with two different cases and validate our system with possible predictive story lines.