• Title/Summary/Keyword: Word Cloud Analysis

Search Result 143, Processing Time 0.031 seconds

A Preliminary Study on Change Management Factors through Analysing Development Phase of Construction IT System (건설 IT 시스템 발전단계분석을 통한 변화관리 요인 기초 연구)

  • Kim, Haneol;Lee, Dongheon;Lim, Hyoungchul
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2022.04a
    • /
    • pp.214-215
    • /
    • 2022
  • This study analyzed the development stage and change management necessity of the construction IT system through existing research and literature review, and used WordCloud, one of the text mining techniques, to analyze current construction trends and major issues. The necessity of change management is derived by using existing research literature and construction-related social issues as analysis data.

  • PDF

Analysis of Domestic Research on Depression and Stress : Focused on the Treatment and Subjects (우울과 스트레스에 관한 국내 연구 분석 : 치료와 대상자를 중심으로)

  • Jo, Nam-Hee;Na, Eun-Young
    • Journal of Convergence for Information Technology
    • /
    • v.7 no.6
    • /
    • pp.53-59
    • /
    • 2017
  • This study was attempted to identify the domestic research related to depression and stress. The subjects of the analysis were 1,875 college degree theses thrown in the National Assembly Library searched by the depression and stress keyword as of November 30, 2016. The analysis method visualizes atypical data with Word Cloud, which is one of the text mining techniques. We also used the R'LDA package and LDA to classify treatment and subjects. As a result of the analysis, 233(12.4%) of the total papers with therapeutic keywords were found. Application of treatment methods was art therapy, music therapy, horticultural therapy, cognitive behavior therapy, clinical art therapy, cognitive therapy, psychological therapy, depression treatment, group therapy, laughter treatment sequence. The study subjects were adolescents, elderly, patient, mother, child, female, parents, and college students in order. The results of LDA topic analysis for adolescents were classified into four topics: self-support, treatment program, relationship effect, and variable study.

A Study on the Use of Stopword Corpus for Cleansing Unstructured Text Data (비정형 텍스트 데이터 정제를 위한 불용어 코퍼스의 활용에 관한 연구)

  • Lee, Won-Jo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.891-897
    • /
    • 2022
  • In big data analysis, raw text data mostly exists in various unstructured data forms, so it becomes a structured data form that can be analyzed only after undergoing heuristic pre-processing and computer post-processing cleansing. Therefore, in this study, unnecessary elements are purified through pre-processing of the collected raw data in order to apply the wordcloud of R program, which is one of the text data analysis techniques, and stopwords are removed in the post-processing process. Then, a case study of wordcloud analysis was conducted, which calculates the frequency of occurrence of words and expresses words with high frequency as key issues. In this study, to improve the problems of the "nested stopword source code" method, which is the existing stopword processing method, using the word cloud technique of R, we propose the use of "general stopword corpus" and "user-defined stopword corpus" and conduct case analysis. The advantages and disadvantages of the proposed "unstructured data cleansing process model" are comparatively verified and presented, and the practical application of word cloud visualization analysis using the "proposed external corpus cleansing technique" is presented.

Investigations on Techniques and Applications of Text Analytics (텍스트 분석 기술 및 활용 동향)

  • Kim, Namgyu;Lee, Donghoon;Choi, Hochang;Wong, William Xiu Shun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.471-492
    • /
    • 2017
  • The demand and interest in big data analytics are increasing rapidly. The concepts around big data include not only existing structured data, but also various kinds of unstructured data such as text, images, videos, and logs. Among the various types of unstructured data, text data have gained particular attention because it is the most representative method to describe and deliver information. Text analysis is generally performed in the following order: document collection, parsing and filtering, structuring, frequency analysis, and similarity analysis. The results of the analysis can be displayed through word cloud, word network, topic modeling, document classification, and semantic analysis. Notably, there is an increasing demand to identify trending topics from the rapidly increasing text data generated through various social media. Thus, research on and applications of topic modeling have been actively carried out in various fields since topic modeling is able to extract the core topics from a huge amount of unstructured text documents and provide the document groups for each different topic. In this paper, we review the major techniques and research trends of text analysis. Further, we also introduce some cases of applications that solve the problems in various fields by using topic modeling.

Analysis of News Regarding New Southeastern Airport Using Text Mining Techniques (텍스트 마이닝 기법을 활용한 동남권 신공항 신문기사 분석)

  • Han, Mu Moung Cho;Kim, Yang Sok;Lee, Choong Kwon
    • Smart Media Journal
    • /
    • v.6 no.1
    • /
    • pp.47-53
    • /
    • 2017
  • Social issues are important factors that decide government policy and newspapers are critical channels that reflect them. Analysing news articles can contribute to understanding social issues, but it is very difficult to analyse the unstructured large volumes of news data manually. Therefore, this study aims to analyze the different views among stakeholders of a specific social issue by using text analysis, word cloud analysis and associative analysis methods, which systematically transform unstructured news data into structured one. We analyzed a total of 115 news articles and a total of 6,772 comments, collected from the selected newspapers (Chosun-Il-bo, Joongang-Il-bo, Donga-Il-bo, Maeil Newspaper, Busan-Il-bo) for two weeks. We found that there are significant differences in tone between newspapers. While nation-wide daily newspapers focus on political relations with local areas, local daily newspapers tend to write articles to represent local governments' interests.

Analysis of Social Media Utilization based on Big Data-Focusing on the Chinese Government Weibo

  • Li, Xiang;Guo, Xiaoqin;Kim, Soo Kyun;Lee, Hyukku
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.8
    • /
    • pp.2571-2586
    • /
    • 2022
  • The rapid popularity of government social media has generated huge amounts of text data, and the analysis of these data has gradually become the focus of digital government research. This study uses Python language to analyze the big data of the Chinese provincial government Weibo. First, this study uses a web crawler approach to collect and statistically describe over 360,000 data from 31 provincial government microblogs in China, covering the period from January 2018 to April 2022. Second, a word separation engine is constructed and these text data are analyzed using word cloud word frequencies as well as semantic relationships. Finally, the text data were analyzed for sentiment using natural language processing methods, and the text topics were studied using LDA algorithm. The results of this study show that, first, the number and scale of posts on the Chinese government Weibo have grown rapidly. Second, government Weibo has certain social attributes, and the epidemics, people's livelihood, and services have become the focus of government Weibo. Third, the contents of government Weibo account for more than 30% of negative sentiments. The classified topics show that the epidemics and epidemic prevention and control overshadowed the other topics, which inhibits the diversification of government Weibo.

Analysis on the English Translation of The First Chosen Educational Ordinance, Manual of Education of Koreans (1913), and Manual of Education in Chosen 1920 (1920) Using Text Mining Analytics (텍스트 마이닝(Text mining) 기법을 활용한 『제1차조선교육령』과 『조선교육요람』(1913, 1920)의영어번역본 분석)

  • Jinyoung Tak;Eunjoo Kwak;Silo Chin;Minjoo Shon;Dongmie Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.6
    • /
    • pp.309-317
    • /
    • 2023
  • The purpose of this paper is to investigate how Japan tried to dominate Chosen through educational policies by analyzing three official English texts published by the Japanese Government-General of Korea: the First Chosen Educational Ordinance declared in 1911, the Manual of Education of Koreans(1913), and the Manual of Education in Chosen 1920(1920). In order to pursue this purpose, the present study carried a corpus-based diachronic analysis, rather then a qualitative analysis. Facilitating text analytics such as Word Cloud and CONCOR, this paper derived the following results: First, the first Chosen Educational Ordinance(1911) includes overall educational regulations, curriculum, and operations of schools. Second, the Manual of Education of Koreans(1913) contains the educational medium and contents on how to educate. Finally, it can be proposed that the Manual of Education in Chosen 1920(1920) contains specific implementation of education and the subject of education.

Parents' Perceptions of Cognitive Rehabilitation for Children With Developmental Disabilities: A Mixed-Method Approach of Phenomenological Methodology and Word Cloud Analysis (발달장애 아동 부모의 인지재활 경험에 대한 질적 연구: 워드 클라우드 분석과 현상학적 연구 방법 혼합설계)

  • Ju, Yu-Mi;Kim, Young-Geun;Lee, Hee-Ryoung;Hong, Seung-Pyo;Han, Dae-Sung
    • Therapeutic Science for Rehabilitation
    • /
    • v.13 no.1
    • /
    • pp.49-63
    • /
    • 2024
  • Objective : The purpose of this study was to investigate parental perspectives on cognitive rehabilitation using a combination of phenomenological research methodology and word cloud analysis. Methods : Interviews were conducted with five parents of children with developmental disabilities. Word cloud analysis was conducted using Python, and five researchers analyzed the meaning units and themes using phenomenological methods. Words with high frequency were considered as a heuristic tool. Results : A total of 43 meaning units and nine components related to the phenomenon of cognitive rehabilitation were derived, and three themes were finalized. The main themes encompassed the definition of cognitive rehabilitation, challenges associated with cognitive rehabilitation, and factors influencing the selection of a cognitive rehabilitation institute. Cognitive rehabilitation emerged as a treatment focused on improving learning, daily functioning, and cognitive abilities in children with developmental disabilities. The perceived issues with cognitive rehabilitation pertained to treatment methods, therapist expertise, and associated costs. In addition, parents highlighted the importance of therapist expertise, humane personality, and affordability of cost and schedule when choosing a cognitive rehabilitation institute. Conclusion : Parents expressed expectations for substantial improvements in their children's daily functioning through cognitive rehabilitation. However, challenges were identified in clinical practices. Going forward, we expect that cognitive rehabilitation will evolve into a better therapeutic support service addressing the concerns raised by parents.

Trend Analysis of Convergence Research based on Social Big Data (소셜 빅데이터 기반 융합연구 동향 분석)

  • Noh, Younghee;Kim, Taeyoun;Jeong, Dae-Keun;Lee, Kwang Hee
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.2
    • /
    • pp.135-146
    • /
    • 2019
  • This study was designed to analyze trends in the entire convergence research beyond academic research through social media big data analysis at a time when interdisciplinary convergence research is emphasized along with the fourth industrial revolution. For this purpose, about 150,000 cases of texts and titles were acquired for about 10 years from January 2009 to September 2018 in connection with the convergence research in social media, and word cloud and network analysis were conducted. As a results, the research fields that were actively conducted for each period were eco-tech in 2009 and 2010, smart technology in 2011 and 2012, information and communication in 2013 and 2014, robots in 2015 and 2016, and artificial intelligence in 2017 and 2018. Also, the research areas that have been consistently conducted for about 10 years are culture, design, chemistry, nanotechnology, biotechnology, robot, IT, and information and communication. Since this study identifies trends in convergence research over time, it can be helpful to researchers who are planning convergence research direction by understanding the trends of convergence research.

Perceived Characteristics of Grains during the Choseon Dynasty - A Study Applying Text Frequency Analysis Using the Choseonwangjoshilrok Data - (조선왕조실록 텍스트 빈도 분석을 통한 조선시대 곡물에 관한 인식 특성 고찰)

  • Mi-Hye, Kim
    • Journal of the Korean Society of Food Culture
    • /
    • v.38 no.1
    • /
    • pp.26-37
    • /
    • 2023
  • This study applied the text frequency method to analyze the crops prevalent during the Chosunwangjoshilrok dynasty, and categorized the results by each king. Contemporary perception of grains was observed by examining the staple crop types. Staple species were examined using the word cloud and semantic network analysis. Totally, 101,842 types of crop consumption were recorded during the Chosunwangjoshilrok period. Of these, 51,337 (50.4%) were grains, 50,407 (49.5%) were beans, and 98 (0.1%) were seeds. Rice was the most frequently consumed grain (37.1%), followed by pii (11.9%), millet (11.3%), barley (4.5%), proso (0.8%), wheat (0.6%), buckwheat (0.1%), and adlay (0.05%). Grain chronological frequency in the Choseon dynasty was determined to be 15,520 cases in the 15th century (30.2%), 11,201 cases in the 18th century (21.8%), 9,421 cases in the 17th century (18.4%), 9,113 cases in the 16th century (17.8%), and 6,082 cases in the 19th century (11.8%). Interest in grain amongst the 27 kings of Choseon was evaluated based on the frequency of records. The 15th century King Sejong recorded the maximum interest with 13,363 cases (13.1%), followed by King Jungjo (8,501 cases in the 18th century; 8.4%), King Sungjong (7,776 cases in the 15th century; 7.6%).