• Title/Summary/Keyword: 텍스트 수집

Search Result 704, Processing Time 0.03 seconds

An Exploratory Study of Happiness and Unhappiness Among Koreans based on Text Mining Techniques (텍스트마이닝 기법을 활용한 한국인의 행복과 불행 탐색연구)

  • Park, Sanghyeon;Do, Kanghyuk;Kim, Hakyeong;Park, Gaeun;Yun, Jinhyeok;Kim, Kyungil
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.7
    • /
    • pp.10-27
    • /
    • 2018
  • The purpose of this study is to explore the meaning of happiness and unhappiness in Korean society through text mining analysis. Similar words with keywords(happiness/unhappiness) from online news portal are extracted using Word2Vec and TF-IDF method. We also use the K-LIWC dictionary to perform the sentiment analysis of words associated with happiness and unhappiness. In TF-IDF analysis, happiness and unhappiness are highly related to social factors and social issues of the year. In Word2Vec analysis, 'Hope' has been similar with happiness for six years. In K-LIWC analysis, 'money/financial issues', 'school', 'communication' is highly related with happiness and unhappiness. In addition, 'physical condition and symptom' is highly related to unhappiness. Implications, limitations, and suggestions for future research are also discussed.

Research Trend Analysis in Fashion Design Studies in Korea using Topic Modeling (토픽모델링을 이용한 국내 패션디자인 연구동향 분석)

  • Jang, Namkyung;Kim, Min-Jeong
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.415-423
    • /
    • 2017
  • This study explored research trends by investigating articles published in the Journal of Korean Society of Fashion Design from 2001 through 2015. English key words and abstracts were analyzed using text mining and topic modeling techniques. The findings are as followings. By the text mining technique, 183 core terms, appeared more than 30 times, were derived from 7137 words used in total 338 articles' key words and abstracts. 'Fashion' and 'design' showed the highest frequency rate. After that, the well-received topic modeling technique, LDA, was applied to the collected data sets. Several distinct sub-research domains strongly tied with the previous fashion design field, except for topics such as fashion brand marketing and digital technology, were extracted. It was observed that there are the growing and declining trends in the research topics. Based on findings, implication, limitation, and future research questions were presented.

A Study on the Promising Future Biotechnology (바이오 미래유망 연구분야 도출에 관한 연구)

  • Kam, Ju-Sik;Kim, Moo-Woong;Par, Sang-Dai;Hyun, Byung-Hwan
    • Journal of Korea Technology Innovation Society
    • /
    • v.15 no.2
    • /
    • pp.345-368
    • /
    • 2012
  • As science and technology are the core engines of economic and social affairs, it is becoming increasingly necessary to explore new promising technologies in order to secure competitiveness in science and technology with a view to helping upgrade the country's overall competitiveness and promoting industrial development. The governments of major advanced countries provide R&D support for promising future technologies. Even in South Korea, a study is being carried out to set up a model for forecasting future technologies and reinforcing the relevant survey system. This study intends to explore methods of identifying promising future technologies in the bio-science sector, which has emerged as a new growth engine. It will use a text-mining technique to collect and analyze theses in the bio science sector. It will identify key research sectors by analyzing thesis contour lines, and then review promising future key research subjects through in-depth study.

  • PDF

An Analysis of Keywords on 'School Space Innovation' Policies using Text Mining - Focused on News Articles - (텍스트 마이닝을 활용한 '학교 공간 혁신' 정책 키워드 분석 - 뉴스 기사를 중심으로 -)

  • Lee, Dongkuk
    • The Journal of Sustainable Design and Educational Environment Research
    • /
    • v.19 no.2
    • /
    • pp.11-20
    • /
    • 2020
  • The goal of this study was to investigate the implementation and related issues of the school space innovation issued by key Korean mass media using text mining. To accomplish this goal, this study collected 519 news articles associated with the school space innovation issued by 54 Korean mass media companies. Based on this data, this study performed the frequency analysis and network analysis regarding the keywords. Based on the findings, the characteristics of school space innovation are summarized as follows: First, school space innovation has progressed in response to future education. Second, users are actively participating in school space innovation. Third, experts are supporting the innovation of school space by establishing a cooperative system. Fourth, the community is actively considering the innovation of school space. Fifth, the main projects of the Ministry of Education and the Provincial Offices of Education are actively conducted in a mix of top-down and bottom-up approaches. The findings of this study will contribute to providing a clear direction for contemporary school space innovation and implications for future research agenda and implementation.

The Study on the patient safety culture convergence research topics through text mining and CONCOR analysis (텍스트마이닝 및 CONCOR 분석을 활용한 환자안전문화 융복합 연구주제 분석)

  • Baek, Su Mi;Moon, Inn Oh
    • Journal of Digital Convergence
    • /
    • v.19 no.12
    • /
    • pp.359-367
    • /
    • 2021
  • The purpose of this study is to analyze domestic patient safety culture research topics using text mining and CONCOR analysis. The research method was conducted in the stages of data collection, data preprocessing, text mining and social network analysis, and CONCOR analysis. A total of 136 articles were analyzed excluding papers that were not published. Data analysis was performed using Textom and UCINET programs. As a result of this study, TF (frequency) of patient safety culture-related studies showed that patient safety was the highest, and TF-IDF (importance in documents) was highest in nursing. As a result of the CONCOR analysis, a total of seven clusters were derived: knowledge and attitude, communication, medical service, team, work environment, structure, organization and management that constitute the patient safety culture. In the future, it is necessary to conduct research on the relationship between the establishment of a patient safety culture and patient outcomes.

Text Big Data Analysis and Summary for Free Semester Operational Plan Document (자유학기제 운영계획서에 대한 텍스트 빅데이터 분석 및 요약)

  • Lee, Suan;Park, Beomjun;Kim, Minkyu;Shin, Hye Sook;Kim, Jinho
    • The Journal of Korean Association of Computer Education
    • /
    • v.22 no.3
    • /
    • pp.135-146
    • /
    • 2019
  • Big data analysis is actively used for collecting and analyzing direct information on related topics in each field of society. Applying big data analysis technology in education field is increasingly interested in Korea, because applying this technology helps to identify the effectiveness of education methods and policies and applying them for policy formulation. In this paper, we propose our approach of utilizing big data analysis technology in education field. We focus on free semester program, one of the current core education policies, and we analyze the main points of interests and differences in the free semester through analysis and visualization of texts that are written on the operation reports prepared by each school. We compare regional differences in key characteristics and interests based on the free semester operation reports from middle schools particularly at Seoul and Gangwon-do regions. In conclusion, applying and utilizing big data analysis technology according to the needs and requirements of education field is a great significance.

Sentiment Analysis of Foot-and-Mouth Disease Using Tweet Text-Mining Technique (트윗 텍스트 마이닝 기법을 이용한 구제역의 감성분석)

  • Chae, Heechan;Lee, Jonguk;Choi, Yoona;Park, Daihee;Chung, Yongwha
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.11
    • /
    • pp.419-426
    • /
    • 2018
  • Due to the FMD(foot-and-mouth disease), the domestic animal husbandry and related industries suffer enormous damage every year. Although various academic researches related to FMD are ongoing, engineering studies on the social effects of FMD are very limited. In this study, we propose a systematic methodology to analyze emotional responses of regular citizens on FMD using text mining techniques. The proposed system first collects data related to FMD from the tweets posted on Twitter, and then performs a polarity classification process using a deep-learning technique. Second, keywords are extracted from the tweet using LDA, which is one of the typical techniques of topic modeling, and a keyword network is constructed from the extracted keywords. Finally, we analyze the various social effects of regular citizens on FMD through keyword network. As a case study, we performed the emotional analysis experiment of regular citizens about FMD from July 2010 to December 2011 in Korea.

A study on Customized Foreign Language Learning Contents Construction (사용자 맞춤형 외국어학습 콘텐츠 구성을 위한 연구)

  • Kim, Gui-Jung;Yi, Jae-Il
    • Journal of Digital Convergence
    • /
    • v.17 no.1
    • /
    • pp.189-194
    • /
    • 2019
  • This paper is a study on the methodology of making customized contents according to user 's tendency through the development of learning contents utilizing IT. A variety of learners around the world use mobile devices and mobile learning contents to conduct their learning activities in various fields, and foreign language learning is one of the typical mobile learning areas. Foreign language learning contents suggested in this study is constructed based on the learner's verbal and text information in accordance with the user's vocal tendency. It is necessary to find out a suitable method to translate the user's native language text into the target language and make it into user friendly content.

A Named Entity Recognition Model in Criminal Investigation Domain using Pretrained Language Model (사전학습 언어모델을 활용한 범죄수사 도메인 개체명 인식)

  • Kim, Hee-Dou;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.13-20
    • /
    • 2022
  • This study is to develop a named entity recognition model specialized in criminal investigation domains using deep learning techniques. Through this study, we propose a system that can contribute to analysis of crime for prevention and investigation using data analysis techniques in the future by automatically extracting and categorizing crime-related information from text-based data such as criminal judgments and investigation documents. For this study, the criminal investigation domain text was collected and the required entity name was newly defined from the perspective of criminal analysis. In addition, the proposed model applying KoELECTRA, a pre-trained language model that has recently shown high performance in natural language processing, shows performance of micro average(referred to as micro avg) F1-score 98% and macro average(referred to as macro avg) F1-score 95% in 9 main categories of crime domain NER experiment data, and micro avg F1-score 98% and macro avg F1-score 62% in 56 sub categories. The proposed model is analyzed from the perspective of future improvement and utilization.

Perception Survey about SMEs Employment of University Students in Chungbuk Area: Based on Text-mining (충북지역 대학생의 중소기업 취업에 대한 인식조사: 텍스트마이닝을 기반으로)

  • Choi, Dabin;Choi, Wooseok;Choi, Sanghyun;Lee, Junghwan
    • Korean small business review
    • /
    • v.42 no.4
    • /
    • pp.235-250
    • /
    • 2020
  • This study surveyed the perception of university students about employment in Small and Medium-sized Enterprises(SME) in the Chungbuk area to prepare improvement measures. In particular, the data were collected in descriptive questions along with the existing survey methods, and the perception of SME and decent work was identified using text-mining. As a result of the analysis, there are positive perceptions of jobs at SME such as various work experiences and low job competition rates, while there are generally many negative perceptions in pay, work and welfare. However, as a result of co-occurrence network analysis of responses to decent jobs, 'Information' was derived as a keyword. Currently, college students' negative perception of SME is affected by the lack of sufficient information, which needs to be improved first. To solve this problem, it was proposed to establish and operate a platform that can provide information on employment of SME and select necessary personnel.