• Title/Summary/Keyword: 뉴스기사

Search Result 505, Processing Time 0.021 seconds

Self-Supervised Document Representation Method

  • Yun, Yeoil;Kim, Namgyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.5
    • /
    • pp.187-197
    • /
    • 2020
  • Recently, various methods of text embedding using deep learning algorithms have been proposed. Especially, the way of using pre-trained language model which uses tremendous amount of text data in training is mainly applied for embedding new text data. However, traditional pre-trained language model has some limitations that it is hard to understand unique context of new text data when the text has too many tokens. In this paper, we propose self-supervised learning-based fine tuning method for pre-trained language model to infer vectors of long-text. Also, we applied our method to news articles and classified them into categories and compared classification accuracy with traditional models. As a result, it was confirmed that the vector generated by the proposed model more accurately expresses the inherent characteristics of the document than the vectors generated by the traditional models.

Text Mining Driven Content Analysis of Social Perception on Schizophrenia Before and After the Revision of the Terminology (조현병과 정신분열병에 대한 뉴스 프레임 분석을 통해 본 사회적 인식의 변화)

  • Kim, Hyunji;Park, Seojeong;Song, Chaemin;Song, Min
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.4
    • /
    • pp.285-307
    • /
    • 2019
  • In 2011, the Korean Medical Association revised the name of schizophrenia to remove the social stigma for the sick. Although it has been about nine years since the revision of the terminology, no studies have quantitatively analyzed how much social awareness has changed. Thus, this study investigates the changes in social awareness of schizophrenia caused by the revision of the disease name by analyzing Naver news articles related to the disease. For text analysis, LDA topic modeling, TF-IDF, word co-occurrence, and sentiment analysis techniques were used. The results showed that social awareness of the disease was more negative after the revision of the terminology. In addition, social awareness of the former term among two terms used after the revision was more negative. In other words, the revision of the disease did not resolve the stigma.

Issue Analysis on Gas Safety Based on a Distributed Web Crawler Using Amazon Web Services (AWS를 활용한 분산 웹 크롤러 기반 가스 안전 이슈 분석)

  • Kim, Yong-Young;Kim, Yong-Ki;Kim, Dae-Sik;Kim, Mi-Hye
    • Journal of Digital Convergence
    • /
    • v.16 no.12
    • /
    • pp.317-325
    • /
    • 2018
  • With the aim of creating new economic values and strengthening national competitiveness, governments and major private companies around the world are continuing their interest in big data and making bold investments. In order to collect objective data, such as news, securing data integrity and quality should be a prerequisite. For researchers or practitioners who wish to make decisions or trend analyses based on objective and massive data, such as portal news, the problem of using the existing Crawler method is that data collection itself is blocked. In this study, we implemented a method of collecting web data by addressing existing crawler-style problems using the cloud service platform provided by Amazon Web Services (AWS). In addition, we collected 'gas safety' articles and analyzed issues related to gas safety. In order to ensure gas safety, the research confirmed that strategies for gas safety should be established and systematically operated based on five categories: accident/occurrence, prevention, maintenance/management, government/policy and target.

The Study for Social Repositioning of Multi-Cultural Family in Jecheon City : From the perspective of Social Construction (제천시 다문화가정의 사회적 리포지셔닝 연구 : 사회적 구성주의의 관점에서)

  • Kim, Su-Wan;Park, Sang-Hyuk
    • The Journal of the Convergence on Culture Technology
    • /
    • v.5 no.1
    • /
    • pp.45-50
    • /
    • 2019
  • This research analyzed that what factors affect to the change of social positioning of 'multi-cultural family(MCF)' centered on 'multi-cultural family in Jecheon City using Social Construction. The purpose of this research analyze the social positioning of MCF in Jecheon City, policy design depending on that social positioning and the effect of social perception. Therefore, this research carried out qualitative analysis method that analyzed news articles, legislations and interviews from 1990 to 2013 based on social construction theory. For the purpose, first, the time scope could be divided into four periods such as 'the quickening period in 1990s', 'quantitative growth period from 2000 to 2005', 'qualitative growth period from 2006 to 2011', 'the period of antagonism after 2012' of MCF.

Employee's Discontent Text Analysis on Anonymous Company Review Web and Suggestions for Discontent Resolve (기업 리뷰 웹 사이트 텍스트 분석을 통한 직원 불만 표현 추출과 불만 원인 도출 및 해소 방안)

  • Baek, HyeYeon;Park, Yongsuk
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.4
    • /
    • pp.357-364
    • /
    • 2019
  • As industrial information disclosure by insider's rate is around 80%, most of relevant researches explain briefly its causes are discontent of salary or human resources system. This paper scrapes texts on Jobplanet, an anonymous company review website and analyzes discontent keyword by 7 related area and their contexts to find out more details on brief causes referred above. After drawing LGG (Local Grammar Graph) by each areas with related dictionary list, this paper shows an example of concordance as a proof and several ways for human resources leakage prevention. Finally, text analysis results are compared with previous researches based on survey with limited questions and answers. This study is meaningful to expand the scope of employee discontent analysis with company review text and provide more specific, granular and honest discontent vocabularies.

The Effect of Youth (18-19 years old) Voters' Use of Political Information and Political Efficacy on Voting Intentions (청소년(만 18-19세) 유권자의 정치 정보 이용행태와 정치효능감이 투표 의도에 미치는 영향)

  • Lee, Sung-Jin;Kim, Peter Eung-Pyo
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.10
    • /
    • pp.344-355
    • /
    • 2021
  • As the voting age has been lowered to 18 from the 21st general election on April 15, 2020, the use of political news, motivation for political interest, and reliability of political media of first-time voters were examined. Accordingly, we analyzed how their political efficacy affects their voting intentions. As a result of the study through the survey of participants, the use of political news by voters was to acquire political information mainly through TV and portals. And the use of traditional media such as radio and newspaper was low. first voters became interested in politics through articles delivered by the media, and it was found that they trusted the political information provided through terrestrial TV reports and debates. This generation also confirmed that if they have higher political efficacy, they show higher willingness to participate in voting. Through this study, in order to increase the political participation of the younger generation who are evaluated as having relatively low interest in politics, the role of the media was reconfirmed as the most important factor.

Factor augmentation for cryptocurrency return forecasting (암호화폐 수익률 예측력 향상을 위한 요인 강화)

  • Yeom, Yebin;Han, Yoojin;Lee, Jaehyun;Park, Seryeong;Lee, Jungwoo;Baek, Changryong
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.2
    • /
    • pp.189-201
    • /
    • 2022
  • In this study, we propose factor augmentation to improve forecasting power of cryptocurrency return. We consider financial and economic variables as well as psychological aspect for possible factors. To be more specific, financial and economic factors are obtained by applying principal factor analysis. Psychological factor is summarized by news sentiment analysis. We also visualize such factors through impulse response analysis. In the modeling perspective, we consider ARIMAX as the classical model, and random forest and deep learning to accommodate nonlinear features. As a result, we show that factor augmentation reduces prediction error and the GRU performed the best amongst all models considered.

Topic Modeling of News Article Related to Franchise Regulation Using LDA (LDA 를 이용한 '프랜차이즈 규제' 관련 뉴스기사 토픽모델링)

  • YANG, Woo-Ryeong;YANG, Hoe Chang
    • The Korean Journal of Franchise Management
    • /
    • v.13 no.4
    • /
    • pp.1-12
    • /
    • 2022
  • Purpose: In 2020, the franchise industry accomplished a significant growth compared to the previous year, as the number of franchise companies increased by 9.0% while the number of franchise brands increased by 12.5%. Despite growth in size, the Korean franchise industry underwent many negative incidents, such as franchise ownership sales to private equity funds, that led to deterioration of businesses. From this point of view, this study aims to make various proposals to help policy makers develop franchise industry policies by analyzing trends of the current and previous presidential administrations' franchise policies and regulations using newspaper articles. Research design, data and methodology: A total of 7,439 articles registered in Naver API from February 25, 2013 to November 29, 2021 were extracted. Among them, 34 unrelated video articles were deleted, and a total of 7,405 articles from both administrations were used for analysis. The R package was used for word frequency analysis, word clouding, word correlation analysis, and LDA (Latent Dirichlet Allocation) topic modeling. Results: The keyword frequency analysis shows that the most frequently mentioned keywords during the previous administration include 'no-brand', 'major company', 'bill', 'business field', and 'SMEs', and those mentioned during the current administration include 'industry' and 'policy'. As a result of LDA topic modeling, 9 topics such as 'global startups' and 'job creation' from the previous administration, and 10 topics such as 'franchise business' and 'distribution industry' from the current administration were derived. The results of LDAvis showed that the previous administration operated a policy based on mutual growth of large and small businesses rather than hostile regulations in the franchise business, whereas the current administration extended the regulation related to franchise business to the employment sector. Conclusions: The analysis of past two administrations' franchise policy, it can be suggested that franchisors and franchisees may complement each other in developing the Fair Transactions in Franchise Business Act and achieving balanced growth. Moreover, political support is needed for sound development of franchisors. Limitations and future research suggestions are presented at the end of this study.

Development of Disaster Situation Specific Tailored Weather Emergency Information Alert System (재난 상황별 맞춤형 기상긴급정보 전달 시스템 개발)

  • Yong-Yook Kim;Ki-Bong Kwon;Byung-Yun Lee
    • Journal of the Society of Disaster Information
    • /
    • v.19 no.1
    • /
    • pp.69-75
    • /
    • 2023
  • Purpose: The risk of disaster from extreme weather events is increasing due to the increase in occurrence and the strength of heavy rains and storms from continued climate change. To reduce these risks, emergency weather information customized for the characteristics of the information users and related circumstances should be provided. Method: A first-stage emergency weather information delivery system has been developed to provide weather information to the disaster-risk area residents and the disaster response personnel. Novel methods to apply artificial intelligence to identify emergencies have been studied. The relationship between special weather reports from meteorological administration and disaster-related news articles has been analyzed to identify the significance of a pilot study using text analytic artificial intelligence. Result: The basis to identify the significance of the relations between disaster-related articles and special weather reports has been established and the possibility of the development of a real-world applicable system based on a broader analysis of data has been suggested. Conclusion: Through direct alert delivery of weather emergency alerts, a weather emergency alert system is expected to reduce the risk of damage from extreme weather situations.

Semantic Pre-training Methodology for Improving Text Summarization Quality (텍스트 요약 품질 향상을 위한 의미적 사전학습 방법론)

  • Mingyu Jeon;Namgyu Kim
    • Smart Media Journal
    • /
    • v.12 no.5
    • /
    • pp.17-27
    • /
    • 2023
  • Recently, automatic text summarization, which automatically summarizes only meaningful information for users, is being studied steadily. Especially, research on text summarization using Transformer, an artificial neural network model, has been mainly conducted. Among various studies, the GSG method, which trains a model through sentence-by-sentence masking, has received the most attention. However, the traditional GSG has limitations in selecting a sentence to be masked based on the degree of overlap of tokens, not the meaning of a sentence. Therefore, in this study, in order to improve the quality of text summarization, we propose SbGSG (Semantic-based GSG) methodology that selects sentences to be masked by GSG considering the meaning of sentences. As a result of conducting an experiment using 370,000 news articles and 21,600 summaries and reports, it was confirmed that the proposed methodology, SbGSG, showed superior performance compared to the traditional GSG in terms of ROUGE and BERT Score.