• Title/Summary/Keyword: news paper articles

Search Result 149, Processing Time 0.028 seconds

Automated Story Generation with Image Captions and Recursiva Calls (이미지 캡션 및 재귀호출을 통한 스토리 생성 방법)

  • Isle Jeon;Dongha Jo;Mikyeong Moon
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.24 no.1
    • /
    • pp.42-50
    • /
    • 2023
  • The development of technology has achieved digital innovation throughout the media industry, including production techniques and editing technologies, and has brought diversity in the form of consumer viewing through the OTT service and streaming era. The convergence of big data and deep learning networks automatically generated text in format such as news articles, novels, and scripts, but there were insufficient studies that reflected the author's intention and generated story with contextually smooth. In this paper, we describe the flow of pictures in the storyboard with image caption generation techniques, and the automatic generation of story-tailored scenarios through language models. Image caption using CNN and Attention Mechanism, we generate sentences describing pictures on the storyboard, and input the generated sentences into the artificial intelligence natural language processing model KoGPT-2 in order to automatically generate scenarios that meet the planning intention. Through this paper, the author's intention and story customized scenarios are created in large quantities to alleviate the pain of content creation, and artificial intelligence participates in the overall process of digital content production to activate media intelligence.

Financial Fraud Detection using Text Mining Analysis against Municipal Cybercriminality (지자체 사이버 공간 안전을 위한 금융사기 탐지 텍스트 마이닝 방법)

  • Choi, Sukjae;Lee, Jungwon;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.119-138
    • /
    • 2017
  • Recently, SNS has become an important channel for marketing as well as personal communication. However, cybercrime has also evolved with the development of information and communication technology, and illegal advertising is distributed to SNS in large quantity. As a result, personal information is lost and even monetary damages occur more frequently. In this study, we propose a method to analyze which sentences and documents, which have been sent to the SNS, are related to financial fraud. First of all, as a conceptual framework, we developed a matrix of conceptual characteristics of cybercriminality on SNS and emergency management. We also suggested emergency management process which consists of Pre-Cybercriminality (e.g. risk identification) and Post-Cybercriminality steps. Among those we focused on risk identification in this paper. The main process consists of data collection, preprocessing and analysis. First, we selected two words 'daechul(loan)' and 'sachae(private loan)' as seed words and collected data with this word from SNS such as twitter. The collected data are given to the two researchers to decide whether they are related to the cybercriminality, particularly financial fraud, or not. Then we selected some of them as keywords if the vocabularies are related to the nominals and symbols. With the selected keywords, we searched and collected data from web materials such as twitter, news, blog, and more than 820,000 articles collected. The collected articles were refined through preprocessing and made into learning data. The preprocessing process is divided into performing morphological analysis step, removing stop words step, and selecting valid part-of-speech step. In the morphological analysis step, a complex sentence is transformed into some morpheme units to enable mechanical analysis. In the removing stop words step, non-lexical elements such as numbers, punctuation marks, and double spaces are removed from the text. In the step of selecting valid part-of-speech, only two kinds of nouns and symbols are considered. Since nouns could refer to things, the intent of message is expressed better than the other part-of-speech. Moreover, the more illegal the text is, the more frequently symbols are used. The selected data is given 'legal' or 'illegal'. To make the selected data as learning data through the preprocessing process, it is necessary to classify whether each data is legitimate or not. The processed data is then converted into Corpus type and Document-Term Matrix. Finally, the two types of 'legal' and 'illegal' files were mixed and randomly divided into learning data set and test data set. In this study, we set the learning data as 70% and the test data as 30%. SVM was used as the discrimination algorithm. Since SVM requires gamma and cost values as the main parameters, we set gamma as 0.5 and cost as 10, based on the optimal value function. The cost is set higher than general cases. To show the feasibility of the idea proposed in this paper, we compared the proposed method with MLE (Maximum Likelihood Estimation), Term Frequency, and Collective Intelligence method. Overall accuracy and was used as the metric. As a result, the overall accuracy of the proposed method was 92.41% of illegal loan advertisement and 77.75% of illegal visit sales, which is apparently superior to that of the Term Frequency, MLE, etc. Hence, the result suggests that the proposed method is valid and usable practically. In this paper, we propose a framework for crisis management caused by abnormalities of unstructured data sources such as SNS. We hope this study will contribute to the academia by identifying what to consider when applying the SVM-like discrimination algorithm to text analysis. Moreover, the study will also contribute to the practitioners in the field of brand management and opinion mining.

Transition of Cheoldo Park and Its Significance as Sports Park (운동공원으로서 철도공원의 변화와 의의)

  • Kim, Youngmin;Cho, Seho
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.3
    • /
    • pp.54-65
    • /
    • 2020
  • This study examines history, usage, and spatial change of Cheoldo Park and its significance in the park planning in the Japanese colonial period focusing on the concept of a sports park. Cheoldo Park was verified as the first newly planned park built during the Japanese colonial period and continuously appeared in the official planning documents with different names and areas as a new planned park. This suggest that changes of Cheoldo Park reveal the important conceptual transition in the park planning. Activities in the park were understood by analyzing news paper articles and pictures, while spatial changes of the park were analyzed through maps and aerial photos. Changes in a park planning process were examined through analyzing the planning documents and maps. Cheoldo Park was opened in 1915 as a supporting facility of the Yongsan rail company residence complex. As Cheoldo Park became one of the urban parks of Gyeongseong in 1925, it had functioned as one of the main sports complexes of Gyeongseong. Although a sports park was suggested as a new type of urban park in the 1930's park plan, the programmatic aspect of a sports park was not distinctly defined yet. Cheoldo Park was not regarded as a sports park in the 1930's park plan. As a sports park was distinguished from other types of urban parks pro grammatically in the 1930s, the city tried to transform Cheoldo Park into a sports park. In the park plan of 1940, with major spatial expansion, Cheoldo Park became Ichon Park to be a main large park and sports park of Gyeongseong. Cheoldo Park contributed to the establishing a new direction of modern park planning, shifting from planning focus on quantitative improvement to qualitative improvement of urban parks. It also provided a realistic model to implement the park plans to overcome various limitations of the Japanese colonial period.

Improving the Retrieval Effectiveness by Incorporating Word Sense Disambiguation Process (정보검색 성능 향상을 위한 단어 중의성 해소 모형에 관한 연구)

  • Chung, Young-Mee;Lee, Yong-Gu
    • Journal of the Korean Society for information Management
    • /
    • v.22 no.2 s.56
    • /
    • pp.125-145
    • /
    • 2005
  • This paper presents a semantic vector space retrieval model incorporating a word sense disambiguation algorithm in an attempt to improve retrieval effectiveness. Nine Korean homonyms are selected for the sense disambiguation and retrieval experiments. The total of approximately 120,000 news articles comprise the raw test collection and 18 queries including homonyms as query words are used for the retrieval experiments. A Naive Bayes classifier and EM algorithm representing supervised and unsupervised learning algorithms respectively are used for the disambiguation process. The Naive Bayes classifier achieved $92\%$ disambiguation accuracy. while the clustering performance of the EM algorithm is $67\%$ on the average. The retrieval effectiveness of the semantic vector space model incorporating the Naive Bayes classifier showed $39.6\%$ precision achieving about $7.4\%$ improvement. However, the retrieval effectiveness of the EM algorithm-based semantic retrieval is $3\%$ lower than the baseline retrieval without disambiguation. It is worth noting that the performances of disambiguation and retrieval depend on the distribution patterns of homonyms to be disambiguated as well as the characteristics of queries.

Multilingual Story Link Detection based on Properties of Event Terms (사건 어휘의 특성을 반영한 다국어 사건 연결 탐색)

  • Lee Kyung-Soon
    • The KIPS Transactions:PartB
    • /
    • v.12B no.1 s.97
    • /
    • pp.81-90
    • /
    • 2005
  • In this paper, we propose a novel approach which models multilingual story link detection by adapting the features such as timelines and multilingual spaces as weighting components to give distinctive weights to terms related to events. On timelines term significance is calculated by comparing term distribution of the documents on that day with that on the total document collection reported, and used to represent the document vectors on that day. Since two languages can provide more information than one language, term significance is measured on each language space and used to refer the other language space as a bridge on multilingual spaces. Evaluating the method on Korean and Japanese news articles, our method achieved $14.3{\%}\;and\;16.7{\%}$ improvement for mono- and multi-lingual story pairs, and for multilingual story pairs, respectively. By measuring the space density, the proposed weighting components are verified with a high density of the intra-event stories and a low density of the inter-events stories. This result indicates that the proposed method is helpful for multilingual story link detection.

The Level of Importance of Well-being Foods and the Level of Satisfaction Depending on Married Women's Lifestyle (기혼여성의 라이프스타일 유형에 따른 웰빙지향 식품에 대한 중요도 및 구매만족도)

  • Han, Sung-Hee
    • Journal of Family Resource Management and Policy Review
    • /
    • v.14 no.4
    • /
    • pp.239-262
    • /
    • 2010
  • This study looks at the patterns of married women's lifestyles and verifies whether there are differences in their preferences, the will to continue shopping, and the importance of healthy foods. The paper analyzes the relative influence of each lifestyle pattern on the level of satisfaction with healthy foods. The results of the analysis of this study are as follows. To find patterns in the lifestyles of married women ages 20s to 50s, the factors were analyzed and five lifestyle patterns were extracted: health managing type, fashion pursuing type, self-expressing type, family-oriented type, and eco-friendly type. If we examine the purchasing of healthy foods for each lifestyle, women with a self-expressing lifestyle gain more information from news articles, books, and salespeople than from other information sources. Women of the health managing, family-oriented, and eco-friendly types had high purchasing frequencies and amounts. A cluster analysis was carried out to categorize the different groups being investigated into lifestyle types. They were categorized into the four clusters: active multiple-oriented type; fashion, self-expressing compromising type; passive well-being oriented type; and family and health managing type. It has been verified that there are differences among the clusters in terms of the level of importance of products, contributions to health, as well as distribution and management of healthy foods. To be more specific, the level of importance of the products as well as their distribution and management manifested as being higher among the active multiple-oriented type and the family-oriented and health managing types. The level of importance of contributions to health scored high among all groups, except the passive well-being oriented type. The active multiple-oriented type and the family-oriented and health managing types showed a high level of preference and will to continue purchasing healthy foods, while the fashion and self-expressing compromising types and passive well-being oriented type showed a low level of preference and will. In order to find patterns in the level of satisfaction with healthy foods, three factors were analyzed: credibility of labels, contributions to health, and satisfaction with the store. The factors that had the greatest influence on the total level of satisfaction was the credibility of labels for the family-oriented lifestyle; a product's contribution to health for the health managing lifestyle; and the store for the fashion pursuing lifestyle.

  • PDF

Interplays among Public Opinion, Public Policy and Discourse: Case Study about the Discursive Structure and Media Politics Surrounding the Fiscal Soundness Policy (재정건전성 담론 해체하기: 미디어담론에 내포된 프레임 구조와 변화를 중심으로)

  • Kang, Kuk-Jin;Kim, Sung-Hae
    • Korean journal of communication and information
    • /
    • v.63
    • /
    • pp.5-25
    • /
    • 2013
  • Korean society suffers from severe divisions represented by bi-polarization and collapse of the middle class. Intensive demanding on expanding social welfare budget has emerged in accordance with such a dramatic shift. Social consensus moving toward well-financed welfare policy, however, happens to meet political opposition supported by the discourse of fiscal soundness. This paper thus pays particular attention to deciphering the discursive structure in way of understanding how discourses bring public policy into play. For this purpose, news articles about fiscal soundness collected from 8 national newspapers have been analyzed in terms of frame, attitude, perspective and world view. Research results show, first of all, that there exist persistent competition between two frames identified as 'reduced tax with fiscal discipline' and 'increased tax with welfare money.' While the 'reduced tax' frame favors in maintaining tax cut at the expense of welfare budjet, the frame of 'increased tax' supports such arguments as the flexible employment of fiscal soundness and prosperity of national community helped by widening tax revenues. Also did these frames include a number of sub-frames like welfare populism, partisan politics, trickle down effect, tax bonanza for the rich, universal welfare and market over-reactions in order to bolster its logical authority. Media's active taking a part in penetrating supportive frames in line with political stance was found as well. Taking into account both the discursive structure upheld by frames and politics materialized by the media, the authors argue that public policies should be considered more as discourse than fixed reality. Shedding additional light on understanding the interplay among public opinion, policies and media discourse is of another importance for further study.

  • PDF

Social Big Data-based Co-occurrence Analysis of the Main Person's Characteristics and the Issues in the 2016 Rio Olympics Men's Soccer Games (소셜 빅데이터 기반 2016리우올림픽 축구 관련 이슈 및 인물에 대한 연관단어 분석)

  • Park, SungGeon;Lee, Soowon;Hwang, YoungChan
    • 한국체육학회지인문사회과학편
    • /
    • v.56 no.2
    • /
    • pp.303-320
    • /
    • 2017
  • This paper seeks to better understand the focal issues and persons related to Rio Olympic soccer games through social data science and analytics. This study collected its data from online news articles and comments specific to KOR during the Olympic football games. In order to investigate the public interests for each game and target persons, this study performed the co-occurrence words analysis. Then after, the study applied the NodeXL software to perform its visualization of the results. Through this application and process, the study found several major issues during the Rio Olympic men's football game including the following: the match between KOR and PIJ, KOR player Heungmin Son, commentator Young-Pyo Lee, sportscaster Woo-Jong Jo. The study also showed the general public opinion expressed positive words towards the South Korean national football team during the Rio Olympics, though there existed negative words as well. Furthermore the study revealed positive attitude towards the commentators and casters. In conclusion, the way to increase the public's interest in big sporting events can be achieved by providing the following: contents that include various professional sports analysis, a capable domain expert with thorough preparation, a commentator and/or caster with artistic sense as well as well-spoken, explanatory power and so on. Multidisciplinary research combined with sports science, social science, information technology and media can contribute to a wide range of theoretical studies and practical developments within the sports industry.

Discovering the Knowledge Structure of Graphene Technology by Text Mining National R&D Projects and Newspapers (국가R&D과제와 신문에서 텍스트마이닝을 통한 그래핀 기술의 지식구조 탐색)

  • Lee, Ji-Yeon;Na, Hye-In;Lee, Byeong-Hee;Kim, Tae-Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.2
    • /
    • pp.85-99
    • /
    • 2021
  • Graphene, called the "dream material" is drawing attention as a groundbreaking new material that will lead the era of the 4th Industrial Revolution. Graphene has high strength, excellent electrical and thermal conductivity, excellent optical permeability, and excellent gas barrier properties. In this paper, as the South Korean government recently announced Green New Deal and Digital New Deal policy, we analyze graphene technology, which is also attracting attention for its application to Corona 19 biosensor, to understand its national R&D trend and knowledge structure, and to explore the possibility of its application. Firstly, 4,054 cases of national R&D project information for the last 10 years are collected from the National Science & Technology Information Service(NTIS) to analyze the trend of graphene-related R&D. Besides, projects classified as green technology are analyzed concerning the government's Green New Deal policy. Secondly, text mining analysis is conducted by collecting 500 recent graphene-related articles from e-newspapers. According to the analysis, the field with the largest number of projects was found to be high-efficiency secondary battery technology, and the proportion of total research funds was also the highest. It is expected that South Korea will lead the development of graphene technology in the future to become a world leader in diverse industries including electric vehicles, cellular phone batteries, next-generation semiconductors, 5G, and biosensors.

Detecting Weak Signals for Carbon Neutrality Technology using Text Mining of Web News (탄소중립 기술의 미래신호 탐색연구: 국내 뉴스 기사 텍스트데이터를 중심으로)

  • Jisong Jeong;Seungkook Roh
    • Journal of Industrial Convergence
    • /
    • v.21 no.5
    • /
    • pp.1-13
    • /
    • 2023
  • Carbon neutrality is the concept of reducing greenhouse gases emitted by human activities and making actual emissions zero through removal of remaining gases. It is also called "Net-Zero" and "carbon zero". Korea has declared a "2050 Carbon Neutrality policy" to cope with the climate change crisis. Various carbon reduction legislative processes are underway. Since carbon neutrality requires changes in industrial technology, it is important to prepare a system for carbon zero. This paper aims to understand the status and trends of global carbon neutrality technology. Therefore, ROK's web platform "www.naver.com." was selected as the data collection scope. Korean online articles related to carbon neutrality were collected. Carbon neutrality technology trends were analyzed by future signal methodology and Word2Vec algorithm which is a neural network deep learning technology. As a result, technology advancement in the steel and petrochemical sectors, which are carbon over-release industries, was required. Investment feasibility in the electric vehicle sector and technology advancement were on the rise. It seems that the government's support for carbon neutrality and the creation of global technology infrastructure should be supported. In addition, it is urgent to cultivate human resources, and possible to confirm the need to prepare support policies for carbon neutrality.