• Title/Summary/Keyword: text mining analysis

Search Result 1,221, Processing Time 0.026 seconds

Malware Analysis Mechanism using the Word Cloud based on API Statistics (API 통계 기반의 워드 클라우드를 이용한 악성코드 분석 기법)

  • Yu, Sung-Tae;Oh, Soo-Hyun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.10
    • /
    • pp.7211-7218
    • /
    • 2015
  • Tens of thousands of malicious codes are generated on average in a day. New types of malicious codes are surging each year. Diverse methods are used to detect such codes including those based on signature, API flow, strings, etc. But most of them are limited in detecting new malicious codes due to bypass techniques. Therefore, a lot of researches have been performed for more efficient detection of malicious codes. Of them, visualization technique is one of the most actively researched areas these days. Since the method enables more intuitive recognition of malicious codes, it is useful in detecting and examining a large number of malicious codes efficiently. In this paper, we analyze the relationships between malicious codes and Native API functions. Also, by applying the word cloud with text mining technique, major Native APIs of malicious codes are visualized to assess their maliciousness. The proposed malicious code analysis method would be helpful in intuitively probing behaviors of malware.

New economic policy uncertainty indexes for South Korea (새로운 우리나라 불확실성 지수의 작성)

  • Lee, Geung-Hee;Cho, Joo-Hee;Jo, Jin-Gyeong
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.5
    • /
    • pp.639-653
    • /
    • 2020
  • Baker et al. (Quarterly Journal of Economics, 134, 1593-1636, 2016) developed an Economic Policy Uncertainty (EPU) index for South Korea in the same way as the U.S. EPU Index. However, the South Korean EPU index of Baker et al. (2016) has limitations as it did not fully reflect South Korean situation in terms of keyword selection and the selection of newspapers. We develop monthly South Korean economic policy uncertainty indexes with different keywords and news media. Various analyses have been conducted in order to examine the usefulness of the newly compiled indexes.

AI speakers!, Speak with feelings - Focusing on Analysis of SNS Comments (AI 스피커!, 감정을 담아 말해봐 - SNS 댓글 분석을 중심으로)

  • Kim, Joon-Hwan;Lee, Namyeon
    • Journal of Digital Convergence
    • /
    • v.18 no.7
    • /
    • pp.101-110
    • /
    • 2020
  • Devices that add emotion-specific services or various functions are appearing in AI speakers and related devices. To this end, this study performed topic modeling analysis on the topics of post-purchase texts written by AI speaker users, and compared them with the data collected via survey questionnaires. Furthermore, data on the emotional intelligence of AI speakers and relationship quality were collected from 600 users and analyzed using structural equation modeling. The findings of the study are as follows: First, the analysis results of topic modeling showed that most of the articles mainly mention the functional aspects of AI speakers. Second, emotional intelligence of AI speaker perceived by consumer affected relationship quality, and relationship quality had a positive effect on customer satisfaction. Therefore, this study expands the area of AI research by integrating the concept of emotional intelligence and relationship quality to provide new theoretical and practical implications.

Knowledge Trend Analysis of Uncertainty in Biomedical Scientific Literature (생의학 학술 문헌의 불확실성 기반 지식 동향 분석에 관한 연구)

  • Heo, Go Eun;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.2
    • /
    • pp.175-199
    • /
    • 2019
  • Uncertainty means incomplete stages of knowledge of propositions due to the lack of consensus of information and existing knowledge. As the amount of academic literature increases exponentially over time, new knowledge is discovered as research develops. Although the flow of time may be an important factor to identify patterns of uncertainty in scientific knowledge, existing studies have only identified the nature of uncertainty based on the frequency in a particular discipline, and they did not take into consideration of the flow of time. Therefore, in this study, we identify and analyze the uncertainty words that indicate uncertainty in the scientific literature and investigate the stream of knowledge. We examine the pattern of biomedical knowledge such as representative entity pairs, predicate types, and entities over time. We also perform the significance testing using linear regression analysis. Seven pairs out of 17 entity pairs show the significant decrease pattern statistically and all 10 representative predicates decrease significantly over time. We analyze the relative importance of representative entities by year and identify entities that display a significant rising and falling pattern.

Comparative analysis on design key-word of the four major international fashion collections - focus on 2018 fashion collection - (4대 해외 패션 컬렉션의 디자인 key-word 비교분석 - 2018년 패션 컬렉션을 중심으로 -)

  • Kim, Sae-Bom;Lee, Eun-Suk
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.21 no.3
    • /
    • pp.109-119
    • /
    • 2019
  • The purpose of this study is to examine fashion trends and the direction of the four fashion collections by analyzing the design key-words of the four major international fashion collections in 2018. The data of this study was collected by extracting the key-words from Marie Claire Korea in 2018, with the total of the collected data numbering 2,144. The data was analyzed by text mining using the R program and word-cloud, and a co-occurrence network analysis was conducted. The results of this study are as follows: First, the key-words of fashion collection designs in 2018 were fringe and ruffle detail, silk and denim fabric, vivid color, stripe and check pattern, pants suit item, and oversized silhouette, focusing on romanticism and sport. Second, seasonal characteristics of the fashion collections were pastel colors in S/S, primary and vivid colors in F/W. Details were embroidery and cutouts in S/S, patchwork and fringe in F/W. Third, the design trends of the four major fashion collections were presented in the Paris collection: stripes, check patterns, embroidery, lace, tailoring, draping, romanticism, and glamor. In the Milan collection, checks, prints, denim, and minidresses reflected sport and romanticism. The London collection included fringe, ruffles, floral patterns, flower patterns, and romanticism. The New York collections included vivid colors, neon colors, pastel colors, oversize silhouettes, bodysuits, and long dresses.

Exploratory research based on big data for Improving the revisit rate of foreign tourists and invigorating consumption (외국인 관광객 재방문율 향상과 소비 활성화를 위한 빅데이터 기반의 탐색적 연구)

  • An, Sung-Hyun;Park, Seong-Taek
    • Journal of Industrial Convergence
    • /
    • v.18 no.6
    • /
    • pp.19-25
    • /
    • 2020
  • Big data analytics are indispensable today in various industries and public sectors. Therefore, in this study, we will utilize big data analysis to search for improvement plans for domestic tourism services using the LDA analysis method. In particular, we have tried an exploratory approach that can improve tourist satisfaction, which can improve revisit and service, especially in Seoul, which has the largest number of foreign tourists. In this study, we collected and analyzed statistical data of Seoul City and Korea Tourism Organization and Internet information such as SNS via R. And we utilized text mining methods including LDA. As a result of the analysis, one of the purposes of visiting South Korea by foreigners was gastronomic tourism. We will try to derive measures to improve the quality of services centered on gastronomic tourism.

A Study on Trend Analysis in Convergence Research Applying Word Cloud in Korea (워드 클라우드 기법을 이용한 국내 융복합 학술연구 트렌드 분석)

  • Kim, Joon-Hwan;Mun, Hyung-Jin;Lee, Hang
    • Journal of Digital Convergence
    • /
    • v.19 no.2
    • /
    • pp.33-38
    • /
    • 2021
  • The convergence trend is the core of the 4th industrial revolution, and due to such expectations and possibilities, various countermeasures are being sought in diverse fields. This study conducted a quantitative analysis to identify the trend of convergence research over the past 10 years. Specifically, major research keywords were extracted, word cloud techniques were applied, and visualized to identify trends in academic research on convergence. To this end, research papers from 2012 to 2020 published in journal of digital convergence were investigated. The analysis period was divided into two periods: the former 4 years(2012-2015) and the latter 4 years(2016-2019) to confirm the difference in research trends. In addition, the research papers of 2020 were analyzed in order to more clearly understand the changes in the research trend of the last year due to the COVID-19. The results of this study are significant in that they can be used as useful basic data for future research and to understand research trends as keywords in the field of convergence.

A Study on the Online Perception of Chabak Using Big Data Analysis (빅데이터 분석을 통한 차박의 온라인 인식에 대한 연구)

  • Kim, Sae-Hoon;Lee, Hwan-Soo
    • The Journal of Society for e-Business Studies
    • /
    • v.26 no.2
    • /
    • pp.61-81
    • /
    • 2021
  • In the era of untact, the "Chabak" using cars as accommodation spaces is attracting attention as a new form of travel. Due to the advantages, including low costs, convenience, and safety, as well as the characteristics of the vehicle enabling independent travel, the demand for Chabak is continuously increasing. Despite the rapid growth of the market and related industries, little academic has investigated this trend. To establish itself as a new type of travel culture and to sustain the growth of related industries, it is essential to understand the public perception of Chabak. Therefore, based on the marketing mix theory and big data analysis, this study analyzes the public perception of Chabak. The results showed that Chabak has established itself as a consumer-led travel culture, contributing to the aftermarket growth of the automobile industry. Additionally, consumers were found to be increasingly inclined to enjoy travel economically and wisely, and actively share information through social media. This initial study on the new travel trend of Chabak is significant in that it employs big data analysis on a theoretical basis.

A Study on Search Query Topics and Types using Topic Modeling and Principal Components Analysis (토픽모델링 및 주성분 분석 기반 검색 질의 유형 분류 연구)

  • Kang, Hyun-Ah;Lim, Heui-Seok
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.6
    • /
    • pp.223-234
    • /
    • 2021
  • Recent advances in the 4th Industrial Revolution have accelerated the change of the shopping behavior from offline to online. Search queries show customers' information needs most intensively in online shopping. However, there are not many search query research in the field of search, and most of the prior research in the field of search query research has been studied on a limited topic and data-based basis based on researchers' qualitative judgment. To this end, this study defines the type of search query with data-based quantitative methodology by applying machine learning to search research query field to define the 15 topics of search query by conducting topic modeling based on search query and clicked document information. Furthermore, we present a new classification system of new search query types representing searching behavior characteristics by extracting key variables through principal component analysis and analyzing. The results of this study are expected to contribute to the establishment of effective search services and the development of search systems.

Changes and Applications of Rural Tourism in the Post-COVID-19 Era through Social Data Analysis (소셜데이터 분석을 통한 포스트 코로나 시대 농촌관광의 변화와 적용방안)

  • Kim, Young-Jin;Lee, Sung-hee;Son, Yong-hoon
    • Journal of Korean Society of Rural Planning
    • /
    • v.27 no.4
    • /
    • pp.43-54
    • /
    • 2021
  • This study analysed changes in rural tourism between before and after COVID-19 using LDA topic analysis. In order to understand the changes in rural tourism, blog data including the keyword 'Gochang-gun travel' was used. As a result of LDA topic analysis with blog data retrieved, the study found nine topics in 2019 and 2020. 2019 and 2020 are, generally, consistent in topics, but the three topics related to rural experiential tourism that appeared in 2019 did not appear in 2020. In 2020, three new topics emerged: Beach vacations and campings. New travel activities of noncontact with other people(Untact tourism in Korean context) in the COVID-19 era, and The negative impacts on travel businesses and behaviours from COVID-19. Especially, the adverse effects of COVID-19 have made an enormous decline in rural experience tourism destinations and cancellation of local festivals. On the other hand, new tourism activities have emerged due to COVID-19. Those activities have included camping, drive-thru destinations, and cycling. Ecological and natural tourist sites such as Ungok Wetland, Seonunsan Mountain, Seonunsa Temple, and Gusipo Beach appeared. These tourist destinations have a quiet atmosphere and less density place noncontacting with other people when visiting. Also, because overseas travel has become difficult, long-term stay travel in rural areas has appeared. This study indicates that COVID-19 has less impacted rural tourism than other tourism destinations with these positive and negative impacts.