• Title/Summary/Keyword: Online mining

Search Result 398, Processing Time 0.026 seconds

A Trend Analysis and Policy proposal for the Work Permit System through Text Mining: Focusing on Text Mining and Social Network analysis (텍스트마이닝을 통한 고용허가제 트렌드 분석과 정책 제안 : 텍스트마이닝과 소셜네트워크 분석을 중심으로)

  • Ha, Jae-Been;Lee, Do-Eun
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.9
    • /
    • pp.17-27
    • /
    • 2021
  • The aim of this research was to identify the issue of the work permit system and consciousness of the people on the system, and to suggest some ideas on the government policies on it. To achieve the aim of research, this research used text mining based on social data. This research collected 1,453,272 texts from 6,217 units of online documents which contained 'work permit system' from January to December, 2020 using Textom, and did text-mining and social network analysis. This research extracted 100 key words frequently mentioned from the analyses of data top-level key word frequency, and degree centrality analysis, and constituted job problem, importance of policy process, competitiveness in the respect of industries, and improvement of living conditions of foreign workers as major key words. In addition, through semantic network analysis, this research figured out major awareness like 'employment policy', and various kinds of ambient awareness like 'international cooperation', 'workers' human rights', 'law', 'recruitment of foreigners', 'corporate competitiveness', 'immigrant culture' and 'foreign workforce management'. Finally, this research suggested some ideas worth considering in establishing government policies on the work permit system and doing related researches.

The Effect of Expert Reviews on Consumer Product Evaluations: A Text Mining Approach (전문가 제품 후기가 소비자 제품 평가에 미치는 영향: 텍스트마이닝 분석을 중심으로)

  • Kang, Taeyoung;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.63-82
    • /
    • 2016
  • Individuals gather information online to resolve problems in their daily lives and make various decisions about the purchase of products or services. With the revolutionary development of information technology, Web 2.0 has allowed more people to easily generate and use online reviews such that the volume of information is rapidly increasing, and the usefulness and significance of analyzing the unstructured data have also increased. This paper presents an analysis on the lexical features of expert product reviews to determine their influence on consumers' purchasing decisions. The focus was on how unstructured data can be organized and used in diverse contexts through text mining. In addition, diverse lexical features of expert reviews of contents provided by a third-party review site were extracted and defined. Expert reviews are defined as evaluations by people who have expert knowledge about specific products or services in newspapers or magazines; this type of review is also called a critic review. Consumers who purchased products before the widespread use of the Internet were able to access expert reviews through newspapers or magazines; thus, they were not able to access many of them. Recently, however, major media also now provide online services so that people can more easily and affordably access expert reviews compared to the past. The reason why diverse reviews from experts in several fields are important is that there is an information asymmetry where some information is not shared among consumers and sellers. The information asymmetry can be resolved with information provided by third parties with expertise to consumers. Then, consumers can read expert reviews and make purchasing decisions by considering the abundant information on products or services. Therefore, expert reviews play an important role in consumers' purchasing decisions and the performance of companies across diverse industries. If the influence of qualitative data such as reviews or assessment after the purchase of products can be separately identified from the quantitative data resources, such as the actual quality of products or price, it is possible to identify which aspects of product reviews hamper or promote product sales. Previous studies have focused on the characteristics of the experts themselves, such as the expertise and credibility of sources regarding expert reviews; however, these studies did not suggest the influence of the linguistic features of experts' product reviews on consumers' overall evaluation. However, this study focused on experts' recommendations and evaluations to reveal the lexical features of expert reviews and whether such features influence consumers' overall evaluations and purchasing decisions. Real expert product reviews were analyzed based on the suggested methodology, and five lexical features of expert reviews were ultimately determined. Specifically, the "review depth" (i.e., degree of detail of the expert's product analysis), and "lack of assurance" (i.e., degree of confidence that the expert has in the evaluation) have statistically significant effects on consumers' product evaluations. In contrast, the "positive polarity" (i.e., the degree of positivity of an expert's evaluations) has an insignificant effect, while the "negative polarity" (i.e., the degree of negativity of an expert's evaluations) has a significant negative effect on consumers' product evaluations. Finally, the "social orientation" (i.e., the degree of how many social expressions experts include in their reviews) does not have a significant effect on consumers' product evaluations. In summary, the lexical properties of the product reviews were defined according to each relevant factor. Then, the influence of each linguistic factor of expert reviews on the consumers' final evaluations was tested. In addition, a test was performed on whether each linguistic factor influencing consumers' product evaluations differs depending on the lexical features. The results of these analyses should provide guidelines on how individuals process massive volumes of unstructured data depending on lexical features in various contexts and how companies can use this mechanism from their perspective. This paper provides several theoretical and practical contributions, such as the proposal of a new methodology and its application to real data.

Changes in consumer perception of fashion products in a pandemic - Effects of COVID-19 spead - (팬데믹 상황에서의 패션제품에 대한 소비자의 인식 변화 분석 - 코로나19 확산의 영향 -)

  • Choi, Yeong-Hyeon;Lee, Kyu-Hye
    • The Research Journal of the Costume Culture
    • /
    • v.28 no.3
    • /
    • pp.285-298
    • /
    • 2020
  • This study aimed at examining fashion consumers' awareness during the COVID-19 pandemic. Big data analysis methods, such as text mining, social network analysis, and regression analysis, were applied to user posts about fashion on Korean portal websites and social media during COVID-19. R 3.4.4, UCINET 6, and SPSS 25.0 software were used to analyze the data. The results were as follows. In researching the popular fashion-related topics during COVID-19, the prevention of infection and prophylaxis were significant concerns in the early stage (Jan 1 to Jan 31, 2020), and changed to online channels and online fashion platforms. Then, various topics and fashion keywords appeared with COVID-19-related keywords afterwards. Fashion-related subjects concerned prophylaxis, home life, digital and beauty products, online channels, and fashion consumption. In comparing fashion consumers' awareness during COVID-19 with SARS and MERS, "face masks" was the common keyword for all three illnesses; yet, the prevention of infection was a major consumer concern in fashion-related subjects during COVD-19 only. As COVD-19 cases increased, the search volume for face masks, shoes, and home clothes also increased. Consumer awareness about face masks shifted from blocking yellow dust and micro-dust to the sociocultural significance and short supply. Keywords related to performance turned out to be the major awareness as to shoes, and home clothes were repurposed with an expanded range of use.

Analysis of Research Topics and Trends on COVID-19 in Korea Using Latent Dirichlet Allocation (LDA)

  • Heo, Seong-Min;Yang, Ji-Yeon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.12
    • /
    • pp.83-91
    • /
    • 2020
  • This study aims to identify research topics and examine the trend of Covid19-related papers on DBpia. Applying latent Dirichlet allocation (LDA), we have extracted seven research topics, each of which concerns "International Dynamics", "Technology & Security", "Psychological Impact", "Biomedical-Related", "Economic Impact", "Online Education", and "Religion-Related". In addition, we used the multinomial logistic model to examine the trend of research topics. We found that the papers mainly cover topics related to "International Dynamics" and "Biomedical-Related" before June 2020, but the topics have become diverse since then. In particular, topics regarding "Economic Impact", "Online Education" and "Psychological Impact" has drawn increased attention of researchers. The findings would provide a guideline for collaboration in Covid19-related research, and could serve as a reference work for active research.

The Dynamics of Online word-of-mouth and Marketing Performance : Exploring Mobile Game Application Reviews (온라인 구전과 마케팅 성과의 다이나믹스 연구 : 모바일 게임 앱 리뷰를 중심으로)

  • Kim, In-kiw;Cha, Seong-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.12
    • /
    • pp.36-48
    • /
    • 2020
  • App market has continuously been growth since its launch. The market revenues will reach about 1,000 billion US dollars in 2019. App is a core service for smartphone. Currently, there are more than 1.5 million mobile apps in App platform calling out for attention. So, if you are looking at developing a successful app, you need to have a solid marketing and distribution strategy. Online word of mouth(eWOM) is one of the most effective, powerful App marketing method. eWOM affect potential consumers' decision making, and this effect can spread rapidly through online social network. Despite the increasing research on word of mouth, only few studies have focused on content analysis. Most of studies focused on the causes and acceptance of eWOM and eWOM performance measurement. This study aims to content analysis of mobile apps review In 2013, Google researchers announced Word2Vec. This method has overcome the weakness of previous studies. This is faster and more accurate than traditional methods. This study found out the relationship between mobile app reviews and checked for reactions by Word2vec.

A topic modeling analysis for Korean online newspapers: Focusing on the social perceptions of nurses during the COVID-19 epidemic period (토픽모델링을 이용한 한국 인터넷 뉴스의 간호사 관련 기사 분석: COVID-19 유행시기를 중점으로)

  • Chang, Soo Jung;Park, Sunah;Son, Yedong
    • The Journal of Korean Academic Society of Nursing Education
    • /
    • v.28 no.4
    • /
    • pp.444-455
    • /
    • 2022
  • Purpose: This study explored the meaning of the social perceptions of nurses in online news articles during the coronavirus disease 2019 (COVID-19) pandemic. Methods: A total of 339 nurse-related articles published in Korean online newspapers from January 1 to December 31, 2020, were extracted by entering various combinations of OR and AND with the four words "Corona," "COVID," "Nursing," and "Nurse" as search keywords using BIGKinds, a news database provided by the Korea Press Foundation. The collected data were analyzed with a keyword network analysis and topic modeling using NetMiner 4. Results: The top keywords extracted from the nurse-related news articles were, in the following order, "metropolitan area," "protective clothing," "government," "task," and "admission." Four topics representing keywords were identified: "encouragement for dedicated nurses," "poor work environment," "front-line nurses working with obligation during the COVID-19 pandemic," and "nurses' efforts to prevent the spread of COVID-19." Conclusion: The media's attention to the dedication of nurses, the shortage of nursing resources, and the need for government support is encouraging in that it forms the public opinion necessary to lead to substantial improvements in treating nurses. The nursing community should actively promote policy proposals to improve treatment toward nurses by utilizing the net function of the media and proactively seek and apply strategies to improve the image of nurses working in various fields.

Detection of Phantom Transaction using Data Mining: The Case of Agricultural Product Wholesale Market (데이터마이닝을 이용한 허위거래 예측 모형: 농산물 도매시장 사례)

  • Lee, Seon Ah;Chang, Namsik
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.161-177
    • /
    • 2015
  • With the rapid evolution of technology, the size, number, and the type of databases has increased concomitantly, so data mining approaches face many challenging applications from databases. One such application is discovery of fraud patterns from agricultural product wholesale transaction instances. The agricultural product wholesale market in Korea is huge, and vast numbers of transactions have been made every day. The demand for agricultural products continues to grow, and the use of electronic auction systems raises the efficiency of operations of wholesale market. Certainly, the number of unusual transactions is also assumed to be increased in proportion to the trading amount, where an unusual transaction is often the first sign of fraud. However, it is very difficult to identify and detect these transactions and the corresponding fraud occurred in agricultural product wholesale market because the types of fraud are more intelligent than ever before. The fraud can be detected by verifying the overall transaction records manually, but it requires significant amount of human resources, and ultimately is not a practical approach. Frauds also can be revealed by victim's report or complaint. But there are usually no victims in the agricultural product wholesale frauds because they are committed by collusion of an auction company and an intermediary wholesaler. Nevertheless, it is required to monitor transaction records continuously and to make an effort to prevent any fraud, because the fraud not only disturbs the fair trade order of the market but also reduces the credibility of the market rapidly. Applying data mining to such an environment is very useful since it can discover unknown fraud patterns or features from a large volume of transaction data properly. The objective of this research is to empirically investigate the factors necessary to detect fraud transactions in an agricultural product wholesale market by developing a data mining based fraud detection model. One of major frauds is the phantom transaction, which is a colluding transaction by the seller(auction company or forwarder) and buyer(intermediary wholesaler) to commit the fraud transaction. They pretend to fulfill the transaction by recording false data in the online transaction processing system without actually selling products, and the seller receives money from the buyer. This leads to the overstatement of sales performance and illegal money transfers, which reduces the credibility of market. This paper reviews the environment of wholesale market such as types of transactions, roles of participants of the market, and various types and characteristics of frauds, and introduces the whole process of developing the phantom transaction detection model. The process consists of the following 4 modules: (1) Data cleaning and standardization (2) Statistical data analysis such as distribution and correlation analysis, (3) Construction of classification model using decision-tree induction approach, (4) Verification of the model in terms of hit ratio. We collected real data from 6 associations of agricultural producers in metropolitan markets. Final model with a decision-tree induction approach revealed that monthly average trading price of item offered by forwarders is a key variable in detecting the phantom transaction. The verification procedure also confirmed the suitability of the results. However, even though the performance of the results of this research is satisfactory, sensitive issues are still remained for improving classification accuracy and conciseness of rules. One such issue is the robustness of data mining model. Data mining is very much data-oriented, so data mining models tend to be very sensitive to changes of data or situations. Thus, it is evident that this non-robustness of data mining model requires continuous remodeling as data or situation changes. We hope that this paper suggest valuable guideline to organizations and companies that consider introducing or constructing a fraud detection model in the future.

Detecting gold-farmers' group in MMORPG by analyzing connection pattern (연결패턴 정보 분석을 통한 온라인 게임 내 불량사용자 그룹 탐지에 관한 연구)

  • Seo, Dong-Nam;Woo, Ji-Young;Woo, Kyung-Moon;Kim, Chong-Kwon;Kim, Huy-Kang
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.22 no.3
    • /
    • pp.585-600
    • /
    • 2012
  • Security issues in online games are increasing as the online game industry grows. Real money trading (RMT) by online game users has become a security issue in several countries including Korea because RMT is related to criminal activities such as money laundering or tax evasion. RMT-related activities are done by professional work forces, namely gold-farmers, and many of them employ the automated program, bot, to gain cyber asset in a quick and efficient way. Online game companies try to prevent the activities of gold-farmers using game bots detection algorithm and block their accounts or IP addresses. However, game bot detection algorithm can detect a part of gold-farmer's network and IP address blocking also can be detoured easily by using the virtual private server or IP spoofing. In this paper, we propose a method to detect gold-farmer groups by analyzing their connection patterns to the online game servers, particularly information on their routing and source locations. We verified that the proposed method can reveal gold-farmers' group effectively by analyzing real data from the famous MMORPG.

Developing a deep learning-based recommendation model using online reviews for predicting consumer preferences: Evidence from the restaurant industry (딥러닝 기반 온라인 리뷰를 활용한 추천 모델 개발: 레스토랑 산업을 중심으로)

  • Dongeon Kim;Dongsoo Jang;Jinzhe Yan;Jiaen Li
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.31-49
    • /
    • 2023
  • With the growth of the food-catering industry, consumer preferences and the number of dine-in restaurants are gradually increasing. Thus, personalized recommendation services are required to select a restaurant suitable for consumer preferences. Previous studies have used questionnaires and star-rating approaches, which do not effectively depict consumer preferences. Online reviews are the most essential sources of information in this regard. However, previous studies have aggregated online reviews into long documents, and traditional machine-learning methods have been applied to these to extract semantic representations; however, such approaches fail to consider the surrounding word or context. Therefore, this study proposes a novel review textual-based restaurant recommendation model (RT-RRM) that uses deep learning to effectively extract consumer preferences from online reviews. The proposed model concatenates consumer-restaurant interactions with the extracted high-level semantic representations and predicts consumer preferences accurately and effectively. Experiments on real-world datasets show that the proposed model exhibits excellent recommendation performance compared with several baseline models.

A Study on the Future Direction of the Digital Signage Industry in Korea: A Big Data Network Analysis from 2008 to 2019

  • Yoo, Seung-Chul;Piscarac, Diana
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.120-127
    • /
    • 2020
  • The use of digital signage in the public and commercial communication areas has been increasing in recent years. By integrating cutting-edge information technologies such as 5G, artificial intelligence, and the Internet of Things, digital signage continues to break apart from traditional outdoor advertising media. This study identified the problems facing the domestic digital signage industry by exploring and analyzing major issues related to digital signage and derived future development measures. Specifically, online documents were collected based on the digital signage-related keywords created over the past 12 years to conduct big data network analysis, and key topics were derived through visualization of the results. This study has great policy implications in that it excluded biased interpretations based on the viewpoints of companies or the government and, more objectively, suggested the direction of the digital signage industry's development in the domestic media market.