• Title/Summary/Keyword: Multiple administration

Search Result 1,609, Processing Time 0.036 seconds

Extension Method of Association Rules Using Social Network Analysis (사회연결망 분석을 활용한 연관규칙 확장기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.111-126
    • /
    • 2017
  • Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

Nutritional Status and Health Risks of Low Income Elderly Women in Gwangju Area (광주지역 저소득층 여자노인의 영양상태와 건강위험요인에 관한 연구)

  • Yang, Eun-Ju;Bang, Hee-Myung
    • Journal of Nutrition and Health
    • /
    • v.41 no.1
    • /
    • pp.65-76
    • /
    • 2008
  • This study was performed to identify association between nutritional status and health risks of the elderly. This was a cross-sectional study involving low income elderly women in Gwangju, Korea (${\geq}$65y, n = 92). Socio-demographics, life style characteristics, health conditions, dietary intakes based on 24h-recall method, anthropometric measures, and clinical biochemistry parameters were examined. Anthropometric and clinical parameters included wt, ht, waist, hip, body protein, body fat, abdominal fat, total cholesterol, HDL-cholesterol, triglyceride, total protein, albumin, hemoglobin, hematocrit, fasting blood glucose, ferritin, IL-2, IL-6, TNF-${\alpha}$, CRP, TAS, TBARS, systolic blood pressure, and diastolic blood pressure. The subjects were divided into three groups based on age (65-74y, 75-84y, 85y${\leq}$) and were divided into two groups according to the sum of the Nutrition Screening Initiative (NSI) checklist score (adequate nutritional status, NSI score ${\leq}$3; at risk of malnutrition, NSI score >3). Mean and frequency of variables were estimated. Analysis of Variance, Tukey test, Chi-square test, and Multiple linear regression analyses were performed. Mean BMI and body fat were 25.1 $kg/m^2$ and 40.0%, respectively. However, for over 80% of subjects, the intakes of energy, fiber, thiamin, riboflavin, niacin, folate, Ca, K, and Zn were less than the Korean DRI (EAR or AI). The subjects who had lower NSI score tended to have better health status, eat meals frequently, have less depression, and exercise regularly. The subjects who had higher NSI score tended to have tooth problems, to eat alone most of time, and to be physically unable to cook or feed. Serum IL-6 and TNF-${\alpha}$ were significantly related with nutritional status which suggested higher tendency of inflammatory response. Serum IL-2, TAS, and glucose were significantly correlated with body fat (%) or abdominal fat (%). These results suggest that improving the nutritional status, increasing regular exercise, maintaining normal weight are beneficial to health care of low income elderly women.

Intelligent Brand Positioning Visualization System Based on Web Search Traffic Information : Focusing on Tablet PC (웹검색 트래픽 정보를 활용한 지능형 브랜드 포지셔닝 시스템 : 태블릿 PC 사례를 중심으로)

  • Jun, Seung-Pyo;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.93-111
    • /
    • 2013
  • As Internet and information technology (IT) continues to develop and evolve, the issue of big data has emerged at the foreground of scholarly and industrial attention. Big data is generally defined as data that exceed the range that can be collected, stored, managed and analyzed by existing conventional information systems and it also refers to the new technologies designed to effectively extract values from such data. With the widespread dissemination of IT systems, continual efforts have been made in various fields of industry such as R&D, manufacturing, and finance to collect and analyze immense quantities of data in order to extract meaningful information and to use this information to solve various problems. Since IT has converged with various industries in many aspects, digital data are now being generated at a remarkably accelerating rate while developments in state-of-the-art technology have led to continual enhancements in system performance. The types of big data that are currently receiving the most attention include information available within companies, such as information on consumer characteristics, information on purchase records, logistics information and log information indicating the usage of products and services by consumers, as well as information accumulated outside companies, such as information on the web search traffic of online users, social network information, and patent information. Among these various types of big data, web searches performed by online users constitute one of the most effective and important sources of information for marketing purposes because consumers search for information on the internet in order to make efficient and rational choices. Recently, Google has provided public access to its information on the web search traffic of online users through a service named Google Trends. Research that uses this web search traffic information to analyze the information search behavior of online users is now receiving much attention in academia and in fields of industry. Studies using web search traffic information can be broadly classified into two fields. The first field consists of empirical demonstrations that show how web search information can be used to forecast social phenomena, the purchasing power of consumers, the outcomes of political elections, etc. The other field focuses on using web search traffic information to observe consumer behavior, identifying the attributes of a product that consumers regard as important or tracking changes on consumers' expectations, for example, but relatively less research has been completed in this field. In particular, to the extent of our knowledge, hardly any studies related to brands have yet attempted to use web search traffic information to analyze the factors that influence consumers' purchasing activities. This study aims to demonstrate that consumers' web search traffic information can be used to derive the relations among brands and the relations between an individual brand and product attributes. When consumers input their search words on the web, they may use a single keyword for the search, but they also often input multiple keywords to seek related information (this is referred to as simultaneous searching). A consumer performs a simultaneous search either to simultaneously compare two product brands to obtain information on their similarities and differences, or to acquire more in-depth information about a specific attribute in a specific brand. Web search traffic information shows that the quantity of simultaneous searches using certain keywords increases when the relation is closer in the consumer's mind and it will be possible to derive the relations between each of the keywords by collecting this relational data and subjecting it to network analysis. Accordingly, this study proposes a method of analyzing how brands are positioned by consumers and what relationships exist between product attributes and an individual brand, using simultaneous search traffic information. It also presents case studies demonstrating the actual application of this method, with a focus on tablets, belonging to innovative product groups.

Impact of Shortly Acquired IPO Firms on ICT Industry Concentration (ICT 산업분야 신생기업의 IPO 이후 인수합병과 산업 집중도에 관한 연구)

  • Chang, YoungBong;Kwon, YoungOk
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.51-69
    • /
    • 2020
  • Now, it is a stylized fact that a small number of technology firms such as Apple, Alphabet, Microsoft, Amazon, Facebook and a few others have become larger and dominant players in an industry. Coupled with the rise of these leading firms, we have also observed that a large number of young firms have become an acquisition target in their early IPO stages. This indeed results in a sharp decline in the number of new entries in public exchanges although a series of policy reforms have been promulgated to foster competition through an increase in new entries. Given the observed industry trend in recent decades, a number of studies have reported increased concentration in most developed countries. However, it is less understood as to what caused an increase in industry concentration. In this paper, we uncover the mechanisms by which industries have become concentrated over the last decades by tracing the changes in industry concentration associated with a firm's status change in its early IPO stages. To this end, we put emphasis on the case in which firms are acquired shortly after they went public. Especially, with the transition to digital-based economies, it is imperative for incumbent firms to adapt and keep pace with new ICT and related intelligent systems. For instance, after the acquisition of a young firm equipped with AI-based solutions, an incumbent firm may better respond to a change in customer taste and preference by integrating acquired AI solutions and analytics skills into multiple business processes. Accordingly, it is not unusual for young ICT firms become an attractive acquisition target. To examine the role of M&As involved with young firms in reshaping the level of industry concentration, we identify a firm's status in early post-IPO stages over the sample periods spanning from 1990 to 2016 as follows: i) being delisted, ii) being standalone firms and iii) being acquired. According to our analysis, firms that have conducted IPO since 2000s have been acquired by incumbent firms at a relatively quicker time than those that did IPO in previous generations. We also show a greater acquisition rate for IPO firms in the ICT sector compared with their counterparts in other sectors. Our results based on multinomial logit models suggest that a large number of IPO firms have been acquired in their early post-IPO lives despite their financial soundness. Specifically, we show that IPO firms are likely to be acquired rather than be delisted due to financial distress in early IPO stages when they are more profitable, more mature or less leveraged. For those IPO firms with venture capital backup have also become an acquisition target more frequently. As a larger number of firms are acquired shortly after their IPO, our results show increased concentration. While providing limited evidence on the impact of large incumbent firms in explaining the change in industry concentration, our results show that the large firms' effect on industry concentration are pronounced in the ICT sector. This result possibly captures the current trend that a few tech giants such as Alphabet, Apple and Facebook continue to increase their market share. In addition, compared with the acquisitions of non-ICT firms, the concentration impact of IPO firms in early stages becomes larger when ICT firms are acquired as a target. Our study makes new contributions. To our best knowledge, this is one of a few studies that link a firm's post-IPO status to associated changes in industry concentration. Although some studies have addressed concentration issues, their primary focus was on market power or proprietary software. Contrast to earlier studies, we are able to uncover the mechanism by which industries have become concentrated by placing emphasis on M&As involving young IPO firms. Interestingly, the concentration impact of IPO firm acquisitions are magnified when a large incumbent firms are involved as an acquirer. This leads us to infer the underlying reasons as to why industries have become more concentrated with a favor of large firms in recent decades. Overall, our study sheds new light on the literature by providing a plausible explanation as to why industries have become concentrated.

Analysis on Factors Influencing Welfare Spending of Local Authority : Implementing the Detailed Data Extracted from the Social Security Information System (지방자치단체 자체 복지사업 지출 영향요인 분석 : 사회보장정보시스템을 통한 접근)

  • Kim, Kyoung-June;Ham, Young-Jin;Lee, Ki-Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.141-156
    • /
    • 2013
  • Researchers in welfare services of local government in Korea have rather been on isolated issues as disables, childcare, aging phenomenon, etc. (Kang, 2004; Jung et al., 2009). Lately, local officials, yet, realize that they need more comprehensive welfare services for all residents, not just for above-mentioned focused groups. Still cases dealt with focused group approach have been a main research stream due to various reason(Jung et al., 2009; Lee, 2009; Jang, 2011). Social Security Information System is an information system that comprehensively manages 292 welfare benefits provided by 17 ministries and 40 thousand welfare services provided by 230 local authorities in Korea. The purpose of the system is to improve efficiency of social welfare delivery process. The study of local government expenditure has been on the rise over the last few decades after the restarting the local autonomy, but these studies have limitations on data collection. Measurement of a local government's welfare efforts(spending) has been primarily on expenditures or budget for an individual, set aside for welfare. This practice of using monetary value for an individual as a "proxy value" for welfare effort(spending) is based on the assumption that expenditure is directly linked to welfare efforts(Lee et al., 2007). This expenditure/budget approach commonly uses total welfare amount or percentage figure as dependent variables (Wildavsky, 1985; Lee et al., 2007; Kang, 2000). However, current practice of using actual amount being used or percentage figure as a dependent variable may have some limitation; since budget or expenditure is greatly influenced by the total budget of a local government, relying on such monetary value may create inflate or deflate the true "welfare effort" (Jang, 2012). In addition, government budget usually contain a large amount of administrative cost, i.e., salary, for local officials, which is highly unrelated to the actual welfare expenditure (Jang, 2011). This paper used local government welfare service data from the detailed data sets linked to the Social Security Information System. The purpose of this paper is to analyze the factors that affect social welfare spending of 230 local authorities in 2012. The paper applied multiple regression based model to analyze the pooled financial data from the system. Based on the regression analysis, the following factors affecting self-funded welfare spending were identified. In our research model, we use the welfare budget/total budget(%) of a local government as a true measurement for a local government's welfare effort(spending). Doing so, we exclude central government subsidies or support being used for local welfare service. It is because central government welfare support does not truly reflect the welfare efforts(spending) of a local. The dependent variable of this paper is the volume of the welfare spending and the independent variables of the model are comprised of three categories, in terms of socio-demographic perspectives, the local economy and the financial capacity of local government. This paper categorized local authorities into 3 groups, districts, and cities and suburb areas. The model used a dummy variable as the control variable (local political factor). This paper demonstrated that the volume of the welfare spending for the welfare services is commonly influenced by the ratio of welfare budget to total local budget, the population of infants, self-reliance ratio and the level of unemployment factor. Interestingly, the influential factors are different by the size of local government. Analysis of determinants of local government self-welfare spending, we found a significant effect of local Gov. Finance characteristic in degree of the local government's financial independence, financial independence rate, rate of social welfare budget, and regional economic in opening-to-application ratio, and sociology of population in rate of infants. The result means that local authorities should have differentiated welfare strategies according to their conditions and circumstances. There is a meaning that this paper has successfully proven the significant factors influencing welfare spending of local government in Korea.

Development and Validation of the Analytical Method for Oxytetracycline in Agricultural Products using QuEChERS and LC-MS/MS (QuEChERS법 및 LC-MS/MS를 이용한 농산물 중 Oxytetracycline의 잔류시험법 개발 및 검증)

  • Cho, Sung Min;Do, Jung-Ah;Lee, Han Sol;Park, Ji-Su;Shin, Hye-Sun;Jang, Dong Eun;Cho, Myong-Shik;Jung, ong-hyun;Lee, Kangbong
    • Journal of Food Hygiene and Safety
    • /
    • v.34 no.3
    • /
    • pp.227-234
    • /
    • 2019
  • An analytical method was developed for the determination of oxytetracycline in agricultural products using the QuEChERS (Quick, Easy, Cheap, Effective, Rugged and Safe) method by liquid chromatography-tandem mass spectrometry (LC-MS/MS). After the samples were extracted with methanol, the extracts were adjusted to pH 4 by formic acid and sodium chloride was added to remove water. Dispersive solid phase extraction (d-SPE) cleanup was carried out using $MgSO_4$ (anhydrous magnesium sulfate), PSA (primary secondary amine), $C_{18}$ (octadecyl) and GCB (graphitized carbon black). The analytes were quantified and confirmed with LC-MS/MS using ESI (electrospray ionization) in positive ion MRM (multiple reaction monitoring) mode. The matrix-matched calibration curves were constructed using six levels ($0.001{\sim}0.25{\mu}g/mL$) and coefficient of determination ($r^2$) was above 0.99. Recovery results at three concentrations (LOQ, $10{\times}LOQ$, and $50{\times}LOQ$, n=5) were from 80.0 to 108.2% with relative standard deviations (RSDs) less than of 11.4%. For inter-laboratory validation, the average recovery was in the range of 83.5~103.2% and the coefficient of variation (CV) was below 14.1%. All results satisfied the criteria ranges requested in the Codex guidelines (CAC/GL 40-1993, 2003) and the Food Safety Evaluation Department guidelines (2016). The proposed analytical method was accurate, effective and sensitive for oxytetracycline determination in agricultural commodities. This study could be useful for safety management of oxytetracycline residues in agricultural products.

The Effect of AD Noises Caused by AD Model Selection on Brand Awareness and Brand Attitudes (광고 모델 관련 광고 노이즈가 브랜드 인지도와 브랜드 태도에 미치는 영향)

  • Chung, Jai-Hak;Lee, Sang-Mi
    • Journal of Global Scholars of Marketing Science
    • /
    • v.18 no.3
    • /
    • pp.89-114
    • /
    • 2008
  • Most of the extant studies on communication effects have been devoted to the typical issue, "what types of communication activities are more effective for brand awareness or brand attitudes?" However, little research has addressed another question on communication decisions, "what makes communication activities less effective?" Our study focuses on factors negatively influenced on the efficiency of communication activities, especially of Advertising. Some studies have introduced concepts closely related to our topic such as consumer confusion, brand confusion, or belief confusion. Studies on product belief confusion have found some factors misleading consumers to misunderstand the physical features of products. Studies on brand confusion have uncovered factors making consumers confused on brand names. Studies on advertising confusion have tested the effects of ad models' employed by many other firms for different products on communication efficiency. We address a new concept, Ad noises, which are any factors interfering with consumers exposed to a particular advertisement in understanding messages provided by advertisements. The objective of this study is to understand the effects of ad noises caused by ad models on brand awareness and brand attitude. There are many different types of AD noises. Particularly, we study the effects of AD noises generated from ad model selection decision. Many companies want to employ celebrities as AD models while the number of celebrities who command a high degree of public and media attention are limited. Inevitably, several firms have been adopting the same celebrities as their AD models for different products. If the same AD model is adopted for TV commercials for different products, consumers exposed to those TV commercials are likely to fail to be aware of the target brand due to interference of TV commercials, for other products, employing the same AD model. This is an ad noise caused by employing ad models who have been exposed to consumers in other advertisements, which is the first type of ad noises studied in this research. Another type of AD noises is related to the decision of AD model replacement for the same product advertising. Firms sometimes launch another TV commercial for the same products. Some firms employ the same AD model for the new TV commercial for the same product and other firms employ new AD models for the new TV commercials for the same product. The typical problem with the replacement of AD models is the possibility of interfering with consumers in understanding messages of the TV commercial due to the dissimilarity of the old and new AD models. We studied the effects of these two types of ad noises, which are the typical factors influencing on the effect of communication: (1) ad noises caused by employing ad models who have been exposed to consumers in other advertisements and (2) ad noises caused by changing ad models with different images for same products. First, we measure the negative influence of AD noises on brand awareness and attitudes, in order to provide the importance of studying AD noises. Furthermore, our study unveiled the mediating conditions(variables) which can increase or decrease the effects of ad noises on brand awareness and attitudes. We study the effects of three mediating variables for ad noises caused by employing ad models who have been exposed to consumers in other advertisements: (1) the fit between product image and AD model image, (2) similarity between AD model images in multiple TV commercials employing the same AD model, and (3) similarity between products of which TV commercial employed the same AD model. We analyze the effects of another three mediating variables for ad noises caused by changing ad models with different images for same products: (1) the fit of old and new AD models for the same product, (2) similarity between AD model images in old and new TV commercials for the same product, and (3) concept similarity between old and new TV commercials for the same product. We summarized the empirical results from a field survey as follows. The employment of ad models who have been used in advertisements for other products has negative effects on both brand awareness and attitudes. our empirical study shows that it is possible to reduce the negative effects of ad models used for other products by choosing ad models whose images are relevant to the images of target products for the advertisement, by requiring ad models of images which are different from those of ad models in other advertisements, or by choosing ad models who have been shown in advertisements for other products which are not similar to the target product. The change of ad models for the same product advertisement can positively influence on brand awareness but positively on brand attitudes. Furthermore, the effects of ad model change can be weakened or strengthened depending on the relevancy of new ad models, the similarity of previous and current ad models, and the consistency of the previous and current ad messages.

  • PDF

The study for the roles of intratracheally administered histamine in the neutrophil-mediated acute lung injury in rats: (호중구를 매개하는 백서의 급성 폐손상의 병리가전에 있어 기도내로 투여한 히스타민의 역활에 관하여)

  • Koh, Youn-Suck;Hybertson, Brooks M.;Jepson, Eric K.;Kim, Mi-Jung;Lee, In-Chul;Lim, Chae-Man;Lee, Sang-Do;Kim, Dong-Soon;Kim, Won-Dong;Repine, John E.
    • Tuberculosis and Respiratory Diseases
    • /
    • v.43 no.3
    • /
    • pp.308-322
    • /
    • 1996
  • Background : Neutrophils are considered to play critical roles in the development of acute respiratory distress syndrome. Histamine, which is distributed abundantly in lung tissue, increases the rolling of neutrophills via increase of P-selectin expression on the surface of endothelial cells and is known to have some interrelationships with IL-1, IL-8 and TNF-$\alpha$. We studied to investigate the effect of the histamine on the acute lung injury of the rats induced by intratracheal insufflation of TNF-$\alpha$ which has less potency to cause lung injury compared to IL-1 in rats. Methods : We intratracheally instilled saline or TNF(R&D, 500ng), IL-1(R&D, 50ng)or histamine of varius dose(1.1, 11 and $55\;{\mu}g/kg$) with and without TNF separately in Sprague-Dawley rats weighing 270-370 grams. We also intratracheally treated IL-1(50ng) along with histamine($55\;{\mu}g/kg$). In cases, there were synergistic effects induced by histamine on the parameters of TNF-induced acute lung injury, antihistamines(Sigma, mepyramine as a $H_1$ receptor blockade and ranitidine as a $H_2$ receptor blockade, 10 mg/kg in each)were co-administered intravenously to the rats treated TNF along with histamine($1.1\;{\mu}g/kg$) intratraeheally. Then after 5 h we measured lung lavage neutrophil numbers, lavage cytokine-induced neutrophil chemoattractants(CINC), lung myeloperoxidase activity(MPO) and lung leak. We also intratracheally insufflated TNF with/without histamine($11\;{\mu}g/kg$), then after 24 h measured lung leak in rats. Statistical analyses were done by Kruskal-Wallis nonparametric ANOVA test with Dunn's multiple comparison test or by Mann-Whitney U test. Results : We found that rats given TNF, histamine alone(11 and $55\;{\mu}g/kg$), and TNF with histamine(l.1, 11, and $55\;{\mu}g/kg$) intratracheally had increased (p<0.05) lung MPO activity compared with saline-treated control rats. TNF with histamine $11\;{\mu}g/kg$ had increased MPO activity (P=0.0251) compared with TNF-treated rats. TNF and TNF with histamine(1.1, 11, and $55\;{\mu}g/kg$) intratracheally had all increased (p<0.05) lung leak, lavage neutophil numbers and lavage CINC activities compared with saline. TNF with histamine $1.1\;{\mu}g/kg$ had increased (p=0.0367) lavage neutrophil numbers compared with TNF treated rats. But there were no additive effect of histamine with TNF compared with TNF alone in acute lung leak on 5 h and 24 h in rats. Treatment of rats with the $H_1$ and $H_2$ antagonists resulted in inhibitions of lavage neutrophil accumulations and lavage CINC activity elevations elicited by co-treated histamine in TNF-induced acute lung injury intratracheally in rats. We also found that rats given IL-1 along with histamine intratracheally did not have increase in lung leak compared with IL-1 treated rats. Conclusion : Histamine administered intratracheally did not have synergistic effects on TNF-induced acute lung leak inspite of additive effects on increase in MPO activity and lavage neutrophil numbers in rats. These observations suggest that instilling histamine intratracheally would not play synergistic roles in neutrophil-mediated acute lung injury in rats.

  • PDF

Development of Yóukè Mining System with Yóukè's Travel Demand and Insight Based on Web Search Traffic Information (웹검색 트래픽 정보를 활용한 유커 인바운드 여행 수요 예측 모형 및 유커마이닝 시스템 개발)

  • Choi, Youji;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.155-175
    • /
    • 2017
  • As social data become into the spotlight, mainstream web search engines provide data indicate how many people searched specific keyword: Web Search Traffic data. Web search traffic information is collection of each crowd that search for specific keyword. In a various area, web search traffic can be used as one of useful variables that represent the attention of common users on specific interests. A lot of studies uses web search traffic data to nowcast or forecast social phenomenon such as epidemic prediction, consumer pattern analysis, product life cycle, financial invest modeling and so on. Also web search traffic data have begun to be applied to predict tourist inbound. Proper demand prediction is needed because tourism is high value-added industry as increasing employment and foreign exchange. Among those tourists, especially Chinese tourists: Youke is continuously growing nowadays, Youke has been largest tourist inbound of Korea tourism for many years and tourism profits per one Youke as well. It is important that research into proper demand prediction approaches of Youke in both public and private sector. Accurate tourism demands prediction is important to efficient decision making in a limited resource. This study suggests improved model that reflects latest issue of society by presented the attention from group of individual. Trip abroad is generally high-involvement activity so that potential tourists likely deep into searching for information about their own trip. Web search traffic data presents tourists' attention in the process of preparation their journey instantaneous and dynamic way. So that this study attempted select key words that potential Chinese tourists likely searched out internet. Baidu-Chinese biggest web search engine that share over 80%- provides users with accessing to web search traffic data. Qualitative interview with potential tourists helps us to understand the information search behavior before a trip and identify the keywords for this study. Selected key words of web search traffic are categorized by how much directly related to "Korean Tourism" in a three levels. Classifying categories helps to find out which keyword can explain Youke inbound demands from close one to far one as distance of category. Web search traffic data of each key words gathered by web crawler developed to crawling web search data onto Baidu Index. Using automatically gathered variable data, linear model is designed by multiple regression analysis for suitable for operational application of decision and policy making because of easiness to explanation about variables' effective relationship. After regression linear models have composed, comparing with model composed traditional variables and model additional input web search traffic data variables to traditional model has conducted by significance and R squared. after comparing performance of models, final model is composed. Final regression model has improved explanation and advantage of real-time immediacy and convenience than traditional model. Furthermore, this study demonstrates system intuitively visualized to general use -Youke Mining solution has several functions of tourist decision making including embed final regression model. Youke Mining solution has algorithm based on data science and well-designed simple interface. In the end this research suggests three significant meanings on theoretical, practical and political aspects. Theoretically, Youke Mining system and the model in this research are the first step on the Youke inbound prediction using interactive and instant variable: web search traffic information represents tourists' attention while prepare their trip. Baidu web search traffic data has more than 80% of web search engine market. Practically, Baidu data could represent attention of the potential tourists who prepare their own tour as real-time. Finally, in political way, designed Chinese tourist demands prediction model based on web search traffic can be used to tourism decision making for efficient managing of resource and optimizing opportunity for successful policy.