Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)
-
- Journal of Intelligence and Information Systems
- /
- v.22 no.3
- /
- pp.143-163
- /
- 2016
The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.
As distribution environment is changing rapidly and competition is more intensive in the channel of distribution, the importance of retailer image and retailer equity is increasing as a different competitive advantages. Also, consumers are not functionally oriented and that their behavior is significantly affected by the symbols such as retailer image which identify retailer in the market place. That is, consumers do not choose products or retailers for their material utilities but consume the symbolic meaning of those products or retailers as expressed in their self images. The concept of self-image congruence has been utilized by marketers and researchers as an aid in better understanding how consumers identify themselves with the brands they buy and the retailer they patronize. Although self-image congruity theory has been tested across many product categories, the theory has not been tested extensively in the retailing. Therefore, this study attempts to investigate the impact of self image congruence between retailer image and self image of consumer on retailer equity such as retailer awareness, retailer association, perceived retailer quality, and retailer loyalty. The purpose of this study is to find out whether retailer-self image congruence can be a new antecedent of retailer equity. In addition, this study tries to examine how four-dimensional retailer equity constructs (retailer awareness, retailer association, perceived retailer quality, and retailer loyalty) affect customers' repatronage intention. For this study, data were gathered by survey and analyzed by structural equation modeling. The sample size in the present study was 254. The reliability of the all seven dimensions was estimated with Cronbach's alpha, composite reliability values and average variance extracted values. We determined whether the measurement model supports the convergent validity and discriminant validity by Exploratory factor analysis and Confirmatory Factor Analysis. For each pair of constructs, the square root of the average variance extracted values exceeded their correlations, thus supporting the discriminant validity of the constructs. Hypotheses were tested using the AMOS 18.0. As expected, the image congruence hypotheses were supported. The greater the degree of congruence between retailer image and self-image, the more favorable were consumers' retailer evaluations. The all two retailer-self image congruence (actual self-image congruence and ideal self-image congruence) affected customer based retailer equity. This result means that retailer-self image congruence is important cue for customers to estimate retailer equity. In other words, consumers are often more likely to prefer products and retail stores that have images similar to their own self-image. Especially, it appeared that effect for the ideal self-image congruence was consistently larger than the actual self-image congruence on the retailer equity. The results mean that consumers prefer or search for stores that have images compatible with consumer's perception of ideal-self. In addition, this study revealed that customers' estimations toward customer based retailer equity affected the repatronage intention. The results showed that all four dimensions (retailer awareness, retailer association, perceived retailer quality, and retailer loyalty) had positive effect on the repatronage intention. That is, management and investment to improve image congruence between retailer and consumers' self make customers' positive evaluation of retailer equity, and then the positive customer based retailer equity can enhance the repatonage intention. And to conclude, retailer's image management is an important part of successful retailer performance management, and the retailer-self image congruence is an important antecedent of retailer equity. Therefore, it is more important to develop and improve retailer's image similar to consumers' image. Given the pressure to provide increased image congruence, it is not surprising that retailers have made significant investments in enhancing the fit between retailer image and self image of consumer. The enhancing such self-image congruence may allow marketers to target customers who may be influenced by image appeals in advertising.
This study attempts to examine the influence that negative WOM (NWOM) has in an online context. It specifically focuses on the impact of the service failure description and the perceived intention of the communication provider on consumer evaluations of firm competence, attitude toward the firm, positive word of mouth and behavioral intentions. Studies of communication persuasiveness focus on "who says what; to whom; in which channel; with what effect (Chiu 2007)." In this research study, we examine electronic web posting, particularly focusing on two aspects of "what": the level of service failure communicated and perceived intention of the individual posting. It stands to reason electronic NWOM that appears to be trying to damage a product’s or firm's reputation will be viewed as more biased and will thus be considered as less credible. According to attribution theory, people search for the causes of events especially those that are negative and unexpected (Weiner 2006). Hennig-Thurau and Walsh (2003) state "since the reader has only limited knowledge and trust of the author of an online articulation the quality of the contribution could be expected to serve as a potent moderator of the articulation-behavior relationship. We therefore posit the following hypotheses: H1. Subjects exposed to electronic NWOM describing a high level of service failure will provide lower scores on measures of (a) firm competence, (b) attitude toward the firm, (c) positive word of mouth, and (d) behavioral intention than will subjects exposed to electronic NWOM describing a low level of service failure. H2. Subjects exposed to electronic NWOM with a warning intent will provide lower scores on measures of (a) firm competence, (b) attitude toward the firm, (c) positive word of mouth, and (d) behavioral intention than will subjects exposed to electronic NWOM with a vengeful intent. H3. Level of service failure in electronic NWOM will interact with the perceived intention of the electronic NWOM, such that there will be a decrease in mean response on measures of (a) firm competence, (b) attitude toward the firm, (c) positive word of mouth, and (d) behavioral intention from electronic NWOM with a warning intent to a vengeful intent. The main study involved a2 (service failure severity) x2 (NWOM with warning versus vengeful intent) factorial experiment. Stimuli were presented to subjects online using a mock online web posting. The scenario described a service failure associated with non-acceptance of a gift card in a brick-and-mortar retail establishment. A national sample was recruited through an online research firm. A total of 113 subjects participated in the study. A total of 104 surveys were analyzed. The scenario was perceived to be realistic with 92.3% giving the scenario a greater than average response. Manipulations were satisfactory. Measures were pre-tested and validated. Items were analyzed and found reliable and valid. MANOVA results found the multivariate interaction was not significant, allowing our interpretation to proceed to the main effects. Significant main effects were found for post intent and service failure severity. The post intent main effect was attributable to attitude toward the firm, positive word of mouth and behavioral intention. The service failure severity main effect was attributable to all four dependent variables: firm competence, attitude toward the firm, positive word of mouth and behavioral intention. Specifically, firm competence for electronic NWOM describing high severity of service failure was lower than electronic NWOM describing low severity of service failure. Attitude toward the firm for electronic NWOM describing high severity of service failure was lower than electronic NWOM describing low severity of service failure. Positive word of mouth for electronic NWOM describing high severity of service failure was lower than electronic NWOM describing low severity of service failure. Behavioral intention for electronic NWOM describing high severity of service failure was lower for electronic NWOM describing low severity of service failure. Therefore, H1a, H1b, H1c and H1d were all supported. In addition, attitude toward the firm for electronic NWOM with a warning intent was lower than electronic NWOM with a vengeful intent. Positive word of mouth for electronic NWOM with a warning intent was lower than electronic NWOM with a vengeful intent. Behavioral intention for electronic NWOM with a warning intent was lower than electronic NWOM with a vengeful intent. Thus, H2b, H2c and H2d were supported. However, H2a was not supported though results were in the hypothesized direction. Otherwise, there was no significant multivariate service failure severity by post intent interaction, nor was there a significant univariate service failure severity by post intent interaction for any of the three hypothesized variables. Thus, H3 was not supported for any of the four hypothesized variables. This study has research and managerial implications. The findings of this study support prior research that service failure severity impacts consumer perceptions, attitude, positive word of mouth and behavioral intentions (Weun et al. 2004). Of further relevance, this response is evidenced in the online context, suggesting the need for firms to engage in serious focused service recovery efforts. With respect to perceived intention of electronic NWOM, the findings support prior research suggesting reader's attributions of the intentions of a source influence the strength of its impact on perceptions, attitude, positive word of mouth and behavioral intentions. The implication for managers suggests while consumers do find online communications to be credible and influential, not all communications are weighted the same. A benefit of electronic WOM, even when it may be potentially damaging, is it can be monitored for potential problems and additionally offers the possibility of redress.
Consumers differ in the way they make a purchase. An audio mania would willingly make a bold, yet serious, decision to buy a top-of-the-line home theater system, while he is not interested in replacing his two-decade-old shabby car. On the contrary, an automobile enthusiast wouldn't mind spending forty thousand dollars to buy a new Jaguar convertible, yet cares little about his junky component system. It is product involvement that helps us explain such differences among individuals in the purchase style. Product involvement refers to the extent to which a product is perceived to be important to a consumer (Zaichkowsky, 2001). Product involvement is an important factor that strongly influences consumer's purchase decision-making process, and thus has been of prime interest to consumer behavior researchers. Furthermore, researchers found that involvement is closely related to perceived risk (Dholakia, 2001). While abundant research exists addressing how product involvement relates to overall perceived risk, little attention has been paid to the relationship between involvement and different types of perceived risk in an electronic commerce setting. Given that perceived risk can be a substantial barrier to the online purchase (Jarvenpaa, 2000), research addressing such an issue will offer useful implications on what specific types of perceived risk an online firm should focus on mitigating if it is to increase sales to a fullest potential. Meanwhile, past research has focused on such consumer responses as information search and dissemination as a consequence of involvement, neglecting other behavioral responses like online merchant selection. For one example, will a consumer seriously considering the purchase of a pricey Guzzi bag perceive a great degree of risk associated with online buying and therefore choose to buy it from a digital storefront rather than from an online marketplace to mitigate risk? Will a consumer require greater trust on the part of the online merchant when the perceived risk of online buying is rather high? We intend to find answers to these research questions through an empirical study. This paper explores the impact of enduring product involvement and perceived risks on required trust level, and further on online merchant choice. For the purpose of the research, five types or components of perceived risk are taken into consideration, including financial, performance, delivery, psychological, and social risks. A research model has been built around the constructs under consideration, and 12 hypotheses have been developed based on the research model to examine the relationships between enduring involvement and five components of perceived risk, between five components of perceived risk and required trust level, between enduring involvement and required trust level, and finally between required trust level and preference toward an e-tailer. To attain our research objectives, we conducted an empirical analysis consisting of two phases of data collection: a pilot test and main survey. The pilot test was conducted using 25 college students to ensure that the questionnaire items are clear and straightforward. Then the main survey was conducted using 295 college students at a major university for nine days between December 13, 2010 and December 21, 2010. The measures employed to test the model included eight constructs: (1) enduring involvement, (2) financial risk, (3) performance risk, (4) delivery risk, (5) psychological risk, (6) social risk, (7) required trust level, (8) preference toward an e-tailer. The statistical package, SPSS 17.0, was used to test the internal consistency among the items within the individual measures. Based on the Cronbach's
The emergence of the internet technology and SNS has increased the information flow and has changed the way people to communicate from one-way to two-way communication. Users not only consume and share the information, they also can create and share it among their friends across the social network service. It also changes the Social Media behavior to become one of the most important communication tools which also includes Social TV. Social TV is a form which people can watch a TV program and at the same share any information or its content with friends through Social media. Social News is getting popular and also known as a Participatory Social Media. It creates influences on user interest through Internet to represent society issues and creates news credibility based on user's reputation. However, the conventional platforms in news services only focus on the news recommendation domain. Recent development in SNS has changed this landscape to allow user to share and disseminate the news. Conventional platform does not provide any special way for news to be share. Currently, Social News Service only allows user to access the entire news. Nonetheless, they cannot access partial of the contents which related to users interest. For example user only have interested to a partial of the news and share the content, it is still hard for them to do so. In worst cases users might understand the news in different context. To solve this, Social News Service must provide a method to provide additional information. For example, Yovisto known as an academic video searching service provided time dependent metadata from the video. User can search and watch partial of video content according to time dependent metadata. They also can share content with a friend in social media. Yovisto applies a method to divide or synchronize a video based whenever the slides presentation is changed to another page. However, we are not able to employs this method on news video since the news video is not incorporating with any power point slides presentation. Segmentation method is required to separate the news video and to creating time dependent metadata. In this work, In this paper, a time dependent metadata-based framework is proposed to segment news contents and to provide time dependent metadata so that user can use context information to communicate with their friends. The transcript of the news is divided by using the proposed story segmentation method. We provide a tag to represent the entire content of the news. And provide the sub tag to indicate the segmented news which includes the starting time of the news. The time dependent metadata helps user to track the news information. It also allows them to leave a comment on each segment of the news. User also may share the news based on time metadata as segmented news or as a whole. Therefore, it helps the user to understand the shared news. To demonstrate the performance, we evaluate the story segmentation accuracy and also the tag generation. For this purpose, we measured accuracy of the story segmentation through semantic similarity and compared to the benchmark algorithm. Experimental results show that the proposed method outperforms benchmark algorithms in terms of the accuracy of story segmentation. It is important to note that sub tag accuracy is the most important as a part of the proposed framework to share the specific news context with others. To extract a more accurate sub tags, we have created stop word list that is not related to the content of the news such as name of the anchor or reporter. And we applied to framework. We have analyzed the accuracy of tags and sub tags which represent the context of news. From the analysis, it seems that proposed framework is helpful to users for sharing their opinions with context information in Social media and Social news.
The study examines how the environmental factors of store influence service brand personality and repurchase intention in the service environment. The service industry has been experiencing the intensified competition with the industry's continuous growth and the influence from rapid technological advancement. Under the circumstances, it has become ever more important for the brand competitiveness to be distinctively recognized against competition. A brand needs to be distinguished and differentiated from competing companies because they are all engaged in the similar environment of the service industry. The differentiation of brand achievement has become increasingly important to highlight certain brand functions to include emotional, self-expressive, and symbolic functions since the importance of such functions has been further emphasized in promoting consumption activities. That is the recent role of brand personality that has been emphasized in the service industry. In other words, customers now freely and actively express their personalities or egos in consumption activities, taking an important role in construction of a brand asset. Hence, the study suggests that it is necessary to disperse the recognition and acknowledgement that the maintenance of the existing customers contributes more to boost repurchase intention when it is compared to the efforts to create new customers, particularly in the service industry. Meanwhile, the store itself can offer a unique environment that may influence the consumer's purchase decision. Consumers interact with store environments in the process of,virtually, all household purchase they make (Sarel 1981). Thus, store environments may encourage customers to purchase. The roles that store environments play are to provide informational cues to customers about the store and goods and communicate messages to stimulate consumers' emotions. The store environments differentiate the store from competing stores and build a unique service brand personality. However, the existing studies related to brand in the service industry mostly concentrated on the relationship between the quality of service and customer satisfaction, and they are mostly generalized while the connective studies focused on brand personality. Such approaches show limitations and are insufficient to investigate on the relationship between store environment and brand personality in the service industry. Accordingly, the study intends to identify the level of contribution to the establishment of brand personality made by the store's physical environments that influence on the specific brand characteristics depending on the type of service. The study also intends to identify what kind of relationships with brand personality exists with brand personality while being influenced by store environments. In addition, the study intends to make meaningful suggestions to better direct marketing efforts by identifying whether a brand personality makes a positive influence to induce an intention for repurchase. For this study, the service industry is classified into four categories based on to the characteristics of service: experimental-emotional service, emotional -credible service, credible-functional service, and functional-experimental service. The type of business with the most frequent customer contact is determined for each service type and the enterprise with the highest brand value in each service sector based on the report made by the Korea Management Association. They are designated as the representative of each category. The selected representatives are a fast-food store (experimental-emotional service), a cinema house (emotional-credible service), a bank (credible-functional service), and discount store (functional-experimental service). The survey was conducted for the four selected brands to represent each service category among consumers who are experienced users of the designated stores in Seoul Metropolitan City and Gyeonggi province via written questionnaires in order to verify the suggested assumptions in the study. In particular, the survey adopted 15 scales, which represent each characteristic factor, among the 42 unique characteristics developed by Jennifer Aaker(1997) to assess the brand personality of each service brand. SPSS for Windows Release 12.0 and LISREL were used in the analysis of data verification. The methodology of the structural equation model was used for the study and the pivotal findings are as follows. 1) The environmental factors ware classified as design factors, ambient factors, and social factors. Therefore, the validity of measurement scale of Baker et al. (1994) was proved. 2) The service brand personalities were subdivided as sincerity, excitement, competence, sophistication, and ruggedness, which makes the use of the brand personality scales by Jennifer Aaker(1997) appropriate in the service industry as well. 3) One-way ANOVA analysis on the scales of store environment and service brand personality showed that there exist statistically significant differences in each service category. For example, the social factors were highest in discount stores, while the ambient factors and design factors were highest in fast-food stores. The discount stores were highest in the sincerity and excitement, while the highest point for banks was in the competence and ruggedness, and the highest point for fast-food stores was in the sophistication, The consumers will make a different respond to the physical environment of stores and service brand personality that are inherent to the corresponding service interface. Hence, the customers will make a different decision-making when dealing with different service categories. In this aspect, the relationships of variables in the proposed hypothesis appear to work in a different way depending on the exposed service category. 4) The store environment factors influenced on service brand personalities differently by category of service. The factors of store's physical environment are transferred to a brand and were verified to strengthen service brand personalities. In particular, the level of influence on the service brand personality by physical environment differs depending on service category or dimension, which indicates that there is a need to apply a different style of management to a different service category or dimension. It signifies that there needs to be a brand strategy established in order to positively influence the relationship with consumers by utilizing an appropriate brand personality factor depending on different characteristics by service category or dimension. 5) The service brand personalities influenced on the repurchase intention. Especially, the largest influence was made in the sophistication dimension of service brand personality scale; the unique and characteristically appropriate arrangement of physical environment will make customers stay in the service environment for a long time and will lead to give a positive influence on the repurchase intention. 6) The store environment factors influenced on the repurchase intention. Particularly, the largest influence was made on the social factors of store environment. The most intriguing finding is that the service factor among all other environment factors gives the biggest influence to the repurchase intention in most of all service types except fast-food stores. Such result indicates that the customers pay attention to how much the employees try to provide a quality service when they make an evaluation on the service brand. At the same time, it also indicates that the personal factor is directly transmitted to the construction of brand personality. The employees' attitude and behavior are the determinants to establish a service brand personality in the process of enhancing service interface. Hence, there should be a reinforced search for a method to efficiently manage the service staff who has a direct contact with customers in order to make an affirmative improvement of the customers' brand evaluation at the service interface. The findings suggest several managerial implications. 1) Results from the empirical study indicated that store environment factors have a strong positive impact on a service brand personality. To increase customers' repurchase intention of a service brand, the management is required to effectively manage store environment factors and create a friendly brand personality based on the corresponding service environment. 2) Mangers and researchers must understand and recognize that the store environment elements are important marketing tools, and that brand personality influences on consumers' repurchase intention. Based on such result of the study, a service brand could be utilized as an efficient measure to achieve a differentiation by enforcing the elements that are most influential among all other store environments for each service category. Therefore, brand personality established involving various store environments will further reinforce the relationship with customers through the elevated brand identification of which utilization to induce repurchase decision can be used as an entry barrier. 3) The study identified the store environment as a component of service brand personality for the store's effective communication with consumers. For this, all communication channels should be maintained with consistency and an integrated marketing communication should be executed to efficiently approach to a larger number of customers. Mangers and researchers must find strategies for aligning decisions about store environment elements with the retailers' marketing and store personality objectives. All ambient, design, and social factors need to be orchestrated so that consumers can take an appropriate store personality. In this study, the induced results from the previous studies were extended to the service industry so as to identify the customers' decision making process that leads to repurchase intention and a result similar to those of the previous studies. The findings suggested several theoretical and managerial implications. However, the situation that only one service brand served as the subject of analysis for each service category, and the situation that correlations among store environment elements were not identified, as well as the problem of representation in selection of samples should be considered and supplemented in the future when further studies are conducted. In addition, various antecedents and consequences of brand personality must be looked at in the aspect of the service environment for further research.