Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)
-
- Journal of Intelligence and Information Systems
- /
- v.22 no.3
- /
- pp.143-163
- /
- 2016
The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.
The Album of Complete Views of Seas and Mountains comprises sixty real scenery landscape paintings depicting Geumgangsan Mountain, the Haegeumgang River, and the eight scenic views of Gwandong regions, as well as fifty-one pieces of writing. It is a rare example in terms of its size and painting style. The paintings in this album, which are densely packed with natural features, follow the painting style of the Southern School yet employ crude and unconventional elements. In them, stones on the mountains are depicted both geometrically and three-dimensionally. Since 1973, parts of this album have been published in some exhibition catalogues. The entire album was opened to the public at the special exhibition "Through the Eyes of Joseon Painters: Real Scenery Landscapes of Korea" held at the National Museum of Korea in 2019. The Album of Complete Views of Seas and Mountains was attributed to Kim Eung-hwan (1742-1789) due to the signature on the final leaf of the album and the seal reading "Bokheon(painter's penname)" on the currently missing album leaf of Chilbodae Peaks. However, there is a strong possibility that this signature and seal may have been added later. This paper intends to reexamine the creator of this album based on a variety of related factors. In order to understand the production background of Album of Complete Views of Seas and Mountains, I investigated the eighteenth-century tradition of drawing scenic spots while travelling in which scenery of was depicted during private travels or official excursions. Jeong Seon(1676-1759), Sim Sa-jeong(1707-1769), Kim Yun-gyeom(1711-1775), Choe Buk(1712-after 1786), and Kang Se-hwang(1713-1791) all went on a journey to Geumgangsan Mountain, the most famous travel destination in the late Joseon period, and created paintings of the mountain, including Album of Pungak Mountain in the Sinmyo Year(1711) by Jeong Seon. These painters presented their versions of the traditional scenic spots of Inner Geumgangsan and newly depicted vistas they discovered for themselves. To commemorate their private visits, they produced paintings for their fellow travelers or sponsors in an album format that could include several scenes. While the production of paintings of private travels to Geumgangsan Mountain increased, King Jeongjo(r. 1776-1800) ordered Kim Eung-hwan and Kim Hong-do, court painters at the Dohwaseo(Royal Bureau of Painting), to paint scenic spots in the nine counties of the Yeongdong region and around Geumgangsan Mountain. King Jeongjo selected these two as the painters for the official excursion taking into account their relationship, their administrative experience as regional officials, and their distinct painting styles. Starting in the reign of King Yeongjo(r. 1724-1776), Kim Eung-hwan and Kim Hong-do served as court painters at the Dohwaseo, maintained a close relationship as a senior and a junior and as colleagues, and served as chalbang(chief in large of post stations) in the Yeongnam region. While Kim Hong-do was proficient at applying soft and delicate brushstrokes, Kim Eung-hwan was skilled at depicting the beauty of robust and luxuriant landscapes. Both painters produced about 100 scenes of original drawings over fifty days of the official excursion. Based on these original drawings, they created around seventy album leaves or handscrolls. Their paintings enriched the tradition of depicting scenic spots, particularly Outer Inner Geumgang and the eight scenic views of Gwandong around Geumgangsan Mountain during private journeys in the eighteenth century. Moreover, they newly discovered places of scenic beauty in the Outer Geungang and Yeongdong regions, establishing them as new painting themes. The Album of Complete Views of Seas and Mountains consists of four volumes. The volumes I, II include twenty-nine paintings of Inner Geumgangsan; the volume III, seventeen scenes of Outer Geumgangsan; and the volume IV, fourteen images of Maritime Geumgangsan and the eight scenic views of Gwandong. These paintings produced on silk show crowded compositions, geometrical depictions of the stones and the mountains, and distinct presentation of the rocky peaks of Geumgangsan Mountain using white and grayish-blue pigments. This album reflects the Joseon painting style of the mid- and late eighteenth century, integrating influences from Jeong Seon, Kang Se-hwang, Sim Sa-jeong, Jeong Chung-yeop(1725-after 1800), and Kim Hong-do. In particular, some paintings in the album show similarities to Kim Hong-do's Album of Famous Mountains in Korea in terms of its compositions and painterly motifs. However, "Yeongrangho Lake," "Haesanjeong Pavilion," and "Wolsongjeong Pavilion" in Kim Eung-hwan's album differ from in the version by Kim Hong-do. Thus, Kim Eung-hwan was influenced by Kim Hong-do, but produced his own distinctive album. The Album of Complete Views of Seas and Mountains includes scenery of "Jaundam Pool," "Baegundae Peak," "Viewing Birobong Peak at Anmunjeom groove," and "Baekjeongbong Peak," all of which are not depicted in other albums. In his version, Kim Eung-hwan portrayed the characteristics of the natural features in each scenic spot in a detailed and refreshing manner. Moreover, he illustrated stones on the mountains using geometric shapes and added a sense of three-dimensionality using lines and planes. Based on the painting traditions of the Southern School, he established his own characteristics. He also turned natural features into triangular or rectangular chunks. All sixty paintings in this album appear rough and unconventional, but maintain their internal consistency. Each of the fifty-one writings included in the Album of Complete Views of Seas and Mountains is followed by a painting of a scenic spot. It explains the depicted landscape, thus helping viewers to understand and appreciate the painting. Intimately linked to each painting, the related text notes information on traveling from one scenic spot to the next, the origins of the place names, geographic features, and other related information. Such encyclopedic documentation began in the early nineteenth century and was common in painting albums of Geumgangsan Mountain in the mid- nineteenth century. The text following the painting of Baekhwaam Hermitage in the Album of Complete Views of Seas and Mountains documents the reconstruction of the Baekhwaam Hermitage in 1845, which provides crucial evidence for dating the text. Therefore, the owner of the Album of Complete Views of Seas and Mountains might have written the texts or asked someone else to transcribe them in the mid- or late nineteenth century. In this paper, I have inferred the producer of the Album of Complete Views of Seas and Mountains to be Kim Eung-hwan based on the painting style and the tradition of drawing scenic spots during official trips. Moreover, its affinity with the Handscroll of Pungak Mountain created by Kim Ha-jong(1793-after 1878) after 1865 is another decisive factor in attributing the album to Kim Eung-hwan. In contrast to the Album of Famous Mountains in Korea by Kim Hong-do, the Album of Complete Views of Seas and Mountains exerted only a minor influence on other painters. The Handscroll of Pungak Mountain by Kim Ha-jong is the sole example that employs the subject matter from the Album of Complete Views of Seas and Mountains and follows its painting style. In the Handscroll of Pungak Mountain, Kim Ha-jong demonstrated a painting style completely different from that in the Album of Seas and Mountains that he produced fifty years prior in 1816 for Yi Gwang-mun, the magistrate of Chuncheon. He emphasized the idea of "scholar thoughts" by following the compositions, painterly elements, and depictions of figures in the painting manual style from Kim Eung-hwan's Album of Complete Views of Seas and Mountains. Kim Ha-jong, a member of the Gaeseong Kim clan and the eldest grandson of Kim Eung-hwan, is presumed to have appreciated the paintings depicted in the nature of Album of Complete Views of Seas and Mountains, which had been passed down within the family, and newly transformed them. Furthermore, the contents and narrative styles of Yi Yu-won's writings attached to the paintings in the Handscroll of Pungak Mountain are similar to those of the fifty-one writings in Kim Eunghwan's album. This suggests a possible influence of the inscriptions in Kim Eung-hwan's album or the original texts from which these inscriptions were quoted upon the writings in Kim Ha-jong's handscroll. However, a closer examination will be needed to determine the order of the transcription of the writings. The Album of Complete View of Seas and Mountains differs from Kim Hong-do's paintings of his official trips and other painting albums he influenced. This album is a siginificant artwork in that it broadens the understanding of the art world of Kim Eung-hwan and illustrates another layer of real scenery landscape paintings in the late eighteenth century.