Product Recommender Systems using Multi-Model Ensemble Techniques (다중모형조합기법을 이용한 상품추천시스템)
-
- Journal of Intelligence and Information Systems
- /
- v.19 no.2
- /
- pp.39-54
- /
- 2013
Recent explosive increase of electronic commerce provides many advantageous purchase opportunities to customers. In this situation, customers who do not have enough knowledge about their purchases, may accept product recommendations. Product recommender systems automatically reflect user's preference and provide recommendation list to the users. Thus, product recommender system in online shopping store has been known as one of the most popular tools for one-to-one marketing. However, recommender systems which do not properly reflect user's preference cause user's disappointment and waste of time. In this study, we propose a novel recommender system which uses data mining and multi-model ensemble techniques to enhance the recommendation performance through reflecting the precise user's preference. The research data is collected from the real-world online shopping store, which deals products from famous art galleries and museums in Korea. The data initially contain 5759 transaction data, but finally remain 3167 transaction data after deletion of null data. In this study, we transform the categorical variables into dummy variables and exclude outlier data. The proposed model consists of two steps. The first step predicts customers who have high likelihood to purchase products in the online shopping store. In this step, we first use logistic regression, decision trees, and artificial neural networks to predict customers who have high likelihood to purchase products in each product group. We perform above data mining techniques using SAS E-Miner software. In this study, we partition datasets into two sets as modeling and validation sets for the logistic regression and decision trees. We also partition datasets into three sets as training, test, and validation sets for the artificial neural network model. The validation dataset is equal for the all experiments. Then we composite the results of each predictor using the multi-model ensemble techniques such as bagging and bumping. Bagging is the abbreviation of "Bootstrap Aggregation" and it composite outputs from several machine learning techniques for raising the performance and stability of prediction or classification. This technique is special form of the averaging method. Bumping is the abbreviation of "Bootstrap Umbrella of Model Parameter," and it only considers the model which has the lowest error value. The results show that bumping outperforms bagging and the other predictors except for "Poster" product group. For the "Poster" product group, artificial neural network model performs better than the other models. In the second step, we use the market basket analysis to extract association rules for co-purchased products. We can extract thirty one association rules according to values of Lift, Support, and Confidence measure. We set the minimum transaction frequency to support associations as 5%, maximum number of items in an association as 4, and minimum confidence for rule generation as 10%. This study also excludes the extracted association rules below 1 of lift value. We finally get fifteen association rules by excluding duplicate rules. Among the fifteen association rules, eleven rules contain association between products in "Office Supplies" product group, one rules include the association between "Office Supplies" and "Fashion" product groups, and other three rules contain association between "Office Supplies" and "Home Decoration" product groups. Finally, the proposed product recommender systems provides list of recommendations to the proper customers. We test the usability of the proposed system by using prototype and real-world transaction and profile data. For this end, we construct the prototype system by using the ASP, Java Script and Microsoft Access. In addition, we survey about user satisfaction for the recommended product list from the proposed system and the randomly selected product lists. The participants for the survey are 173 persons who use MSN Messenger, Daum Caf
Development of technologies in artificial intelligence has been rapidly increasing with the Fourth Industrial Revolution, and researches related to AI have been actively conducted in a variety of fields such as autonomous vehicles, natural language processing, and robotics. These researches have been focused on solving cognitive problems such as learning and problem solving related to human intelligence from the 1950s. The field of artificial intelligence has achieved more technological advance than ever, due to recent interest in technology and research on various algorithms. The knowledge-based system is a sub-domain of artificial intelligence, and it aims to enable artificial intelligence agents to make decisions by using machine-readable and processible knowledge constructed from complex and informal human knowledge and rules in various fields. A knowledge base is used to optimize information collection, organization, and retrieval, and recently it is used with statistical artificial intelligence such as machine learning. Recently, the purpose of the knowledge base is to express, publish, and share knowledge on the web by describing and connecting web resources such as pages and data. These knowledge bases are used for intelligent processing in various fields of artificial intelligence such as question answering system of the smart speaker. However, building a useful knowledge base is a time-consuming task and still requires a lot of effort of the experts. In recent years, many kinds of research and technologies of knowledge based artificial intelligence use DBpedia that is one of the biggest knowledge base aiming to extract structured content from the various information of Wikipedia. DBpedia contains various information extracted from Wikipedia such as a title, categories, and links, but the most useful knowledge is from infobox of Wikipedia that presents a summary of some unifying aspect created by users. These knowledge are created by the mapping rule between infobox structures and DBpedia ontology schema defined in DBpedia Extraction Framework. In this way, DBpedia can expect high reliability in terms of accuracy of knowledge by using the method of generating knowledge from semi-structured infobox data created by users. However, since only about 50% of all wiki pages contain infobox in Korean Wikipedia, DBpedia has limitations in term of knowledge scalability. This paper proposes a method to extract knowledge from text documents according to the ontology schema using machine learning. In order to demonstrate the appropriateness of this method, we explain a knowledge extraction model according to the DBpedia ontology schema by learning Wikipedia infoboxes. Our knowledge extraction model consists of three steps, document classification as ontology classes, proper sentence classification to extract triples, and value selection and transformation into RDF triple structure. The structure of Wikipedia infobox are defined as infobox templates that provide standardized information across related articles, and DBpedia ontology schema can be mapped these infobox templates. Based on these mapping relations, we classify the input document according to infobox categories which means ontology classes. After determining the classification of the input document, we classify the appropriate sentence according to attributes belonging to the classification. Finally, we extract knowledge from sentences that are classified as appropriate, and we convert knowledge into a form of triples. In order to train models, we generated training data set from Wikipedia dump using a method to add BIO tags to sentences, so we trained about 200 classes and about 2,500 relations for extracting knowledge. Furthermore, we evaluated comparative experiments of CRF and Bi-LSTM-CRF for the knowledge extraction process. Through this proposed process, it is possible to utilize structured knowledge by extracting knowledge according to the ontology schema from text documents. In addition, this methodology can significantly reduce the effort of the experts to construct instances according to the ontology schema.
Land was originally communized by a community in the primitive society of Korea, and in the age of the ancient society SAM KUK-SILLA, KOKURYOE and PAEK JE-it was distributed under the principle of land-nationalization. But by the occupation of the lands which were permitted to transmit from generation to generation as Royal Grant Lands and newly cleared lands, the private occupation had already begun to be formed. Thus the private ownership of land originated by chiefs of the tribes had a trend to be gradually pervaded to the communal members. After the, SILLA Kingdom unified SAM KUK in 668 A.D., JEONG JEON System and KWAN RYO JEON System, which were the distribution systems of farmlands originated from the TANG Dynasty in China, were enforced to established the basis of an absolute monarchy. Even in this age the forest area was jointly controlled and commonly used by village communities because of the abundance of area and stocked volume, and the private ownership of the forest land was prohibited by law under the influence of the TANG Dynasty system. Toward the end of the SILLA Dynasty, however, as its centralism become weak, the tendency of the private occupancy of farmland by influential persons was expanded, and at the same time the occupancy of the forest land by the aristocrats and Buddhist temples began to come out. In the ensuing KORYO Dynasty (519 to 1391 A.D.) JEON SI KWA System under the principle of land-nationalization was strengthened and the privilege of tax collection was transferred to the bureaucrats and the aristocrats as a means of material compensation for them. Taking this opportunity the influential persons began to expand their lands for the tax collection on a large scale. Therefore, about in the middle of 11th century the farmlands and the forest lands were annexed not only around the vicinity of the capital but also in the border area by influential persons. Toward the end of the KORYO Dynasty the royal families, the bureaucrats and the local lords all possessed manors and occupied the forest lands on a large scale as a part of their farmlands. In the KORYO Dynasty, where national economic foundation was based upon the lands, the disorder of the land system threatened the fall of the Dynasty and so the land reform carried out by General YI SEONG-GYE had led to the creation of ensuing YI Dynasty. All systems of the YI Dynasty were substantially adopted from those of the KORYO Dynasty and thereby KWA JEON System was enforced under the principle of land-nationalization, while the occupancy or the forest land was strictly prohibited, except the national or royal uses, by the forbidden item in KYEONG JE YUK JEON SOK JEON, one of codes provided by the successive kings in the YI Dynasty. Thus the basis of the forest land system through the YI Dynasty had been established, while the private forest area possessed by influential persons since the previous KORYO Dynasty was preserved continuously under the influence of their authorities. Therefore, this principle of the prohibition was nothing but a legal fiction for the security of sovereign powers. Consequently the private occupancy of the forest area was gradually enlarged and finally toward the end of YI Dynasty the privately possessed forest lands were to be officially authorized. The forest administration systems in the YI Dynasty are summarized as follows: a) KEUM SAN and BONG SAN. Under the principle of land-nationalization by a powerful centralism KWA JEON System was established at the beginning of the YI Dynasty and its government expropriated all the forests and prohibited strictly the private occupation. In order to maintain the dignity of the royal capital, the forests surounding capital areas were instituted as KEUM SAN (the reserved forests) and the well-stocked natural forest lands were chosen throughout the nation by the government as BONG SAN(national forests for timber production), where the government nominated SAN JIK(forest rangers) and gave them duties to protect and afforest the forests. This forest reservation system exacted statute labors from the people of mountainious districts and yet their commons of the forest were restricted rigidly. This consequently aroused their strong aversion against such forest reservation, therefore those forest lands were radically spoiled by them. To settle this difficult problem successive kings emphasized the preservation of the forests repeatedly, and in KYEONG KUK DAI JOEN, the written constitution of the YI Dynasty, a regulation for the forest preservation was provided but the desired results could not be obtained. Subsequently the split of bureaucrats with incessant feuds among politicians and scholars weakened the centralism and moreover, the foreign invasions since 1592 made the national land devasted and the rural communities impoverished. It happned that many wandering peasants from rural areas moved into the deep forest lands, where they cultivated burnt fields recklessly in the reserved forest resulting in the severe damage of the national forests. And it was inevitable for the government to increase the number of BONG SAN in order to solve the problem of the timber shortage. The increase of its number accelerated illegal and reckless cutting inevitably by the people living mountainuos districts and so the government issued excessive laws and ordinances to reserve the forests. In the middle of the 18th century the severe feuds among the politicians being brought under control, the excessive laws and ordinances were put in good order and the political situation became temporarily stabilized. But in spite of those endeavors evil habitudes of forest devastation, which had been inveterate since the KORYO Dynasty, continued to become greater in degree. After the conclusion of "the Treaty of KANG WHA with Japan" in 1876 western administration system began to be adopted, and thereafter through the promulgation of the Forest Law in 1908 the Imperial Forests were separated from the National Forests and the modern forest ownership system was fixed. b) KANG MU JANG. After the reorganization of the military system, attaching importance to the Royal Guard Corps, the founder of the YI Dynasty, TAI JO (1392 to 1398 A.D.) instituted the royal preserves-KANG MU JANG-to attain the purposes for military training and royal hunting, prohibiting strictly private hunting, felling and clearing by the rural inhabitants. Moreover, the tyrant, YEON SAN (1495 to 1506 A.D.), expanded widely the preserves at random and strengthened its prohibition, so KANG MU JANG had become the focus of the public antipathy. Since the invasion of Japanese in 1592, however, the innovation of military training methods had to be made because of the changes of arms and tactics, and the royal preserves were laid aside consequently and finally they had become the private forests of influential persons since 17th century. c) Forests for official use. All the forests for official use occupied by government officies since the KORYO Dynasty were expropriated by the YI Dynasty in 1392, and afterwards the forests were allotted on a fixed standard area to the government officies in need of firewoods, and as the forest resources became exhausted due to the depredated forest yield, each office gradually enlarged the allotted area. In the 17th century the national land had been almost devastated by the Japanese invasion and therefore each office was in the difficulty with severe deficit in revenue, thereafter waste lands and forest lands were allotted to government offices inorder to promote the land clearing and the increase in the collections of taxes. And an abuse of wide occupation of the forests by them was derived and there appeared a cause of disorder in the forest land system. So a provision prohibiting to allot the forests newly official use was enacted in 1672, nevertheless the government offices were trying to enlarge their occupied area by encroaching the boundary and this abuse continued up to the end of the YI Dynasty. d) Private forests. The government, at the bigninning of the YI Dynasty, expropriated the forests all over the country under the principle of prohibition of private occupancy of forest lands except for the national uses, while it could not expropriate completely all of the forest lands privately occupied and inherited successively by bureaucrats, and even local governors could not control them because of their strong influences. Accordingly the King, TAI JONG (1401 to 1418 A.D.), legislated the prohibition of private forest occupancy in his code, KYEONG JE YUK JEON (1413), and furthermore he repeatedly emphasized to observe the law. But The private occupancy of forest lands was not yet ceased up at the age of the King, SE JO (1455 to 1468 A.D.), so he prescribed the provision in KYEONG KUK DAI JEON (1474), an immutable law as a written constitution in the YI Dynasty: "Anyone who privately occupy the forest land shall be inflicted 80 floggings" and he prohibited the private possession of forest area even by princes and princesses. But, it seemed to be almost impossible for only one provsion in a code to obstruct the historical growing tendecy of private forest occupancy, for example, the King, SEONG JONG (1470 to 1494 A.D.), himself granted the forests to his royal families in defiance of the prohibition and thereafter such precedents were successively expanded, and besides, taking advantage of these facts, the influential persons openly acquired their private forest lands. After tyrannical rule of the King, YEON SAN (1945 to 1506 A.D.), the political disorder due to the splits to bureaucrats with successional feuds and the usurpations of thrones accelerated the private forest occupancy in all parts of the country, thus the forbidden clause on the private forest occupancy in the law had become merely a legal fiction since the establishment of the Dynasty. As above mentioned, after the invasion of Japanese in 1592, the courts of princes (KUNG BANGG) fell into the financial difficulties, and successive kings transferred the right of tax collection from fisherys and saltfarms to each KUNG BANG and at the same time they allotted the forest areas in attempt to promote the clearing. Availing themselves of this opportunity, royal families and bureaucrats intended to occupy the forests on large scale. Besides a privilege of free selection of grave yard, which had been conventionalized from the era of the KORYO Dynasty, created an abuse of occuping too wide area for grave yards in any forest at their random, so the King, TAI JONG, restricted the area of grave yard and homestead of each family. Under the policy of suppresion of Buddhism in the YI Dynasty a privilege of taxexemption for Buddhist temples was deprived and temple forests had to follow the same course as private forests did. In the middle of 18th century the King, YEONG JO (1725 to 1776 A.D.), took an impartial policy for political parties and promoted the spirit of observing laws by putting royal orders and regulations in good order excessively issued before, thus the confused political situation was saved, meanwhile the government officially permittd the private forest ownership which substantially had already been permitted tacitly and at the same time the private afforestation areas around the grave yards was authorized as private forests at least within YONG HO (a boundary of grave yard). Consequently by the enforcement of above mentioned policies the forbidden clause of private forest ownership which had been a basic principle of forest system in the YI Dynasty entireely remained as only a historical document. Under the rule of the King, SUN JO (1801 to 1834 A.D.), the political situation again got into confusion and as the result of the exploitation from farmers by bureaucrats, the extremely impoverished rural communities created successively wandering peasants who cleared burnt fields and deforested recklessly. In this way the devastation of forests come to the peak regardless of being private forests or national forests, moreover, the influential persons extorted private forests or reserved forests and their expansion of grave yards became also excessive. In 1894 a regulation was issued that the extorted private forests shall be returned to the initial propriators and besides taking wide area of the grave yards was prohibited. And after a reform of the administrative structure following western style, a modern forest possession system was prepared in 1908 by the forest law including a regulation of the return system of forest land ownership. At this point a forbidden clause of private occupancy of forest land got abolished which had been kept even in fictitious state since the foundation of the YI Dynasty. e) Common forests. As above mentioned, the forest system in the YI Dynasty was on the ground of public ownership principle but there was a high restriction to the forest profits of farmers according to the progressive private possession of forest area. And the farmers realized the necessity of possessing common forest. They organized village associations, SONGE or KEUM SONGE, to take the ownerless forests remained around the village as the common forest in opposition to influential persons and on the other hand, they prepared the self-punishment system for the common management of their forests. They made a contribution to the forest protection by preserving the common forests in the late YI Dynasty. It is generally known that the absolute monarchy expr opriates the widespread common forests all over the country in the process of chainging from thefeudal society to the capitalistic one. At this turning point in Korea, Japanese colonialists made public that the ratio of national and private forest lands was 8 to 2 in the late YI Dynasty, but this was merely a distorted statistics with the intention of rationalizing of their dispossession of forests from Korean owners, and they took advantage of dead forbidden clause on the private occupancy of forests for their colonization. They were pretending as if all forests had been in ownerless state, but, in truth, almost all the forest lands in the late YI Dynasty except national forests were in the state of private ownership or private occupancy regardless of their lawfulness.