• Title/Summary/Keyword: tree based learning

Search Result 435, Processing Time 0.026 seconds

Development of Intelligent Internet Shopping Mall Supporting Tool Based on Software Agents and Knowledge Discovery Technology (소프트웨어 에이전트 및 지식탐사기술 기반 지능형 인터넷 쇼핑몰 지원도구의 개발)

  • 김재경;김우주;조윤호;김제란
    • Journal of Intelligence and Information Systems
    • /
    • v.7 no.2
    • /
    • pp.153-177
    • /
    • 2001
  • Nowadays, product recommendation is one of the important issues regarding both CRM and Internet shopping mall. Generally, a recommendation system tracks past actions of a group of users to make a recommendation to individual members of the group. The computer-mediated marketing and commerce have grown rapidly and thereby automatic recommendation methodologies have got great attentions. But the researches and commercial tools for product recommendation so far, still have many aspects that merit further considerations. To supplement those aspects, we devise a recommendation methodology by which we can get further recommendation effectiveness when applied to Internet shopping mall. The suggested methodology is based on web log information, product taxonomy, association rule mining, and decision tree learning. To implement this we also design and intelligent Internet shopping mall support system based on agent technology and develop it as a prototype system. We applied this methodology and the prototype system to a leading Korean Internet shopping mall and provide some experimental results. Through the experiment, we found that the suggested methodology can perform recommendation tasks both effectively and efficiently in real world problems. Its systematic validity issues are also discussed.

  • PDF

A Methodology of Decision Making Condition-based Data Modeling for Constructing AI Staff (AI 참모 구축을 위한 의사결심조건의 데이터 모델링 방안)

  • Han, Changhee;Shin, Kyuyong;Choi, Sunghun;Moon, Sangwoo;Lee, Chihoon;Lee, Jong-kwan
    • Journal of Internet Computing and Services
    • /
    • v.21 no.1
    • /
    • pp.237-246
    • /
    • 2020
  • this paper, a data modeling method based on decision-making conditions is proposed for making combat and battlefield management systems to be intelligent, which are also a decision-making support system. A picture of a robot seeing and perceiving like humans and arriving a point it wanted can be understood and be felt in body. However, we can't find an example of implementing a decision-making which is the most important element in human cognitive action. Although the agent arrives at a designated office instead of human, it doesn't support a decision of whether raising the market price is appropriate or doing a counter-attack is smart. After we reviewed a current situation and problem in control & command of military, in order to collect a big data for making a machine staff's advice to be possible, we propose a data modeling prototype based on decision-making conditions as a method to change a current control & command system. In addition, a decision-making tree method is applied as an example of the decision making that the reformed control & command system equipped with the proposed data modeling will do. This paper can contribute in giving us an insight of how a future AI decision-making staff approaches to us.

A Study on the Meaning Landscape and Environmental Design Techniques of Yoohoedang Garden(Hageowon : 何去園) of Byulup(別業) Type Byulseo(別墅) (별업(別業) '유회당' 원림 하거원(何去園)의 의미경관 해석과 환경설계기법)

  • Shin, Sang-sup;Kim, Hyun-wuk
    • Korean Journal of Heritage: History & Science
    • /
    • v.46 no.2
    • /
    • pp.46-69
    • /
    • 2013
  • The results of study on the meaning landscape and environmental design techniques of the Byulup, Yoohoedang garden(Hageowon) based on the story in the collection of Kwon Yi-jin (Yoohoedangjip, 有懷堂集), are as below. First, Yoohoedang Kwon Yi-jin (有懷堂 權以鎭 : 1668~1734) constructed a Byulup garden consisting of ancestor grave, Byulup, garden, and a school, through 3 steps for 20 years in the back hill area of Moosoo-dong village, south of Mountain Bomun in Daejeon. In other words, he built the Byulup(別業, Yoohoedang) by placing his father's grave in the back hill of the village, and then constructed Yoegeongam(餘慶菴) and Geoupjae(居業齋) for protection of the pond(Napoji, 納汚池), garden(Banhwanwon, 盤桓園), and ancestor graves, and descendants' studying in the middle stage. He built an extension in Yoohoedang and finally completed the large-size garden (Hageowon) by extending the east area. Second, in terms of geomancy sense, Yoohoedang Byulup located in Moosoo-dong village area is the representative example including all space elements such as main living house (the head family house of Andong Kwon family), Byulup (Yoohoedang), ancestor graves, Hagoewon (garden) and Yoegeongam (cemetery management and school) which byulup type Byulseo should be equipped with. Thirdly, there are various meaning landscape elements combining the value system of Confucianism, Buddhism and Taoism value, including; (1) remembering parents, (2) harmonious family, (3) integrity, (4) virtue, (5) noble personality, (6) good luck, (7) hermit life, (8) family prosperity and learning development, (9) grace from ancestors, (10) fairyland, (11) guarding ancestor graves, and (12) living ever-young. Fourth, after he arranged ancestor graveyard in the back of the village, he used surrounding natural landscapes to construct Hagoewon garden with water garden consisting of 4 mountain streams and 3 ponds for 13 years, and finally completed a beautiful fairyland with 5 platforms, 3 bamboo forests, as well as the Seokgasan(石假山, artificial hill). Fifth, he adopted landscape plantation (28 kinds; pine, maple, royal azalea, azalea, persimmon tree, bamboo, willow, pomegranate tree, rose, chinensis, chaenomeles speciosa, Japanese azalea, peach tree, lotus, chrysanthemum, peony, and Paeonia suffruticosa, etc.) to apply romance from poetic affection, symbol and ideal from personification, as well as plantation plan considering seasonal landscapes. Landscape rocks were used by intact use of natural rocks, connecting with water elements, garden ornament method using Seokyeonji and flower steps, and mountain Seokga method showing the essence of landscape meanings. In addition, waterscape are characterized by active use of water considering natural streams and physio-graphic condition (eastern valley), ecological corridor role that rhythmically connects each space of the garden and waterways following routes, landscape meaning introduction connecting 'gaining knowledge by the study of things' values including Hwalsoodam(活水潭, pond), Mongjeong(蒙井, spring), Hosoo(濠水, stream), and Boksoo(?水, stream), and sensuous experience space construction with auditory and visualization using properties of landscape matters.

Development of High-Resolution Fog Detection Algorithm for Daytime by Fusing GK2A/AMI and GK2B/GOCI-II Data (GK2A/AMI와 GK2B/GOCI-II 자료를 융합 활용한 주간 고해상도 안개 탐지 알고리즘 개발)

  • Ha-Yeong Yu;Myoung-Seok Suh
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_3
    • /
    • pp.1779-1790
    • /
    • 2023
  • Satellite-based fog detection algorithms are being developed to detect fog in real-time over a wide area, with a focus on the Korean Peninsula (KorPen). The GEO-KOMPSAT-2A/Advanced Meteorological Imager (GK2A/AMI, GK2A) satellite offers an excellent temporal resolution (10 min) and a spatial resolution (500 m), while GEO-KOMPSAT-2B/Geostationary Ocean Color Imager-II (GK2B/GOCI-II, GK2B) provides an excellent spatial resolution (250 m) but poor temporal resolution (1 h) with only visible channels. To enhance the fog detection level (10 min, 250 m), we developed a fused GK2AB fog detection algorithm (FDA) of GK2A and GK2B. The GK2AB FDA comprises three main steps. First, the Korea Meteorological Satellite Center's GK2A daytime fog detection algorithm is utilized to detect fog, considering various optical and physical characteristics. In the second step, GK2B data is extrapolated to 10-min intervals by matching GK2A pixels based on the closest time and location when GK2B observes the KorPen. For reflectance, GK2B normalized visible (NVIS) is corrected using GK2A NVIS of the same time, considering the difference in wavelength range and observation geometry. GK2B NVIS is extrapolated at 10-min intervals using the 10-min changes in GK2A NVIS. In the final step, the extrapolated GK2B NVIS, solar zenith angle, and outputs of GK2A FDA are utilized as input data for machine learning (decision tree) to develop the GK2AB FDA, which detects fog at a resolution of 250 m and a 10-min interval based on geographical locations. Six and four cases were used for the training and validation of GK2AB FDA, respectively. Quantitative verification of GK2AB FDA utilized ground observation data on visibility, wind speed, and relative humidity. Compared to GK2A FDA, GK2AB FDA exhibited a fourfold increase in spatial resolution, resulting in more detailed discrimination between fog and non-fog pixels. In general, irrespective of the validation method, the probability of detection (POD) and the Hanssen-Kuiper Skill score (KSS) are high or similar, indicating that it better detects previously undetected fog pixels. However, GK2AB FDA, compared to GK2A FDA, tends to over-detect fog with a higher false alarm ratio and bias.

Suggestion of Urban Regeneration Type Recommendation System Based on Local Characteristics Using Text Mining (텍스트 마이닝을 활용한 지역 특성 기반 도시재생 유형 추천 시스템 제안)

  • Kim, Ikjun;Lee, Junho;Kim, Hyomin;Kang, Juyoung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.149-169
    • /
    • 2020
  • "The Urban Renewal New Deal project", one of the government's major national projects, is about developing underdeveloped areas by investing 50 trillion won in 100 locations on the first year and 500 over the next four years. This project is drawing keen attention from the media and local governments. However, the project model which fails to reflect the original characteristics of the area as it divides project area into five categories: "Our Neighborhood Restoration, Housing Maintenance Support Type, General Neighborhood Type, Central Urban Type, and Economic Base Type," According to keywords for successful urban regeneration in Korea, "resident participation," "regional specialization," "ministerial cooperation" and "public-private cooperation", when local governments propose urban regeneration projects to the government, they can see that it is most important to accurately understand the characteristics of the city and push ahead with the projects in a way that suits the characteristics of the city with the help of local residents and private companies. In addition, considering the gentrification problem, which is one of the side effects of urban regeneration projects, it is important to select and implement urban regeneration types suitable for the characteristics of the area. In order to supplement the limitations of the 'Urban Regeneration New Deal Project' methodology, this study aims to propose a system that recommends urban regeneration types suitable for urban regeneration sites by utilizing various machine learning algorithms, referring to the urban regeneration types of the '2025 Seoul Metropolitan Government Urban Regeneration Strategy Plan' promoted based on regional characteristics. There are four types of urban regeneration in Seoul: "Low-use Low-Level Development, Abandonment, Deteriorated Housing, and Specialization of Historical and Cultural Resources" (Shon and Park, 2017). In order to identify regional characteristics, approximately 100,000 text data were collected for 22 regions where the project was carried out for a total of four types of urban regeneration. Using the collected data, we drew key keywords for each region according to the type of urban regeneration and conducted topic modeling to explore whether there were differences between types. As a result, it was confirmed that a number of topics related to real estate and economy appeared in old residential areas, and in the case of declining and underdeveloped areas, topics reflecting the characteristics of areas where industrial activities were active in the past appeared. In the case of the historical and cultural resource area, since it is an area that contains traces of the past, many keywords related to the government appeared. Therefore, it was possible to confirm political topics and cultural topics resulting from various events. Finally, in the case of low-use and under-developed areas, many topics on real estate and accessibility are emerging, so accessibility is good. It mainly had the characteristics of a region where development is planned or is likely to be developed. Furthermore, a model was implemented that proposes urban regeneration types tailored to regional characteristics for regions other than Seoul. Machine learning technology was used to implement the model, and training data and test data were randomly extracted at an 8:2 ratio and used. In order to compare the performance between various models, the input variables are set in two ways: Count Vector and TF-IDF Vector, and as Classifier, there are 5 types of SVM (Support Vector Machine), Decision Tree, Random Forest, Logistic Regression, and Gradient Boosting. By applying it, performance comparison for a total of 10 models was conducted. The model with the highest performance was the Gradient Boosting method using TF-IDF Vector input data, and the accuracy was 97%. Therefore, the recommendation system proposed in this study is expected to recommend urban regeneration types based on the regional characteristics of new business sites in the process of carrying out urban regeneration projects."

Steel Plate Faults Diagnosis with S-MTS (S-MTS를 이용한 강판의 표면 결함 진단)

  • Kim, Joon-Young;Cha, Jae-Min;Shin, Junguk;Yeom, Choongsub
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.47-67
    • /
    • 2017
  • Steel plate faults is one of important factors to affect the quality and price of the steel plates. So far many steelmakers generally have used visual inspection method that could be based on an inspector's intuition or experience. Specifically, the inspector checks the steel plate faults by looking the surface of the steel plates. However, the accuracy of this method is critically low that it can cause errors above 30% in judgment. Therefore, accurate steel plate faults diagnosis system has been continuously required in the industry. In order to meet the needs, this study proposed a new steel plate faults diagnosis system using Simultaneous MTS (S-MTS), which is an advanced Mahalanobis Taguchi System (MTS) algorithm, to classify various surface defects of the steel plates. MTS has generally been used to solve binary classification problems in various fields, but MTS was not used for multiclass classification due to its low accuracy. The reason is that only one mahalanobis space is established in the MTS. In contrast, S-MTS is suitable for multi-class classification. That is, S-MTS establishes individual mahalanobis space for each class. 'Simultaneous' implies comparing mahalanobis distances at the same time. The proposed steel plate faults diagnosis system was developed in four main stages. In the first stage, after various reference groups and related variables are defined, data of the steel plate faults is collected and used to establish the individual mahalanobis space per the reference groups and construct the full measurement scale. In the second stage, the mahalanobis distances of test groups is calculated based on the established mahalanobis spaces of the reference groups. Then, appropriateness of the spaces is verified by examining the separability of the mahalanobis diatances. In the third stage, orthogonal arrays and Signal-to-Noise (SN) ratio of dynamic type are applied for variable optimization. Also, Overall SN ratio gain is derived from the SN ratio and SN ratio gain. If the derived overall SN ratio gain is negative, it means that the variable should be removed. However, the variable with the positive gain may be considered as worth keeping. Finally, in the fourth stage, the measurement scale that is composed of selected useful variables is reconstructed. Next, an experimental test should be implemented to verify the ability of multi-class classification and thus the accuracy of the classification is acquired. If the accuracy is acceptable, this diagnosis system can be used for future applications. Also, this study compared the accuracy of the proposed steel plate faults diagnosis system with that of other popular classification algorithms including Decision Tree, Multi Perception Neural Network (MLPNN), Logistic Regression (LR), Support Vector Machine (SVM), Tree Bagger Random Forest, Grid Search (GS), Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The steel plates faults dataset used in the study is taken from the University of California at Irvine (UCI) machine learning repository. As a result, the proposed steel plate faults diagnosis system based on S-MTS shows 90.79% of classification accuracy. The accuracy of the proposed diagnosis system is 6-27% higher than MLPNN, LR, GS, GA and PSO. Based on the fact that the accuracy of commercial systems is only about 75-80%, it means that the proposed system has enough classification performance to be applied in the industry. In addition, the proposed system can reduce the number of measurement sensors that are installed in the fields because of variable optimization process. These results show that the proposed system not only can have a good ability on the steel plate faults diagnosis but also reduce operation and maintenance cost. For our future work, it will be applied in the fields to validate actual effectiveness of the proposed system and plan to improve the accuracy based on the results.

The big data method for flash flood warning (돌발홍수 예보를 위한 빅데이터 분석방법)

  • Park, Dain;Yoon, Sanghoo
    • Journal of Digital Convergence
    • /
    • v.15 no.11
    • /
    • pp.245-250
    • /
    • 2017
  • Flash floods is defined as the flooding of intense rainfall over a relatively small area that flows through river and valley rapidly in short time with no advance warning. So that it can cause damage property and casuality. This study is to establish the flash-flood warning system using 38 accident data, reported from the National Disaster Information Center and Land Surface Model(TOPLATS) between 2009 and 2012. Three variables were used in the Land Surface Model: precipitation, soil moisture, and surface runoff. The three variables of 6 hours preceding flash flood were reduced to 3 factors through factor analysis. Decision tree, random forest, Naive Bayes, Support Vector Machine, and logistic regression model are considered as big data methods. The prediction performance was evaluated by comparison of Accuracy, Kappa, TP Rate, FP Rate and F-Measure. The best method was suggested based on reproducibility evaluation at the each points of flash flood occurrence and predicted count versus actual count using 4 years data.

Prediction of Landslides and Determination of Its Variable Importance Using AutoML (AutoML을 이용한 산사태 예측 및 변수 중요도 산정)

  • Nam, KoungHoon;Kim, Man-Il;Kwon, Oil;Wang, Fawu;Jeong, Gyo-Cheol
    • The Journal of Engineering Geology
    • /
    • v.30 no.3
    • /
    • pp.315-325
    • /
    • 2020
  • This study was performed to develop a model to predict landslides and determine the variable importance of landslides susceptibility factors based on the probabilistic prediction of landslides occurring on slopes along the road. Field survey data of 30,615 slopes from 2007 to 2020 in Korea were analyzed to develop a landslide prediction model. Of the total 131 variable factors, 17 topographic factors and 114 geological factors (including 89 bedrocks) were used to predict landslides. Automated machine learning (AutoML) was used to classify landslides and non-landslides. The verification results revealed that the best model, an extremely randomized tree (XRT) with excellent predictive performance, yielded 83.977% of prediction rates on test data. As a result of the analysis to determine the variable importance of the landslide susceptibility factors, it was composed of 10 topographic factors and 9 geological factors, which was presented as a percentage for each factor. This model was evaluated probabilistically and quantitatively for the likelihood of landslide occurrence by deriving the ranking of variable importance using only on-site survey data. It is considered that this model can provide a reliable basis for slope safety assessment through field surveys to decision-makers in the future.

Development of prediction model identifying high-risk older persons in need of long-term care (장기요양 필요 발생의 고위험 대상자 발굴을 위한 예측모형 개발)

  • Song, Mi Kyung;Park, Yeongwoo;Han, Eun-Jeong
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.4
    • /
    • pp.457-468
    • /
    • 2022
  • In aged society, it is important to prevent older people from being disability needing long-term care. The purpose of this study is to develop a prediction model to discover high-risk groups who are likely to be beneficiaries of Long-Term Care Insurance. This study is a retrospective study using database of National Health Insurance Service (NHIS) collected in the past of the study subjects. The study subjects are 7,724,101, the population over 65 years of age registered for medical insurance. To develop the prediction model, we used logistic regression, decision tree, random forest, and multi-layer perceptron neural network. Finally, random forest was selected as the prediction model based on the performances of models obtained through internal and external validation. Random forest could predict about 90% of the older people in need of long-term care using DB without any information from the assessment of eligibility for long-term care. The findings might be useful in evidencebased health management for prevention services and can contribute to preemptively discovering those who need preventive services in older people.

The Newly changed Painting's Aesthetic of Seonbi painter Yoon DeokHee and Yun Yong Father and Son (선비화가 윤덕희(尹德熙)·윤용(尹愹) 부자(父子)의 변유적(變維的) 회화심미(繪畵審美) 고찰)

  • Kim, Doyoung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.2
    • /
    • pp.199-206
    • /
    • 2021
  • The three generations of Haenam Yoon, who have been handed down to Gongjae Yoon DuSeo (1668~1715), Yoon DeokHee (1685~1776) and Yoon Yong (1708~1740), were based in Haenam. They had an artistic soul on the stage of Hanyang and succeeded in the art of the family, building a reputation as a family of Seonbi painters representing the late Joseon Dynasty. Born as the eldest son of Gongjae and lived at the age of 82, Rakseo learned a variety of studies, calligraphy and painting from his father and Lee Seo. While learning the paintings of the early and mid Joseon period, and accepting the Namjong painting method, he pursued the realism and three-dimensional sense of the subject by adding a Western-style shading method. In particular, he showed outstanding talent in horse paintings and pottery figures, expressing his original 'Beauty that realistically portrays real scenery'. Cheonggo, who was born as the second son of Rakseo and died at the age of 32, was good at Namjong landscape painting using various tree drawing methods. He painted the original Siuido by changing the topical poems, as well as detailed observations and explorations to accurately describe the facts of the object. In addition, 'Beauty showing affection through realistic scenery' was expressed by newly changing and reinterpreting the tendency of home appliances painting to express the spirit as a form beyond the realistic landscape. Rakseo and Cheonggo father and son made a 'NogUdang' painting style, drastically changing the paintings of the late Joseon Dynasty, and had a great influence on the history of Korean painting.