• Title/Summary/Keyword: Classification trees

Search Result 317, Processing Time 0.029 seconds

Application study of random forest method based on Sentinel-2 imagery for surface cover classification in rivers - A case of Naeseong Stream - (하천 내 지표 피복 분류를 위한 Sentinel-2 영상 기반 랜덤 포레스트 기법의 적용성 연구 - 내성천을 사례로 -)

  • An, Seonggi;Lee, Chanjoo;Kim, Yongmin;Choi, Hun
    • Journal of Korea Water Resources Association
    • /
    • v.57 no.5
    • /
    • pp.321-332
    • /
    • 2024
  • Understanding the status of surface cover in riparian zones is essential for river management and flood disaster prevention. Traditional survey methods rely on expert interpretation of vegetation through vegetation mapping or indices. However, these methods are limited by their ability to accurately reflect dynamically changing river environments. Against this backdrop, this study utilized satellite imagery to apply the Random Forest method to assess the distribution of vegetation in rivers over multiple years, focusing on the Naeseong Stream as a case study. Remote sensing data from Sentinel-2 imagery were combined with ground truth data from the Naeseong Stream surface cover in 2016. The Random Forest machine learning algorithm was used to extract and train 1,000 samples per surface cover from ten predetermined sampling areas, followed by validation. A sensitivity analysis, annual surface cover analysis, and accuracy assessment were conducted to evaluate their applicability. The results showed an accuracy of 85.1% based on the validation data. Sensitivity analysis indicated the highest efficiency in 30 trees, 800 samples, and the downstream river section. Surface cover analysis accurately reflects the actual river environment. The accuracy analysis identified 14.9% boundary and internal errors, with high accuracy observed in six categories, excluding scattered and herbaceous vegetation. Although this study focused on a single river, applying the surface cover classification method to multiple rivers is necessary to obtain more accurate and comprehensive data.

An Integrated Model based on Genetic Algorithms for Implementing Cost-Effective Intelligent Intrusion Detection Systems (비용효율적 지능형 침입탐지시스템 구현을 위한 유전자 알고리즘 기반 통합 모형)

  • Lee, Hyeon-Uk;Kim, Ji-Hun;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.1
    • /
    • pp.125-141
    • /
    • 2012
  • These days, the malicious attacks and hacks on the networked systems are dramatically increasing, and the patterns of them are changing rapidly. Consequently, it becomes more important to appropriately handle these malicious attacks and hacks, and there exist sufficient interests and demand in effective network security systems just like intrusion detection systems. Intrusion detection systems are the network security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. Conventional intrusion detection systems have generally been designed using the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. However, they cannot handle new or unknown patterns of the network attacks, although they perform very well under the normal situation. As a result, recent studies on intrusion detection systems use artificial intelligence techniques, which can proactively respond to the unknown threats. For a long time, researchers have adopted and tested various kinds of artificial intelligence techniques such as artificial neural networks, decision trees, and support vector machines to detect intrusions on the network. However, most of them have just applied these techniques singularly, even though combining the techniques may lead to better detection. With this reason, we propose a new integrated model for intrusion detection. Our model is designed to combine prediction results of four different binary classification models-logistic regression (LOGIT), decision trees (DT), artificial neural networks (ANN), and support vector machines (SVM), which may be complementary to each other. As a tool for finding optimal combining weights, genetic algorithms (GA) are used. Our proposed model is designed to be built in two steps. At the first step, the optimal integration model whose prediction error (i.e. erroneous classification rate) is the least is generated. After that, in the second step, it explores the optimal classification threshold for determining intrusions, which minimizes the total misclassification cost. To calculate the total misclassification cost of intrusion detection system, we need to understand its asymmetric error cost scheme. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, total misclassification cost is more affected by FNE rather than FPE. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 10,000 samples from them by using random sampling method. Also, we compared the results from our model with the results from single techniques to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell R4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on GA outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that the proposed model outperformed all the other comparative models in the total misclassification cost perspective. Consequently, it is expected that our study may contribute to build cost-effective intelligent intrusion detection systems.

An Intelligent Intrusion Detection Model Based on Support Vector Machines and the Classification Threshold Optimization for Considering the Asymmetric Error Cost (비대칭 오류비용을 고려한 분류기준값 최적화와 SVM에 기반한 지능형 침입탐지모형)

  • Lee, Hyeon-Uk;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.157-173
    • /
    • 2011
  • As the Internet use explodes recently, the malicious attacks and hacking for a system connected to network occur frequently. This means the fatal damage can be caused by these intrusions in the government agency, public office, and company operating various systems. For such reasons, there are growing interests and demand about the intrusion detection systems (IDS)-the security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. The intrusion detection models that have been applied in conventional IDS are generally designed by modeling the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. These kinds of intrusion detection models perform well under the normal situations. However, they show poor performance when they meet a new or unknown pattern of the network attacks. For this reason, several recent studies try to adopt various artificial intelligence techniques, which can proactively respond to the unknown threats. Especially, artificial neural networks (ANNs) have popularly been applied in the prior studies because of its superior prediction accuracy. However, ANNs have some intrinsic limitations such as the risk of overfitting, the requirement of the large sample size, and the lack of understanding the prediction process (i.e. black box theory). As a result, the most recent studies on IDS have started to adopt support vector machine (SVM), the classification technique that is more stable and powerful compared to ANNs. SVM is known as a relatively high predictive power and generalization capability. Under this background, this study proposes a novel intelligent intrusion detection model that uses SVM as the classification model in order to improve the predictive ability of IDS. Also, our model is designed to consider the asymmetric error cost by optimizing the classification threshold. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, when considering total cost of misclassification in IDS, it is more reasonable to assign heavier weights on FNE rather than FPE. Therefore, we designed our proposed intrusion detection model to optimize the classification threshold in order to minimize the total misclassification cost. In this case, conventional SVM cannot be applied because it is designed to generate discrete output (i.e. a class). To resolve this problem, we used the revised SVM technique proposed by Platt(2000), which is able to generate the probability estimate. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 1,000 samples from them by using random sampling method. In addition, the SVM model was compared with the logistic regression (LOGIT), decision trees (DT), and ANN to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell 4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on SVM outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that our model reduced the total misclassification cost compared to the ANN-based intrusion detection model. As a result, it is expected that the intrusion detection model proposed in this paper would not only enhance the performance of IDS, but also lead to better management of FNE.

Characteristics of Vegetation Structure in the Ridgeline Area of the Nakdong-Jeongmaek (낙동정맥 마루금 일대의 식생구조 특성)

  • Park, Seok-Gon;Kang, Hyun-Mi
    • Korean Journal of Environment and Ecology
    • /
    • v.30 no.3
    • /
    • pp.386-398
    • /
    • 2016
  • To understand the vegetation structure in the ridgeline area of Nakdong-jeongmaek, six primary areas in Nakdong-jeongmaek were selected and their vegetation distribution was surveyed considering the environmental conditions and artificial influences. According to the results of community classification based on TWINSPAN, the vegetation in the surveyed region was categorized into 9 groups: Betula costata-Quercus mongolica community, Q. mongolica community, Pinus densiflora-Q. mongolica community, P. densiflora community, Deciduous oaks-P. densiflora community, Deciduous oaks community, P. thunbergii community, P. koraiensis-P. rigida community, and Chamaecyparis obtusa-Alnus firma community. In Baekbyeongsan(Mt.) located in Taebaek-si of Gangwon-do, Betula costata-Quercus mongolica community was found, reflecting the environmental characteristics of northern temperate climate. P. thunbergii community appeared in Gudeoksan(Mt.) of Busan metropolitan city, which is near the coast. Since Gudeoksan(Mt.) is near to the downtown and its altitude above the sea is relatively low, people visit the area often. Therefore, C. obtusa and P. thunbergii have been planted for producing forest trees and implementing anti-erosion afforestation. In the other primary survey areas, Q. mongolica-dominant communities, P. densiflora-dominant communities, and deciduous oak-dominant communities, which are representative forest vegetation types of Jeongmaeks in South Korea, were mainly distributed, showing no significant difference compared to the forest vegetation types of other Jeongmaeks. Since the Nakdong-jeongmaek from south to north, it shows clear characteristics of vegetation changes between the northern temperate climate and the warm temperate climate of the south.

Development of BIM Templates for Vest-Pocket Park Landscape Design (소공원의 조경설계를 위한 BIM 템플릿 개발)

  • Seo, Young-hoon;Kim, Dong-pil;Moon, Ho-Gyeong
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.44 no.1
    • /
    • pp.40-50
    • /
    • 2016
  • A BIM, which is being applied actively to the construction and civil construction industries, is a technology that can maximize efficiency of various sectors from initial planning and design, construction, and maintenance, to demolition; however, it is in the introductory phase in the field of domestic landscaping. In order to introduce and promote BIM in the field of landscape design, this study developed a prototype of a library and template and analyzed the performance of trial application. For the development of a prototype, annotations and types were analyzed from floor plans of existing small parks, and components of landscape template were deduced. Based on this, play facilities, pergola, and benches were madeintofamily and templates, making automatic design possible. In addition, annotations and tags that are often used in landscape design were made, and a 3D view was materialized through visibility/graphic reassignment. As for tables and quantities, boundary stone table, mounding table, summary sheet of quantities, table of contents, and summary sheet of packaging quantities were grouped and connected with floor plans; regarding landscaping trees, classification criteria and name of trees that are suitable for domestic situations were applied. A landscape template was created to enable the library file format(rfa) that can be mounted on a building with BIM programs. As for problems that arose after the trial application of the prepared template, some CAD files could not be imported; also, while writing tables, the basis of calculation could not be made automatically. Regarding this, it is thought that functions of a BIM program and template need improvement.

The Effects of the Biodiversity Increase after Creation of the Artificial Wetland -The Case of Ecological Pond at Seoul Technical High School- (인공습지 조성후 생물다양성 증진 효과에 관한 연구 -서울공고 생태연못을 중심으로-)

  • 김귀곤;조동길
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.27 no.3
    • /
    • pp.1-17
    • /
    • 1999
  • The purpose of this study is to evaluate the creation techniques of artificial wetland, one of biotopes developed to promote biodiversity in urban areas, and to look for improvement steps. Specifically, artificial wetland creation techniques were categorized into living environment and living creature classification. Being living conditions for creations, habitat environment was reviewed with a focus on water and soil environments. Living creatures were classified into plants, insects, fish, and birds. The evaluation of creation techniques was done in post-construction evaluation while considering the creation of habitats for living creatures. Intervention by users, changes in living environment and living species, and relevance of creation techniques were reviewed. Key results of this study are as follows. (1) Water environment for the living environment of creatures provides a suitable environment conditions for the living of creatures through a process easing the use of piped water. Various water depths and embankment appear to have a positive impact on the living of aquatic life. In particular, embankment covered in soil naturally played an important role as a place for the activities of aquatic insects and young fish as well as the growth of aquatic plants. (2) Various aquatic and ground plants to promote insect-diversity, shallow water, and old-tree logs had contributed greatly in increasing the types and number of insects. Aquatic insects. Aquatic insects were seen much particularly in areas where aquatic plants are rich but water is shallow than any other areas. (3) A space piled with stone to provide habitats for fish was not much used. However, it was observed that fish used embankment built with natural stones and embankment using logs in areas where water is deep. In addition, it was confirmed that 1,500 fish that had been released propagated using various depths and places for birth. (4) It was analyzed that techniques (creation of island, log setting, and creation of man-made bird nests) to provide habitats and to attract birds are not serving their roles. In such a case, it is believed that species had not increased due to the smallness as well as isolated features of the area. Based on theoretical review, they are judged to be areas that are likely to be used when a greater variety of birds is introduced. It is judged that attracting and keeping more birds at the site, such spaces need to be linked systematically in the future in terms of building eco-network while ensuring an adequate living areas. (5) In the study areas, users intervened greatly. As a result, a blockage was created preventing the normal growth of plants and non-indigenous plants were introduced. In order to limit the intervention by users, setting enough buffer zones, and environment education programs were urgently required. D/H=1>Hyangkyo> houses on the river>temples>lecture halls. D/H ratio of the backside areas is as follows. D/H=1>Hyangkyo>houses on the river>lecture halls. 4. Inner garden were planted deciduous than evergreen trees with Lagerstroemia indica. Enclosed dominant trees were planted by Pinus densiflora, Querces seuata. construct GEM strain, and examined for the expression and functional stability in microcosms.

  • PDF

Multispectral Image Compression Using Classified Interband Bidirectional Prediction and Extended SPHT (영역별 대역간 양방향 예측과 확장된 SPIHT를 이용한 다분광 화상데이터의 압축)

  • Kim, Seung-Jin;Ban, Seong-Won;Kim, Byung-Ju;Park, Kyung-Nam;Kim, Young-Choon;Lee, Kuhn-Il
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.5
    • /
    • pp.486-493
    • /
    • 2002
  • In this paper, we proposed the effective multispectral image compression method using CIBP(classified interband bidrectional prediction) and extended SPIHT(set partition in hierarchical trees) in wavelet domain. We determine separately feature bands that have the highest correlation with other bands in the visible range and in the infrared range of wavelengths. Feature bands are coded to remove the spatial redundancy with SPIHT in the wavelet domain. Prediction bands that have high correlation with feature bands are wavelet transformed and they are classified into one of three classes considering reflection characteristics of the baseband. For Prediction bands, CIBP is performed to reduce the spectral redundancy. for the difference bands between prediction bands and the predicted bands, They are ordered to upgrade the compression efficiency of extended SPIHT with the largest error magnitude. The arranged bands are coded to compensate the prediction error with extended SPIHT. Experiments are carried out on the multispectral images. The results show that the proposed method reconstructs higher quality images than images reconstructed by the conventional methods at the same bit rate.

Characteristics of Vegetation Structure on the Ridge of the Naknam-Jeongmaek (낙남정맥 마루금 일대의 식생구조 특성)

  • Oh, Koo-Kyoon;Kang, Hyun-Mi;Park, Seok-Gon
    • Korean Journal of Environment and Ecology
    • /
    • v.28 no.6
    • /
    • pp.725-740
    • /
    • 2014
  • To figure out the vegetation situation in the ridge of the Naknam-Jeongmaek, six intensive survey sites were selected considering environmental factors and human interferences, 132 quadrats (area $100m^2$) were installed to survey appearing species of trees and major environmental factors, and the characteristics of vegetation structures were analyzed. The surveyed plots were divided into six groups according to the analysis of classification by TWINSPAN; Quercus mongolica-Q. variabilis community, Pinus densiflora-Q. dentata community, Chamaecyparis obtusa community, Q. mongolica-P. densiflora community, P. densiflora-P. thunbergii community, P. koraiensis community, Rhododendron spp.-Lespedeza cyrtobotrya community. P. densiflora-Q. dentata community, Q. mongolica-P. densiflora community, and P. densiflora-P. thunbergii community are expected to be succeeded by deciduous oaks because the power of deciduous oaks is strong in their lower layer. C. obtusa community, P. densiflora community, and Rhododendron spp.-L.cyrtobotrya community are artificial forests that were artificially formed and are expected to be maintained in the current state for some time because the dominance value of planted species of trees is high. Most vegetations in Naknam-Jeongmaek were secondary forests or artificial forests formed for forest tree production and forestation for erosion control. In particular the top regions and hilly sections of the mountain were mostly dominated by deciduous oaks such as Q. mongolica, Q. variabilis showed some P. densiflora community competing with deciduous oaks. On the other hand, low sections and regions adjacent to the city showed severe artificial interference since exotic species such as P. thunbergiil, C. obtusa, P. koraiensis, and Rhododendron spp. were planted.

Analysis of Survivability for Combatants during Offensive Operations at the Tactical Level (전술제대 공격작전간 전투원 생존성에 관한 연구)

  • Kim, Jaeoh;Cho, HyungJun;Kim, GakGyu
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.5
    • /
    • pp.921-932
    • /
    • 2015
  • This study analyzed military personnel survivability in regards to offensive operations according to the scientific military training data of a reinforced infantry battalion. Scientific battle training was conducted at the Korea Combat Training Center (KCTC) training facility and utilized scientific military training equipment that included MILES and the main exercise control system. The training audience freely engaged an OPFOR who is an expert at tactics and weapon systems. It provides a statistical analysis of data in regards to state-of-the-art military training because the scientific battle training system saves and utilizes all training zone data for analysis and after action review as well as offers training control during the training period. The methodologies used the Cox PH modeling (which does not require parametric distribution assumptions) and decision tree modeling for survival data such as CART, GUIDE, and CTREE for richer and easier interpretation. The variables that violate the PH assumption were stratified and analyzed. Since the Cox PH model result was not easy to interpret the period of service, additional interpretation was attempted through univariate local regression. CART, GUIDE, and CTREE formed different tree models which allow for various interpretations.

Vegetation Structure of Lower Stratum and Pinus densiflora Natural Regeneration Features from Micro-topography Classification in Pinus densiflora Forest of Anmyeon-do Island (안면도 소나무림 내 미세지형구분을 통한 하층식생구조와 소나무 천연갱신 양상)

  • Byeon, Seong Yeob;Kim, Hyun Seop;Yun, Chung Weon
    • Journal of Korean Society of Forest Science
    • /
    • v.108 no.2
    • /
    • pp.189-199
    • /
    • 2019
  • The forest management paradigm has recently shifted from focusing on commercial production to focusing on ecosystem management. Accordingly, a natural seedling regeneration method that has a naturally high affinity has attracted much attention in recent years. The aim of this study was to determine the relationship among various environmental factors, lower stratum vegetation, and seedling regeneration in Pinus densiflora forests. The survey site comprised 50 sectors divided using the line transect method, and the survey data were divided into those from wet habitat (19 sites) and dry habitat (31 sites), depending on the soil humidity, and were analyzed separately to show the close relationship between soil humidity and natural seedling regeneration. As a result, the dry habitat exhibited high seedling density (157,419 trees/ha), with the main species being Quercus serrata, Zanthoxylum piperitum, Smilax china, and Pueraria lobata, while wet habitat exhibited low seedling density (57,895 trees/ha), with the main species being Stephanandra incisa, Castanea crenata, Lespedeza maximowiczii, Lysimachia barystachys, Aralia elata, and Styrax japonicus. The height and root-collar diameter under wet conditions exhibited faster growth than those under dry conditions. Height growth by the root-collar diameter in dry habitat increased faster than that in wet habitat. It was also confirmed that seedling regeneration in wet habitat exhibited a rapid growth pattern 5 years after germination. These results suggest that the seedlings begin to grow more rapidly after a period of suppression by competition with surrounding plants. Considering an ecosystem or ecological management approach, specific practices, such as bush control and vine clearing in wet habitats, should be more intensively conducted, especially at the beginning of the management operations.