통합 검색 | Korea Science

Landslide susceptibility assessment using feature selection-based machine learning models

Liu, Lei-Lei;Yang, Can;Wang, Xiao-Mi
- Geomechanics and Engineering
- /
- 제25권1호
- /
- pp.1-16
- /
- 2021
Machine learning models have been widely used for landslide susceptibility assessment (LSA) in recent years. The large number of inputs or conditioning factors for these models, however, can reduce the computation efficiency and increase the difficulty in collecting data. Feature selection is a good tool to address this problem by selecting the most important features among all factors to reduce the size of the input variables. However, two important questions need to be solved: (1) how do feature selection methods affect the performance of machine learning models? and (2) which feature selection method is the most suitable for a given machine learning model? This paper aims to address these two questions by comparing the predictive performance of 13 feature selection-based machine learning (FS-ML) models and 5 ordinary machine learning models on LSA. First, five commonly used machine learning models (i.e., logistic regression, support vector machine, artificial neural network, Gaussian process and random forest) and six typical feature selection methods in the literature are adopted to constitute the proposed models. Then, fifteen conditioning factors are chosen as input variables and 1,017 landslides are used as recorded data. Next, feature selection methods are used to obtain the importance of the conditioning factors to create feature subsets, based on which 13 FS-ML models are constructed. For each of the machine learning models, a best optimized FS-ML model is selected according to the area under curve value. Finally, five optimal FS-ML models are obtained and applied to the LSA of the studied area. The predictive abilities of the FS-ML models on LSA are verified and compared through the receive operating characteristic curve and statistical indicators such as sensitivity, specificity and accuracy. The results showed that different feature selection methods have different effects on the performance of LSA machine learning models. FS-ML models generally outperform the ordinary machine learning models. The best FS-ML model is the recursive feature elimination (RFE) optimized RF, and RFE is an optimal method for feature selection.
https://doi.org/10.12989/gae.2021.25.1.001 인용 KSCI

가려진 얼굴의 인식 (Recognition of Occluded Face)

강현철
- 한국정보통신학회논문지
- /
- 제23권6호
- /
- pp.682-689
- /
- 2019
부분 기반 영상 표현(part-based image representation)에서는 영상의 부분적인 모습을 기저 벡터로 표현하고 기저 벡터의 선형 조합으로 영상을 분해하며, 이 때 기저 벡터의 계수가 곧 물체의 부분적인 특징을 의미하게 된다. 본 논문에는 부분 기반 영상 표현 기법인 비음수 행렬 분해(non-negative matrix factorization, NMF)를 이용하여 얼굴 영상을 표현하고 신경망 기법을 적용하여 가려진 얼굴을 인식하는 얼굴 인식을 제안한다. 표준 비음수 행렬 분해, 투영 경사 비음수 행렬 분해, 직교 비음수 행렬 분해를 이용하여 얼굴 영상을 표현하였고, 각 기법의 성능을 비교하였다. 인식기로는 학습벡터양자화 신경망을 사용하였으며, 인식기에서의 거리 척도로는 유클리디언 거리를 사용하였다. 실험 결과, 전통적인 얼굴 인식 방법에 비하여 제안한 기법이 가려진 얼굴 인식에 보다 강인함을 보인다.
https://doi.org/10.6109/jkiice.2019.23.6.682 인용 PDF KSCI HTML

심층 신경망을 활용한 전자문서 내 객체의 자동 추출 방법 연구 (Automatic Object Extraction from Electronic Documents Using Deep Neural Network)

장희진;채영훈;이상원;조진용
- 정보처리학회논문지:소프트웨어 및 데이터공학
- /
- 제7권11호
- /
- pp.411-418
- /
- 2018
인공지능 기술의 확산으로 인해 과학기술 분야에서도 연구 데이터의 확보, 저장 및 활용이 중요시 되고 있는 상황이다. 연구 데이터를 확보하기 위해 전자문서 형태의 연구논문으로부터 그래프, 표와 같은 유의미한 객체를 추출하는 다양한 방법들이 제안되고 있다. 경험적 방법론을 이용하는 기존의 연구들은 문서의 편집 특성을 일반화하여 객체들을 추출하기 때문에 다수의 이질적인 형태를 갖는 전자문서들을 대상으로 연구결과를 적용하는데는 한계가 있다. 본 논문은 경험적 방법론의 경직성을 극복하고 이질적인 전자문서들로부터 목표 객체들을 효과적으로 추출하기 위해 심층 학습 기반의 객체 추출 시스템을 제안한다. 텐서플로우 객체 탐지 API의 Faster R-CNN 알고리즘을 기반으로 새로운 학습 모델을 생성했으며 심층 학습과 평가를 위해 총 100여 편의 연구논문들을 대상으로 목표 객체들을 데이터화했다. 마지막으로 성능평가를 통해 제안한 시스템이 경험적 방법론을 적용한 비교 대상에 비해 약 5.2% 높은 성능을 보임을 확인하였다.
https://doi.org/10.3745/KTSDE.2018.7.11.411 인용 PDF KSCI HTML

데이터 마이닝 기법을 이용한 소규모 악성코드 탐지에 관한 연구 (A Study on Detection of Small Size Malicious Code using Data Mining Method)

이택현;국광호
- 융합보안논문지
- /
- 제19권1호
- /
- pp.11-17
- /
- 2019
최근 인터넷 기술을 악용하는 행위로 인하여 경제적, 정신적 피해가 증가하고 있다. 특히, 신규로 제작되거나 변형된 악성코드는 기존의 정보보호 체계를 우회하여 사이버 보안 위협의 기본 수단으로 활용되고 있다. 이를 억제하기 위한 다양한 연구가 진행되었지만, 실제 악성코드의 많은 비중을 차지하는 소규모 실행 파일에 대한 연구는 미진한 편이다. 본 연구에서는 기존에 알려진 소규모 실행 파일의 특징을 데이터마이닝 기법으로 분석하여 알려지지 않은 악성코드 탐지에 활용할 수 있는 모델을 제안한다. 데이터 마이닝 분석 기법에는 나이브베이지안, SVM, 의사결정나무, 랜덤포레스트, 인공신경망 등 다양하게 수행하였으며, 바이러스토탈의 악성코드 검출 수준에 따라서 개별적으로 정확도를 비교하였다. 결과적으로 분석 파일 34,646개에 대하여 80% 이상의 분류 정확도를 검증하였다.
PDF KSCI

Mathematical Model and Design Optimization of Reduction Gear for Electric Agricultural Vehicle

Pratama, Pandu Sandi;Byun, Jae-Young;Lee, Eun-Suk;Keefe, Dimas Harris Sean;Yang, Ji-Ung;Chung, Song-Won;Choi, Won-Sik
- 한국산업융합학회 논문집
- /
- 제22권1호
- /
- pp.1-9
- /
- 2019
In electric agricultural machine the gearbox is used to increase torque and lower the output speed of the motor shaft. The gearbox consists of several shafts, helical gears and spur gears works in series. Optimization plays an important role in gear design as reducing the weight or volume of a gear set will increase its service life and improve the bearing capacity. In this paper the basic design parameters for gear like shaft diameter and face width are considered as the input variables. The bending stress and material volume is considered as the objective function. ANSYS was used to investigate the bending stress when the variable was changed. Artificial Neural Network (ANN) was used to obtain the mathematical model of the system based on the bending stress behaviour. The ANN was used since the output system is nonlinear. The Genetic Algorithm (GA)　technique of optimization is used to obtain the optimized values of shaft diameter and face width on the pinion based on the ANN mathematical model and the results are compared as that obtained using the traditional method. The ANN and GA were performed using MATLAB. The simulation results were shown that the proposed algorithm was successfully calculated the value of shaft diameter and face width to obtain the minimal bending stress and material volume of the gearbox.
https://doi.org/10.21289/KSIC.2019.22.1.001 인용 PDF KSCI HTML

수직축 풍력터빈 성능향상을 위한 풍력타워 최적설계에 관한 연구 (Optimum Design of a Wind Power Tower to Augment Performance of Vertical Axis Wind Turbine)

조수용;임채환;조종현
- 한국항공우주학회지
- /
- 제47권3호
- /
- pp.177-186
- /
- 2019
풍력 타워는 수직형 풍력터빈의 성능을 향상시키기 위해 사용되어왔다. 하지만 올바르게 설계되지 않은 풍력 타워는 오히려 풍력터빈의 성능을 저하시킬 수 있다. 따라서 본 연구에서는 풍력 타워의 최적화 연구를 수행하였다. 이를 위하여 다음과 같이 6가지의 설계변수가 선택되었다. 즉, 가이드 벽의 외부 및 내부 반경, 스플리터의 적용 여부, 스플리터의 내부 반경, 가이드 벽의 개수 및 원주각도가 선정되었다. 최적화를 위한 목적함수는 풍력타워 내에 설치된 수직형 풍력터빈에서의 주기적인 평균 토크가 사용되었으며, 최적화 과정에서 지엽적인 최적화 결과를 피하기 위하여 실험계획법, 유전자알고리즘 및 인공신경망기법이 사용되었다. 인공신경망은 세대의 증가에 따라 지속적으로 향상하였으며, 수직 풍력터빈의 성능은 독립운전에 비하여 최적화된 풍력 타워 내에서 두 배 이상 향상되었다.
https://doi.org/10.5139/JKSAS.2019.47.3.177 인용 PDF KSCI

다중선형회귀 및 인공신경망 모형을 이용한 대설피해에 따른 피해액 예측에 관한 연구 (Prediction of damages induced by Snow using Multiple-linear regression and Artificial Neural Network model)

권순호;이의훈;정건희;김중훈
- 한국수자원학회:학술대회논문집
- /
- 한국수자원학회 2017년도 학술발표회
- /
- pp.20-20
- /
- 2017
최근 기후변화 영향에 따라 전 세계적으로 인명피해 및 재산피해를 유발하는 자연재난이 지속적으로 증가하고 있으며, 그로 인한 자연재해의 규모가 점점 더 커지고 있다. 실제로 우리나라에서도 지난 1994 년에서 2013 년까지 지난 20 년간 자연재해에 의한 피해액은 12조 3천억 원으로 집계되었으며, 이 중 강우와 태풍에 의한 피해가 85 % 이고, 대설에 의한 피해는 약 13 % 로 자연재해 중 대부분의 피해는 강우 및 태풍에서 발생하지만, 폭설에 의한 피해도 적지 않은 것으로 나타났다. 이에 따라, 정확한 예측을 위해 신뢰도 높은 자료 구축을 통한 대설피해 예측에 관한 연구가 필요한 시점이다. 본 연구에서는 대설피해액 예측을 위해 우리나라의 63개 기상 관측소에서 관측한 적설심 자료 및 기상관측 자료와 사회 경제 자료 총 11개를 대설피해 예측을 위한 입력변수로 선정하고, 이를 기상관측소가 속한 도시의 면적에 따라 3개의 지역으로 구분하였다. 주성분분석을 활용하여 선정된 입력변수들을 4개의 주성분으로 구분하고, 인공신경망 및 다중선형 회귀 모형을 구성하여 각 지역별 대설피해 예측의 오차를 분석하였다. 적용결과, 인공신경망 모형을 이용한 대설피해 예측의 수정결정계수는 22.8 %~48.2 %를 나타냈고, 다중선형회귀 모형의 수정결정 계수는 9.2 %~39.7% 로 나타났다. 그러므로 인공신경망 모형이 다중회귀 모형보다 선택된 입력자료를 활용하여 대설피해를 예측하는 목적으로 조금 더 우수한 결과를 나타내었다. 향후 자료를 보완 및 모형의 고도화를 통해 보다 정확한 대설피해 예측 함수 개발이 가능할 것으로 기대된다.
PDF

Study on Dimensionality Reduction for Sea-level Variations by Using Altimetry Data around the East Asia Coasts

Hwang, Do-Hyun;Bak, Suho;Jeong, Min-Ji;Kim, Na-Kyeong;Park, Mi-So;Kim, Bo-Ram;Yoon, Hong-Joo
- 대한원격탐사학회지
- /
- 제37권1호
- /
- pp.85-95
- /
- 2021
Recently, as data mining and artificial neural network techniques are developed, analyzing large amounts of data is proposed to reduce the dimension of the data. In general, empirical orthogonal function (EOF) used to reduce the dimension in the ocean data and recently, Self-organizing maps (SOM) algorithm have been investigated to apply to the ocean field. In this study, both algorithms used the monthly Sea level anomaly (SLA) data from 1993 to 2018 around the East Asia Coasts. There was dominated by the influence of the Kuroshio Extension and eddy kinetic energy. It was able to find the maximum amount of variance of EOF modes. SOM algorithm summarized the characteristic of spatial distributions and periods in EOF mode 1 and 2. It was useful to find the change of SLA variable through the movement of nodes. Node 1 and 5 appeared in the early 2000s and the early 2010s when the sea level was high. On the other hand, node 2 and 6 appeared in the late 1990s and the late 2000s, when the sea level was relatively low. Therefore, it is considered that the application of the SOM algorithm around the East Asia Coasts is well distinguished. In addition, SOM results processed by SLA data, it is able to apply the other climate data to explain more clearly SLA variation mechanisms.
https://doi.org/10.7780/kjrs.2021.37.1.7 인용 PDF KSCI HTML

PM10 예측 성능 향상을 위한 이진 분류 모델 비교 분석 (Comparative Analysis of the Binary Classification Model for Improving PM10 Prediction Performance)

정용진;이종성;오창헌
- 한국정보통신학회논문지
- /
- 제25권1호
- /
- pp.56-62
- /
- 2021
미세먼지 예보에 대한 높은 정확도가 요구됨에 따라 기계 학습의 알고리즘을 적용하여 예측 정확도를 높이려는 다양한 시도들이 이루어지고 있다. 그러나 미세먼지의 특성과 불균형적인 농도별 발생 비율에 대한 문제로 예측 모델의 학습 및 예측이 잘 이루어지지 않는다. 이러한 문제를 해결하기 위해 특정 농도를 기준으로 미세먼지를 저농도와 고농도로 구분하여 예측을 수행하는 등 다양한 연구가 진행되고 있다. 본 논문에서는 미세먼지 농도의 불균형 특성으로 인한 예측 성능 향상의 문제를 해결하기 위한 미세먼지 농도의 이진 분류 모델을 제안하였다. 분류 알고리즘 중 logistic regression, decision tree, SVM 및 MLP를 이용하여 PM10에 대한 이진분류 모델들을 설계하였다. 오차 행렬을 통해 성능을 비교한 결과, 4가지 모델 중 MLP 모델이 89.98%의 정확도로 가장 높은 이진 분류 성능을 보였다.
https://doi.org/10.6109/jkiice.2021.25.1.56 인용 PDF KSCI

자료기반 물환경 모델의 현황 및 발전 방향 (Data-Driven Modeling of Freshwater Aquatic Systems: Status and Prospects)

차윤경;신지훈;김영우
- 한국물환경학회지
- /
- 제36권6호
- /
- pp.611-620
- /
- 2020
Although process-based models have been a preferred approach for modeling freshwater aquatic systems over extended time intervals, the increasing utility of data-driven models in a big data environment has made the data-driven models increasingly popular in recent decades. In this study, international peer-reviewed journals for the relevant fields were searched in the Web of Science Core Collection, and an extensive literature review, which included total 2,984 articles published during the last two decades (2000-2020), was performed. The review results indicated that the rate of increase in the number of published studies using data-driven models exceeded those using process-based models since 2010. The increase in the use of data-driven models was partly attributable to the increasing availability of data from new data sources, e.g., remotely sensed hyperspectral or multispectral data. Consistently throughout the past two decades, South Korea has been one of the top ten countries in which the greatest number of studies using the data-driven models were published. Among the major data-driven approaches, i.e., artificial neural network, decision tree, and Bayesian model, were illustrated with case studies. Based on the review, this study aimed to inform the current state of knowledge regarding the biogeochemical water quality and ecological models using data-driven approaches, and provide the remaining challenges and future prospects.
https://doi.org/10.15681/KSWE.2020.36.6.611 인용 PDF KSCI

검색결과 3,103건 처리시간 0.032초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)