• Title/Summary/Keyword: 모델선택

Search Result 3,052, Processing Time 0.032 seconds

Improving Generalization Performance of Neural Networks using Natural Pruning and Bayesian Selection (자연 프루닝과 베이시안 선택에 의한 신경회로망 일반화 성능 향상)

  • 이현진;박혜영;이일병
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.3_4
    • /
    • pp.326-338
    • /
    • 2003
  • The objective of a neural network design and model selection is to construct an optimal network with a good generalization performance. However, training data include noises, and the number of training data is not sufficient, which results in the difference between the true probability distribution and the empirical one. The difference makes the teaming parameters to over-fit only to training data and to deviate from the true distribution of data, which is called the overfitting phenomenon. The overfilled neural network shows good approximations for the training data, but gives bad predictions to untrained new data. As the complexity of the neural network increases, this overfitting phenomenon also becomes more severe. In this paper, by taking statistical viewpoint, we proposed an integrative process for neural network design and model selection method in order to improve generalization performance. At first, by using the natural gradient learning with adaptive regularization, we try to obtain optimal parameters that are not overfilled to training data with fast convergence. By adopting the natural pruning to the obtained optimal parameters, we generate several candidates of network model with different sizes. Finally, we select an optimal model among candidate models based on the Bayesian Information Criteria. Through the computer simulation on benchmark problems, we confirm the generalization and structure optimization performance of the proposed integrative process of teaming and model selection.

A Hybrid Efficient Feature Selection Model for High Dimensional Data Set based on KNHNAES (2013~2015) (KNHNAES (2013~2015) 에 기반한 대형 특징 공간 데이터집 혼합형 효율적인 특징 선택 모델)

  • Kwon, Tae il;Li, Dingkun;Park, Hyun Woo;Ryu, Kwang Sun;Kim, Eui Tak;Piao, Minghao
    • Journal of Digital Contents Society
    • /
    • v.19 no.4
    • /
    • pp.739-747
    • /
    • 2018
  • With a large feature space data, feature selection has become an extremely important procedure in the Data Mining process. But the traditional feature selection methods with single process may no longer fit for this procedure. In this paper, we proposed a hybrid efficient feature selection model for high dimensional data. We have applied our model on KNHNAES data set, the result shows that our model outperforms many existing methods in terms of accuracy over than at least 5%.

Crop Yield Estimation Utilizing Feature Selection Based on Graph Classification (그래프 분류 기반 특징 선택을 활용한 작물 수확량 예측)

  • Ohnmar Khin;Sung-Keun Lee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.6
    • /
    • pp.1269-1276
    • /
    • 2023
  • Crop estimation is essential for the multinational meal and powerful demand due to its numerous aspects like soil, rain, climate, atmosphere, and their relations. The consequence of climate shift impacts the farming yield products. We operate the dataset with temperature, rainfall, humidity, etc. The current research focuses on feature selection with multifarious classifiers to assist farmers and agriculturalists. The crop yield estimation utilizing the feature selection approach is 96% accuracy. Feature selection affects a machine learning model's performance. Additionally, the performance of the current graph classifier accepts 81.5%. Eventually, the random forest regressor without feature selections owns 78% accuracy and the decision tree regressor without feature selections retains 67% accuracy. Our research merit is to reveal the experimental results of with and without feature selection significance for the proposed ten algorithms. These findings support learners and students in choosing the appropriate models for crop classification studies.

A Study of Traffic Prediction Method Based on Hidden Markov Model (은닉 마르코프 모델 기반의 교통량 예측 기법 연구)

  • Kim, Min-Jae;You, Hee-Young
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2014.01a
    • /
    • pp.347-348
    • /
    • 2014
  • 최근 급증하는 교통 혼잡으로 인해 시간적/물질적 손실이 크게 발생하고 있다. 이러한 교통난 해소는 시설투자만으로는 근본적인 해결책이 될 수 없다는 판단 하에 지난 수년간 보다 정확한 교통량을 예측하기 위해 시계열 기반의 다양한 교통량 예측 모델들이 개발 되어 왔다. 그러나 시계열 기반의 모델들은 회귀분석을 통해 과거 교통량을 분석하고 과거의 교통패턴이 미래에도 지속적으로 연장된다는 가정 하에 연구되었기 때문에 실시간으로 급변하는 불규칙한 교통 패턴에 대한 예측의 신뢰성을 떨어트린다. 또한 시계열 기반의 예측 기법은 어떠한 회귀분석 모델을 사용하는지에 따라 성능의 차이가 많이 나타나기 때문에 회귀분석 모델 선택이 중요하다. 이러한 제약을 극복하기 위해 본 논문에서는 은닉 마르코프 모델(Hidden Markov model)을 이용해 동적인 교통 패턴에 따라 현재 상황에 맞는 회귀분석 모델을 선택하는 신뢰도 높은 교통량 예측 시스템을 제안한다.

  • PDF

Using an Intervention Model for Occupational Therapy Service Specialist Based on a Special Education Supporting Center (특수교육지원센터에 기반을 둔 작업치료서비스 전문가의 중재모델 사용)

  • Kim, Se-Yun;Kim, Su-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.4
    • /
    • pp.225-234
    • /
    • 2011
  • The purpose of this study was to survey the intervention model, intervention settings, intervention time, factors influencing selection of intervention model, relationship between intervention model and areas which supporting center for special education based occupational therapist was using. Study data were provided by 46 therapy supporting service professionals through e-mail and analyzed. The findings indicated that first, the therapy supporting service professionals often employed a direct therapy(individual) focusing on performance component of child. Second, they believed that direct therapy is most effective in improving children's performance and raising awareness of the importance of occupational therapy. Third, when choose intervention model, they took into consideration the child's performance components deficits and mind of team chief. Fourth, no correlation between the application time and the perception of effectiveness of intervention model. Also didn't find correlation between applying time of intervention model and intervention area. When putting the various research result together, the model school-based occupational therapists using was similar to medical model. Therefore it is need a study to develop effective intervention model and apply it in school environment.

Three-Dimensional Active Shape Models for Medical Image Segmentation (의료영상 분할을 위한 3차원 능동 모양 모델)

  • Lim, Seong-Jae;Jeong, Yong-Yeon;Ho, Yo-Sung
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.44 no.5
    • /
    • pp.55-61
    • /
    • 2007
  • In this paper, we propose a three-dimensional(3D) active shape models for medical image segmentation. In order to build a 3D shape model, we need to generate a point distribution model(PDM) and select corresponding landmarks in all the training shapes. The manual determination method, two-dimensional(2D) method, and limited 3D method of landmark correspondences are time-consuming, tedious, and error-prone. In this paper, we generate a 3D statistical shape model using the 3D model generation method of a distance transform and a tetrahedron method for landmarking. After generating the 3D model, we extend the shape model training and gray-level model training of 2D active shape models(ASMs) and we use the integrated modeling process with scale and gray-level models for the appearance profile to represent the local structure. Experimental results are comparable to those of region-based, contour-based methods, and 2D ASMs.

Selection of Optimal Model for Structural System Identification (SI기법 적용을 위한 최적 모델의 선택)

  • Kwak, Hyun-Seok;Kwon, Soon-Jung;Lee, Hae-Sung;Shin, Soobong
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.9 no.2
    • /
    • pp.217-224
    • /
    • 2005
  • A methodology of selecting an optimal model is proposed for applying a frequency-domain SI method effectively. Instead of using a reduced finite element model, a reasonably detail finite element model is established first and then the model is identified. To satisfy the identifiability criterion, a parameter grouping scheme is applied to control the number of unknowns. Among the simulated member grouping cases, an optimal model is selected as the one with the minimal statistical error. The proposed approach has been examined through simulation studies on a single span box-girder bridge.

Data Classification of Visual Quality for Image Recognition (영상인식을 위한 화질의 데이터 분류성)

  • Cho, Jae-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.01a
    • /
    • pp.279-280
    • /
    • 2021
  • 패턴 또는 영상을 인식하기 위하여 먼저 기계 학습 모델을 선택하고, 선택된 모델은 여러 단계의 처리 단계 과정으로써, 학습 데이터 구성과 특징 추출 그리고 분류기 등으로 크게 나눌 수 있다. 기존의 학습 모델의 처리 단계 중 학습 데이터 구성은 첫 번째 중요한 단계이다. 본 논문에서는 학습 데이터들의 특징을 분석하여 데이터 분류성의 척도로 사용될 수 있는지를 검토하여 차후 기계 학습 및 딥 러닝의 인식을 높이고자 한다.

  • PDF

연합학습 환경에서 클라이언트 선택의 최적화 기법

  • 박민정;손영진;채상미
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.11a
    • /
    • pp.722-723
    • /
    • 2023
  • 연합학습은 중앙 서버에서 데이터를 수집하는 방식이 아닌 로컬 디바이스 또는 클라이언트에서 학습을 진행하고 중앙 서버로 모델 업데이트만 전송하는 분산 학습 기법으로 데이터 보안 및 개인정보보호를 강화하는 동시에 효율적인 분산 학습을 수행할 수 있다. 그러나, 연합학습 대부분의 시나리오는 클라이언트의 서로 다른 분포 형태인 non-IID 데이터를 대상으로 학습함에 따라 중앙집중식 모델에 비하여 낮은 성능을 보이게 된다. 이에 본 연구에서는 연합학습 모델의 성능을 개선하기 위하여 non-IID 의 환경에서 참여 후보자 중에서 적합한 클라이언트 선택의 최적화 기법을 분석한다.

Compiler Analysis Framework Using SVM-Based Genetic Algorithm : Feature and Model Selection Sensitivity (SVM 기반 유전 알고리즘을 이용한 컴파일러 분석 프레임워크 : 특징 및 모델 선택 민감성)

  • Hwang, Cheol-Hun;Shin, Gun-Yoon;Kim, Dong-Wook;Han, Myung-Mook
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.30 no.4
    • /
    • pp.537-544
    • /
    • 2020
  • Advances in detection techniques, such as mutation and obfuscation, are being advanced with the development of malware technology. In the malware detection technology, unknown malware detection technology is important, and a method for Malware Authorship Attribution that detects an unknown malicious code by identifying the author through distributed malware is being studied. In this paper, we try to extract the compiler information affecting the binary-based author identification method and to investigate the sensitivity of feature selection, probability and non-probability models, and optimization to classification efficiency between studies. In the experiment, the feature selection method through information gain and the support vector machine, which is a non-probability model, showed high efficiency. Among the optimization studies, high classification accuracy was obtained through feature selection and model optimization through the proposed framework, and resulted in 48% feature reduction and 53 faster execution speed. Through this study, we can confirm the sensitivity of feature selection, model, and optimization methods to classification efficiency.