• Title/Summary/Keyword: 기계 학습 알고리즘

Search Result 774, Processing Time 0.024 seconds

A New Statistical Sampling Method for Reducing Computing time of Machine Learning Algorithms (기계학습 알고리즘의 컴퓨팅시간 단축을 위한 새로운 통계적 샘플링 기법)

  • Jun, Sung-Hae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.2
    • /
    • pp.171-177
    • /
    • 2011
  • Accuracy and computing time are considerable issues in machine learning. In general, the computing time for data analysis is increased in proportion to the size of given data. So, we need a sampling approach to reduce the size of training data. But, the accuracy of constructed model is decreased by going down the data size simultaneously. To solve this problem, we propose a new statistical sampling method having similar performance to the total data. We suggest a rule to select optimal sampling techniques according to given data structure. This paper shows a sampling method for reducing computing time with keeping the most of accuracy using cluster sampling, stratified sampling, and systematic sampling. We verify improved performance of proposed method by accuracy and computing time between sample data and total data using objective machine learning data sets.

항로표지 배치 적합성 검증 방안에 관한 연구

  • 백인흠;박준모;이미라;하창승
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2022.06a
    • /
    • pp.81-82
    • /
    • 2022
  • 항로표지는 선박이 항로에서 안전하게 항해하는데 중요한 역할을 하며, 이 때문에 국가에서는 항로표지 배치의 적합성 여부에 대한 검토를 주기적으로 실시하고 있다. 이 연구에서는 기계학습을 이용한 새로운 항로표지 배치 적합성 검증을 위한 알고리즘 및 시스템을 구현하고 실제 항로에 적용 가능한 시스템을 개발하였다.

  • PDF

Effect of Application of Ensemble Method on Machine Learning with Insufficient Training Set in Developing Automated English Essay Scoring System (영작문 자동채점 시스템 개발에서 학습데이터 부족 문제 해결을 위한 앙상블 기법 적용의 효과)

  • Lee, Gyoung Ho;Lee, Kong Joo
    • Journal of KIISE
    • /
    • v.42 no.9
    • /
    • pp.1124-1132
    • /
    • 2015
  • In order to train a supervised machine learning algorithm, it is necessary to have non-biased labels and a sufficient amount of training data. However, it is difficult to collect the required non-biased labels and a sufficient amount of training data to develop an automatic English Composition scoring system. In addition, an English writing assessment is carried out using a multi-faceted evaluation of the overall level of the answer. Therefore, it is difficult to choose an appropriate machine learning algorithm for such work. In this paper, we show that it is possible to alleviate these problems through ensemble learning. The results of the experiment indicate that the ensemble technique exhibited an overall performance that was better than that of other algorithms.

RFA: Recursive Feature Addition Algorithm for Machine Learning-Based Malware Classification

  • Byeon, Ji-Yun;Kim, Dae-Ho;Kim, Hee-Chul;Choi, Sang-Yong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.2
    • /
    • pp.61-68
    • /
    • 2021
  • Recently, various technologies that use machine learning to classify malicious code have been studied. In order to enhance the effectiveness of machine learning, it is most important to extract properties to identify malicious codes and normal binaries. In this paper, we propose a feature extraction method for use in machine learning using recursive methods. The proposed method selects the final feature using recursive methods for individual features to maximize the performance of machine learning. In detail, we use the method of extracting the best performing features among individual feature at each stage, and then combining the extracted features. We extract features with the proposed method and apply them to machine learning algorithms such as Decision Tree, SVM, Random Forest, and KNN, to validate that machine learning performance improves as the steps continue.

Improving Performance for $Na{\ddot{i}}ve$ Bayes Classifier Using Virtual Examples (가상예제를 이용한 $Na{\ddot{i}}ve$ Bayes 분류기 성능 향상)

  • Lee Yujung;Kang Byoungho;Kang Jaeho;Ryu Kwang Ryel
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11b
    • /
    • pp.655-657
    • /
    • 2005
  • 기계학습에서 분류는 훈련 예제들로 학습하여 생성한 분류기를 활용하여 새로운 예제에 어느 한 범주를 부여하는 것을 말한다. 일반적으로 분류의 성능 즉 정확도의 향상은 학습 알고리즘을 개선하거나 훈련예제 집합을 변형시킴으로써 가능하다. 본 논문에서 소개하는 가상예제를 이용한 분류기 성능 향상 방안은 후자에 속한다. 실세계 분류문제에서 많은 수의 훈련예제들을 수집하는 일은 대상문제에 따라 비용이 많이 드는 경우가 있다. 또한 적은 수의 훈련예제를 학습해 생성한 분류기는 분류성능이 좋지 않을 수 있다. 본 논문에서는 이런 문제를 해결하기 위해서 가상예제를 생성해 훈련예제 집합에 추가하는 방안을 제안하고자 한다. 가상예제를 이용한 분류성능 향상방안이 $Na{\ddot{i}}ve$ Bayes 학습 알고리즘 성능 개선에 효과가 있음을 실험을 통해 확인하였다.

  • PDF

Win/Lose Prediction System : Predicting Baseball Game Results using a Hybrid Machine Learning Model (혼합형 기계 학습 모델을 이용한 프로야구 승패 예측 시스템)

  • 홍석미;정경숙;정태충
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.9 no.6
    • /
    • pp.693-698
    • /
    • 2003
  • Every baseball game generates various records and on the basis of those records, win/lose prediction about the next game is carried out. Researches on win/lose predictions of professional baseball games have been carried out, but there are not so good results yet. Win/lose prediction is very difficult because the choice of features on win/lose predictions among many records is difficult and because the complexity of a learning model is increased due to overlapping factors among the data used in prediction. In this paper, learning features were chosen by opinions of baseball experts and a heuristic function was formed using the chosen features. We propose a hybrid model by creating a new value which can affect predictions by combining multiple features, and thus reducing a dimension of input value which will be used for backpropagation learning algorithm. As the experimental results show, the complexity of backpropagation was reduced and the accuracy of win/lose predictions on professional baseball games was improved.

A Study on Automatic Recommendation of Keywords for Sub-Classification of National Science and Technology Standard Classification System Using AttentionMesh (AttentionMesh를 활용한 국가과학기술표준분류체계 소분류 키워드 자동추천에 관한 연구)

  • Park, Jin Ho;Song, Min Sun
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.2
    • /
    • pp.95-115
    • /
    • 2022
  • The purpose of this study is to transform the sub-categorization terms of the National Science and Technology Standards Classification System into technical keywords by applying a machine learning algorithm. For this purpose, AttentionMeSH was used as a learning algorithm suitable for topic word recommendation. For source data, four-year research status files from 2017 to 2020, refined by the Korea Institute of Science and Technology Planning and Evaluation, were used. For learning, four attributes that well express the research content were used: task name, research goal, research abstract, and expected effect. As a result, it was confirmed that the result of MiF 0.6377 was derived when the threshold was 0.5. In order to utilize machine learning in actual work in the future and to secure technical keywords, it is expected that it will be necessary to establish a term management system and secure data of various attributes.

An Analysis of Artificial Intelligence Algorithms Applied to Rock Engineering (암반공학분야에 적용된 인공지능 알고리즘 분석)

  • Kim, Yangkyun
    • Tunnel and Underground Space
    • /
    • v.31 no.1
    • /
    • pp.25-40
    • /
    • 2021
  • As the era of Industry 4.0 arrives, the researches using artificial intelligence in the field of rock engineering as well have increased. For a better understanding and availability of AI, this paper analyzed the types of algorithms and how to apply them to the research papers where AI is applied among domestic and international studies related to tunnels, blasting and mines that are major objects in which rock engineering techniques are applied. The analysis results show that the main specific fields in which AI is applied are rock mass classification and prediction of TBM advance rate as well as geological condition ahead of TBM in a tunnel field, prediction of fragmentation and flyrock in a blasting field, and the evaluation of subsidence risk in abandoned mines. Of various AI algorithms, an artificial neural network is overwhelmingly applied among investigated fields. To enhance the credibility and accuracy of a study result, an accurate and thorough understanding on AI algorithms that a researcher wants to use is essential, and it is expected that to solve various problems in the rock engineering fields which have difficulty in approaching or analyzing at present, research ideas using not only machine learning but also deep learning such as CNN or RNN will increase.

A Study of XML-based FSM Definition System (XML 기반의 FSM 시스템에 관한 연구)

  • 이정훈;신성운;오상권
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.550-552
    • /
    • 2004
  • 가상공간에는 PC(Playerable Character), NPC(Non-Playerable Character)등의 동적 객체와 건물, 지형 등의 정적 객체들이 존재하게 된다. 동적 객체들의 경우, 현실감을 위해 인공지능이 자주 이용된다 현재까지 인공지능에 대한 연구는 유한상태기계(Finite State Machine. FSM). 학습 알고리즘, 유전자 알고리즘, 신경망 알고리즘 등을 중심으로 진행되어 왔다. 이중 유한상태기계는 비교적 알고리즘이 간단하고, 시스템의 부담이 적어 간단한 객체의 인공지능으로 가장 널리 사용되고 있다. 본 논문은 유찬상태기계를 확장하여 모드변경(Mode Change)과 그룹행동을 보여줄 수 있는 XML을 활용한 FSM 시스템을 제안한다. 여기서 모드변경이란 하나의 행동 패턴에서 다른 행동 패턴으로 변경하는 것을, 그룹행동은 여러 객체가 함께 행동하는 Flocking기법을 지칭한파. 이러한 XML을 활용한 FSM 시스템은 다양한 패턴의 정의는 물론, 객체의 상태 정의 및 수정, 확장이 용이하여, 다양한 응용 분야에서 활용될 수 있다.

  • PDF

Model Transformation and Inference of Machine Learning using Open Neural Network Format (오픈신경망 포맷을 이용한 기계학습 모델 변환 및 추론)

  • Kim, Seon-Min;Han, Byunghyun;Heo, Junyeong
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.3
    • /
    • pp.107-114
    • /
    • 2021
  • Recently artificial intelligence technology has been introduced in various fields and various machine learning models have been operated in various frameworks as academic interest has increased. However, these frameworks have different data formats, which lack interoperability, and to overcome this, the open neural network exchange format, ONNX, has been proposed. In this paper we describe how to transform multiple machine learning models to ONNX, and propose algorithms and inference systems that can determine machine learning techniques in an integrated ONNX format. Furthermore we compare the inference results of the models before and after the ONNX transformation, showing that there is no loss or performance degradation of the learning results between the ONNX transformation.