• 제목/요약/키워드: optimal learning

검색결과 1,187건 처리시간 0.024초

A Study on the Implementation of Crawling Robot using Q-Learning

  • Hyunki KIM;Kyung-A KIM;Myung-Ae CHUNG;Min-Soo KANG
    • 한국인공지능학회지
    • /
    • 제11권4호
    • /
    • pp.15-20
    • /
    • 2023
  • Machine learning is comprised of supervised learning, unsupervised learning and reinforcement learning as the type of data and processing mechanism. In this paper, as input and output are unclear and it is difficult to apply the concrete modeling mathematically, reinforcement learning method are applied for crawling robot in this paper. Especially, Q-Learning is the most effective learning technique in model free reinforcement learning. This paper presents a method to implement a crawling robot that is operated by finding the most optimal crawling method through trial and error in a dynamic environment using a Q-learning algorithm. The goal is to perform reinforcement learning to find the optimal two motor angle for the best performance, and finally to maintain the most mature and stable motion about EV3 Crawling robot. In this paper, for the production of the crawling robot, it was produced using Lego Mindstorms with two motors, an ultrasonic sensor, a brick and switches, and EV3 Classroom SW are used for this implementation. By repeating 3 times learning, total 60 data are acquired, and two motor angles vs. crawling distance graph are plotted for the more understanding. Applying the Q-learning reinforcement learning algorithm, it was confirmed that the crawling robot found the optimal motor angle and operated with trained learning, and learn to know the direction for the future research.

변분법을 이용한 재귀신경망의 온라인 학습 (A on-line learning algorithm for recurrent neural networks using variational method)

  • 오원근;서병설
    • 제어로봇시스템학회논문지
    • /
    • 제2권1호
    • /
    • pp.21-25
    • /
    • 1996
  • In this paper we suggest a general purpose RNN training algorithm which is derived on the optimal control concepts and variational methods. First, learning is regared as an optimal control problem, then using the variational methods we obtain optimal weights which are given by a two-point boundary-value problem. Finally, the modified gradient descent algorithm is applied to RNN for on-line training. This algorithm is intended to be used on learning complex dynamic mappings between time varing I/O data. It is useful for nonlinear control, identification, and signal processing application of RNN because its storage requirement is not high and on-line learning is possible. Simulation results for a nonlinear plant identification are illustrated.

  • PDF

Optimal iterative learning control with model uncertainty

  • Le, Dang Khanh;Nam, Taek-Kun
    • Journal of Advanced Marine Engineering and Technology
    • /
    • 제37권7호
    • /
    • pp.743-751
    • /
    • 2013
  • In this paper, an approach to deal with model uncertainty using norm-optimal iterative learning control (ILC) is mentioned. Model uncertainty generally degrades the convergence and performance of conventional learning algorithms. To deal with model uncertainty, a worst-case norm-optimal ILC is introduced. The problem is then reformulated as a convex minimization problem, which can be solved efficiently to generate the control signal. The paper also investigates the relationship between the proposed approach and conventional norm-optimal ILC; where it is found that the suggested design method is equivalent to conventional norm-optimal ILC with trial-varying parameters. Finally, simulation results of the presented technique are given.

다목적을 고려한 전력 시스템의 최적운용을 위한 S 모델 Automata의 적용 연구 (A Study on the Application of S Model Automata for Multiple Objective Optimal Operation of Power Systems)

  • 이병하;박종근
    • 대한전기학회논문지:전력기술부문A
    • /
    • 제49권4호
    • /
    • pp.185-194
    • /
    • 2000
  • The learning automaton is an automaton to update systematically the strategy for enhancing the performance in response to the output results, and several schemes of learning automata have been presented. In this paper, S-model learning automata are applied in order to achieve the best compromise solution between an optimal solution for economic operation and an optimal solution for stable operation of the power system under the circumstance that the loads vary randomly. It is shown that learning automata are applied satisfactorily to the multiobjective optimization problem for obtaining the best tradeoff among the conflicting economy and stability objectives of power systems.

  • PDF

A new control approach for seismic control of buildings equipped with active mass damper: Optimal fractional-order brain emotional learning-based intelligent controller

  • Abbas-Ali Zamani;Sadegh Etedali
    • Structural Engineering and Mechanics
    • /
    • 제87권4호
    • /
    • pp.305-315
    • /
    • 2023
  • The idea of the combination of the fractional-order operators with the brain emotional learning-based intelligent controller (BELBIC) is developed for implementation in seismic-excited structures equipped with active mass damper (AMD). For this purpose, a new design framework of the mentioned combination namely fractional-order BEBIC (FOBELBIC) is proposed based on a modified-teaching-learning-based optimization (MTLBO) algorithm. The seismic performance of the proposed controller is then evaluated for a 15-story building equipped with AMD subjected to two far-field and two near-field earthquakes. An optimal BELBIC based on the MTLBO algorithm is also introduced for comparison purposes. In comparison with the structure equipped with a passive tuned mass damper (TMD), an average reduction of 44.7% and 42.8% are obtained in terms of the maximum absolute and RMS top floor displacement for FOBELBIC, while these reductions are obtained as 30.4% and 30.1% for the optimal BELBIC, respectively. Similarly, the optimal FOBELBIC results in an average reduction of 42.6% and 39.4% in terms of the maximum absolute and RMS top floor acceleration, while these reductions are given as 37.9% and 30.5%, for the optimal BELBIC, respectively. Consequently, the superiority of the FOBELBIC over the BELBIC is concluded in the reduction of maximum and RMS seismic responses.

최적 반복 학습 제어기법을 이용한 RTP의 웨이퍼 온도균일제어 (Control of Wafer Temperature Uniformity in Rapid Thermal Processing using an Optimal Iterative teaming Control Technique)

  • 이진호;진인식;이광순;최진훈
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2000년도 제15차 학술회의논문집
    • /
    • pp.358-358
    • /
    • 2000
  • An iterative learning control technique based on a linear quadratic optimal criterion is proposed for temperature uniformity control of a silicon wafer in rapid thermal processing.

  • PDF

PID Type Iterative Learning Control with Optimal Gains

  • Madady, Ali
    • International Journal of Control, Automation, and Systems
    • /
    • 제6권2호
    • /
    • pp.194-203
    • /
    • 2008
  • Iterative learning control (ILC) is a simple and effective method for the control of systems that perform the same task repetitively. ILC algorithm uses the repetitiveness of the task to track the desired trajectory. In this paper, we propose a PID (proportional plus integral and derivative) type ILC update law for control discrete-time single input single-output (SISO) linear time-invariant (LTI) systems, performing repetitive tasks. In this approach, the input of controlled system in current cycle is modified by applying the PID strategy on the error achieved between the system output and the desired trajectory in a last previous iteration. The convergence of the presented scheme is analyzed and its convergence condition is obtained in terms of the PID coefficients. An optimal design method is proposed to determine the PID coefficients. It is also shown that under some given conditions, this optimal iterative learning controller can guarantee the monotonic convergence. An illustrative example is given to demonstrate the effectiveness of the proposed technique.

Q-러닝 기반의 선박의 최적 경로 생성 (Generation of Ship's Optimal Route based on Q-Learning)

  • 이형탁;김민규;양현
    • 한국항해항만학회:학술대회논문집
    • /
    • 한국항해항만학회 2023년도 춘계학술대회
    • /
    • pp.160-161
    • /
    • 2023
  • 현재 선박의 항해계획은 항해사의 지식과 경험적인 방법에 의존하고 있다. 그러나 최근에는 선박 자율운항기술이 발전됨에 따라, 항해계획의 자동화 기술도 여러 가지 방법으로 연구되고 있다. 본 연구에서는 강화학습 기법 중 하나인 Q-러닝을 기반으로 선박 최적 항해 경로를 생성하고자 한다. 강화학습은 다양한 상황에 대한 경험을 학습하고, 이를 기반으로 최적의 결정을 내리는 방식으로 적용된다.

  • PDF

딥러닝을 활용한 실시간 주식거래에서의 매매 빈도 패턴과 예측 시점에 관한 연구: KOSDAQ 시장을 중심으로 (A Study on the Optimal Trading Frequency Pattern and Forecasting Timing in Real Time Stock Trading Using Deep Learning: Focused on KOSDAQ)

  • 송현정;이석준
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제27권3호
    • /
    • pp.123-140
    • /
    • 2018
  • Purpose The purpose of this study is to explore the optimal trading frequency which is useful for stock price prediction by using deep learning for charting image data. We also want to identify the appropriate time for accurate forecasting of stock price when performing pattern analysis. Design/methodology/approach In order to find the optimal trading frequency patterns and forecast timings, this study is performed as follows. First, stock price data is collected using OpenAPI provided by Daishin Securities, and candle chart images are created by data frequency and forecasting time. Second, the patterns are generated by the charting images and the learning is performed using the CNN. Finally, we find the optimal trading frequency patterns and forecasting timings. Findings According to the experiment results, this study confirmed that when the 10 minute frequency data is judged to be a decline pattern at previous 1 tick, the accuracy of predicting the market frequency pattern at which the market decreasing is 76%, which is determined by the optimal frequency pattern. In addition, we confirmed that forecasting of the sales frequency pattern at previous 1 tick shows higher accuracy than previous 2 tick and 3 tick.

머신러닝을 활용한 어린이 스마트 횡단보도 최적입지 선정 - 창원시 사례를 중심으로 - (Machine Learning based Optimal Location Modeling for Children's Smart Pedestrian Crosswalk: A Case Study of Changwon-si)

  • 이수현;서용원;김세인;이재경;윤원주
    • 한국BIM학회 논문집
    • /
    • 제12권2호
    • /
    • pp.1-11
    • /
    • 2022
  • Road traffic accidents (RTAs) are the leading cause of accidental death among children. RTA reduction is becoming an increasingly important social issue among children. Municipalities aim to resolve this issue by introducing "Smart Pedestrian Crosswalks" that help prevent traffic accidents near children's facilities. Nonetheless such facilities tend to be installed in relatively limited number of areas, such as the school zone. In order for budget allocation to be efficient and policy effects maximized, optimal location selection based on machine learning is needed. In this paper, we employ machine learning models to select the optimal locations for smart pedestrian crosswalks to reduce the RTAs of children. This study develops an optimal location index using variable importance measures. By using k-means clustering method, the authors classified the crosswalks into three types after the optimal location selection. This study has broadened the scope of research in relation to smart crosswalks and traffic safety. Also, the study serves as a unique contribution by integrating policy design decisions based on public and open data.