• Title/Summary/Keyword: Learning Parameter

Search Result 681, Processing Time 0.026 seconds

Improvements of pursuit performance using episodic parameter optimization in probabilistic games (에피소드 매개변수 최적화를 이용한 확률게임에서의 추적정책 성능 향상)

  • Kwak, Dong-Jun;Kim, H.-Jin
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.40 no.3
    • /
    • pp.215-221
    • /
    • 2012
  • In this paper, we introduce an optimization method to improve pursuit performance of a pursuer in a pursuit-evasion game (PEG). Pursuers build a probability map and employ a hybrid pursuit policy which combines the merits of local-max and global-max pursuit policies to search and capture evaders as soon as possible in a 2-dimensional space. We propose an episodic parameter optimization (EPO) algorithm to learn good values for the weighting parameters of a hybrid pursuit policy. The EPO algorithm is performed while many episodes of the PEG are run repeatedly and the reward of each episode is accumulated using reinforcement learning, and the candidate weighting parameter is selected in a way that maximizes the total averaged reward by using the golden section search method. We found the best pursuit policy in various situations which are the different number of evaders and the different size of spaces and analyzed results.

Repetitive learning method for trajectory control of robot manipulators using disturbance observer

  • Kim, Bong-Keun;Chung, Wan-Kyun;Youm, Youngil
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1996.10a
    • /
    • pp.99-102
    • /
    • 1996
  • A novel iterative learning control scheme comprising a unique feedforward learning controller and a disturbance observer is proposed. Disturbance observer compensates disturbance due to parameter variations, mechanical nonlinearities, unmodeled dynamics and external disturbances. The convergence and robustness of the proposed controller is proved by the method based on Lyapunov stability theorem. The results of numerical simulation are shown to verify the effectiveness of the proposed control scheme.

  • PDF

Robust control for external input perturbation using second order derivative of universal learning network

  • Ohbayashi, Masanao;Hirasawa, Kotaro
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1996.10a
    • /
    • pp.111-114
    • /
    • 1996
  • This paper proposes a robust control method using Universal Learning Network(U.L.N.) and second order derivatives of U.L.N.. Robust control considered here is defined as follows. Even if external input (equal to reference input in this paper) to the system at control stage changes awfully from that at learning stage, the system can be controlled so as to maintain a good performance. In order to realize such a robust control, a new term concerning the perturbation is added to a usual criterion function. And parameter variables are adjusted so as to minimize the above mentioned criterion function using the second order derivative of the criterion function with respect to the parameters.

  • PDF

New Learning Hybrid Model for Room Impulse Response Functions (새로운 학습 하이브리드 실내 충격 응답 모델)

  • Shin, Min-Cheol;Wang, Se-Myung
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2007.11a
    • /
    • pp.23-27
    • /
    • 2007
  • Many trials have been used to model room impulse responses, all attempting to provide efficient representations of room acoustics. The traditional model designs for room impulse response seem to fail in accuracy, controllability, or computational efficiency. In time domain, a room impulse response is generally considered as the combination of three parts having different acoustic characteristics, initial time delay, early reflection, and late reverberation. This paper introduces new learning hybrid model for the room impulse response. In this proposed model, those three parts are modeled using different models with learning algorithms that determine the length or boundary of each model in the hybrid model. By the simulation with measured room impulse responses, it was examined that the performance of proposed model shows the best efficiency in views of both the parameter numbers and modeling error.

  • PDF

D.C. Motor Speed Control by Learning Gain Regulator (학습이득 조절기에 의한 직류 모터 속도제어)

  • Park, Wal-Seo;Lee, Sung-Su;Kim, Yong-Wook
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.19 no.6
    • /
    • pp.82-86
    • /
    • 2005
  • PID controller is widely used as automatic equipment for industry. However when a system has various characters of intermittence or continuance, a new parameter decision for accurate control is a bud task. As a method of solving this problem, in this paper, a teaming gain regulator as PID controller functions is presented. A propriety teaming gain of system is decided by a rule of Delta learning. The function of proposed loaming gain regulator is verified by simulation results of DC motor.

A MODIFIED EXTENDED KALMAN FILTER METHOD FOR MULTI-LAYERED NEURAL NETWORK TRAINING

  • KIM, KYUNGSUP;WON, YOOJAE
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.22 no.2
    • /
    • pp.115-123
    • /
    • 2018
  • This paper discusses extended Kalman filter method for solving learning problems of multilayered neural networks. A lot of learning algorithms for deep layered network are sincerely suffered from complex computation and slow convergence because of a very large number of free parameters. We consider an efficient learning algorithm for deep neural network. Extended Kalman filter method is applied to parameter estimation of neural network to improve convergence and computation complexity. We discuss how an efficient algorithm should be developed for neural network learning by using Extended Kalman filter.

Parameter Adaptationin in Neural Network Using Fuzzy (퍼지를 이용한 신경망에서의 파라미터의 수정)

  • Lee, Kwong-Won;Ko, Joe-Ho;Bae, Young-Chul;Yim, Wha-Young
    • Proceedings of the KIEE Conference
    • /
    • 1997.07b
    • /
    • pp.383-385
    • /
    • 1997
  • Back-propagation is one of the efficient algorithms used to nonlinear optimizations or controls. In spite of its structual simplicity and learning ability, learning time is very long or bad case converge local minimum on complicate input patterns. In order to improve these matters varing learning rate and momentums were proposed. In this paper, to improve its performance fuzzy is adjusted in parameters, learning rate and momentums. Parameters are adjusted by errors and change of errors adaptively. In order to evaluate proposed method simulated with MATLAB on inverted pendulum.

  • PDF

LMI-Based Synthesis of Robust Iterative Learning Controller with Current Feedback for Linear Uncertain Systems

  • Xu, Jianming;Sun, Mingxuan;Yu, Li
    • International Journal of Control, Automation, and Systems
    • /
    • v.6 no.2
    • /
    • pp.171-179
    • /
    • 2008
  • This paper addresses the synthesis of an iterative learning controller for a class of linear systems with norm-bounded parameter uncertainties. We take into account an iterative learning algorithm with current cycle feedback in order to achieve both robust convergence and robust stability. The synthesis problem of the developed iterative learning control (ILC) system is reformulated as the ${\gamma}$-suboptimal $H_{\infty}$ control problem via the linear fractional transformation (LFT). A sufficient convergence condition of the ILC system is presented in terms of linear matrix inequalities (LMIs). Furthermore, the ILC system with fast convergence rate is constructed using a convex optimization technique with LMI constraints. The simulation results demonstrate the effectiveness of the proposed method.

Dynamic control of mobile robots using a robust.adaptive learning control method (강인.적응학습제어 방식에 의한 이동로봇의 동력학 제어)

  • Nam, Jae-Ho;Baek, Seung-Min;Guk, Tae-Yong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.4 no.2
    • /
    • pp.178-186
    • /
    • 1998
  • In this paper, a robust.adaptive learning control scheme is presented for precise trajectory tracking of rigid mobile robots. In the proposed controller, a set of desired trajectories is defined and used in constructing the control input and learning rules which constitute the main part of the proposed controller. Stable operating characteristics such as precise trajectory tracking, parameter estimation, disturbance suppression, etc., are shown thorugh experiments and computer simulations.

  • PDF

A Study on the Second-order Iterative Learning Control Algorithm with Feedback (궤환을 갖는 2차 반복 학습제어 알고리즘에 관한 연구)

  • Huh, Kyung-Moo
    • The Transactions of the Korean Institute of Electrical Engineers A
    • /
    • v.48 no.5
    • /
    • pp.629-635
    • /
    • 1999
  • A second-order iterative learning control algorithm with feedback is proposed in this paper, in which a feedback term is added in the learning control scheme for the enhancement of convergence speed and robustness to disturbances or system parameter variations. The convergence proof of the proposed algorithm is givenl, and the sufficient condition for the convergence of the algorithm is provided. And it also includes the discussions about the convergence performance of the algorithm when the initial condition at the beginning of each iteration differs from the previous value of the initial. Simulation results show the validity and efficiency of the proposed algorithm.

  • PDF