• Title/Summary/Keyword: 자율 학습 신경망

Search Result 73, Processing Time 0.024 seconds

A Study of Real Time Object Tracking using Reinforcement Learning (강화학습을 사용한 실시간 이동 물체 추적에 관한 연구)

  • 김상헌;이동명;정재영;운학수;박민욱;김관형
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09b
    • /
    • pp.87-90
    • /
    • 2003
  • 과거의 이동로봇 시스템은 완전한 자율주행이 주된 목표였으며 그때의 영상정보는 단지 모니터링을 하는 보조적인 수단으로 사용되었다. 그러나 지금은 이동 물체의 추적, 대상 물체의 인식과 판별, 특징 추출과 같은 다양한 응용분야에서 영상정보를 이용하는 연구가 활발히 진행되고 있다 또한 제어 측면에서는 전통적인 제어기법으로는 해결하기 힘들었던 여러 가지 비선형적인 제어를 지능제어 방법을 통하여 많이 해결하곤 하였다. 그러한 지능제어에서 신경망을 많이 사용하기도 한다. 최근에는 신경망의 학습에 많이 사용하는 방법 중 강화학습이 많이 사용되고 있다. 강화학습이란 동적인 제어평면에서 시행착오를 통해, 목적을 이루기 위해 각 상황에서 행동을 학습하는 방법이다. 그러므로 이러한 강화학습은 수많은 시행착오를 거쳐 그 대응 관계를 학습하게 된다. 제어에 사용되는 제어 파라메타는 어떠한 상태에 처할 수 있는 상태와 행동들, 그리고 상태의 변화, 또한 최적의 해를 구할 수 있는 포상알고리즘에 대해 다양하게 연구되고 있다. 본 논문에서 연구한 시스템은 비젼시스템과 Strong Arm 보드를 이용하여 대상물체의 색상과 형태를 파악한 후 실시간으로 물체를 추적할 수 있게 구성하였으며, 또한 물체 이동의 비선형적인 경향성을 강화학습을 통하여 물체이동의 비선형성을 보다 유연하게 대처하여 보다 안정하고 빠르며 정확하게 물체를 추적하는 방법을 실험을 통하여 제안하였다.

  • PDF

Development of Autonomous Algorithm Using an Online Feedback-Error Learning Based Neural Network for Nonholonomic Mobile Robots (온라인 피드백 에러 학습을 이용한 이동 로봇의 자율주행 알고리즘 개발)

  • Lee, Hyun-Dong;Myung, Byung-Soo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.5
    • /
    • pp.602-608
    • /
    • 2011
  • In this study, a method of designing a neurointerface using neural network (NN) is proposed for controlling nonholonomic mobile robots. According to the concept of virtual master-slave robots, in particular, a partially stable inverse dynamic model of the master robot is acquired online through the NN by applying a feedback-error learning method, in which the feedback controller is assumed to be based on a PD compensator for such a nonholonomic robot. The NN for the online feedback-error learning can composed that the input layer consists of six units for the inputs $x_i$, i=1~6, the hidden layer consists of two hidden units for hidden outputs $o_j$, j=1~2, and the output layer consists of two units for the outputs ${\tau}_k$, k=1~2. A tracking control problem is demonstrated by some simulations for a nonholonomic mobile robot with two-independent driving wheels. The initial q value was set to [0, 5, ${\pi}$].

BLE-based Indoor Positioning System design using Neural Network (신경망을 이용한 BLE 기반 실내 측위 시스템 설계)

  • Shin, Kwang-Seong;Lee, Heekwon;Youm, Sungkwan
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.1
    • /
    • pp.75-80
    • /
    • 2021
  • Positioning technology is performing important functions in augmented reality, smart factory, and autonomous driving. Among the positioning techniques, the positioning method using beacons has been considered a challenging task due to the deviation of the RSSI value. In this study, the position of a moving object is predicted by training a neural network that takes the RSSI value of the receiver as an input and the distance as the target value. To do this, the measured distance versus RSSI was collected. A neural network was introduced to create synthetic data from the collected actual data. Based on this neural network, the RSSI value versus distance was predicted. The real value of RSSI was obtained as a neural network for generating synthetic data, and based on this value, the coordinates of the object were estimated by learning a neural network that tracks the location of a terminal in a virtual environment.

Development of Autonomous Vehicle Learning Data Generation System (자율주행 차량의 학습 데이터 자동 생성 시스템 개발)

  • Yoon, Seungje;Jung, Jiwon;Hong, June;Lim, Kyungil;Kim, Jaehwan;Kim, Hyungjoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.19 no.5
    • /
    • pp.162-177
    • /
    • 2020
  • The perception of traffic environment based on various sensors in autonomous driving system has a direct relationship with driving safety. Recently, as the perception model based on deep neural network is used due to the development of machine learning/in-depth neural network technology, a the perception model training and high quality of a training dataset are required. However, there are several realistic difficulties to collect data on all situations that may occur in self-driving. The performance of the perception model may be deteriorated due to the difference between the overseas and domestic traffic environments, and data on bad weather where the sensors can not operate normally can not guarantee the qualitative part. Therefore, it is necessary to build a virtual road environment in the simulator rather than the actual road to collect the traning data. In this paper, a training dataset collection process is suggested by diversifying the weather, illumination, sensor position, type and counts of vehicles in the simulator environment that simulates the domestic road situation according to the domestic situation. In order to achieve better performance, the authors changed the domain of image to be closer to due diligence and diversified. And the performance evaluation was conducted on the test data collected in the actual road environment, and the performance was similar to that of the model learned only by the actual environmental data.

Local Path Generation Method for Unmanned Autonomous Vehicles Using Reinforcement Learning (강화학습을 이용한 무인 자율주행 차량의 지역경로 생성 기법)

  • Kim, Moon Jong;Choi, Ki Chang;Oh, Byong Hwa;Yang, Ji Hoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.9
    • /
    • pp.369-374
    • /
    • 2014
  • Path generation methods are required for safe and efficient driving in unmanned autonomous vehicles. There are two kinds of paths: global and local. A global path consists of all the way points including the source and the destination. A local path is the trajectory that a vehicle needs to follow from a way point to the next in the global path. In this paper, we propose a novel method for local path generation through machine learning, with an effective curve function used for initializing the trajectory. First, reinforcement learning is applied to a set of candidate paths to produce the best trajectory with maximal reward. Then the optimal steering angle with respect to the trajectory is determined by training an artificial neural network. Our method outperformed existing approaches and successfully found quality paths in various experimental settings, including the cases with obstacles.

Moving Path Following of Autonomous Mobile Robot using Neural Network (신경망을 이용한 자율이동로봇의 이동 경로 추종)

  • 주기세
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.4 no.3
    • /
    • pp.585-594
    • /
    • 2000
  • The exact path following of an autonomous mobile robot in a factory and an unreliable environment has many disadvantages in case of a classical control algorithm. In this paper, a neural network control approach based on an error back propagation algorithm is proposed for controlling a mobile robot to follow a line installed on the road. Since not only the three recognized informations from three sensors attached on a mobile robot but also the ten detailed informations in non recognition area are learned with input patterns, a mobile robot moves smoothly an installed line in spite of non perception space. The mobile robot has an effect of error minimization with a short time till a destination. To test an effectiveness of the proposed controller, the two motor velocity changes which is affected from a moving angle change of a mobile robot are simulated with computer.

  • PDF

Autonomous control of bicycle using Deep Deterministic Policy Gradient Algorithm (Deep Deterministic Policy Gradient 알고리즘을 응용한 자전거의 자율 주행 제어)

  • Choi, Seung Yoon;Le, Pham Tuyen;Chung, Tae Choong
    • Convergence Security Journal
    • /
    • v.18 no.3
    • /
    • pp.3-9
    • /
    • 2018
  • The Deep Deterministic Policy Gradient (DDPG) algorithm is an algorithm that learns by using artificial neural network s and reinforcement learning. Among the studies related to reinforcement learning, which has been recently studied, the D DPG algorithm has an advantage of preventing the cases where the wrong actions are accumulated and affecting the learn ing because it is learned by the off-policy. In this study, we experimented to control the bicycle autonomously by applyin g the DDPG algorithm. Simulation was carried out by setting various environments and it was shown that the method us ed in the experiment works stably on the simulation.

  • PDF

단안 깊이 추정 기술 동향

  • Kim, Won-Jun
    • Broadcasting and Media Magazine
    • /
    • v.27 no.2
    • /
    • pp.43-50
    • /
    • 2022
  • 한 장의 이미지로부터 장면의 깊이 정보를 추정하는 기술은 자율 주행, 실내외 로봇 기반 서비스 등 다양한 응용 분야에서 널리 적용되고 있다. 심층 학습을 이용한 알고리즘이 활발히 연구되면서 이러한 단안 깊이 추정 기술의 산업 분야 적용 범위는 확대되고 있는 추세이다. 그러나, 깊이 경계 정보를 정밀하게 예측하는데 여전히 많은 어려움이 있으며, 다양한 실제 환경에서 획득한 3차원 깊이 정보 구축 또한 많은 비용이 소모되는 문제점이 있다. 본 고에서는 이러한 문제를 해결하기 위해 최근 활발히 연구되고 있는 심층신경망 기반 단안 깊이 추정 연구의 최신 동향을 소개하고자 한다. 지도 학습 기반 방법부터 최근 활발히 연구되고 있는 비지도 학습 방법까지 상세히 살펴본다. 이와 더불어 대표 방법에 대한 성능 평가 결과도 간략히 제시하고자 한다.

Forecasting of Runoff Hydrograph Using Neural Network Algorithms (신경망 알고리즘을 적용한 유출수문곡선의 예측)

  • An, Sang-Jin;Jeon, Gye-Won;Kim, Gwang-Il
    • Journal of Korea Water Resources Association
    • /
    • v.33 no.4
    • /
    • pp.505-515
    • /
    • 2000
  • THe purpose of this study is to forecast of runoff hydrographs according to rainfall event in a stream. The neural network theory as a hydrologic blackbox model is used to solve hydrological problems. The Back-Propagation(BP) algorithm by the Levenberg-Marquardt(LM) techniques and Radial Basis Function(RBF) network in Neural Network(NN) models are used. Runoff hydrograph is forecasted in Bocheongstream basin which is a IHP the representative basin. The possibility of a simulation for runoff hydrographs about unlearned stations is considered. The results show that NN models are performed to effective learning for rainfall-runoff process of hydrologic system which involves a complexity and nonliner relationships. The RBF networks consist of 2 learning steps. The first step is an unsupervised learning in hidden layer and the next step is a supervised learning in output layer. Therefore, the RBF networks could provide rather time saved in the learning step than the BP algorithm. The peak discharge both BP algorithm and RBF network model in the estimation of an unlearned are a is trended to observed values.

  • PDF

Application of Neural Network Self Adaptative Control System for A.C. Servo Motor Speed Control (A.C. 서보모터 속도 제어를 위한 신경망 자율 적응제어 시스템의 적용)

  • Park, Wal-Seo;Lee, Seong-Soo;Kim, Yong-Wook;Yoo, Seok-Ju
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.21 no.7
    • /
    • pp.103-108
    • /
    • 2007
  • Neural network is used in many fields of control systems currently. However, It is not easy to obtain input-output pattern when neural network is used for the system of a single feedback controller and it is difficult to get satisfied performance with neural network when load changes rapidly or disturbance is applied. To resolve these problems, this paper proposes a new mode to implement a neural network controller by installing a real object in place of activation function of Neural Network output node. As the Neural Network self adaptive control system is designed in simple structure neural network input-output pattern problem is solved naturally and real tin Loaming becomes possible through general back propagation algorithm. The effect of the proposed Neural Network self adaptive control algorithm was verified in a test of controlling the speed of a A.C. servo motor equipped with a high speed computing capable DSP (TMS320C32) on which the proposed algorithm was loaded.