• Title/Summary/Keyword: multi-agent learning

Search Result 121, Processing Time 0.043 seconds

Cooperative Multi-Agent Reinforcement Learning-Based Behavior Control of Grid Sortation Systems in Smart Factory (스마트 팩토리에서 그리드 분류 시스템의 협력적 다중 에이전트 강화 학습 기반 행동 제어)

  • Choi, HoBin;Kim, JuBong;Hwang, GyuYoung;Kim, KwiHoon;Hong, YongGeun;Han, YounHee
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.9 no.8
    • /
    • pp.171-180
    • /
    • 2020
  • Smart Factory consists of digital automation solutions throughout the production process, including design, development, manufacturing and distribution, and it is an intelligent factory that installs IoT in its internal facilities and machines to collect process data in real time and analyze them so that it can control itself. The smart factory's equipment works in a physical combination of numerous hardware, rather than a virtual character being driven by a single object, such as a game. In other words, for a specific common goal, multiple devices must perform individual actions simultaneously. By taking advantage of the smart factory, which can collect process data in real time, if reinforcement learning is used instead of general machine learning, behavior control can be performed without the required training data. However, in the real world, it is impossible to learn more than tens of millions of iterations due to physical wear and time. Thus, this paper uses simulators to develop grid sortation systems focusing on transport facilities, one of the complex environments in smart factory field, and design cooperative multi-agent-based reinforcement learning to demonstrate efficient behavior control.

Design of Multi-agent System for Course Scheduling of Learner-oriented using Weakness Analysis Algorithm (취약성 분석 알고리즘을 이용한 학습자 중심의 코스 스케쥴링 멀티 에이전트 시스템의 설계)

  • Kim, Tae-Seog;Lee, Jong-Hee;Lee, Keun-Wang;Oh, Hae-Seok
    • The KIPS Transactions:PartA
    • /
    • v.8A no.4
    • /
    • pp.517-522
    • /
    • 2001
  • The appearance of web technology has accelerated a role of the development of the multimedia technology, the computer communication technology and the multimedia application contents. And serveral researches of WBI (Web-based Instruction) system have combined the technology of the digital library and LOD. Recently WBI (Web-based Instruction) model which is based on web has been proposed in the part of the new activity model of teaching-learning. And the demand of the customized coursewares which is required from the learners is increased, the needs of the efficient and automated education agents in the web-based instruction are recognized. But many education systems that had been studied recently did not service fluently the courses which learners had been wanting and could not provide the way for the learners to study the learning weakness which is observed in the continuous feedback of the course. In this paper we propose "Design of Multi-agent System for Course Scheduling of Learner-oriented using Weakness Analysis Algorithm". First proposed system monitors learner's behaviors constantly, evaluates them, and calculates his accomplishment. From this accomplishment the multi-agent schedules the suitable course for the learner. And the learner achieves a active and complete learning from the repeated and suitable course.le course.

  • PDF

Application of reinforcement learning to hyper-redundant system Acquisition of locomotion pattern of snake like robot

  • Ito, K.;Matsuno, F.
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.01a
    • /
    • pp.65-70
    • /
    • 2001
  • We consider a hyper-redundant system that consists of many uniform units. The hyper-redundant system has many degrees of freedom and it can accomplish various tasks. Applysing the reinforcement learning to the hyper-redundant system is very attractive because it is possible to acquire various behaviors for various tasks automatically. In this paper we present a new reinforcement learning algorithm "Q-learning with propagation of motion". The algorithm is designed for the multi-agent systems that have strong connections. The proposed algorithm needs only one small Q-table even for a large scale system. So using the proposed algorithm, it is possible for the hyper-redundant system to learn the effective behavior. In this algorithm, only one leader agent learns the own behavior using its local information and the motion of the leader is propagated to another agents with time delay. The reward of the leader agent is given by using the whole system information. And the effective behavior of the leader is learned and the effective behavior of the system is acquired. We apply the proposed algorithm to a snake-like hyper-redundant robot. The necessary condition of the system to be Markov decision process is discussed. And the computer simulation of learning the locomotion is demonstrated. From the simulation results we find that the task of the locomotion of the robot to the desired point is learned and the winding motion is acquired. We can conclude that our proposed system and our analysis of the condition, that the system is Markov decision process, is valid.

  • PDF

Earthwork Planning via Reinforcement Learning with Heterogeneous Construction Equipment (강화학습을 이용한 이종 장비 토목 공정 계획)

  • Ji, Min-Gi;Park, Jun-Keon;Kim, Do-Hyeong;Jung, Yo-Han;Park, Jin-Kyoo;Moon, Il-Chul
    • Journal of the Korea Society for Simulation
    • /
    • v.27 no.1
    • /
    • pp.1-13
    • /
    • 2018
  • Earthwork planning is one of the critical issues in a construction process management. For the construction process management, there are some different approaches such as optimizing construction with either mathematical methodologies or heuristics with simulations. This paper propose a simulated earthwork scenario and an optimal path for the simulation using a reinforcement learning. For reinforcement learning, we use two different Markov decision process, or MDP, formulations with interacting excavator agent and truck agent, sequenced learning, and independent learning. The simulation result shows that two different formulations can reach the optimal planning for a simulated earthwork scenario. This planning could be a basis for an automatic construction management.

Policy Modeling for Efficient Reinforcement Learning in Adversarial Multi-Agent Environments (적대적 멀티 에이전트 환경에서 효율적인 강화 학습을 위한 정책 모델링)

  • Kwon, Ki-Duk;Kim, In-Cheol
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.3
    • /
    • pp.179-188
    • /
    • 2008
  • An important issue in multiagent reinforcement learning is how an agent should team its optimal policy through trial-and-error interactions in a dynamic environment where there exist other agents able to influence its own performance. Most previous works for multiagent reinforcement teaming tend to apply single-agent reinforcement learning techniques without any extensions or are based upon some unrealistic assumptions even though they build and use explicit models of other agents. In this paper, basic concepts that constitute the common foundation of multiagent reinforcement learning techniques are first formulated, and then, based on these concepts, previous works are compared in terms of characteristics and limitations. After that, a policy model of the opponent agent and a new multiagent reinforcement learning method using this model are introduced. Unlike previous works, the proposed multiagent reinforcement learning method utilize a policy model instead of the Q function model of the opponent agent. Moreover, this learning method can improve learning efficiency by using a simpler one than other richer but time-consuming policy models such as Finite State Machines(FSM) and Markov chains. In this paper. the Cat and Mouse game is introduced as an adversarial multiagent environment. And effectiveness of the proposed multiagent reinforcement learning method is analyzed through experiments using this game as testbed.

INFLUENCE OF LEADER ON ORGANIZATIONAL LEARNING IN CONSTRUCTION TEAMS

  • Chieh-Chi Cheng;Jiin-Song Tsai
    • International conference on construction engineering and project management
    • /
    • 2009.05a
    • /
    • pp.338-344
    • /
    • 2009
  • Organizational learning of construction team has been long addressed in the literatures, but the mechanism of learning and the influence of leader in the team still remain vague. This paper presents a computational model (OLT) depicting the mechanism and the influence of leader in a systemic way. The OLT model is a multi-agent system based on some eloquent propositions proposed in previous researches. The proposed model is preliminarily validated by some toy-problem simulations. In the OLT model, the leader is assigned as a project manager. The results show that a proper leader can effectively improve the learning process and the result-in performance, in which the team learning is mainly affected by both the leader and the majority in a team. Based on our findings, two propositions are concluded accordingly: (1) Learning of a team would be enhanced if a proper leader is assigned; (2) The effectiveness of learning would increase in a team, in which the members retain explorative attitudes.

  • PDF

Pedestrian Multi-Agent Model in College Town Streets (대학촌 가로의 보행환경 개선을 위한 보행자 멀티에이전트(Pedestrian Multi-Agent) 모델링)

  • Moon, Tae-Heon;Han, Soo-Chel;Sung, Han-Uk;Jeong, Kyeong-Seok
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.9 no.2
    • /
    • pp.194-205
    • /
    • 2006
  • The purpose of this study is to develop a pedestrian multi-agent model and simulation system using multi-agent theory, which may be utilized as a planning support system for building a comfort and safe environment of pedestrian street. Differing from existing pedestrian models, however, every single pedestrian was regarded as an individual agent in the model. Multiple agents like multiple pedestrians in the street then maintain their own characteristics and respond to surrounding environment. In addition their moving behavior are made by their own decision rules that they have or had acquired through the interactive communications or learning between agents like real world. After verifying the model validation, as the $R^2$ between the predicted value and observed value was up to 0.781, the developed model was applied to Gazwa district within Gyeongsang university village. The simulation system was developed by Flash MX action scripts and the physical environment of the streets was configured with the digital map and ArcGis within computer virtual space. The attribute data of buildings such as type and size of commercial business were collected through the field survey and combined with physical features. Then the effect of the variation of building attractiveness and the occurrence of street events to pedestrian environment were simulated. Through the experiments this study could make suggestions to improve pedestrian environment.

  • PDF

Research of soccer robot system strategies

  • Sugisaka, Masanori;Kiyomatsu, Toshiro;Hara, Masayoshi
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.92.4-92
    • /
    • 2002
  • In this paper, as an ideal test bed for studies on multi-agent system, the multiple micro robot soccer playing system is introduced at first. The construction of such experimental system has involved lots of kinds of challenges such as sensors fusing, robot designing, vision processing, motion controlling, and especially the cooperation planning of those robots. So in this paper we want to stress emphasis on how to evolve the system automatically based on the model of behavior-based learning in multi-agent domain. At first we present such model in common sense and then apply it to the realistic experimental system . At last we will give some results showing that the proposed approach is feasi...

  • PDF

The Analysis of Flatland Challenge Winners' Multi-agent Methodologies

  • Choi, BumKyu;Kim, Jong-Kook
    • Annual Conference of KIPS
    • /
    • 2021.05a
    • /
    • pp.369-372
    • /
    • 2021
  • Scheduling the movements of trains in the modern railway system is becoming essential and important. Swiss Federal Railway Company (SBB) and machine learning researchers began collaborating to make a simulation environment and held a Flatland challenge. In this paper, the methodologies of the winners of this competition are analyzed to achieve insight and research trends. This problem is similar to the Multi-Agent Path Finding (MAPF) and Vehicle Rescheduling Problem (VRSP). The potential of the attempted methods from the Flatland challenge to be applied to various transportation systems as well as railways is discussed.