통합 검색 | Korea Science

Avoidance Behavior of Small Mobile Robots based on the Successive Q-Learning

Kim, Min-Soo
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2001년도 ICCAS
- /
- pp.164.1-164
- /
- 2001
Q-learning is a recent reinforcement learning algorithm that does not need a modeling of environment and it is a suitable approach to learn behaviors for autonomous agents. But when it is applied to multi-agent learning with many I/O states, it is usually too complex and slow. To overcome this problem in the multi-agent learning system, we propose the successive Q-learning algorithm. Successive Q-learning algorithm divides state-action pairs, which agents can have, into several Q-functions, so it can reduce complexity and calculation amounts. This algorithm is suitable for multi-agent learning in a dynamically changing environment. The proposed successive Q-learning algorithm is applied to the prey-predator problem with the one-prey and two-predators, and its effectiveness is verified from the efficient avoidance ability of the prey agent.
PDF

다 개체 시스템의 상태 일치를 위한 이산 시간 출력 궤환 협조 제어 알고리즘 (Discrete-Time Output Feedback Algorithm for State Consensus of Multi-Agent Systems)

김재용;이진영;김정수
- 전기학회논문지
- /
- 제60권3호
- /
- pp.625-631
- /
- 2011
This paper presents a discrete-time output feedback consensus algorithm for Multi-Agent Systems (MAS). Under the assumption that an agent is aware of the relative state information about its neighbors, a state feedback consensus algorithm is designed based on Linear Matrix Inequality (LMI) method. In general, however, it is possible to obtain its relative output information rather than the relative state information. To reconcile this problem, an Unknown Input Observer (UIO) is employed in this paper. To this end, first it is shown that the relative state information can be estimated using the UIO and the measured relative output information. Then a certainty-equivalence type output feedback consensus algorithm is proposed by combining the LMI-based state feedback consensus algorithm with the UIO. Finally, simulation results are given to illustrate that the proposed method successfully achieves the state consensus.
https://doi.org/10.5370/KIEE.2011.60.3.625 인용 PDF KSCI

Synthesis of New Biocompatible Multi-Functional Textile Finishing Agent

Ko, Young-Il;Jung, Chul-Won;Kim, Sung-Hoon;Bae, Jin-Seok
- 한국염색가공학회:학술대회논문집
- /
- 한국염색가공학회 2009년도 학술발표대회
- /
- pp.160-161
- /
- 2009
A methacrylate monomer having phospholipid polar group and cell membrane structure is known as highly biocompatible. Based on these properties, new biocompatible multi-functional textile finishing agent was developed using phospolipid copolymer. 2-Methacryloyloxyethyl phosphorylcholine (MPC) was synthesized using 2-hydroxyethyl methacrylate (HEMA), 2-chloro-2-oxo-1,3,2-dioxaphospholane (COP), trimethylamine (TMA) and triethylamine (TEA), and then polymerized to prepare MPC copolymer by radical polymerization using AIBN. The structures of MPC and MPCE were characterized by FTIR and 1H NMR and will be evaluated as textile finishing agent in further study.
PDF

Multi-agent Q-learning based Admission Control Mechanism in Heterogeneous Wireless Networks for Multiple Services

Chen, Jiamei;Xu, Yubin;Ma, Lin;Wang, Yao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제7권10호
- /
- pp.2376-2394
- /
- 2013
In order to ensure both of the whole system capacity and users QoS requirements in heterogeneous wireless networks, admission control mechanism should be well designed. In this paper, Multi-agent Q-learning based Admission Control Mechanism (MQACM) is proposed to handle new and handoff call access problems appropriately. MQACM obtains the optimal decision policy by using an improved form of single-agent Q-learning method, Multi-agent Q-learning (MQ) method. MQ method is creatively introduced to solve the admission control problem in heterogeneous wireless networks in this paper. In addition, different priorities are allocated to multiple services aiming to make MQACM perform even well in congested network scenarios. It can be observed from both analysis and simulation results that our proposed method not only outperforms existing schemes with enhanced call blocking probability and handoff dropping probability performance, but also has better network universality and stability than other schemes.
https://doi.org/10.3837/tiis.2013.10.003 인용 PDF KSCI

분산환경에서 멀티에이전트 상호협력을 통한 신뢰성 있는 정보검색기법 (Reliable Information Search mechanism through the cooperation of MultiAgent in Distributed Environment)

박민기;김귀태;이재완
- 인터넷정보학회논문지
- /
- 제5권5호
- /
- pp.69-77
- /
- 2004
인터넷이 널리 보급되면서 지능형 검색 에이전트들이 사용자의 요구를 만족시키기 위해 일반화되어 사용되고 있다. 그러나 이러한 지능형 멀티에이전트들은 서로 독립적으로 사용되어 멀티에이전트들 간의 분산된 정보를 원활하고 효율적으로 처리하기 위한 상호 협력 작용이 부족해 정보의 신뢰성이 낮고 동적으로 변화하는 분산 환경에 대처하기가 어렵다. 이런 문제를 해결하기 위해 본 논문에서는 멀티에이전트간의 효율적인 상호 협력과 빠른 정보처리를 위해 브로커 에이전트에 에이전시를 생성하고 신경망을 이용해 멀티에이전트들의 에이전시들을 분류하여 더욱 신속·정확한 정보를 사용자에게 제공하도록 한다. 또한 정보의 신뢰성을 위해서 에이전트 관리기법을 제안하여 기존의 검색 시스템이 가지고 있는 정보갱신문제를 향상시키고, 시뮬레이션을 통해 본 연구의 성능을 평가한다.
PDF

배전계통 리클로우저 기반의 자율적 고장복구 방법론 (The Self-Fault Restoration Methodology based on the Recloser in the Distribution Systems)

고윤석
- 전기학회논문지
- /
- 제58권9호
- /
- pp.1681-1688
- /
- 2009
This paper proposes a new fault restoration method which adopts the recloser as top agent to release the problems of the data concentration and fault processing delay of the existing DAS(distribution Automation System) under the ubiquitous distribution system. In proposed method, top agent collects the data based on the multi-casting communication with the tie switches of the interconnection point, and then selects a closed switch(tie switch) to transfer the sound outage load to other feeders based on the heuristic search strategy step by step until the load transfer work is finished. Here, a new heuristic rule is developed which can guarantee the relational load balancing and line loss from the collected voltage data. Finally, the several faults are simulated for typical multi-section and multi-interconnection distribution system to prove the effectiveness of the proposed strategy, in particular, for each simulation cases, the load balancing index and line loss index of the obtained solution from the proposed method is compared with those of all of feasible solutions.
PDF KSCI

유사 정보 추출에 기반한 조성 에이전트 모델 (A Coordination Agent Model based on Extracting Similar Information)

양소진;이현수;오경환
- 인지과학
- /
- 제12권1_2호
- /
- pp.55-63
- /
- 2001
90년대 들어 각광받고 있는 에이전트 기술은, 동현 또는 이형의 에이전트가 각자 맡은 일을 해결하는 일종의 멀티분산 시스템으로, 조정 에이전트가 여러 응용 에이전트의 통신과 제어를 담당하는 멀티 에이전트 시스템이 주류를 이룬다. 에이전트 기반 응용 기술로는 인터넷 정보검색, 온라인 쇼핑, 메시징, 네트워크 관리등이 있는데, 포괄적으로 본다면 이러한 기술은 인터넷의 보편화에 따라 넘쳐나는 정보를 효율적으로 다루기 위한 기술이라고 볼 수 있다. 본 논문의 목적은 유사도와 강화 학습을 사용하여, 정보를 제공하는 에이전트와 정보를 요청하는 에이전트간의 연결을 매개하는 조정 에이전트(Coordination Agent, Middle Agent) 구현 방식을 제안하는데 있다. 정보 에이전트를 사용하는 멀티 에이전트 시스템에서, 정보 에이전트는 조정 에이전트에게 자신의 정보를 등록하고, 질의 에이전트는 조정 에이전트에게 원하는 질의를 요청함으로써, 서로에 대한 지식 없이도 정보의 교류가 가능하다. 그러나. 정확한 정보의 교류가 가능하기 위해서는 정보 에이전트가 자신이 제공하는 정보를 조정 에이전트에 등록했을 때, 등록된 정보와 정확히 일치하지 않는 정보를 지의 에이전트가 요청했을 경우에 대한 적절한 대처 방안이 마련되어야 한다. 이를 위하여 본 논문에서는 질의 에이전트의 질의와 가장 밀접한 정보를 제공하는 것으로 판단되는 정보 에이전트를 찾는 방안을 제안하고 가상의 실험공간에서 얻어진 결과에 대하여 타당성을 검증한다.
PDF

지능형 멀티 에이전트 시스템 기반의 진학 상담 시스템 (Admission Consultation System based on Intelligent Multi-Agent System)

최동운;김수용
- 지능정보연구
- /
- 제12권1호
- /
- pp.43-55
- /
- 2006
사이버 교육 분야는 인터넷의 발전에 의해서 많은 변화를 하였다. 대학 진학을 위한 분야가 하나이다. 대학 입시 업무 흐름에 있어서 수험생들은 학교에 직접 방문하여 접수하는 방식과 각 지역 접수창구와 팩스를 이용하여 접수를 한다. 최근에는 이를 유기적으로 통합 관리할 수 있는 인터넷 접수 방식이 각광을 받고 있다. 본 논문의 다중 진학 상담 에이전트 시스템은 대학과 학과를 선택하는 과정에서 적합한 학과를 추천하여 주는 인터넷 원서 접수 시스템을 설계하고 구현하였다.
PDF

마이크로그리드 환경에서 부하차단을 위한 다중 에이전트 시스템의 설계 및 구현 (Design and Implementation of Multi-Agent System for Load Shedding in Microgrid)

임유진;김학만
- 정보처리학회논문지:컴퓨터 및 통신 시스템
- /
- 제1권3호
- /
- pp.125-132
- /
- 2012
마이크로그리드(microgrid)의 독립적 운용방식(islanded operation mode)에서는 전력 공급량과 전력 수요량 사이의 균형을 맞추기 위하여 부하차단(load shedding) 기법이 사용된다. 기존의 부하차단 기법들은 부하의 요구량이 연속적인 값의 범위를 갖는다고 가정하였다. 그러나 일부 부하는 이산적인 정수 값을 사용하여 자신의 요구량을 나타낸다. 따라서 본 논문에서는 독립형 운용방식 환경에서 부하 요구량의 이산적인 특성을 고려한 부하차단 기법을 다중 에이전트 시스템을 이용하여 설계한다. 또한 시스템 구조, 에이전트 기능, 그리고 에이전트 간 통신 프로토콜을 추가로 정의 및 확장함으로써 시스템을 구현하였으며, 구현된 시스템을 기반으로 다양한 시나리오 환경에서의 실험을 통하여 그 성능을 증명하였다.
https://doi.org/10.3745/KTCCS.2012.1.3.125 인용 PDF

멀티에이전트 기반 Deep Q-Network 모델을 이용한 동적 미사일 방어효과 개선 (Improving Dynamic Missile Defense Effectiveness Using Multi-Agent Deep Q-Network Model)

김민국;홍동욱;최봉완;경지훈
- 산업경영시스템학회지
- /
- 제47권2호
- /
- pp.74-83
- /
- 2024
The threat of North Korea's long-range firepower is recognized as a typical asymmetric threat, and South Korea is prioritizing the development of a Korean-style missile defense system to defend against it. To address this, previous research modeled North Korean long-range artillery attacks as a Markov Decision Process (MDP) and used Approximate Dynamic Programming as an algorithm for missile defense, but due to its limitations, there is an intention to apply deep reinforcement learning techniques that incorporate deep learning. In this paper, we aim to develop a missile defense system algorithm by applying a modified DQN with multi-agent-based deep reinforcement learning techniques. Through this, we have researched to ensure an efficient missile defense system can be implemented considering the style of attacks in recent wars, such as how effectively it can respond to enemy missile attacks, and have proven that the results learned through deep reinforcement learning show superior outcomes.
https://doi.org/10.11627/jksie.2024.47.2.074 인용 PDF

검색결과 589건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)