• Title/Summary/Keyword: Q learning

Search Result 426, Processing Time 0.028 seconds

Developing artificial football agents based upon multi-agent techniques in the AI world cup (AI World Cup 환경을 이용한 멀티 에이전트 기반 지능형 가상 축구 에이전트 구현)

  • Lee, Eunhoo;Seong, Hyeon-ah;Jung, Minji;Lee, Hye-in;Joung, Jinoo;Lee, Eui Chul;Lee, Jee Hang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.819-822
    • /
    • 2021
  • AI World Cup 환경은 다수 가상 에이전트들이 팀을 이뤄서 서로 상호작용하며 대전이 가능한 가상 축구 환경이다. 본 논문에서는 AI World Cup 환경에서 멀티 에이전트기반 학습/추론 기술을 사용하여 다양한 전략과 전술을 구사하는 가상 축구 에이전트 구현과 시뮬레이션 결과를 소개한다. 먼저, 역할을 바탕으로 협동하여 상대방과 대전할 수 있는 논리 기반 추론형 멀티 에이전트 기술이 적용된 Dynamic planning 축구 에이전트 9 세트를 구현하였다. 이후, 강화학습 에이전트 기반, 단일 에이전트를 조합한 Independent Q-Learning 방식의 학습형 축구 에이전트를 구현한 후, 이를 멀티 에이전트 강화학습으로 확장하여 역할 기반 전략 학습이 가능한 가상 축구 에이전트를 구현하고 시뮬레이션 하였다. 구현된 가상 축구 에이전트들 간 대전을 통해 승률을 확인하고, 전략의 우수성을 분석하였다. 시뮬레이션 예제는 다음에서 확인할 수 있다 (https://github.com/I-hate-Soccer/Simulation).

Neuromorphic Sensory Cognition-Focused on Touch and Smell (뉴로모픽 감각 인지 기술 동향 - 촉각, 후각을 중심으로)

  • K.-H. Park;H.-K. Lee;Y. Kang;D. Kim;J.W. Lim;C.H. Je;J. Yun;J.-Y. Kim;S.Q. Lee
    • Electronics and Telecommunications Trends
    • /
    • v.38 no.6
    • /
    • pp.62-74
    • /
    • 2023
  • In response to diverse external stimuli, sensory receptors generate spiking nerve signals. These generated signals are transmitted to the brain along the neural pathway to advance to the stage of recognition or perception, and then they reach the area of discrimination or judgment for remembering, assessing, and processing incoming information. We review research trends in neuromorphic sensory perception technology inspired by biological sensory perception functions. Among the various senses, we consider sensory nerve decoding technology based on sensory nerve pathways focusing on touch and smell, neuromorphic synapse elements that mimic biological neurons and synapses, and neuromorphic processors. Neuromorphic sensory devices, neuromorphic synapses, and artificial sensory memory devices that integrate storage components are being actively studied. However, various problems remain to be solved, such as learning methods to implement cognitive functions beyond simple detection. Considering applications such as virtual reality, medical welfare, neuroscience, and cranial nerve interfaces, neuromorphic sensory recognition technology is expected to be actively developed based on new technologies, including combinatorial neurocognitive cell technology.

A DQN-based Two-Stage Scheduling Method for Real-Time Large-Scale EVs Charging Service

  • Tianyang Li;Yingnan Han;Xiaolong Li
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.3
    • /
    • pp.551-569
    • /
    • 2024
  • With the rapid development of electric vehicles (EVs) industry, EV charging service becomes more and more important. Especially, in the case of suddenly drop of air temperature or open holidays that large-scale EVs seeking for charging devices (CDs) in a short time. In such scenario, inefficient EV charging scheduling algorithm might lead to a bad service quality, for example, long queueing times for EVs and unreasonable idling time for charging devices. To deal with this issue, this paper propose a Deep-Q-Network (DQN) based two-stage scheduling method for the large-scale EVs charging service. Fine-grained states with two delicate neural networks are proposed to optimize the sequencing of EVs and charging station (CS) arrangement. Two efficient algorithms are presented to obtain the optimal EVs charging scheduling scheme for large-scale EVs charging demand. Three case studies show the superiority of our proposal, in terms of a high service quality (minimized average queuing time of EVs and maximized charging performance at both EV and CS sides) and achieve greater scheduling efficiency. The code and data are available at THE CODE AND DATA.

Spatial effect on the diffusion of discount stores (대형할인점 확산에 대한 공간적 영향)

  • Joo, Young-Jin;Kim, Mi-Ae
    • Journal of Distribution Research
    • /
    • v.15 no.4
    • /
    • pp.61-85
    • /
    • 2010
  • Introduction: Diffusion is process by which an innovation is communicated through certain channel overtime among the members of a social system(Rogers 1983). Bass(1969) suggested the Bass model describing diffusion process. The Bass model assumes potential adopters of innovation are influenced by mass-media and word-of-mouth from communication with previous adopters. Various expansions of the Bass model have been conducted. Some of them proposed a third factor affecting diffusion. Others proposed multinational diffusion model and it stressed interactive effect on diffusion among several countries. We add a spatial factor in the Bass model as a third communication factor. Because of situation where we can not control the interaction between markets, we need to consider that diffusion within certain market can be influenced by diffusion in contiguous market. The process that certain type of retail extends is a result that particular market can be described by the retail life cycle. Diffusion of retail has pattern following three phases of spatial diffusion: adoption of innovation happens in near the diffusion center first, spreads to the vicinity of the diffusing center and then adoption of innovation is completed in peripheral areas in saturation stage. So we expect spatial effect to be important to describe diffusion of domestic discount store. We define a spatial diffusion model using multinational diffusion model and apply it to the diffusion of discount store. Modeling: In this paper, we define a spatial diffusion model and apply it to the diffusion of discount store. To define a spatial diffusion model, we expand learning model(Kumar and Krishnan 2002) and separate diffusion process in diffusion center(market A) from diffusion process in the vicinity of the diffusing center(market B). The proposed spatial diffusion model is shown in equation (1a) and (1b). Equation (1a) is the diffusion process in diffusion center and equation (1b) is one in the vicinity of the diffusing center. $$\array{{S_{i,t}=(p_i+q_i{\frac{Y_{i,t-1}}{m_i}})(m_i-Y_{i,t-1})\;i{\in}\{1,{\cdots},I\}\;(1a)}\\{S_{j,t}=(p_j+q_j{\frac{Y_{j,t-1}}{m_i}}+{\sum\limits_{i=1}^I}{\gamma}_{ij}{\frac{Y_{i,t-1}}{m_i}})(m_j-Y_{j,t-1})\;i{\in}\{1,{\cdots},I\},\;j{\in}\{I+1,{\cdots},I+J\}\;(1b)}}$$ We rise two research questions. (1) The proposed spatial diffusion model is more effective than the Bass model to describe the diffusion of discount stores. (2) The more similar retail environment of diffusing center with that of the vicinity of the contiguous market is, the larger spatial effect of diffusing center on diffusion of the vicinity of the contiguous market is. To examine above two questions, we adopt the Bass model to estimate diffusion of discount store first. Next spatial diffusion model where spatial factor is added to the Bass model is used to estimate it. Finally by comparing Bass model with spatial diffusion model, we try to find out which model describes diffusion of discount store better. In addition, we investigate the relationship between similarity of retail environment(conceptual distance) and spatial factor impact with correlation analysis. Result and Implication: We suggest spatial diffusion model to describe diffusion of discount stores. To examine the proposed spatial diffusion model, 347 domestic discount stores are used and we divide nation into 5 districts, Seoul-Gyeongin(SG), Busan-Gyeongnam(BG), Daegu-Gyeongbuk(DG), Gwan- gju-Jeonla(GJ), Daejeon-Chungcheong(DC), and the result is shown

    . In a result of the Bass model(I), the estimates of innovation coefficient(p) and imitation coefficient(q) are 0.017 and 0.323 respectively. While the estimate of market potential is 384. A result of the Bass model(II) for each district shows the estimates of innovation coefficient(p) in SG is 0.019 and the lowest among 5 areas. This is because SG is the diffusion center. The estimates of imitation coefficient(q) in BG is 0.353 and the highest. The imitation coefficient in the vicinity of the diffusing center such as BG is higher than that in the diffusing center because much information flows through various paths more as diffusion is progressing. A result of the Bass model(II) shows the estimates of innovation coefficient(p) in SG is 0.019 and the lowest among 5 areas. This is because SG is the diffusion center. The estimates of imitation coefficient(q) in BG is 0.353 and the highest. The imitation coefficient in the vicinity of the diffusing center such as BG is higher than that in the diffusing center because much information flows through various paths more as diffusion is progressing. In a result of spatial diffusion model(IV), we can notice the changes between coefficients of the bass model and those of the spatial diffusion model. Except for GJ, the estimates of innovation and imitation coefficients in Model IV are lower than those in Model II. The changes of innovation and imitation coefficients are reflected to spatial coefficient(${\gamma}$). From spatial coefficient(${\gamma}$) we can infer that when the diffusion in the vicinity of the diffusing center occurs, the diffusion is influenced by one in the diffusing center. The difference between the Bass model(II) and the spatial diffusion model(IV) is statistically significant with the ${\chi}^2$-distributed likelihood ratio statistic is 16.598(p=0.0023). Which implies that the spatial diffusion model is more effective than the Bass model to describe diffusion of discount stores. So the research question (1) is supported. In addition, we found that there are statistically significant relationship between similarity of retail environment and spatial effect by using correlation analysis. So the research question (2) is also supported.

  • PDF
  • A Vector-Controlled PMSM Drive with a Continually On-Line Learning Hybrid Neural-Network Model-Following Speed Controller

    • EI-Sousy Fayez F. M.
      • Journal of Power Electronics
      • /
      • v.5 no.2
      • /
      • pp.129-141
      • /
      • 2005
    • A high-performance robust hybrid speed controller for a permanent-magnet synchronous motor (PMSM) drive with an on-line trained neural-network model-following controller (NNMFC) is proposed. The robust hybrid controller is a two-degrees-of-freedom (2DOF) integral plus proportional & rate feedback (I-PD) with neural-network model-following (NNMF) speed controller (2DOF I-PD NNMFC). The robust controller combines the merits of the 2DOF I-PD controller and the NNMF controller to regulate the speed of a PMSM drive. First, a systematic mathematical procedure is derived to calculate the parameters of the synchronous d-q axes PI current controllers and the 2DOF I-PD speed controller according to the required specifications for the PMSM drive system. Then, the resulting closed loop transfer function of the PMSM drive system including the current control loop is used as the reference model. In addition to the 200F I-PD controller, a neural-network model-following controller whose weights are trained on-line is designed to realize high dynamic performance in disturbance rejection and tracking characteristics. According to the model-following error between the outputs of the reference model and the PMSM drive system, the NNMFC generates an adaptive control signal which is added to the 2DOF I-PD speed controller output to attain robust model-following characteristics under different operating conditions regardless of parameter variations and load disturbances. A computer simulation is developed to demonstrate the effectiveness of the proposed 200F I-PD NNMF controller. The results confirm that the proposed 2DOF I-PO NNMF speed controller produces rapid, robust performance and accurate response to the reference model regardless of load disturbances or PMSM parameter variations.

    A Study on Quality Improvement of Medical Equipments (의료기기 QI 활동 개선방안에 대한 연구)

    • Kang, Hun-Hee;Juh, Ra-Hyeong;Kim, Jong-Soon;Kim, Seo-Hwak;Huh, Soo-Jin
      • Quality Improvement in Health Care
      • /
      • v.5 no.2
      • /
      • pp.190-201
      • /
      • 1998
    • Background : Medical equipments take a very important role in diagnosis and treatment of disease in modern medicine and effective maintenance of the equipments is a necessary to provide a good health care to the public. After developing a new QC program for effective maintenance of medical equipments and practicing it for a year, we report the results of the new program. Methods : The maintenance data of 9 equipments in 8 categories including a CT Scanner were analyzed with regard to the parts responsible for most frequent failure and cause of the failure. After learning the most frequent failure part and cause of the failure, we developed a new QC program that emphasizes preventive maintenance of the most frequent failure part. We compared the number of failure per year and active rate of each equipment before and after the adoption of the new QC program. Results : The average number of failure per year per equipment was 20.7 before and it decreased by 43% to 11.9 after adoption of the new QC program. The average active rate of the equipments was 92.6% before and it increased by 3.2% to 95.8% after adoption of the new program. Conclusions : The practice of the new QC program appears very useful as it decreased the failure rate and increased the active rate of the equipments.

    • PDF

    A Study of Path-Finding Method of Small Unmanned Aerial Vehicles for Collision Avoidance (소형 무인비행체에서의 충돌회피를 위한 비행경로 생성에 관한 연구)

    • Shin, Saebyuk;Kim, Jinbae;Kim, Shin-Dug;Kim, Cheong Ghil
      • Journal of Satellite, Information and Communications
      • /
      • v.12 no.1
      • /
      • pp.76-80
      • /
      • 2017
    • With the fast growing popularity of small UAVs (Unmanned Aerial Vehicles), recent UAV systems have been designed and utilized for the various field with their own specific purposes. UAVs are opening up many new opportunities in the fields of electronics, sensors, camera, and software for pilots. Increase in awareness and mission capabilities of UAVs are driving innovations and new applications driven with the help of low cost and its capability in undertaking high threat task. In particular, small unmanned aerial vehicles should fly in environments with high probability of unexpected sudden change or obstacle appearance in low altitude situations. In this paper, current researches regarding techniques of autonomous flight of smal UAV systems are introduced and we propose a draft idea for planning paths for small unmanned aerial vehicles in adversarial environments to arrive at the given target safely with low cost sensors.

    A Knowledge-Based Intelligent Information Agent for Animal Domain (동물 영역 지식 기반의 지능형 정보 에이전트)

    • 이용현;오정욱;변영태
      • Korean Journal of Cognitive Science
      • /
      • v.10 no.1
      • /
      • pp.67-78
      • /
      • 1999
    • Information providers on WWW have been rapidly increasing, and they provide a vast amount of information in various fields, Because of this reason, it becomes hard for users to get the information they want. Although there are several search engines that help users with the keyword matching methods, it is not easy to find suitable keywords. In order to solve these problems with a specific domain, we propose an intelligent information agent(HHA-la : HongIk Information Agent) that converts user's q queries to forms including related domain words in order to represent user's intention as much as it can and provides the necessary information of the domain to users. HHA-la h has an ontological knowledge base of animal domain, supplies necessary information for queries from users and other agents, and provides relevant web page information. One of system components is a WebDB which indexes web pages relevant to the animal domain. The system also supplies new operators by which users can represent their thought more clearly, and has a learning mechanism using accumulated results and user feedback to behave more intelligently, We implement the system and show the effectiveness of the information agent by presenting experiment results in this paper.

    • PDF

    A Case of Smith-Lemli-Opitz Syndrome in DHCR7 Mutation (DHCR 7 유전자 돌연변이로 확진된 스미스-렘리-오피츠 증후군 1례)

    • Jeong, Yu Ju;Huh, Rimm;Kwun, Younghee;Lee, Jieun;Cho, Sung Yoon;Ki, Chang-Seok;Jin, Dong-Kyu
      • Journal of The Korean Society of Inherited Metabolic disease
      • /
      • v.14 no.1
      • /
      • pp.60-65
      • /
      • 2014
    • Smith-Lemli-Opitz syndrome (SLOS) is an autosomal recessive disease caused by a defect in cholesterol biosynthesis. This mutation encodes 7-dehydrocholesterol reductase (DHCR7), which is located on chromosome 11q13. It is characterized by typical facial appearances, microcephaly, small up-turned nose, cleft palate, syndactyly, and is correlated with cardiac, gastrointestinal and genital malformations. There may also be mental retardation, behavioral problems and growth retardation. It causes a broad spectrum of effects, ranging from a mild disorder of learning and behavior to a lethal malformation. There are four reports of Smith-Lemli-Opitz syndrome in Korean children. Here, we describe a two months old female with microcephaly, toe syndactyly and a cleft soft palate who was diagnosed as SLOS with c. 1054 C>T (p.R352W) and c.907G>A (p. G303R) mutations.

    The Effects of the Situation-Based Mathematical Problem Posing Activity on Problem Solving Ability and Mathematical Attitudes (상황제시형 수학 문제 만들기(WQA) 활동이 문제해결력 및 수학적 태도에 미치는 영향)

    • Kim, Kyeong-Ock;Ryu, Sung-Rim
      • School Mathematics
      • /
      • v.11 no.4
      • /
      • pp.665-683
      • /
      • 2009
    • The purpose of this study is to improve forward mathematics study by analyzing the effects of the teaching and learning process applied situation-based mathematical problem posing activity on problem solving ability and mathematical attitudes. For this purpose, the research questions were established as follows: 1. How the situation-based mathematical problem posing activity(WQA activity) changes the problem solving ability of students? 2. How the situation-based mathematical problem posing activity(WQA activity) changes the mathematical attitudes of students? The results of the study were as follows: (1) There was significant difference between experimental group and comparative group in problem solving ability. This means that situation-based mathematical problem posing activity was generally more effective in improving problem solving ability than general classroom-based instruction. (2) There was not significant difference between experimental group and comparative group in mathematical attitudes. But the experimental group's average scores of mathematical attitudes except mathematical confidence was higher than comparative group's ones. And there was significant difference in the mathematical adaptability. The results obtained in this study suggest that the situation-based mathematical problem posing activity can be used to improve the students' problem solving ability and mathematical attitudes

    • PDF

    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.