• Title/Summary/Keyword: action constraints

Search Result 58, Processing Time 0.028 seconds

A Study of Collaborative and Distributed Multi-agent Path-planning using Reinforcement Learning

  • Kim, Min-Suk
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.3
    • /
    • pp.9-17
    • /
    • 2021
  • In this paper, an autonomous multi-agent path planning using reinforcement learning for monitoring of infrastructures and resources in a computationally distributed system was proposed. Reinforcement-learning-based multi-agent exploratory system in a distributed node enable to evaluate a cumulative reward every action and to provide the optimized knowledge for next available action repeatedly by learning process according to a learning policy. Here, the proposed methods were presented by (a) approach of dynamics-based motion constraints multi-agent path-planning to reduce smaller agent steps toward the given destination(goal), where these agents are able to geographically explore on the environment with initial random-trials versus optimal-trials, (b) approach using agent sub-goal selection to provide more efficient agent exploration(path-planning) to reach the final destination(goal), and (c) approach of reinforcement learning schemes by using the proposed autonomous and asynchronous triggering of agent exploratory phases.

A study on the determination of the number of mobility cluster (적정 이동군집수 결정에 관한 연구)

  • ;Ham, Sung Hun
    • Journal of the Korean Geographical Society
    • /
    • v.30 no.2
    • /
    • pp.120-131
    • /
    • 1995
  • To analyze mobility patterns, this study used three Constraint (Capability Constraint, Coupling Constraint, Authority Constraint) models which were proposed in Dr. Hagerstrand's Time-space theory. This paper shows that three constraint models have some effects upon mobility by age. In this study, Capability Constraint means a certain special constraint that is what we can't do during proceeding basic natural urges like sleep, fare, etc. Coupling constraint is a physical one. Each person limits the action range for staying on a special place in special time. For instance, students have to stay in school so that they have mobility constraints. Authority Constraint is a social one. When we use urban facilities or traffic, we may be controlled by mobility sphere by an agreement or a social position. It is social agreement that the opening hour of a store, the time table of mass-transportation and a social positional control that the personal income, the standard of education. In this study it has been in a process of determination of the cluster number that degree of influences a social constraint to mobility. Considering the mobility constraint of characteristics of space divides urban and rural, people in urban area have higher mobility rate than in rural area. Resuets of determination of the cluster, show similar mobility pattern. People in urban area are connected verity of mobility which related to urban space structures with determination of cluste-number. That is to say, mobility patterns can be changed by space charactcristics. Constraints by sex and age are also social constraints and they are influenced by mobility patterns. For instance, females at the age of twenties have similar mobility pattern to the same age male but they have sudden changes after thirty's age. Male entertains a similar pattern without restriction of age. That is to say, management by sex as a social constraint affects mobility. To establish more realistic traffie policy, mobility formation should be reflected to the space in a view of social-behavioral science. To embody this, some problems should be investigated as follows. 1. As a problem of methodology, if sufficient samples ensured, we could subdivide clusters and could open up a new method of analyzing the mobility clusters by using the neuro-network. 2. Extracting actions connected with mobility and finding life cycle which is classified by daily cluste-characteristics, suitable counterproposal could be presented to the traific policy.

  • PDF

Fine Grain Real-Time Code Scheduling Using an Adaptive Genetic Algorithm (적합 유전자 알고리즘을 이용한 실시간 코드 스케쥴링)

  • Chung, Tai-Myoung
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.6
    • /
    • pp.1481-1494
    • /
    • 1997
  • In hard real-time systems, a timing fault may yield catastrophic results. Dynamic scheduling provides the flexibility to compensate for unexpected events at runtime; however, scheduling overhead at runtime is relatively large, constraining both the accuracy of the timing and the complexity of the scheduling analysis. In contrast, static scheduling need not have any runtime overhead. Thus, it has the potential to guarantee the precise time at which each instruction implementing a control action will execute. This paper presents a new approach to the problem of analyzing high-level language code, augmented by arbitrary before and after timing constraints, to provide a valid static schedule. Our technique is based on instruction-level complier code scheduling and timing analysis, and can ensure the timing of control operations to within a single instruction clock cycle. Because the search space for a valid static schedule is very large, a novel adaptive genetic search algorithm was developed.

  • PDF

System identification and admittance model-based nanodynamic control of ultra-precision cutting process (다이아몬드 터닝 머시인의 극초정밀 절삭공정에서의 시스템 규명 및 제어)

  • 정상화;김상석;오용훈
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1996.10b
    • /
    • pp.1352-1355
    • /
    • 1996
  • The control of diamond turning is usually achieved through a laser-interferometer feedback of slide position. If the tool post is rigid and the material removal process is relatively static, then such a non-collocated position feedback control scheme may surface. However, as the accuracy requirement gets tighter and desired surface contours become more complex, the need for a direct tool-tip sensing becomes inevitable. The physical constraints of the machining process prohibit any reasonable implementation of a tool-tip motion measurement. It is proposed that the measured force normal to the face of the workpiece can be filtered through an appropriate admittance transfer function to result in the estimated depth of cut. This can be compared to the desired depth of cut to generate the adjustment control action in addition to position feedback control. In this work, the design methodology on the admittance model-based control with a conventional controller is presented. The recursive least-squares algorithm with forgetting factor is proposed to identify the parameters and update the cutting process in real time. The normal cutting forces are measured to identify the cutting dynamics in the real diamond turning process using the precision dynamometer. Based on the parameter estimation of cutting dynamics and the admittance model-based nanodynamic control scheme, simulation results are shown.

  • PDF

Admittance Model-Based Nanodynamic Control of Diamond Turnning Machine (어드미턴스 모델을 이용한 다이아몬드 터닝머시인의 극초정밀 제어)

  • 정상화;김상석
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 1996.04a
    • /
    • pp.49-52
    • /
    • 1996
  • The control of diamond turning is usually achieved through a laser-interferometer feedback of slide position. The limitation of this control scheme is that the feedback signal does not account for additional dynamics of the tool post and the material removal process. If the tool post is rigid and the material removal process is relatively static, then such a non-collocated position feedback control scheme may surfice. However, as the accuracy requirement gets tighter and desired surface contours become more complex, the need for a direct tool-tip sensing becomes inevitable. The physical constraints of the machining processprohibit any reasonable implementation of a tool-tip motion measurement. It is proposed that the measured force normalto the face of the workpice can be filterd through an appropriate admittance transfer function to result in the estimated depth of cut. This can be compared to the desired depth of cut to generate the adjustment cotnrol action in addition to position feedback control. In this work, the design methodology on the admittance model-based control with a conventional controller is presented. Based on the empirical data of the cutting dynamics, simulation results are shown.

  • PDF

Nutrition Behaviour of Families with Low-Income

  • Jacqueline Koehler;Stephanie Lehmkuehler;Ingrid-Ute Leonhaeuser
    • International Journal of Human Ecology
    • /
    • v.5 no.1
    • /
    • pp.117-130
    • /
    • 2004
  • Poverty is an important issue, not only in developing countries but also in industrialised societies. In 1999 15% of the European population have been in risk of poverty and the number of people living in poverty in Germany continues to increase. As poverty concerns all aspects of life, it influences health, well-being and the nutrition of the people living on low-income. Although this problem is obvious, only few surveys have been conducted to analyse it and therefore there is only limited information on the nutritional situation and nutrition behaviour of the poor. A qualitative study, which looked closely at the nutrition behaviour of 15 low-income families, was carried out in Giessen, Germany. The results showed that the nutritional situation of poor families differs from that families with a higher income have, the reasons being that their scope for action is restricted by a shortage of money and that there is a lack of skills and knowledge to provide family members with adequate nutrition. Strategies to improve the nutrition situation of poor families should aim at encouraging them to acquire relevant information and appropriate skills to adopt a healthier diet within their financial, social and cultural constraints. Also there have to be socio-political arrangements, which improve existing financial and social provisions as well as preventive educational measures.

Optimal design and real application of nonlinear PID controllers (비선형 PID 제어기의 최적 설계및 실제 적용)

  • Lee, Moon-Yong;Koo, Doe-Gyoon;Lee, Jong-Min
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.3 no.6
    • /
    • pp.639-643
    • /
    • 1997
  • This paper presents how nonlinear PID control algorithms can be applied on chemical processes for a more stable operation and perfect automation. A pass balance controller is designed to balance the exiting temperatures of a heater and a heat exchange network. The proposed controller has gain-varying integral action and deals with the operational constraints in an efficient manner. Also, the use of a PID gap controller is proposed to maximize energy saving and operation stability and to minimize operator intervention in operation of air fan coolers. The proposed controller adjusts the opening of a louver automatically in such a way that it keeps the air fan pitch position within the desired range. All these nonlinear PID controllers have been implemented on the distributed control system (DCS) for good reliability and operability. Operator acceptance was very high and the implemented controllers have shown good performance and high service factor still now on. The proposed methodology can be directly applied to similar processes without any modification.

  • PDF

Low-Complexity Energy Efficient Base Station Cooperation Mechanism in LTE Networks

  • Yu, Peng;Feng, Lei;Li, Zifan;Li, Wenjing;Qiu, Xuesong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.10
    • /
    • pp.3921-3944
    • /
    • 2015
  • Currently Energy-Saving (ES) methods in cellular networks could be improved, as compensation method for irregular Base Station (BS) deployment is not effective, most regional ES algorithm is complex, and performance decline caused by ES action is not evaluated well. To resolve above issues, a low-complexity energy efficient BS cooperation mechanism for Long Time Evolution (LTE) networks is proposed. The mechanism firstly models the ES optimization problem with coverage, resource, power and Quality of Service (QoS) constraints. To resolve the problem with low complexity, it is decomposed into two sub-problems: BS Mode Determination (BMD) problem and User Association Optimization (UAO) problem. To resolve BMD, regional dynamic multi-stage algorithms with BS cooperation pair taking account of load and geographic topology is analyzed. And then a distributed heuristic algorithm guaranteeing user QoS is adopted to resolve UAO. The mechanism is simulated under four LTE scenarios. Comparing to other algorithms, results show that the mechanism can obtain better energy efficiency with acceptable coverage, throughput, and QoS performance.

Optimum design of steel floor system: effect of floor division number, deck thickness and castellated beams

  • Kaveh, A.;Ghafari, M.H.
    • Structural Engineering and Mechanics
    • /
    • v.59 no.5
    • /
    • pp.933-950
    • /
    • 2016
  • Decks, interior beams, edge beams and girders are the parts of a steel floor system. If the deck is optimized without considering beam optimization, finding best result is simple. However, a deck with higher cost may increase the composite action of the beams and decrease the beam cost reducing the total cost. Also different number of floor divisions can improve the total floor cost. Increasing beam capacity by using castellated beams is other efficient method to save the costs. In this study, floor optimization is performed and these three issues are discussed. Floor division number and deck sections are some of the variables. Also for each beam, profile section of the beam, beam cutting depth, cutting angle, spacing between holes and number of filled holes at the ends of castellated beams are other variables. Constraints include the application of stress, stability, deflection and vibration limitations according to the load and resistance factor (LRFD) design. Objective function is the total cost of the floor consisting of the steel profile cost, cutting and welding cost, concrete cost, steel deck cost, shear stud cost and construction costs. Optimization is performed by enhanced colliding body optimization (ECBO), Results show that using castellated beams, selecting a deck with higher price and considering different number of floor divisions can decrease the total cost of the floor.

Measuring gameplay similarity between human and reinforcement learning artificial intelligence (사람과 강화학습 인공지능의 게임플레이 유사도 측정)

  • Heo, Min-Gu;Park, Chang-Hoon
    • Journal of Korea Game Society
    • /
    • v.20 no.6
    • /
    • pp.63-74
    • /
    • 2020
  • Recently, research on automating game tests using artificial intelligence agents instead of humans is attracting attention. This paper aims to collect play data from human and artificial intelligence and analyze their similarity as a preliminary study for game balancing automation. At this time, constraints were added at the learning stage in order to create artificial intelligence that can play similar to humans. Play datas obtained 14 people and 60 artificial intelligence by playing Flippy bird games 10 times each. The collected datas compared and analyzed for movement trajectory, action position, and dead position using the cosine similarity method. As a result of the analysis, an artificial intelligence agent with a similarity of 0.9 or more with humans was found.