• 제목/요약/키워드: CRITIC

검색결과 147건 처리시간 0.024초

Walter Benjamin′s Unacknowledged Romanticism

  • Halmi, Nicholas
    • 인문언어
    • /
    • 제2권2호
    • /
    • pp.163-182
    • /
    • 2002
  • In Origin of the German Mourning Play(1928), the critic Waltre Benjamin strongly criticized the German Romantic concept of the symbol, according to which the universal and ideal can be represented wholly in the particular and empirical by virtue of an ontological connection between them. Yet this criticism did not prevent Benjamin, in his epistemological preface to the book, from availing himself of the same monadological model (derived from Leibniz and Goethe) on which the Romantics had relied. Although he specifically rejected their insistence on the fusion of the phenomenal and the ideal in the symbol, his own theory of Ideas and their presentation in criticism nonetheless requires just such a fusion. This is not immediately apparent for two reasons: first, Benjamin proposes, in contrast to Platonic and Romantic theory, that Ideas themselves are subject to historical change, and therefore not capable of manifesting themselves fully in any given historical phenomenon; and second, he proposes that Ideas rather than phenomena are monads, individually representing the whole of the world in which they participate. The task of the critic, which Benjamin calls Darstellung("presentation"), consists in revealing Ideas by reducing historical phenomena to their constituent elements and reassembling those elements in what amounts to a mosaic of quotations. But this task is possible only if the critic has a preconception of the Idea he is trying to reveal-a possibility that Benjamin′s theory of knowledge does not allow for at all- or if he can discern the Ideas in the individual phenomenal fragments from which he creates his mosaic, in which case phenomena and Ideas must be related monadologically after all. Benjamin seems to admit the latter possibility in a cryptic sentence in the manuscript draft of his preface to the Origin, but he does not do so in the final printed version. Thus he effectively deprived the critic of an epistemological basis for the presentation of Ideas.

  • PDF

RLS 기반의 Natural Actor-Critic 알고리즘을 이용한 터널 환기제어기 설계 (Tunnel Ventilation Controller Design Employing RLS-Based Natural Actor-Critic Algorithm)

  • 주백석;김동남;홍대희;박주영;정진택;권태형
    • 한국정밀공학회:학술대회논문집
    • /
    • 한국정밀공학회 2006년도 춘계학술대회 논문집
    • /
    • pp.53-54
    • /
    • 2006
  • The main purpose of tunnel ventilation system is to maintain CO pollutant and VI (visibility index) under an adequate level to provide drivers with safe driving condition. Moreover, it is necessary to minimize power consumption used to operate ventilation system. To achieve the objectives, the control algorithm used in this research is reinforcement teaming (RL) method. RL is a goal-directed teaming of a mapping from situations to actions. The goal of RL is to maximize a reward which is an evaluative feedback from the environment. Constructing the reward of the tunnel ventilation system, two objectives listed above are included. RL algorithm based on actor-critic architecture and natural gradient method is adopted to the system. Also, the recursive least-squares (RLS) is employed to the learning process to improve the efficiency of the use of data. The simulation results performed with real data collected from existing tunnel are provided in this paper. It is confirmed that with the suggested controller, the pollutant level inside the tunnel was well maintained under allowable limit and the performance of energy consumption was improved compared to conventional control scheme.

  • PDF

기지국 상태 조정을 위한 강화 학습 기법 분석 (Analysis of Reinforcement Learning Methods for BS Switching Operation)

  • 박혜빈;임유진
    • 예술인문사회 융합 멀티미디어 논문지
    • /
    • 제8권2호
    • /
    • pp.351-358
    • /
    • 2018
  • 강화 학습은 변화하는 환경에서의 최적의 보상을 얻을 수 있는 행동을 결정하기 위한 정책을 얻는 기계 학습 기법이다. 하지만 기존에 연구되어 온 강화 학습은 불확실하고 연속적인 실제 환경에서 최적의 행동을 얻기 위해 발생되는 높은 계산 복잡도 문제와 학습된 결과를 얻기 위해서는 많은 시간이 소요 된다는 문제점을 가지고 있다. 앞에서 언급한 문제를 해결하기 위해, 높은 계산 복잡도 문제를 해결을 위해서는 강화 학습을 구성하는 가치 함수와 정책을 독립적으로 구성하는 AC(actor-critic) 기법이 제안되었다. 그리고 빠른 학습 결과를 얻기 위해 기 학습된 지식을 새로운 환경에서 이용하여 기존 학습보다 빠르게 학습 결과를 얻을 수 있는 전이 학습(transfer learning) 기법이 제안되었다. 본 논문에서는 기존에 연구되어 왔던 기계 학습 기법의 향상 기법인 AC 기법과 전이 학습 기법에 대해 소개하고, 이를 무선 액세스 네트워크 환경에서 기지국 상태 조정을 위해 적용되고 있는 사례를 소개한다.

현실 세계에서의 로봇 파지 작업을 위한 정책/가치 심층 강화학습 플랫폼 개발 (Development of an Actor-Critic Deep Reinforcement Learning Platform for Robotic Grasping in Real World)

  • 김태원;박예성;김종복;박영빈;서일홍
    • 로봇학회논문지
    • /
    • 제15권2호
    • /
    • pp.197-204
    • /
    • 2020
  • In this paper, we present a learning platform for robotic grasping in real world, in which actor-critic deep reinforcement learning is employed to directly learn the grasping skill from raw image pixels and rarely observed rewards. This is a challenging task because existing algorithms based on deep reinforcement learning require an extensive number of training data or massive computational cost so that they cannot be affordable in real world settings. To address this problems, the proposed learning platform basically consists of two training phases; a learning phase in simulator and subsequent learning in real world. Here, main processing blocks in the platform are extraction of latent vector based on state representation learning and disentanglement of a raw image, generation of adapted synthetic image using generative adversarial networks, and object detection and arm segmentation for the disentanglement. We demonstrate the effectiveness of this approach in a real environment.

폭풍해일 침수예상도 검증을 위한 형상유사도 분석 : 형상기준 (Shape Similarity Analysis for Verification of Hazard Map for Storm Surge : Shape Criterion)

  • 김영인;김동현;이승오
    • 한국방재안전학회논문집
    • /
    • 제12권3호
    • /
    • pp.13-24
    • /
    • 2019
  • 실시간 재난위험도 예측 모델인 SIND 모델의 정확도 확인 및 검증을 위해 다양한 형상유사도 개념을 적용하였다. 기하학적 방법론 중에서 가장 널리 이용되는 CRITIC 기법은 침수예상도와 같은 복잡한 지형 형상에 적용하기에는 분명한 한계점을 보여서 본 연구에서는 복잡한 전파특성의 형상을 평가할 수 있는 RCCI와 TF 등과 같은 형상인자를 추가하여 수정된 CRITIC 기법을 제시하였다. 본 연구에서 제안된 형상유사도 평가 방법을 폭풍해일의 침수예상도에 적용하여 검토한 결과, 면 객체 쌍들을 수동으로 정 매칭쌍과 오 매칭쌍으로 구분하였으며, 각 형상 인자들, 위치기준, 면적기준, 형상 기준의 가중치들을 변화시켜가며 각 매칭쌍의 형상유사도를 산정하였다. 본 연구에서 제안된 방법론과 산정된 가중치를 참고자료인 침수예상도의 지도 객체와 목표자료인 SIND 모델결과의 객체에 적용한 결과, 정 매칭쌍은 약 90%가 형상유사도 0.5 이상의 값을 가졌고, 오 매칭쌍은 약 70%가 0.5 미만으로 나타났다. 향후 다수의 객체가 하나의 객체와 대응되는 점을 보완 조정한다면 정 매칭쌍의 형상유사도는 전체적으로 증가하고 오 매칭쌍의 형상유사도는 감소할 것이라 판단된다.

제조업과 서비스업에서의 6시그마 적용에 관한 비교연구 (The Research in the Application of Six Sigma in Manufacturing and Service Industry)

  • 박주석;김동수
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 2004년도 춘계공동학술대회 논문집
    • /
    • pp.298-301
    • /
    • 2004
  • In the last ten years, manufacturing companies like GE, IBM, Motorola and service companies like GE Capital and CITIBANK have realized that Six Sigma has standed for quality reform movemant. Six Sigma management which makes the critic effects on the quality management have enlarged and applied office indirect work process and service department as well beyond question manufacturing process. Then Can Six Sigma application methodology from manufacturing industry be applied to service industry without transformation? Can Six Sigma be applied on improvement of Information system quality which is the critic key figure of current successful companies? These questions would be resolved by conducting researches about application and evaluation of Six Sigma on manufacturing industry and service industry. This paper will furnish information of Six Sigma's new function for future business situation through comparing those researches

  • PDF

IRPO 기반 Actor-Critic 학습 기법을 이용한 로봇이동 (Robot locomotion via IRPO based Actor-Critic Learning Method)

  • 김종호;강대성;박주영
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2005년도 제36회 하계학술대회 논문집 D
    • /
    • pp.2933-2935
    • /
    • 2005
  • The IRPO(Intensive Randomized Policy Optimizer) algorithm is a recently developed tool in the area of reinforcement leaming. And it has been shown to be very successful in several application problems. To compare with a general RL method, IRPO has some difference in that policy utilizes the entire history of agent -environment interaction. The policy is derived from the history directly, not through any kind of a model of the environment. In this paper, we consider a robot-control problem utilizing a IRPO algorithm. We also developed a MATLAH-based animation program, by which the effectiveness of the training algorithms were observed.

  • PDF

시스템 특성함수 기반 평균보상 TD(${\lambda}$) 학습을 통한 유한용량 Fab 스케줄링 근사화 (Capacitated Fab Scheduling Approximation using Average Reward TD(${\lambda}$) Learning based on System Feature Functions)

  • 최진영
    • 산업경영시스템학회지
    • /
    • 제34권4호
    • /
    • pp.189-196
    • /
    • 2011
  • In this paper, we propose a logical control-based actor-critic algorithm as an efficient approach for the approximation of the capacitated fab scheduling problem. We apply the average reward temporal-difference learning method for estimating the relative value functions of system states, while avoiding deadlock situation by Banker's algorithm. We consider the Intel mini-fab re-entrant line for the evaluation of the suggested algorithm and perform a numerical experiment by generating some sample system configurations randomly. We show that the suggested method has a prominent performance compared to other well-known heuristics.

분산 A3C를 활용한 회전식 도립 진자 시스템 설계 (Design of Rotary Inverted Pendulum System Using Distributed A3C Algorithm)

  • 권도형;임현교;김주봉;한연희
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2019년도 춘계학술발표대회
    • /
    • pp.493-495
    • /
    • 2019
  • 제어 분야의 가장 기초적인 시스템인 Rotary Inverted Pendulum 을 제어하기 위하여, 본 논문에서는 강화학습에서 Deep Q-Network 과 함께 대표적인 알고리즘으로 알려진 Asynchronous Advantage Actor-Critic 을 활용하여 다중 디바이스 제어를 설계한다. Deep Q-Network 알고리즘을 활용한 기존 연구와 동일한 방식으로 실 세계의 물리 에이전트와 가상 환경을 맵핑시키며, 스위치를 통하여 로컬 에이전트와 글로벌 네트워크 간 통신을 구성한다. 본 논문에서는 분산 Asynchronous Advantage Actor-Critic 을 이용함으로써 실 세계의 다중 에이전트 제어를 위한 강화 학습의 활용 가능성을 조명한다.