Search | Korea Science

Improved Deep Q-Network Algorithm Using Self-Imitation Learning (Self-Imitation Learning을 이용한 개선된 Deep Q-Network 알고리즘)

Sunwoo, Yung-Min;Lee, Won-Chang
- Journal of IKEEE
- /
- v.25 no.4
- /
- pp.644-649
- /
- 2021
Self-Imitation Learning is a simple off-policy actor-critic algorithm that makes an agent find an optimal policy by using past good experiences. In case that Self-Imitation Learning is combined with reinforcement learning algorithms that have actor-critic architecture, it shows performance improvement in various game environments. However, its applications are limited to reinforcement learning algorithms that have actor-critic architecture. In this paper, we propose a method of applying Self-Imitation Learning to Deep Q-Network which is a value-based deep reinforcement learning algorithm and train it in various game environments. We also show that Self-Imitation Learning can be applied to Deep Q-Network to improve the performance of Deep Q-Network by comparing the proposed algorithm and ordinary Deep Q-Network training results.
https://doi.org/10.7471/ikeee.2021.25.4.644 인용 PDF KSCI

Multi-Agent Deep Reinforcement Learning for Fighting Game: A Comparative Study of PPO and A2C

Yoshua Kaleb Purwanto;Dae-Ki Kang
- International Journal of Internet, Broadcasting and Communication
- /
- v.16 no.3
- /
- pp.192-198
- /
- 2024
This paper investigates the application of multi-agent deep reinforcement learning in the fighting game Samurai Shodown using Proximal Policy Optimization (PPO) and Advantage Actor-Critic (A2C) algorithms. Initially, agents are trained separately for 200,000 timesteps using Convolutional Neural Network (CNN) and Multi-Layer Perceptron (MLP) with LSTM networks. PPO demonstrates superior performance early on with stable policy updates, while A2C shows better adaptation and higher rewards over extended training periods, culminating in A2C outperforming PPO after 1,000,000 timesteps. These findings highlight PPO's effectiveness for short-term training and A2C's advantages in long-term learning scenarios, emphasizing the importance of algorithm selection based on training duration and task complexity. The code can be found in this link https://github.com/Lexer04/Samurai-Shodown-with-Reinforcement-Learning-PPO.
https://doi.org/10.7236/IJIBC.2024.16.3.192 인용 PDF

A Study on Directionality in Modern Utilization of 『Natya sastra』 -Focused on Facial Expression Acting Techniques- (『나띠야 샤스뜨라』의 현대적 활용에 대한 방향성 고찰 -표정연기술을 중심으로-)

Ahn, Jae-Beom
- The Journal of the Korea Contents Association
- /
- v.17 no.12
- /
- pp.408-416
- /
- 2017
This researcher looked at the characteristics of "$N\bar{a}tya$ $\acute{s}\bar{a}stra$" acting techniques focused on body expressions and sought to utilize it as training method for modern actors. 'Body postponement as a symbol' is a key characteristic of "$N\bar{a}tya$ $\acute{s}\bar{a}stra$" acting techniques, and each facial expression, gesture, and movement performed by an actor has a specific meaning. Therefore, the study on "$N\bar{a}tya$ $\acute{s}\bar{a}stra$" has been treated as a major study mainly in the non-realistic expression style or the body-centered acting theory. Therefore, a study on "$N\bar{a}tya$ $\acute{s}\bar{a}stra$" acting techniques was mainly dealt with as a major research subject in the nonrealistic expression style or body-centered acting theory. This study, on the other hand, emphasized the importance of research on the utilization plan of actor training, which can be usually used in realistic plays as well as non-realistic plays. According to psychologists such as Damasio and Eckman, internal impulses can also be triggered through expressions and actions associated with emotions. In addition, not only acting through the actor's inner impulse, but also expression of the actor's faithful external emotion can trigger the emotion of the audience. Such a case can be a psychological basis that "$N\bar{a}tya$ $\acute{s}\bar{a}stra$" acting techniques, which defines rasa and expresses it physically, can be used as a methodology to enhance inner truth. Therefore, the study on the actor training utilizing ing "$N\bar{a}tya$ $\acute{s}\bar{a}stra$" can be applied as an effective approach in the study of contemporary acting theory which intends to integrate inner impulse and external expression.
https://doi.org/10.5392/JKCA.2017.17.12.408 인용 PDF KSCI

A Study on the Theory of Action by Vakhangov and Michael Chekhov (박탄고프와 미카엘 체홉의 연기론 고찰)

Do, Jung-Nim;Park, Yi-Seul
- Journal of Korea Entertainment Industry Association
- /
- v.14 no.4
- /
- pp.133-144
- /
- 2020
This study is a new proposal for the methods of actor training and role creation in the contemporary theater and an approach to the practical utilization of the performer, regarding the actor's 'presence' as the essence of living arts, a peculiarity of theater. As the method for this, this study sorts out Vakhangov and Mikhail Chekhov's elements of acting techniques and at the same time, allows an easier approach to the theoretical concept based on the performance records found in the developmental process. The magic realism and the technique of acting discussed in priority in this study emphasize the importance of the exploration and realization of artistic inspiration in everyday life, the actor's imagination and image, and unconsciousness as a method for creating new actors and diversifying their roles. When their common views are summed up, the goals to achieve include a study of a creative method in which outer form and inner truth are combined and the implementation of a new system for creating the individual actor's originality. This study would classify the similarities and differences found through this, reveal the limit of practical efficacy and propose it as a universal method for creating the roles, asking for the actor's voluntary training and active attitudes.
https://doi.org/10.21184/jkeia.2020.6.14.4.133 인용

Robot Locomotion via RLS-based Actor-Critic Learning (RLS 기반 Actor-Critic 학습을 이용한 로봇이동)

Kim, Jong-Ho;Kang, Dae-Sung;Park, Joo-Young
- Journal of the Korean Institute of Intelligent Systems
- /
- v.15 no.7
- /
- pp.893-898
- /
- 2005
Due to the merits that only a small amount of computation is needed for solutions and stochastic policies can be handled explicitly, the actor-critic algorithm, which is a class of reinforcement learning methods, has recently attracted a lot of interests in the area of artificial intelligence. The actor-critic network composes of tile actor network for selecting control inputs and the critic network for estimating value functions, and in its training stage, the actor and critic networks take the strategy, of changing their parameters adaptively in order to select excellent control inputs and yield accurate approximation for value functions as fast as possible. In this paper, we consider a new actor-critic algorithm employing an RLS(Recursive Least Square) method for critic learning, and policy gradients for actor learning. The applicability of the considered algorithm is illustrated with experiments on the two linked robot arm.
https://doi.org/10.5391/JKIIS.2005.15.7.893 인용 PDF KSCI

On The Voice Training of Stage Speech in Acting Education - Yuri Vasiliev's Stage Speech Training Method - (연기 교육에서 무대 언어의 발성 훈련에 관하여 - 유리 바실리예프의 무대 언어 훈련방법 -)

Xu, Cheng-Kang
- Journal of Korea Entertainment Industry Association
- /
- v.15 no.3
- /
- pp.203-210
- /
- 2021
Yuri Vasilyev - actor, director and drama teacher. Russian meritorious artist, winner of the stage "Medal of Friendship" awarded by Russian President Vladimir Putin; academician of the Petrovsky Academy of Sciences and Arts in Russia, professor of the Russian National Academy of Performing Arts, and professor of the Bavarian Academy of Drama in Munich, Germany. The physiological sense stimulation method based on the improvement of voice, language and motor function of drama actors. On the basis of a systematic understanding of performing arts, Yuri Vasiliev created a unique training method of speech expression and skills. From the complicated art training, we find out the most critical skills for focused training, which we call basic skills training. Throughout the whole training process, Professor Yuri made a clear request for the actor's lines: "action! This is the basis of actors' creation. So action is the key! Action and voice are closely linked. Actor's voice is human voice, human life, human feeling, human experience and disaster. It is also the foundation of creation that actors acquire their own voice. What we are engaged in is pronunciation, breathing, tone and intonation, speed and rhythm, expressiveness, sincerity, stage voice and movement, gesture, all of which are used to train the voice of actors according to the standard of drama. In short, Professor Yuri's training course is not only the training of stage performance and skills, but also contains a rich view of drama and performance. I think, in addition to learning from the means and methods of training, it is more important for us to understand the starting point and training objectives of Professor Yuri's use of these exercises.
https://doi.org/10.21184/jkeia.2021.4.15.3.203 인용

Development of an Actor-Critic Deep Reinforcement Learning Platform for Robotic Grasping in Real World (현실 세계에서의 로봇 파지 작업을 위한 정책/가치 심층 강화학습 플랫폼 개발)

Kim, Taewon;Park, Yeseong;Kim, Jong Bok;Park, Youngbin;Suh, Il Hong
- The Journal of Korea Robotics Society
- /
- v.15 no.2
- /
- pp.197-204
- /
- 2020
In this paper, we present a learning platform for robotic grasping in real world, in which actor-critic deep reinforcement learning is employed to directly learn the grasping skill from raw image pixels and rarely observed rewards. This is a challenging task because existing algorithms based on deep reinforcement learning require an extensive number of training data or massive computational cost so that they cannot be affordable in real world settings. To address this problems, the proposed learning platform basically consists of two training phases; a learning phase in simulator and subsequent learning in real world. Here, main processing blocks in the platform are extraction of latent vector based on state representation learning and disentanglement of a raw image, generation of adapted synthetic image using generative adversarial networks, and object detection and arm segmentation for the disentanglement. We demonstrate the effectiveness of this approach in a real environment.
https://doi.org/10.7746/jkros.2020.15.2.197 인용 PDF KSCI

Discussion about the Priority for the Improvement of Performer Training in Korea

Son, BongHee
- International journal of advanced smart convergence
- /
- v.11 no.2
- /
- pp.135-141
- /
- 2022
This thesis examines a significant way to enhancing and improving the term/phenomenon of performer training system in contemporary Korean theatre. To articulate the matters, this research engages in discussing and criticizing those problematic issues that we, as an instructor/trainer, have faced with through the last decades in the field of performer training and education. Specifically, we concern with the necessity of an applicable and appropriate educational/training system where each student-actor would discover his/her own adaptability by evaluating what a specific method and approach is. This atmosphere accurately provided by an instructor/trainer can also facilitate and enhance the young students' potential possibilities and/or talent, that is, as we argue a way to accomplish each performer's true nature. To achieve the goals, we underlie the necessity of establishing and/or settling performer training program/course by means of an alternative path. The research finding shows that within the atmosphere each student could share then interrogate what a possible or ideal way is according to his/her comprehensive understandings with clearer purpose: what kind of performers would you produce, train, and/or educate.
https://doi.org/10.7236/IJASC.2022.11.2.135 인용 PDF KSCI

A Case Study on the Role Creation of Actors Using Etude - Centering on the Play - (에쮸드(Etude)를 활용한 배우의 역할창조 사례연구 - 연극 <춤추며 간다.>를 중심으로 -)

Lee, Jeong-Ha
- Journal of Korea Entertainment Industry Association
- /
- v.13 no.3
- /
- pp.101-110
- /
- 2019
The actor's art work is to build and create a role on stage based on the writer's drama. The actor's role creation is possible by analyzing the role of the writer in the drama logically and acting it actively. This is how an actor who practices practical acts goes beyond a stereotypical role-building and performs live acting skills. A case study in the field work for the application of Etude is absolutely necessary at present, where Etude of Stanislavsky is operated in Korean university education and field. This study will be a case in which Etude, which is a scientific and systematic acting methodology of Stanislavsky, is recognized and applied in the field as a methodology for more extended actor training methodology rather than making a judgment about the value of Etude as applied to theater education and the field as an acting training method. The researcher will introduce the methodology of using Etude as an acting method of Stanislavsky through the use of Etude in the creative play , and would like to give an example of an acting creation process model about 'how to apply Etude'. Through these studies and applications, actors can avoid falling into stereotypes and mannerism, and prepare the foundations for a living actor's art, the acting guide for creating a practical role.
https://doi.org/10.21184/jkeia.2019.4.13.3.101 인용

Robot locomotion via IRPO based Actor-Critic Learning Method (IRPO 기반 Actor-Critic 학습 기법을 이용한 로봇이동)

Kim, Jong-Ho;Kang, Dae-Sung;Park, Joo-Young
- Proceedings of the KIEE Conference
- /
- 2005.07d
- /
- pp.2933-2935
- /
- 2005
The IRPO(Intensive Randomized Policy Optimizer) algorithm is a recently developed tool in the area of reinforcement leaming. And it has been shown to be very successful in several application problems. To compare with a general RL method, IRPO has some difference in that policy utilizes the entire history of agent -environment interaction. The policy is derived from the history directly, not through any kind of a model of the environment. In this paper, we consider a robot-control problem utilizing a IRPO algorithm. We also developed a MATLAH-based animation program, by which the effectiveness of the training algorithms were observed.
PDF

Search Result 47, Processing Time 0.133 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)