Search | Korea Science

Research Trends on Deep Reinforcement Learning (심층 강화학습 기술 동향)

Jang, S.Y.;Yoon, H.J.;Park, N.S.;Yun, J.K.;Son, Y.S.
- Electronics and Telecommunications Trends
- /
- v.34 no.4
- /
- pp.1-14
- /
- 2019
Recent trends in deep reinforcement learning (DRL) have revealed the considerable improvements to DRL algorithms in terms of performance, learning stability, and computational efficiency. DRL also enables the scenarios that it covers (e.g., partial observability; cooperation, competition, coexistence, and communications among multiple agents; multi-task; decentralized intelligence) to be vastly expanded. These features have cultivated multi-agent reinforcement learning research. DRL is also expanding its applications from robotics to natural language processing and computer vision into a wide array of fields such as finance, healthcare, chemistry, and even art. In this report, we briefly summarize various DRL techniques and research directions.
https://doi.org/10.22648/ETRI.2019.J.340401 인용 PDF

Continual Multiagent Reinforcement Learning in Dynamic Environments (동적 환경에서의 지속적인 다중 에이전트 강화 학습)

Jung, Kyuyeol;Kim, Incheol
- Proceedings of the Korea Information Processing Society Conference
- /
- 2020.11a
- /
- pp.988-991
- /
- 2020
다양한 실세계 응용 분야들에서 공동의 목표를 위해 여러 에이전트들이 상호 유기적으로 협력할 수 있는 행동 정책을 배우는 것은 매우 중요하다. 이러한 다중 에이전트 강화 학습(MARL) 환경에서 기존의 연구들은 대부분 중앙-집중형 훈련과 분산형 실행(CTDE) 방식을 사실상 표준 프레임워크로 채택해왔다. 하지만 이러한 다중 에이전트 강화 학습 방식은 훈련 시간 동안에는 경험하지 못한 새로운 환경 변화가 실전 상황에서 끊임없이 발생할 수 있는 동적 환경에서는 효과적으로 대처하기 어렵다. 이러한 동적 환경에 효과적으로 대응하기 위해, 본 논문에서는 새로운 다중 에이전트 강화 학습 체계인 C-COMA를 제안한다. C-COMA는 에이전트들의 훈련 시간과 실행 시간을 따로 나누지 않고, 처음부터 실전 상황을 가정하고 지속적으로 에이전트들의 협력적 행동 정책을 학습해나가는 지속 학습 모델이다. 본 논문에서는 대표적인 실시간 전략게임인 StarcraftII를 토대로 동적 미니게임을 구현하고 이 환경을 이용한 다양한 실험들을 수행함으로써, 제안 모델인 C-COMA의 효과와 우수성을 입증한다.
https://doi.org/10.3745/PKIPS.y2020m11a.988 인용 PDF

Relationships of the Self-regulated Learning Strategies used in Both Science and English Classes and Motivation to Academic Performance by Science-gifted High School Students (과학영재고등학생의 과학과 영어과목에서의 학습전략 사용 및 동기의 차이와 학업수행과의 관계)

Sung, Hyun-Sook;Kim, Eel;Kim, Young-Sang
- Journal of Gifted/Talented Education
- /
- v.19 no.1
- /
- pp.95-117
- /
- 2009
This study investigated the relationships of the self-regulated learning strategies used in both science and English classes and motivation to academic performance of science-gifted high school students. Participants of this study were 144 freshmen of Korea Science Academy It was found out that the use of self-regulation learning strategies and motivation exerts differential influence on the academic performance of science-gifted students, depending on the subjects they study. Results showed that they used more vigorously in science class those self-regulated strategies which consist of cognition, metacognition, and resource management strategies than in English class. In addition, their motivation level in science class was significantly higher than that in English class. Self-regulated strategies did not explain any variance in physics GPA. Task value among the motivation variables accounted for 2 percent of variance in physics GPA. Metacognition and time and study environment variables explained 8 percent and 15 percent of variance in English GPA, respectively. Self-efficacy in motivation accounted for 30 percent of variance in English GPA, These results were discussed in the light of instruction for science-gifted high students.
PDF KSCI

Performance Evaluation of Multilinear Regression Empirical Formula and Machine Learning Model for Prediction of Two-dimensional Transverse Dispersion Coefficient (다중선형회귀경험식과 머신러닝모델의 2차원 횡 분산계수 예측성능 평가)

Lee, Sun Mi;Park, Inhwan
- Proceedings of the Korea Water Resources Association Conference
- /
- 2022.05a
- /
- pp.172-172
- /
- 2022
분산계수는 하천에서 오염물질의 혼합능을 파악할 수 있는 대표적인 인자이다. 특히 하수처리장 방류수 혼합예측과 같이 횡 방향 혼합에 대한 예측이 중요한 경우, 하천의 지형적, 수리학적 특성을 고려한 2차원 횡 분산계수의 결정이 필요하다. 2차원 횡 분산계수의 결정을 위해 기존 연구에서는 추적자실험결과로부터 경험식을 만들어 횡 분산계수 산정에 사용해왔다. 회귀분석을 통한 경험식 산정을 위해서는 충분한 데이터가 필요하지만, 2차원 추적자 실험 건수가 충분치 않아 신뢰성 높은 경험식 산정이 어려운 상황이다. 따라서 본 연구에서는 SMOTE기법을 이용하여 횡분산계수 실험데이터를 증폭시켜 이로부터 횡 분산계수 경험식을 산정하고자 한다. 또한 다중선형회귀분석을 통해 도출된 경험식의 한계를 보완하기 위해 다양한 머신러닝 기법을 적용하고, 횡 분산계수 산정에 적합한 머신러닝 기법을 제안하고자 한다. 기존 추적자실험 데이터로부터 하폭 대 수심비, 유속 대 마찰유속비, 횡 분산계수 데이터 셋을 수집하였으며, SMOTE 알고리즘의 적용을 통해 회귀분석과 머신러닝 기법 적용에 필요한 데이터그룹을 생성했다. 새롭게 생성된 데이터 셋을 포함하여 다중선형회귀분석을 통해 횡 분산계수 경험식을 결정하였으며, 새로 제안한 경험식과 기존 경험식에 대한 정확도를 비교했다. 또한 다중선형회귀분석을 통해 결정된 경험식은 횡 분산계수 예측범위에 한계를 보였기 때문에 머신러닝기법을 적용하여 다중선형회귀분석에 대한 예측성능을 평가했다. 이를 위해 머신러닝 기법으로서 서포트 벡터 머신 회귀(SVR), K근접이웃 회귀(KNN-R), 랜덤 포레스트 회귀(RFR)를 활용했다. 세 가지 머신러닝 기법을 통해 도출된 횡 분산계수와 경험식으로부터 결정된 횡 분산계수를 비교하여 예측 성능을 비교했다. 이를 통해 제한된 실험데이터 셋으로부터 2차원 횡 분산계수 산정을 위한 데이터 전처리 기법 및 횡 분산계수 산정에 적합한 머신러닝 절차와 최적 학습기법을 도출했다.
PDF

Design of Spark SQL Based Framework for Advanced Analytics (Spark SQL 기반 고도 분석 지원 프레임워크 설계)

Chung, Jaehwa
- KIPS Transactions on Software and Data Engineering
- /
- v.5 no.10
- /
- pp.477-482
- /
- 2016
As being the advanced analytics indispensable on big data for agile decision-making and tactical planning in enterprises, distributed processing platforms, such as Hadoop and Spark which distribute and handle the large volume of data on multiple nodes, receive great attention in the field. In Spark platform stack, Spark SQL unveiled recently to make Spark able to support distributed processing framework based on SQL. However, Spark SQL cannot effectively handle advanced analytics that involves machine learning and graph processing in terms of iterative tasks and task allocations. Motivated by these issues, this paper proposes the design of SQL-based big data optimal processing engine and processing framework to support advanced analytics in Spark environments. Big data optimal processing engines copes with complex SQL queries that involves multiple parameters and join, aggregation and sorting operations in distributed/parallel manner and the proposing framework optimizes machine learning process in terms of relational operations.
https://doi.org/10.3745/KTSDE.2016.5.10.477 인용 PDF KSCI

Web-Based Teaching-Learning System of Mobile Agent (이동 에이전트를 활용한 웹기반 교수-학습시스템)

Ko, Ju-Yeon;Park, Sun-Ju
- Journal of The Korean Association of Information Education
- /
- v.5 no.2
- /
- pp.216-229
- /
- 2001
A more interactive teaching-learning system is increasingly necessary in the consumer-oriented environment of distance education. This article would like to suggest a more spontaneous system which is learners at various levels. The suggested system keynotes its efficiency with the introduction of a "mobile agent" concept through which learners are able to network and complete their assignments despite their dispersed environments. This article also suggests some managerial techniques for the systematic management of agent-based learners possessing diverse characteristics. Through this study, we expect more highly effect by offer data adapted to learning goal to learner's ability, get out of uniform web-based teaching-learning.
PDF

Skin and non-skin color separability enhancement based on Average Neighborhood Margin Maximization (ANMM(Average Neighborhood Margin Maximization)에 기반한 피부색과 비피부색 분리력 향상 기법)

Ban, Yuseok;Lee, Sangyoun
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.6-7
- /
- 2011
본 논문에서는 지역적 학습 방법을 활용하는 Average Neighborhood Margin Maximization(ANMM)에 기반하여 피부색과 비피부색 영역을 분리하는 이진 분류의 통계적 접근법을 제안한다. Fisher Linear Discriminant(FLD)와 Average Neighborhood Margin Maximization(ANMM)의 피부색과 비피부색 클래스 내 분산 대비 클래스 간 분산의 비교를 통해 두 클래스 간 분리력 변화를 확인한다. 교사(Supervised) 이진 분류문제에 대하여 Small sample size(SSS) 문제, 가우시안 분포 가정의 문제, 최대 추출 가능 특징 수 제한 문제 등을 해결함과 동시에, 지역적 특성 학습 방법의 도입을 통해 피부색과 비피부색 간 분리력을 향상시킨다.
PDF

Development of a Metadata Tool for LIO Learning Object Model on the Distributed Environments (분산 환경에서의 LIO 학습 객체 모델을 위한 메타데이터 도구 개발)

Shin, Haeng-Ja;Park, Keuyng-Hwan
- Proceedings of the Korea Information Processing Society Conference
- /
- 2003.11b
- /
- pp.697-700
- /
- 2003
메타데이터는 데이터의 데이터로서 컨텐츠 모델을 구성하는 각 요소들의 속성을 기술하는 방법으로 컨텐츠에 대한 정보를 제공한다. 이러한 메타데이터는 컨텐츠를 더 쉽게 이용하거나 검색할 수 있도록 인덱스화된 레이블로 기술되는데, 정확하게 기술하기 위해 메타데이터 요소가 정밀하여야 한다. 본 논문에서는 다른 시스템들 간에 재사용 가능한 LIO 학습 객체 모델의 메타데이터를 e-learning 시스템의 메타데이터 표준화 기술인 LOM 을 기반으로 가상교육 시스템에서 필수적인 메타 데이터를 생성, 갱신, 저장하는 도구를 설계 및 개발하고 분산 컴퓨팅 환경에서 효과적으로 활용하도록 XML 문서로 바인딩 하였다.
PDF

KAISER: Named Entity Recognizer using Word Embedding-based Self-learning of Gazettes (KAISER: 워드 임베딩 기반 개체명 어휘 자가 학습 방법을 적용한 개체명 인식기)

Hahm, Younggyun;Choi, Dongho;Choi, Key-Sun
- Annual Conference on Human and Language Technology
- /
- 2016.10a
- /
- pp.337-339
- /
- 2016
본 논문에서는 한국어 개체명 인식의 성능 향상을 위하여 워드 임베딩을 활용할 수 있는 방법에 대하여 기술한다. 워드 임베딩이란 문장의 단어의 공기정보를 바탕으로 그 단어의 의미를 벡터로 표현하는 분산표현이다. 이러한 분산 표현은 단어 간의 유의미한 정도를 계산하는데 유용하다. 본 논문에서는 이러한 워드 임베딩을 통하여 단어 벡터들의 코사인 유사도를 통한 개체명 사전 자가 학습 및 매칭 방법을 적용하고, 그 실험 결과를 보고한다.
PDF

The multi agent control heuristic using direction vector (방향 벡터를 이용한 다중에이전트 휴리스틱)

Kim Hyun;Lee SeungGwan;Chung TaeChoong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2004.11a
- /
- pp.525-528
- /
- 2004
먹이추적문제(prey pursuit problem)는 가상 격자로 이루어진 공간 내에 다중의 에이전트를 이용하여 먹이를 포획하는 것이다. 에이전트들은 먹이를 포획하기 위해 $30{\times}30$으로 이루어진 격자공간 (gride)안에서 기존 제안된 지역 제어, 분산 제어, 강화학습을 이용한 분산 제어 전략들을 적용하여 먹이를 포획하는 전략을 구현하였다. 제한된 격자 공간은 현실세계를 표현하기에는 너무도 역부족이어서 본 논문에서는 제한된 격자공간이 아닌 현실 세계와 흡사한 무한 공간 환경을 표현하고자 하였다. 표현된 환경의 모델은 순환구조(circular)형 격자 공간이라는 새로운 실험 공간이며, 새로운 공간에 맞는 전략은 에이전트와 먹이와의 추적 관계를 방향 벡터를 고려한 모델로 구현하였다. 기존 실험과는 차별화 된 환경에서 에이전트들은 휴리스틱을 통한 학습을 할 수 있다는 가정과 먹이의 효율적 포획, 충돌문제 해결이라는 결과를 얻었다.
PDF

Search Result 534, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)