통합 검색 | Korea Science

Solving Continuous Action/State Problem in Q-Learning Using Extended Rule Based Fuzzy Inference System

Kim, Min-Soeng;Lee, Ju-Jang
- Transactions on Control, Automation and Systems Engineering
- /
- 제3권3호
- /
- pp.170-175
- /
- 2001
Q-learning is a kind of reinforcement learning where the agent solves the given task based on rewards received from the environment. Most research done in the field of Q-learning has focused on discrete domains, although the environment with which the agent must interact is generally continuous. Thus we need to devise some methods that enable Q-learning to be applicable to the continuous problem domain. In this paper, an extended fuzzy rule is proposed so that it can incorporate Q-learning. The interpolation technique, which is widely used in memory-based learning, is adopted to represent the appropriate Q value for current state and action pair in each extended fuzzy rule. The resulting structure based on the fuzzy inference system has the capability of solving the continuous state about the environment. The effectiveness of the proposed structure is shown through simulation on the cart-pole system.
PDF

상호작용 증진을 위한 동적인 Q&A 게시판의 설계 및 구현 (Design and Implementation of Dynamic Q&A Bulletin Board System for Enhancement of Interaction)

윤소영;이지영
- 정보학연구
- /
- 제4권2호
- /
- pp.37-49
- /
- 2001
본 연구는 웹 기반 수업에서 상호작용을 위한 수단으로 사용되고 있는 Q&A(Question and Answer) 게시판에 동적인 기능을 추가하여 학습자에게는 즉각적인 응답을 주고, 교수자에게는 답변에 대한 부담감을 해소하고자 하였다. 또한 이를 통하여 웹 기반 수업에서 상호작용 증진 효과를 얻고자 하였다. 구현한 동적인 Q&A 게시판은 기존 Q&A게시판의 단점인 질문을 게시하고 교수자가 확인하여 답변할 때까지 기다려야 했던 점을 개선하여 교수자가 미리 구축해놓은 답변 데이터베이스와 인터넷 검색엔진에서 검색한 결과를 즉각적으로 응답할 수 있게 하였다.
PDF

전자책 유통을 위한 리더 시스템 개발 (Development of E-Book Reader System for Q＋Platform)

이은정
- 인터넷정보학회논문지
- /
- 제2권4호
- /
- pp.83-90
- /
- 2001
XML 기반의 전자책 리더 시스템인 Q＋-리더의 개발을 소개한다. 이 시스템은 정보가전 용 내장형 플랫폼 Q+를 목표로 개발되었다. 본 리더 시스템은 OEB 표준에서 규정한 XML 기반의 컨텐트 형식과 CSS에 의한 스타일을 지원한다. 본 시스템은 전자책 컨텐츠를 사용자에게 랜더링해 주는 역할을 하는데, 이러한 랜더링 기능을 내재함으로서 전자책 리더 시스템은 컨텐츠의 사용에 대한 제어가 가능하게 된다. 본 시스템은 자바 언어로 개발되어 여타 플랫폼에서도 사용 가능할 뿐 아니라 개방형 구조로 설계되어 OEB 이외의 다른 표준에 대해서도 쉽게 확장 가능할 것으로 기대된다.
PDF

A 4-step Inference Method for Natural Language Propositions Involving Fuzzy Quantifiers and Truth Qualifiers

Okamoto, Wataru
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 2003년도 ISIS 2003
- /
- pp.579-582
- /
- 2003
In this paper, we propose a 4-step inference method needed for constructing a natural language communication system. The method is used to obtain fuzzy quantifier Q′when QA is Fisr τ⇔ Q′(m′A) is mF is m"is τ is inferred (Q, Q′: quantifiers, A: fuzzy subject, m′, m": modifiers, y: fuzzy predicate, τ: truth qualifier). We show that Q′is resolved step by step for two types of Q, including a non-increasing type (few,...) and a non-decreasing type(most,...).
PDF

코히어런트 PON시스템의 I/Q 진폭불균형 분석 및 보상 (Analysis and Compensation of I/Q Amplitude Imbalance In Coherent PON Systems)

김나영;이승우;박영일
- 한국통신학회논문지
- /
- 제40권10호
- /
- pp.1940-1946
- /
- 2015
차세대 광가입자망시스템에서는 전송속도 및 전송거리 향상을 위해 코히어런트 광전송 시스템이 검토되고 있다. 그런데 이 전송방식의 경우 I/Q 불균형 요인에 의해 전송 성능 저하를 일으킬 수 있으며, 가입자 수신부 내부 구조의 비대칭성은 I/Q 진폭불균형의 주 요인이 될 수 있다. 따라서 안정적인 전송 성능 보장을 위해서는 이런 불균형 성분을 제거하거나 보상해주어야 한다. 본 논문에서는 I/Q 진폭불균형의 원인 및 전송 성능에 미치는 영향을 분석하고, 수신부에서 발생하는 I/Q 진폭불균형 요인을 보상하는 방식을 제시하였다. 또한 시뮬레이션을 통해 제안한 방식의 성능을 보인다.
https://doi.org/10.7840/kics.2015.40.10.1940 인용 PDF KSCI

Visual Analysis of Deep Q-network

Seng, Dewen;Zhang, Jiaming;Shi, Xiaoying
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제15권3호
- /
- pp.853-873
- /
- 2021
In recent years, deep reinforcement learning (DRL) models are enjoying great interest as their success in a variety of challenging tasks. Deep Q-Network (DQN) is a widely used deep reinforcement learning model, which trains an intelligent agent that executes optimal actions while interacting with an environment. This model is well known for its ability to surpass skilled human players across many Atari 2600 games. Although DQN has achieved excellent performance in practice, there lacks a clear understanding of why the model works. In this paper, we present a visual analytics system for understanding deep Q-network in a non-blind matter. Based on the stored data generated from the training and testing process, four coordinated views are designed to expose the internal execution mechanism of DQN from different perspectives. We report the system performance and demonstrate its effectiveness through two case studies. By using our system, users can learn the relationship between states and Q-values, the function of convolutional layers, the strategies learned by DQN and the rationality of decisions made by the agent.
https://doi.org/10.3837/tiis.2021.03.003 인용 PDF KSCI HTML

Q+ 실시간 운영체제에서 동작하는 미디어 재생기의 구현 (The Implementation of a Media Player on Q+ Real-time Operating System)

조창식;마평수
- 한국정보처리학회논문지
- /
- 제7권11호
- /
- pp.3509-3518
- /
- 2000
ADSL, ISDN 등과 같은 초고속 인터넷 접속 서비스가 발전함에 따라 일반 가정에서 인터넷을 이용하여 영화나 음악을 감상하는 것이 가능하게 되었다. 또한 정보가전의 활용 범위가 확대됨에 따라 다양한 서비스를 제공하는 정보가전의 개발이 가속화되고 있으며 정보가전을 위한 운영체제 개발 및 실시간 운영체제를 탑재한 단말장치에서의 스트리밍 서비스가 중요한 개발 목표가 되고 있다. 본 논문에서는 실시간 운영체제인 Q+에서 동작하는 미디어 재생기의 구현 기술과 경험에 대하여 설명한다. 미디어 재생기는 서버에서 전송된 MP3, MPEG-1, MPEG-4 데이터를 소프트웨어로 디코딩하여 사용자에게 보여준다. 미디어 재생기는 저가의 CPU가 장착된 디지털 TV 셋탑박스에서 동작하며, Q+ 운영체제의 커널 및 라이브러리를 이용하여 구현되었다. 따라서 하드웨어와 실시간 운영체제의 특성을 고려한 프로그래밍 기법 및 성능 향상 기법이 요구된다. 본 논문에서는 Q+ 운영체제에서 동작하는 미디어 재생기 구현과 관련하여 프로그래밍 상의 기법 및 미디어 재생기의 성능 향상 방법에 대하여 설명한다.
PDF

THE NAVIER-STOKES EQUATIONS WITH INITIAL VALUES IN BESOV SPACES OF TYPE B^-1+3/q_q,_∞

Farwig, Reinhard;Giga, Yoshikazu;Hsu, Pen-Yuan
- 대한수학회지
- /
- 제54권5호
- /
- pp.1483-1504
- /
- 2017
We consider weak solutions of the instationary Navier-Stokes system in a smooth bounded domain ${\Omega}{\subset}{\mathbb{R}}^3$ with initial value $u_0{\in}L^2_{\sigma}({\Omega})$. It is known that a weak solution is a local strong solution in the sense of Serrin if $u_0$ satisfies the optimal initial value condition $u_0{\in}B^{-1+3/q}_{q,s_q}$ with Serrin exponents $s_q$ > 2, q > 3 such that ${\frac{2}{s_q}}+{\frac{3}{q}}=1$. This result has recently been generalized by the authors to weighted Serrin conditions such that u is contained in the weighted Serrin class ${{\int}_0^T}({\tau}^{\alpha}{\parallel}u({\tau}){\parallel}_q)^s$ $d{\tau}$ < ${\infty}$ with ${\frac{2}{s}}+{\frac{3}{q}}=1-2{\alpha}$, 0 < ${\alpha}$ < ${\frac{1}{2}}$. This regularity is guaranteed if and only if $u_0$ is contained in the Besov space $B^{-1+3/q}_{q,s}$. In this article we consider the limit case of initial values in the Besov space $B^{-1+3/q}_{q,{\infty}}$ and in its subspace ${{\circ}\atop{B}}^{-1+3/q}_{q,{\infty}}$ based on the continuous interpolation functor. Special emphasis is put on questions of uniqueness within the class of weak solutions.
https://doi.org/10.4134/JKMS.j160529 인용 PDF KSCI

암반분류법을 이용한 석회석 광산 내 대규격 갱도의 안정성 평가 (Evaluating the Stability of Large-scale Gangways Mined in a Limestone Mine Using Rock Classification Schemes)

윤용균;이홍우
- 터널과지하공간
- /
- 제17권6호
- /
- pp.503-510
- /
- 2007
석회석 광산에 굴착된 대규격 갱도의 안정성을 평가하기 위하여 22곳의 측정지점을 선택한 후 RMR과 Q 분류법을 실시하였다. 측정 대상 갱도가 조사시점까지 안정성에 심각한 문제가 없었다는 점을 고려하면 갱도 폭에 대한 안정성을 평가함에 있어 RMR보다는 Q 분류법이 측정 결과와 부합하는 것으로 나타났다. 갱도의 전체적인 안정성을 평가하기 위하여 수정 Q 분류법의 일종인 확장 안정성 도해법을 적용한 결과 한 곳을 제외하고는 모든 측정 갱도들이 안정한 것으로 평가되었다. RMR과 Q 분류법의 적용 결과를 토대로 하여 대규격 갱도의 최대 무지보 폭과 한계높이를 평가할 수 있는 회귀식을 제안하였다.
PDF KSCI

Initial Slot-Count Selection Scheme with Tag Number Estimation in Gen-2 RFID System

Lim, In-Taek;Ryu, Young-Tae
- Journal of information and communication convergence engineering
- /
- 제8권5호
- /
- pp.519-523
- /
- 2010
In Gen-2 RFID system, the initial value of $Q_{fp}$, which is the slot-count parameter of Q-algorithm, is not defined in the standard. In this case, if the number of tags within the reader's identification range is small and we let the initial $Q_{fp}$ be large, the number of empty slot will be large. On the other hand, if we let the initial $Q_{fp}$ be small in spite of many tags, almost all the slots will be collided. As a result, the performance will be declined because the frame size does not converge to the optimal point quickly during the query round. In this paper, we propose a scheme to allocate the optimal initial $Q_{fp}$ through the tag number estimation before the query round begins. Through computer simulations, it is demonstrated that the proposed scheme achieves more stable performance than Gen-2 Q-algorithm.
https://doi.org/10.6109/jicce.2010.8.5.519 인용 PDF KSCI

검색결과 1,963건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)