• 제목/요약/키워드: I/Q

검색결과 1,719건 처리시간 0.058초

Q-value Initialization을 이용한 Reinforcement Learning Speedup Method (Reinforcement learning Speedup method using Q-value Initialization)

  • 최정환
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(3)
    • /
    • pp.13-16
    • /
    • 2001
  • In reinforcement teaming, Q-learning converges quite slowly to a good policy. Its because searching for the goal state takes very long time in a large stochastic domain. So I propose the speedup method using the Q-value initialization for model-free reinforcement learning. In the speedup method, it learns a naive model of a domain and makes boundaries around the goal state. By using these boundaries, it assigns the initial Q-values to the state-action pairs and does Q-learning with the initial Q-values. The initial Q-values guide the agent to the goal state in the early states of learning, so that Q-teaming updates Q-values efficiently. Therefore it saves exploration time to search for the goal state and has better performance than Q-learning. 1 present Speedup Q-learning algorithm to implement the speedup method. This algorithm is evaluated. in a grid-world domain and compared to Q-teaming.

  • PDF

Polynomials satisfying f(x-a)f(x)+c over finite fields

  • Park, Hong-Goo
    • 대한수학회보
    • /
    • 제29권2호
    • /
    • pp.277-283
    • /
    • 1992
  • Let GF(q) be a finite field with q elements where q=p$^{n}$ for a prime number p and a positive integer n. Consider an arbitrary function .phi. from GF(q) into GF(q). By using the Largrange's Interpolation formula for the given function .phi., .phi. can be represented by a polynomial which is congruent (mod x$^{q}$ -x) to a unique polynomial over GF(q) with the degree < q. In [3], Wells characterized all polynomial over a finite field which commute with translations. Mullen [2] generalized the characterization to linear polynomials over the finite fields, i.e., he characterized all polynomials f(x) over GF(q) for which deg(f) < q and f(bx+a)=b.f(x) + a for fixed elements a and b of GF(q) with a.neq.0. From those papers, a natural question (though difficult to answer to ask is: what are the explicit form of f(x) with zero terms\ulcorner In this paper we obtain the exact form (together with zero terms) of a polynomial f(x) over GF(q) for which satisfies deg(f) < p$^{2}$ and (1) f(x+a)=f(x)+c for the fixed nonzero elements a and c in GF(q).

  • PDF

A NOTE ON CYCLOTOMIC UNITS IN FUNCTION FIELDS

  • Jung, Hwanyup
    • 충청수학회지
    • /
    • 제20권4호
    • /
    • pp.433-438
    • /
    • 2007
  • Let $\mathbb{A}=\mathbb{F}_q[T]$ and $k=\mathbb{F}_q(T)$. Assume q is odd, and fix a prime divisor ${\ell}$ of q - 1. Let P be a monic irreducible polynomial in A whose degree d is divisible by ${\ell}$. In this paper we define a subgroup $\tilde{C}_F$ of $\mathcal{O}^*_F$ which is generated by $\mathbb{F}^*_q$ and $\{{\eta}^{{\tau}^i}:0{\leq}i{\leq}{\ell}-1\}$ in $F=k(\sqrt[{\ell}]{P})$ and calculate the unit-index $[\mathcal{O}^*_F:\tilde{C}_F]={\ell}^{\ell-2}h(\mathcal{O}_F)$. This is a generalization of [3, Theorem 16.15].

  • PDF

중부지방 비목나무 자생림의 식물군집구조 분석(I) (Plants Community Structure Analysis of Lindera erythrocarpa Native Forest in the Central Korea(I))

  • 이동철;심경구;최송현;이경재
    • 한국조경학회지
    • /
    • 제22권2호
    • /
    • pp.133-157
    • /
    • 1994
  • This study was executed to find out the succession stage and the ecological niche of Lindera erythrocarpa Markino. Four sites were selected by field investigation. They are Jeondungsa and Jeongsusa of Kanghwa Island, Mt. Suri of Anyang and Mt. Gaya of Chungcheongnamdo. They located in the region which have the similar temperature with Seoul region or lower average temperature for winter than that of adjacent Seoul. In the four sites, L, erythrocarpa was appeard in canopy layer at L. erythrocarpa community in Jeondungsa, L. erythrocarpa-Q. serrata, Z. serrata-L. erythrocarpa community in Jeongsusa, Castanea crenata-L, erythrocarpa community, L. erythrocarpa-Q. serrata community in Mt. Gaya and in the rest of the sites, it lives in subtree and shrub layer. And in the four sites but Jeongsusa area, it correspond with Chang(1991)'s study that L. erythrocarpa is dominant species in the site impacted by human. L. erythrocarpa lives with Quercus spp. such as Q. serrata, Q. variabilis, Q. mongolica and Carpinus laxiflora but it's presumably a passing phenomena.

  • PDF

D.Q.M.을 이용한 I-단면 곡선보의 진동해석 (Differential Quadrature Analysis for Vibration of Wide-Flange Curved Beams)

  • Ji-Won Han;Ki-Jun Kang
    • 한국안전학회지
    • /
    • 제13권3호
    • /
    • pp.163-170
    • /
    • 1998
  • I-단면 곡선보(curved beam)의 뒤틀림(warping)을 포함한 평면외(out-of-plane)의 자유진동을 해석하는데 differential quadrature method(D.Q.M.)을 이용하여 다양한 경계조건(boundary conditions)과 굽힘각(opening angles)에 따른 진동수(frequencies)를 계산하였다. D.Q.M.의 결과는 해석적 해답(exact solution) 또는 다른 수치해석(Rayleigh-Ritz 또는 FEM) 결과와 비교하였으며, D.Q.M.은 적은 요소(grid points)를 사용하여 정확한 해석결과를 보여주었다.

  • PDF

유한차분해석과 개별요소해석을 이용한 암반에 근입된 현장타설말뚝의 선단지지력 연구 (A Study on the Ultimate Point Resistance of Rock Socketed Drilled Shafts Using FLAC3D and UDEC)

  • 이재환;조후연;유광호;정상섬
    • 한국지반공학회논문집
    • /
    • 제28권1호
    • /
    • pp.29-39
    • /
    • 2012
  • 본 연구에서는 암반근입 현장타설말뚝의 선단지지력에 영향을 미치는 주요 영향인자들과 이들 영향인자에 따른 선단지지력의 변화특성을 수치해석을 통하여 분석하였다. 수치해석은 일반적으로 널리 사용되는 연속체해석 중 유한차분해석(FDM)과 암반에 존재하는 불연속면(절리, 단층 등)의 특성을 고려할 수 있는 불연속체해석 중 개별요소해석(DEM)을 병행함으로서 해석의 정확도를 높였다. 그 결과, 암반에 근입된 현장타설말뚝의 선단지지력($q_{max}$)은 암반의 탄성계수($E_m$), 불연속면의 간격($S_j$)에 비례하여 증가하였으며, 말뚝의 직경(D)에는 반비례하는 것을 확인할 수 있었다. 또한 불연속면의 경사($i_j$)에 대해서는 불연속면의 경사($i_j$)가 $0^{\circ}$ < $i_j$ < $60^{\circ}$일 때의 선단지지력은 그 외 경사의 선단지지력에 비해 최대 약 50%까지 감소하였으며 이는 말뚝으로부터 전해진 하중에 의하여 말뚝하부 암반 자체 보다 암반의 불연속면에서 먼저 전단파괴가 발생하였기 때문인 것으로 판단된다. 불연속면의 경사($i_j$)가 불연속면의 내부마찰각(${\phi}_j$)과 근접할 때 선단지지력이 최소치에 가까운 것으로 나타났으며, 따라서 불연속면의 경사가 일반적인 암반 및 암반 불연속면 내부마찰각의 범위인 $20^{\circ}{\sim}40^{\circ}$에 존재할 때는 선단지지력의 산정 시 반드시 불연속면 경사의 영향을 고려해야하는 것으로 나타났다.