DOI QR코드

DOI QR Code

Reinforcement Learning-based Approach for Lego Puzzle Generation

강화학습을 이용한 레고 퍼즐 생성 기술 개발

  • 박철성 (상명대학교 컴퓨터과학과) ;
  • 양희경 (상명대학교 SW융합학부) ;
  • 민경하 (상명대학교 컴퓨터과학과)
  • Received : 2020.03.10
  • Accepted : 2020.06.08
  • Published : 2020.06.20

Abstract

We present a reinforcement learning-based framework for generating 2D Lego puzzle from input pixel art images. We devise heuristics for a proper Lego puzzle as stability and efficiency. We also design a DQN structure and train it to maximize the heuristics of 2D Lego puzzle. In legorization stage, we complete the layout of Lego puzzle by adding a Lego brick to the input image using the trained DQN. During this process, we devise a region of interest to reduce the computational loads of the legorization. Using this approach, our framework can present a very high resolutional Lego puzzle.

2D 레고 퍼즐은 레고 브릭을 이용해서 다양한 영상을 완성하는 퍼즐로 많은 사람들의 사랑을 받고 있다. 본 연구에서는 입력된 픽셀 아트 영상으로부터 강화학습에 기반한 2D 레고 퍼즐을 구성하고 완성하는 방법을 제안한다. 먼저, 학습 단계에서는 바람직한 레고 퍼즐에 대한 휴리스틱을 안정성과 효율성으로 설정하고 이를 최대한 만족시키는 방향으로 DQN을 학습한다. 그리고, 레고화 단계에서는 이 DQN을 이용해서 실제 레고 브릭을 추가해가면서 퍼즐의 구성도를 완성하는 과정을 수행한다. 이 과정을 통해서 지금까지 기술로는 수행하기 힘들었던 매우 높은 해상도의 레고 퍼즐을 효율적으로 완성하는 기술을 제공한다.

Keywords

References

  1. R. Gower, A. Heydtmann, and H. Petersen, "Lego: Automated model construction," in Proc. of 32nd European Study Group with Industry 1998, 1998, pp. 81-94.
  2. P. Petrovic, "Solving lego brick layout problem using evolutionary algorithms," in Proc. of Norwegian Conference on Computer Science, 2001, pp. 87-97.
  3. L. van Zijl and E. Smal, "Cellular automata with cell clustering," in Proc. of Automata 2008, 2008, pp. 425-440.
  4. R. Testuz, Y. Schwartzburg, and M. Pauly, "Automatic generation of constructable brick sculptures," in Proc. of Eurographics (Short Papers) 2013, 2013, pp. 81-84.
  5. S. Ono, A. Andre, and Y. Chang, "Automatic generation of lego from the polygonal data," in Proc. of International workshop on advanced image technology 2013, 2013, pp. 262-267.
  6. S. Lee, J. Kim, J. W. Kim, and B. R. Moon, "Finding an optimal lego brick layout of voxelized 3d object using a genetic algorithm," in Proc. of the 2015 Annual Conference on Genetic and Evolutionary Computation, 2015, pp. 1215-1222.
  7. M. Zhang, Y. Igarashi, Y. Kanamori, and J. Mitani, "Designing miniblock artwork from colored mesh," in Proc. of Smart Graphics 2015, 2015, p. 2.
  8. G. Yun, C. Park, H. Yang, and K. Min, "Legorization with multiheight bricks from silhouette-fitted voxelization," in Proc. of Computer Graphics International 1986 (Short paper), 2017, p. 40.
  9. S. J. Luo, Y. Yue, C. K. Huang, Y. H. Chung, S. Imai, T. Nishita, and B. Y. Chen, "Legolization: Optimizing lego designs," ACM Transactions on Graphics, vol. 34, no. 6, p. 222, 2015.
  10. T. Kosaki, H. Tedenuma, and T. Maekawa, "Automatic generation of lego building instructions from multiple photographic images of real objects," Computer-Aided Design, vol. 70, no. C, pp. 13-22, 2016. https://doi.org/10.1016/j.cad.2015.06.020
  11. M. Zhang, J. Mitani, Y. Kanamori, and Y. Fukui, "Blocklizer: Interactive design of stable mini block artwork," in Proc. of ACM SIGGRAPH 2014 Posters, 2014, p. 18.
  12. M. Kuo, Y. Lin, H. Chu, R. Lee, and Y. Yang, "Pixel2brick: Constructing brick sculptures from pixel art." Computer Graphics Forum, vol. 34, no. 7, pp. 339-348, 2015. https://doi.org/10.1111/cgf.12772
  13. Reinforcement Learning, https://en.wikipedia.org/wiki/Reinforcement_learning.
  14. V. Mnih, K. Kavukcuoglu, D. Silver, A. Rusu, J. Veness, M. Bellemare, A. Graves, M. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, "Human-level control through deep reinforcement learning,"Nature, Vol. 518, No. 7540, pp. 529-533, 2015. https://doi.org/10.1038/nature14236
  15. H. Hasselt, A. Guez and D. Silver, "Deep reinforcement learning with double Q-learning," Proc. of AAAI conference on Artificial Intelligence 2016, pp. 2094-2100, 2016.
  16. T. Schaul, J. Quan, I. Antonoglou, and D. Silver, "Prioritized experience replay," Proc. of International Conference on Learning Representation (ICLR) 2016.
  17. Bellemare, M., Dabney, W., and Munos, R., "A distributional perspective on reinforcement learning," Proc. of International Conference on Machine Learning 2017, pp. 449-458, 2017.