Acknowledgement
이 논문은 2021년도 정부(과학기술정보통신부)의 재원으로 정보통신기획평가원의 지원을 받아 수행된 연구임 (No. 2017-0-00072, 초실감 테라미디어를 위한 AV 부호화 및 LF 미디어 원천기술 개발).
References
- Thomas Wiegand, Gary J Sullivan, Gisle Bjontegaard, and Ajay Luthra, "Overview of the h. 264/avc video coding standard," IEEE Transactions on circuits and systems for video technology, 13(7):560-576, 2003. https://doi.org/10.1109/TCSVT.2003.815165
- Gary J Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand, "Overview of the high efficiency video coding (hevc) standard," IEEE Transactions on circuits and systems for video technology, 22(12):1649-1668, 2012. https://doi.org/10.1109/TCSVT.2012.2221191
- Jens-Rainer Ohm and Gary J Sullivan, "Versatile video coding-towards the next generation of video compression," In Picture Coding Symposium, volume 2018, 2018.
- Johannes Balle, David Minnen, Saurabh Singh, Sung Jin Hwang, and Nick Johnston, "Variational image compression with a scale hyperprior," In International Conference on Learning Representations, 2018.
- Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Dongyang Li, Yichen Qian, Hao Li, "Spatiotemporal Entropy Model is All You Need f or Learned Video Compression," arXiv, 2021, https://arxiv.org/abs/2104.06083 (accessed Oct. 24, 2021).
- F. Bellard, BPG image format, http://bellard.org/bpg/ (accessed: Jan. 30, 2017).
- Z. Wang, E. P. Simoncelli, and A. C. Bovik, "Multiscale structural similarity for imagequality assessment," in Signals, Systems and Computers, 2004. Conference Record of the Thirty-Seventh Asilomar Conference on, IEEE, vol. 2, 2003, pp. 1398-1402
- Johannes Balle, Valero Laparra, and Eero P. Simoncelli, "End-to-end optimized image compression," In International Conference on Learning Representations, 2017.
- David Minnen, Johannes Balle, and George D Toderici, "Joint autoregressive and hierarchical priors for learned imagecompression," In Advances in Neural Information Processing Systems, pages 10771-10780, 2018.
- Aaron Van den Oord, Nal Kalchbrenner, Lasse Espeholt, Oriol Vinyals, Alex Graves, et al., "Conditional image generation with PixelCNN decoders," In Advances in neural information processing systems, pages 4790-4798, 2016.
- Zhengxue Cheng, Heming Sun, Masaru Takeuchi, and Jiro Katto, "Learned image compression with discretized gaussian mixture likelihoods and attention modules," In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7939-7948, 2020.
- Z. Cheng, H. Sun, M. Takeuchi, J. Katto, "Deep ResidualLearning for Image Compression," CVPR Workshop, pp. 1-4, June 16-20, 2019.
- Y.Zhang, K. Li, K. Li, B. Zhong, Y. Fu, "Residual Nonlocal Attention Networks for Image Restoration," International Conference on Learning Representations, pp. 1-18, 2019
- Reynolds, Douglas. (2008), "Gaussian Mixture Models," Encyclopedia of Biometrics, 10.1007/978-0-387-73003-5_196.
- Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Chunlei Cai, and Zhiyong Gao, "DVC: An end-to-end deep video compression framework," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages, 11006-11015, 2019.
- Anurag Ranjan and Michael J Black, "Optical flow estimation using a spatial pyramid network," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4161-4170, 2017.
- Abdelaziz Djelouah, Joaquim Campos, Simone Schaub-Meyer, and Christopher Schroers, "Neural inter-frame compression for video coding," In Proceedings of the IEEE International Conference on Computer Vision, pages 6421-6429, 2019.
- Ren Yang, Fabian Mentzer, Luc Van Gool, and Radu Timofte, "Learning for video compression with hierarchical quality and recurrent enhancement," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6628-6637, 2020.
- Eirikur Agustsson, David Minnen, Nick Johnston, Johannes Balle, Sung Jin Hwang, and George Toderici, "Scale-space flow for end-to-end optimized video compression," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 8503-8512, 2020.
- Fabian Mentzer, Eirikur Agustsson, Johannes Balle, David Minnen, Nick Johnston and George Toderici, "Towards Generative Video Compression," arXiv, 2021, https://arxiv.org/abs/2107.12038 (accessed Aug. 26, 2021).
- Goodfellow, Ian J., Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville and Yoshua Bengio, "Generative Adversarial Nets," NeurIPS, 2014.
- Fabian Mentzer, George D Toderici, Michael Tschannen, and Eirikur Agustsson, "High-fidelity generative image compression," Advances in Neural Information Processing Systems, 33, 2020
- David Minnen, Johannes Balle, and George Toderici, "Joint autoregressive and hierarchical priors for learned image compression," In Advances in Neural Information Processing Systems, pages 10771-10780, 2018.
- Yoojin Choi, Mostafa El-Khamy, and Jungwon Lee, "Variable rate deep image compression with a conditional autoencoder," In Proceedings of the IEEE International Conference on Computer Vision, pages 3146-3154, 2019.
- Compressai, https://interdigitalinc.github.io/CompressAI/index.html# (accessed Nov. 24, 2021).
- Tianfan Xue, Baian Chen, Jiajun Wu, Donglai Wei, and William T Free man, "Video enhancement with task-oriented flow," International Journal of Computer Vision, 127(8):1106-1125, 2019. https://doi.org/10.1007/s11263-018-01144-2
- Diederik P Kingma and Jimmy Ba, "Adam: A method for stochastic optimization," arXiv preprint 2014, https://arxiv.org/abs/2107.12038 (accessed Nov. 24, 2021).
- Alexandre Mercat, Marko Viitanen, and Jarno Vanne, "Uvg dataset: 50/120fps 4k sequences for video codec analysis anddevelopment," In Proceedings of the 11th ACM Multimedia Systems Conference, pages 297-302, 2020.
- Oren Rippel, Alexander G. Anderson, Kedar Tatwawadi, Sanjay Nair, Craig Lytle and Lubomir Bourdev, "ELF-VC: Efficient Learned Flexible-Rate Video Coding," arXiv preprint, 2021, https://arxiv.org/abs/2104.14335 (accessed Nov. 24, 2021).