Search | Korea Science

Block-based Learned Image Compression for Phase Holograms (신경망 기반 블록 단위 위상 홀로그램 이미지 압축)

Seung Mi Choi;Su yong Bahk;Hyun Min Ban;Jun Yeong Cha;Hui Yong Kim
- Journal of Broadcast Engineering
- /
- v.28 no.1
- /
- pp.42-54
- /
- 2023
It is an important issue to compress huge holographic data in a digital format. In particular, research on the compression of phase-only holograms for commercialization is noteworthy. Conventional video coding standards optimized for natural images are not suitable for compressing phase signals, and neural network-based compression model that can be optimized for phase signals can achieve high performance, but has a memory issue in learning high-resolution holographic data. In this paper, we show that by applying a block-based learned image compression model that can solve memory problems to phase-only holograms, the proposed method can demonstrate significant performance improvement over standard codecs even under the same conditions as block-based. Block-based learned compression model can provide compatibility with conventional standard codecs, solve memory problems, and can perform significantly better against phase-only hologram compression.
https://doi.org/10.5909/JBE.2023.28.1.42 인용 PDF

In-Loop Filtering with a Deep Network in HEVC (깊은 신경망을 사용한 HEVC의 루프 내 필터링)

Kim, Dongsin;Lee, So Yoon;Yang, Yoonmo;Oh, Byung Tae
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.145-147
- /
- 2020
As deep learning technology advances, there have been many attempts to improve video codecs such as High-Efficiency-Video-Coding (HEVC) using deep learning technology. One of the most researched approaches is improving filters inside codecs through image restoration researches. In this paper, we propose a method 01 replacing the sample adaptive offset (SAO) filtering with a deep neural network. The proposed method uses the deep neural network to find the optimal offset value. The proposed network consists of two subnetworks to find the offset value and its type of the signal, which can restore nonlinear and complex type of error. Experimental results show that the performance is better than the conventional HEVC in low delay P and random access mode.
PDF

A New Fast Variable Length Decoding Method Based on the Probabilistic Distribution of Symbols in a VLC Table (확률분포기반 고속 가변장 복호화 방법)

김은석;채병조;오승준
- Proceedings of the IEEK Conference
- /
- 2001.06d
- /
- pp.41-44
- /
- 2001
Variable length coding (VLC) has been used in many well known standard video coding algorithms such as MPEG and H.26x. However, VLC can not be processed parallelly because of its sequentiality. This sequentiality is a big barrier for implementing a real-time software video codec since parallel schemes can not be applied. In this paper, we propose a new fast VLD (Variable Length Decoding) method based on the probabilistic distribution of symbols in VLC tables used in MPEG as well as H.263 standard codecs. Even though MPEG suggests the table partitioning method, they do not show theoretically why the number of partitioned tables is two or three. We suggest the method for deciding the number of partitioned tables. Applying our scheme to several well-known MPEG-2 test sequences, we can reduce the computational time up to about 10% without any sacrificing video quality
PDF

Objective measurement of spatial auditory quality for multi channel audio codecs (멀티채널 오디오 압축 코덱 음질의 객관적인 측정방법)

Choi, In-Yong;Chon, Sang-Bae;Sung, Koeng-Mo
- Proceedings of the IEEK Conference
- /
- 2005.11a
- /
- pp.431-434
- /
- 2005
본 논문은 멀티채널 오디오 압축 코덱의 음질을 객관적으로 평가할 수 있는 시스템 및 파라메터에 관한 것으로, 멀티채널 오디오 신호로부터 양이입력신호(ear input signals)를 만들어내는 전처리 과정과 이 과정을 통해 출력되는 양이입력신호로부터 양이레벨차이왜곡(inter-aural level difference distortion)을 구하는 과정 및 양이레벨차이왜곡이 청취평가 결과와 일관적인 상관관계를 보임을 서술한다. 본 연구에 의하면 멀티채널 오디오 압축 코덱의 음질을 선별된 청취자에 의한 주관적인 평가와 통계처리 없이 객관적인 측정만을 통해 평가하는 것이 가능하며, 이를 사용하면 멀티채널 오디오 압축 코덱 개발자들이 시간, 경제적 부담 없이 자신이 개발한 압축 코덱의 음질을 간단하게 평가해볼 수 있다.
PDF

Design of Subband Codecs Using Optimized Vector Quantizer

Jee, Innho
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.2E
- /
- pp.33-38
- /
- 1996
This paper provides an approach for representing an optimum vector quantizer by a scalar nonlinear gain-plus-additive noise model. The validity and accuracy of this analytic model is confirmed by comparing the calcuated model quantization errors with actual simulation of the optimum Linde-Buzo-Gray(LBG) vector quantizer. Using this model we frm MSE measure of an M-band filter bank codec in terms of the equivalent scalar quantizatin model and find the optimum FIR filter coefficients for each channel in the M-band structure for a given bit rate, given filter length, and given input signal correlation model. Specific design examples are worked out for 4-tap filters in the two-band paraunitary case. These theoretical results are confirmed by extensive Monte Carlo simulation.
PDF

Stereoscopic Sequence Coding Using MPEG-2 MVP (MPEG-2 MVP를 이용한 스테레오 동영상부호화)

배태면;권동현한규필하영호
- Proceedings of the IEEK Conference
- /
- 1998.10a
- /
- pp.143-146
- /
- 1998
A new stereoscopic codec. structure using MPEG-2 multiview profile is presented in this paper. In the suggested codec., the left image is coded with motion estimation in the base layerand the right image is coded with disparity estimation in the enhancement layer. Since it is possible to calculate rough motion of the right image sequence with disparity and motion of the left image sequence, motion compensation of the enhancement layer is performed without motion estimation. Since the proposed codec. does not perform motion estimation in the enhancement layer encoding, it is simple and reduces the encoding time. We compared the PSNR of encoded image with three different structured codec., and the experimental results show that suggested codec. has comparable with other codecs.
PDF

Optimized Wiener Filter for Noise Reduction in VoIP Environments (VoIP 환경에서의 잡음제거를 위한 최적화된 위너 필터)

Jeong, Sang-Bae;Lee, Sung-Doke;Hahn, Min-Soo
- MALSORI
- /
- no.64
- /
- pp.105-119
- /
- 2007
Noise reduction technologies are indispensable to achieve acceptable speech quality in VoIP systems. This paper proposes a Wiener filter optimized to the estimated SNR of noisy speech for the noise reduction in VoIP environments. The proposed noise canceller is applied as a pre-processor before speech encoding. The performance of the proposed method is evaluated by the PESQ in various noisy conditions. In this paper, the proposed algorithm is applied to G.711, G.723.1, and G.729A which are all VoIP speech codecs. The PESQ results show that the performance of our proposed noise reduction scheme outperforms those of the noise suppression in the IS-127 EVRC and the ETSI standard for the advanced distributed speech recognition front-end.
PDF

Coding Efficiency Comparison between Next Generation Video Codecs: HEVC vs VP9 (차세대 동영상 코덱 압축 효율 비교: HEVC vs VP9)

Kim, Il-Koo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2013.06a
- /
- pp.176-179
- /
- 2013
본 논문에서는 JCT-VC 에서 2013 년 1 월에 표준화가 완료된 High Efficiency Video Coding (HEVC)과 구글에서 2013 년 6 월에 개발 완료 예정인 VP9 의 압축 효율 비교를 수행한다. HEVC 는 UHD 등 고화질 방송 등에 대응하도록 디자인 되었으며, VP9 은 유튜브 (YouTube) 등과 같은 인터넷 비디오 스트리밍에 적합하도록 디자인되었다. VP9 의 경우 HEVC 와는 달리 로열티 프리 (royalty-free)를 지향하며 오픈소스 (open source) 방식으로 개발이 진행되고 있다. 본 논문에서는 HEVC 와 VP9 의 디자인 차별점을 소개하고, 랜덤 액세스 환경(Random Access, RA)과 저지연 환경 (Low Delay, LD)에서 HEVC 와 VP9 의 압축 효율을 비교한다. 실험 결과에 따르면, 방송 및 패키지 미디어 등에서 많이 사용될 랜덤 액세스 환경에서는 VP9 이 HEVC 대비 32.7% 열세를 보인다. 비디오 컨퍼런스등과 같은 저지연 환경에서는 VP9 이 HEVC 대비 26.7% 열세를 보인다. VP9 의 경우 개발이 완료된 것이 아니므로, 향후 압축 효율의 향상이 있을 것으로 기대된다.
PDF

Weighting Method based on Motion Information for Objective Video Quality Assessment (객관적 영상 화질 평가 기준를 위한 움직임 정보에 따른 중요도 결정 기법)

Park, Su-Young;Kim, Tae-Wan;Lee, Sang-Hoon
- Proceedings of the IEEK Conference
- /
- 2008.06a
- /
- pp.909-910
- /
- 2008
For evaluating the performance of some codecs, many researchers have study and develop new objective video quality assessments. However, it's not sufficient for evaluating the temporal feature of video data yet, which is a distinguishable and representative characteristic when compared with other multimedia. This paper propose the method to apply the weight to SSIM (Structural SIMilarity) according to the cognitive psychological feature. And, we presented that the performance of objective video quality assessment applied the weight to SSIM by using the proposed method is superior to one of original SSIM.
PDF

Channel Expansion Technology in MPEG Audio (MPEG 오디오의 채널 확장 기술)

Pang, Hee-Suk
- Journal of Broadcast Engineering
- /
- v.16 no.5
- /
- pp.714-721
- /
- 2011
MPEG audio uses the masking effect, high frequency component synthesis based on spectral band replication, and channel expansion based on parametric stereo for efficient compression of audio signals. In this paper, we present an overview of the state-of-the-art channel expansion technology in MPEG audio. We also present technical overviews and application examples to broadcasting services for HE-AAC v.2, MPEG Surround, spatial audio object coding (SAOC), and unified speech and audio coding (USAC) which are MPEG audio codecs based on the channel expansion technology.
https://doi.org/10.5909/JEB.2011.16.5.714 인용 PDF KSCI

Search Result 114, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)