• 제목/요약/키워드: hierarchical B layer

검색결과 26건 처리시간 0.024초

웨이브렛 변환된 다해상도 영상을 이용한 계층적 움직임 추정 (Multi-resolution hierarchical motion estimation in the wavelet transform domain)

  • 김진태;장준필;김동욱;최종수
    • 전자공학회논문지B
    • /
    • 제33B권8호
    • /
    • pp.50-59
    • /
    • 1996
  • In this paper, a new hierarchical motion estiamtion scheme using the wavelet transformed multi-resolution image layers is proposed. Compared with the full search motion estimation method, the existing hierarchical methods remarkably reduce the amount of the computation but their efficiencies are depreciated by the local minima problem. In order to solve the local minima problem, the multi-resolution image layers are composed using the wavelet transform and the number of layers participated in the motion estimation for a block is determined by considering of its low band energy and higher band energy on the first wavelet transformed layer. The ratio between higher band energy and low band energy of each block is evaluated and in the case of the blocks which include relatively large higher band energy, the motion estimation is carried out in the high resolution layer. Otherwise, all layers are used. The final motion vectors are obtained in the first wavelet transformed layer. So less bits for motion vectors are transmitted, and the decomposition of received image using inverse wavelet transform decreases the blocking effect.

  • PDF

계층구조 신경망을 이용한 한글 인식 (Hangul Recognition Using a Hierarchical Neural Network)

  • 최동혁;류성원;강현철;박규태
    • 전자공학회논문지B
    • /
    • 제28B권11호
    • /
    • pp.852-858
    • /
    • 1991
  • An adaptive hierarchical classifier(AHCL) for Korean character recognition using a neural net is designed. This classifier has two neural nets: USACL (Unsupervised Adaptive Classifier) and SACL (Supervised Adaptive Classifier). USACL has the input layer and the output layer. The input layer and the output layer are fully connected. The nodes in the output layer are generated by the unsupervised and nearest neighbor learning rule during learning. SACL has the input layer, the hidden layer and the output layer. The input layer and the hidden layer arefully connected, and the hidden layer and the output layer are partially connected. The nodes in the SACL are generated by the supervised and nearest neighbor learning rule during learning. USACL has pre-attentive effect, which perform partial search instead of full search during SACL classification to enhance processing speed. The input of USACL and SACL is a directional edge feature with a directional receptive field. In order to test the performance of the AHCL, various multi-font printed Hangul characters are used in learning and testing, and its processing its speed and and classification rate are compared with the conventional LVQ(Learning Vector Quantizer) which has the nearest neighbor learning rule.

  • PDF

다시점 비디오 부호화를 위한 시간적 예측 구조 (Temporal Prediction Structure for Multi-view Video Coding)

  • 윤효순;김미영
    • 한국멀티미디어학회논문지
    • /
    • 제15권9호
    • /
    • pp.1093-1101
    • /
    • 2012
  • 다시점 비디오는 3차원 정보를 표현하기 위한 영상으로 하나의 3차원 장면을 여러 시점에서 다수의 카메라로 촬영한 동영상이다. 영상들 사이에 존재하는 시간적 상관성과 화면간 상관성을 이용하는 다시점 비디오 부호화는 카메라의 수에 비례하여 데이터의 양이 늘어나기 때문에 계산량을 줄일 수 있는 다시점 비디오 부호화 기술이 필요하다. 본 논문에서는 다시점 비디오의 부호화 성능을 향상시키기 위한 효율적인 예측구조를 제안한다. 제안한 예측 구조는 다시점 비디오의 부호화 효율을 높이기 위하여 부호화되는 현재 화면과 현재 화면이 참조하는 참조 화면들과의 평균 거리, B계층 최대 인덱스 그리고 각 Bi 계층의 화면 수를 고려하였다. 제안한 예측 구조의 성능을 참조 예측 구조의 성능과 비교하였을 때 영상 화질 면에 있어서 제안한 예측 구조가 Fraunhofer-HHI의 계층적 B화면 구조보다 약 0.07~0.13 (dB) 성능 향상을 보였다. 발생되는 평균 초당 비트량에 있어서 제안한 예측 구조가 Fraunhofer-HHI의 계층적 B화면 구조보다 최대 6.5(Kbps) 감소하였다.

다시점 비디오 부호화를 위한 개선된 예측 구조와 움직임 추정 기법 (Improved Prediction Structure and Motion Estimation Method for Multi-view Video Coding)

  • 윤효순;김미영
    • 정보과학회 논문지
    • /
    • 제41권11호
    • /
    • pp.900-910
    • /
    • 2014
  • 다시점 비디오는 하나의 3차원 장면을 여러 시점에서 다수의 카메라로 촬영 영상으로 다시점 비디오 부호화의 계산량은 카메라 수에 비례하여 증가한다. 본 논문에서는 다시점 비디오 부호화의 계산량을 줄이면서 영상 화질을 유지하는 예측 구조와 움직임 추정 기법을 제안한다. 제안한 개선된 예측 구조는 B계층 최대 인덱스 그리고 각 Bi계층의 화면수를 고려하였다. 제안한 움직임 추정 기법은 계층적인 탐색 기법으로 수정된 다이아몬드 탐색 패턴, 점진적인 다이아몬드 탐색 패턴 그리고 수정된 래스터 탐색 패턴으로 구성된다. 제안한 예측 구조와 움직임 추정 기법의 성능을 Fraunhofer-HHI의 계층적 B화면 구조와 TZ 움직임 추정 기법을 사용한 JMVC 참조 모델의 성능과 비교한 경우, 영상 화질과 발생 비트량은 비슷하지만 다시점 비디오 부호화의 계산량을 40~70% 줄인다.

시간적 예측 구조와 움직임 벡터의 특성을 이용한 움직임 추정 기법 (Temporal Prediction Structure and Motion Estimation Method based on the Characteristic of the Motion Vectors)

  • 윤효순;김미영
    • 한국멀티미디어학회논문지
    • /
    • 제18권10호
    • /
    • pp.1205-1215
    • /
    • 2015
  • Efficient multi-view coding techniques are needed to reduce the complexity of multi-view video which increases in proportion to the number of cameras. To reduce the complexity and maintain image quality and bit-rates, an motion estimation method and temporal prediction structure are proposed in this paper. The proposed motion estimation method exploits the characteristic of motion vector distribution and the motion direction and motion size of the block to place search points and decide the search patten adaptively. And the proposed prediction structure divides every GOP to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. Experiment results show that the complexity reduction of the proposed temporal prediction structure and motion estimation method over hierarchical B pictures prediction structure and TZ search method which are used in JMVC(Joint Multi-view Video Coding) reference model can be up to 45∼70% while maintaining similar video quality and bit rates.

A Cross-Layer Unequal Error Protection Scheme for Prioritized H.264 Video using RCPC Codes and Hierarchical QAM

  • Chung, Wei-Ho;Kumar, Sunil;Paluri, Seethal;Nagaraj, Santosh;Annamalai, Annamalai Jr.;Matyjas, John D.
    • Journal of Information Processing Systems
    • /
    • 제9권1호
    • /
    • pp.53-68
    • /
    • 2013
  • We investigate the rate-compatible punctured convolutional (RCPC) codes concatenated with hierarchical QAM for designing a cross-layer unequal error protection scheme for H.264 coded sequences. We first divide the H.264 encoded video slices into three priority classes based on their relative importance. We investigate the system constraints and propose an optimization formulation to compute the optimal parameters of the proposed system for the given source significance information. An upper bound to the significance-weighted bit error rate in the proposed system is derived as a function of system parameters, including the code rate and geometry of the constellation. An example is given with design rules for H.264 video communications and 3.5-4 dB PSNR improvement over existing RCPC based techniques for AWGN wireless channels is shown through simulations.

필기체 숫자인식을 위한 병렬 자구성 계층 신경회로망 (Parallel, self-organizing, hierarchical neural networks for handwritten digit recognition)

  • 방극준;조남신;강창언;홍대식
    • 전자공학회논문지B
    • /
    • 제33B권7호
    • /
    • pp.173-182
    • /
    • 1996
  • In this paper, we propose the parallel, self-organizing, hierarchical neural netowrks as a handwritten digit recognition system. This system can absorb the various shape variations of handwritten digits by using the different methods of extracting the features in each stage neural network (SNN) of the PSHNN, and can reduce training time by using the single layer neural network as the SNN, and can obtain high rate of correct recognition by using the certainty area in all the output nodes individually. experiments have been performed with NIST database. In which we use 21, 315 digits (10, 625 digits for training and 10,663 digits for testing). The results show that the correct rate is 97.48% the error rate is 1.72% and the reject rate is 0.78%.

  • PDF

AT-DMB 시스템에서 채널추정을 이용한 기본계층 수신 성능 향상기법 (Improving the Base-Layer BER performance at AT-DMB using a Channel Estimation)

  • 방극준
    • 전자공학회논문지 IE
    • /
    • 제49권2호
    • /
    • pp.46-51
    • /
    • 2012
  • AT-DMB 시스템의 신호 전송은 향상계층은 채널등화를 거친 Coherent Detection을 사용하지만 기본계층은 T-DMB와 마찬가지로 차동변복조를 사용한다. 본 논문에서는 이와같은 구조에서 어차피 향상계층 수신을 위하여 사용되는 채널추정 결과를 기본계층에 적용하여 기본계층의 수신성능을 향상시킬 수 있음을 보여준다. 제안하는 방법은 AT-DMB 수신단의 차동복조앞단에서 수신신호에 채널등화를 적용한 후 수신신호 성상도를 가장 가까운 ${\pi}$/4-shift DQPSK 성상도점으로 집중화시킨 후 차동복조를 적용함으로서 코딩을 적용하지 않은 상태에서 AWGN $10^{-4}BER$ 기준으로 약 2dB 성능향상을 얻을 수 있음을 보였다.

이동 보상 기법을 이용한 서브밴드 부호화 시스템에 관한 연구 (A Study on the Subband Coding System Using Motion Compensation Techniques)

  • 이기승;박용철;서정태;윤대희
    • 전자공학회논문지B
    • /
    • 제31B권10호
    • /
    • pp.99-111
    • /
    • 1994
  • A motion picture compression scheme using subband coding with motion compensation is presneted in this paper. A hierarchical subband decomposition is used to split the image signal into 10 subbands with a 3-layer pyramid structure and motion compensation is used in each band. However, in this case, motion vector information is drastically increased; therefore, initial motion vectors are estimated in the highest pyramid and motion vectors are refined using the reconsructed subband signal in each layer. Simulation results show that the proposed method compares favorably in terms of prediction error energy and side informatio with methods requiring additional information. Images recostructed from the proposed method show good quality compared to those reconstructed using blockwise DCT.

  • PDF

혼성 예측 피라미드 호환 부호화 기법 (On the Hybrid Prediction Pyramid Compatible Coding Technique)

  • 이준서;이상욱
    • 한국통신학회논문지
    • /
    • 제21권1호
    • /
    • pp.33-46
    • /
    • 1996
  • Inthis paper, we investigate the compatible coding technique, which receives much interest ever since the introduction of HDTV. First, attempts have been made to analyze the theoretical transform coding gains for various hierarchical decomposition techniques, namely subband, pyramid and DCT-based decomposition techniques. It is shown that the spatical domain techniques proide higher transform coding gains than the DCT-based coding technique. Secondly, we compare the performance of these spatial domain techniques, in terms of the PSNR versus various rate allocations to each layer. Based on these analyses, it is believed that the pyramid decomposition is more appropriate for the compatible coding. Also in this paper, we propose a hybrid prediction pyramid coding technique, by combining the spatio-temporal prediction in MPEG-2[3] and the adaptive MC(Motion Compensation)[1]. In the proposed coding technigue, we also employ an adaptive DCT coefficient scanning technique to exploit the direction information of the 2nd-layer signal. Through computer simulations, the proposed hybrid prediction with adaptive scanning technuque shows the PSNR improvement, by about 0.46-1.78dB at low 1st-layer rate(about 0.1bpp) over the adaptive MC[1], and by about 0.33-0.63dB at high 1st-layer rate (about 0.32-0.43bpp) over the spatio-temporal prediction[3].

  • PDF