• Title/Summary/Keyword: Image Coding

Search Result 1,161, Processing Time 0.028 seconds

Uni-directional 8X8 Intra Prediction for H.264 Coding Efficiency (H.264에서 성능향상을 위한 Uni-directional 8X8 인트라 예측)

  • Kook, Seung-Ryong;Park, Gwang-Hoon;Lee, Yoon-Jin;Sim, Dong-Gyu;Jung, Kwang-Soo;Choi, Hae-Chul;Choi, Jin-Soo;Lim, Sung-Chang
    • Journal of Broadcast Engineering
    • /
    • v.14 no.5
    • /
    • pp.589-600
    • /
    • 2009
  • This paper is ready to change a trend of a ultra high definition (UHD) video image, and it will contribute to improve the performance of the latest H.264 through the Uni-directional $8{\times}8$ intra-prediction idea which is based on developing a intra prediction compression. The Uni-directional $8{\times}8$ intra prediction is focused on a $8{\times}8$ block intra prediction using $4{\times}4$ block based prediction which is using the same direction of intra prediction. This paper describes that the uni-directional $8{\times}8$ intra-prediction gets a improvement around 7.3% BDBR only in the $8{\times}8$ block size, and it gets a improvement around 1.3% BDBR in the H.264 applied to the multi block size structures. In the case of a larger image size, it can be changed to a good algorithm. Because the video codec which is optimized for UHD resolution can be used a different block size which is bigger than before(currently a minimum of $4{\times}4$ blocks of units).

Image Analysis Using Digital Radiographic Lumbar Spine of Patients with Osteoporosis (골다공증 환자의 Digital 방사선 요추 Image를 이용한 영상분석)

  • Park, Hyong-Hu;Lee, Jin-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.11
    • /
    • pp.362-369
    • /
    • 2014
  • This study aimed to propose an accurate diagnostic method for osteoporosis by realizing a computer-aided diagnosis system with the application of the statistical analysis of texture features using digital images of lateral lumbar spine of patients with osteoporosis and providing reliable supplementary diagnostic information by model experimental research for early diagnosis of diseases. For these purposes, digital images of lateral lumbar spine of normal individuals and patients with osteoporosis were used in the experiments, and the values of statistical texture features on the set ROI were expressed in six parameters. Among the texture feature values of the six parameters of osteoporosis, the highest and lowest recognition rates of 95 and 80% were shown in average gray level and uniformity, respectively. Moreover, all the six parameters showed recognition rates of over 80% for osteoporosis: 82.5% in average contrast, 90% in smoothness, 87.5% in skewness, and 87.5% in entropy. Therefore, if a program developing into a computer-aided diagnosis system for medical images is coded based on the results of this study, it is considered possible to be applied to preliminary diagnostic data for automatic detection of lesions and disease diagnosis using medical images, to provide information for definite diagnosis of diseases, to diagnose by limited device, and to be used to shorten the time to analyze medical images.

Two Flow Control Techniques for Teleconferencing over the Internet (인터넷상에서 원격회의를 위한 두 가지 흐름 제어 기법)

  • Na, Seung-Gu;Go, Min-Su;An, Jong-Seok
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.26 no.8
    • /
    • pp.975-983
    • /
    • 1999
  • 최근 네트워크의 속도가 빨라지고 멀티미디어 데이터를 다루기 위한 기술들이 개발됨에 따라 많은 멀티미디어 응용 프로그램들이 인터넷에 등장하고 있다. 그러나 이들 응용프로그램들은 수신자에게 전송되는 영상.음성의 품질이 낮기 때문에 기대만큼 빠르게 확산되지 못하고 있다. 영상.음성의 품질이 낮은 이유는 현재 인터넷이 실시간 응용프로그램이 요구하는 만큼 빠르고 신뢰성 있게 데이터를 전송할 수 없기 때문이다. 현재 인터넷의 내부구조를 바꾸지 않고 품질을 높이기 위해 많은 연구들이 진행되고 있는데 그 중 하나는 동적으로 변화하는 인터넷의 상태에 맞게 멀티캐스트 트래픽의 전송율을 조절하는 종단간의 흐름제어이다. 본 논문은 기존의 흐름제어 기법인 IVS와 RLM의 성능을 개선시키기 위한 두 가지 흐름제어 기법을 소개한다. IVS는 송신자가 주기적으로 측정된 네트워크 상태에 따라 전송율을 일정하게 조절한다. 송신자가 하나의 데이타 스트림을 생성하는 IVS와는 달리 RLM에서는 송신자가 계층적 코딩에 의하여 생성된 여러개의 데이타 스트림을 전송하고 각 수신자는 자신의 네트워크 상태에 맞게 데이타 스트림을 선택하는 기법이다. 그러나 IVS는 송신자가 전송율을 일정하게 증가시키고, RLM은 각자의 네트워크 상태를 고려하지 않고 임의의 시간에 하나 이상의 데이타 스트림을 받기 때문에 성능을 저하시킬 수 있다. 본 논문에서는 TCP-like IVS와 Adaptive RLM이라는 두 가지 새로운 기법을 소개한다. TCP-like IVS는 송신자가 전송율을 동적으로 결정하고, Adaptive RLM은 하나 이상의 데이타 스트림을 받기 위해 적당한 시간을 선택할 수 있다. 본 논문에서는 시뮬레이션을 통해 여러 가지 네트워크 구조에서 두 가지 방식이 기존의 방식에 비하여 더욱 높은 대역폭 이용율과 10~20% 정도 적은 패킷손실율을 이룬다는 것을 보여준다.Abstract Nowadays, many multimedia applications for the Internet are introduced as the network gets faster and many techniques manipulating multimedia data are developed. These multimedia applications, however, do not spread widely and are not fast as expected at their introduction time due to the poor quality of image and voice delivered at receivers. The poor quality is mainly attributed to that the current Internet can not carry data as fast and reliably as the real-time applications require. To improve the quality without modifying the internal structure of the current Internet, many researches are conducted. One of them is an end-to-end flow control of multicast traffic adapting the sending rate to the dynamically varying Internet state. This paper proposes two flow-control techniques which can improve the performance of the two conventional techniques; IVS and RLM. IVS statically adjusts the sending rate based on the network state periodically estimated. Differently from IVS in which a sender produces one single data stream, in RLM a sender transmits several data streams generated by the layered coding scheme and each receiver selects some data streams based on its own network state. The more data streams a receiver receives, the better quality of image or voice the receiver can produce. The two techniques, however, can degrade the performance since IVS increases its sending rate statically and RLM accepts one more data stream at arbitrary time regardless of the network state respectively. We introduce two new techniques called TCP-like IVS and Adaptive RLM; TCP-like IVS can determine the sending rate dynamically and Adaptive RLM can select the right time to add one more data stream. Our simulation experiments show that two techniques can achieve better utilization and less packet loss by 10-20% over various network topologies.

Automatic Measurement Method of Traffic Signs Using Image Recognition and Photogrammetry Technology (영상인식과 사진측량 기술을 이용한 교통표지 자동측정 방법)

  • Chang, Sang Kyu;Kim, Jin Soo
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.21 no.3
    • /
    • pp.19-25
    • /
    • 2013
  • Recently, more accurate database information of facilities is being required, with the increase in importance of urban road facility management. Therefore, this study proposed how to automatically detect particular traffic signs necessary for efficient construction of road facility DB. For this study, central locations of facilities were searched, after recognition and automatic detection of particular traffic signs through an image. Then, coordinate values of traffic signs calculated in the study were compared with real coordinate values, in order to evaluate the accuracy of traffic sign locations which were finally detected. Computer vision technology was used in recognizing and detecting traffic signs through OPEN CV-based coding, and photogrammetry was used in calculating accurate locations of detected traffic signs. For the experiment, circular road signal(No Parking) and triangular road signal(Crosswalk) were chosen out of various kinds of road signals. The research result showed that the circular road signal had a nearly 50cm error value, and the triangular road signal had a nearly 60cm error value, when comparing the calculated coordinates with the real coordinates. Though this result is not satisfactory, it is considered that there would be no problem to find locations of traffic signs.

Group-based Adaptive Rendering for 6DoF Immersive Video Streaming (6DoF 몰입형 비디오 스트리밍을 위한 그룹 분할 기반 적응적 렌더링 기법)

  • Lee, Soonbin;Jeong, Jong-Beom;Ryu, Eun-Seok
    • Journal of Broadcast Engineering
    • /
    • v.27 no.2
    • /
    • pp.216-227
    • /
    • 2022
  • The MPEG-I (Immersive) group is working on a standardization project for immersive video that provides 6 degrees of freedom (6DoF). The MPEG Immersion Video (MIV) standard technology is intended to provide limited 6DoF based on depth map-based image rendering (DIBR) technique. Many efficient coding methods have been suggested for MIV, but efficient transmission strategies have received little attention in MPEG-I. This paper proposes group-based adaptive rendering method for immersive video streaming. Each group can be transmitted independently using group-based encoding, enabling adaptive transmission depending on the user's viewport. In the rendering process, the proposed method derives weights of group for view synthesis and allocate high quality bitstream according to a given viewport. The proposed method is implemented through the Test Model for Immersive Video (TMIV) test model. The proposed method demonstrates 17.0% Bjontegaard-delta rate (BD-rate) savings on the peak signalto-noise ratio (PSNR) and 14.6% on the Immersive Video PSNR(IV-PSNR) in terms of various end-to-end evaluation metrics in the experiment.

R-lambda Model based Rate Control for GOP Parallel Coding in A Real-Time HEVC Software Encoder (HEVC 실시간 소프트웨어 인코더에서 GOP 병렬 부호화를 지원하는 R-lambda 모델 기반의 율 제어 방법)

  • Kim, Dae-Eun;Chang, Yongjun;Kim, Munchurl;Lim, Woong;Kim, Hui Yong;Seok, Jin Wook
    • Journal of Broadcast Engineering
    • /
    • v.22 no.2
    • /
    • pp.193-206
    • /
    • 2017
  • In this paper, we propose a rate control method based on the $R-{\lambda}$ model that supports a parallel encoding structure in GOP levels or IDR period levels for 4K UHD input video in real-time. For this, a slice-level bit allocation method is proposed for parallel encoding instead of sequential encoding. When a rate control algorithm is applied in the GOP level or IDR period level parallelism, the information of how many bits are consumed cannot be shared among the frames belonging to a same frame level except the lowest frame level of the hierarchical B structure. Therefore, it is impossible to manage the bit budget with the existing bit allocation method. In order to solve this problem, we improve the bit allocation procedure of the conventional ones that allocate target bits sequentially according to the encoding order. That is, the proposed bit allocation strategy is to assign the target bits in GOPs first, then to distribute the assigned target bits from the lowest depth level to the highest depth level of the HEVC hierarchical B structure within each GOP. In addition, we proposed a processing method that is used to improve subjective image qualities by allocating the bits according to the coding complexities of the frames. Experimental results show that the proposed bit allocation method works well for frame-level parallel HEVC software encoders and it is confirmed that the performance of our rate controller can be improved with a more elaborate bit allocation strategy by using the preprocessing results.

Block-based Adaptive Bit Allocation for Reference Memory Reduction (효율적인 참조 메모리 사용을 위한 블록기반 적응적 비트할당 알고리즘)

  • Park, Sea-Nae;Nam, Jung-Hak;Sim, Dong-Gy;Joo, Young-Hun;Kim, Yong-Serk;Kim, Hyun-Mun
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.3
    • /
    • pp.68-74
    • /
    • 2009
  • In this paper, we propose an effective memory reduction algorithm to reduce the amount of reference frame buffer and memory bandwidth in video encoder and decoder. In general video codecs, decoded previous frames should be stored and referred to reduce temporal redundancy. Recently, reference frames are recompressed for memory efficiency and bandwidth reduction between a main processor and external memory. However, these algorithms could hurt coding efficiency. Several algorithms have been proposed to reduce the amount of reference memory with minimum quality degradation. They still suffer from quality degradation with fixed-bit allocation. In this paper, we propose an adaptive block-based min-max quantization that considers local characteristics of image. In the proposed algorithm, basic process unit is $8{\times}8$ for memory alignment and apply an adaptive quantization to each $4{\times}4$ block for minimizing quality degradation. We found that the proposed algorithm can obtain around 1.7% BD-bitrate gain and 0.03dB BD-PSNR gain, compared with the conventional fixed-bit min-max algorithm with 37.5% memory saving.

Study for applying the augmented reality onto postage stamps (우표의 증강현실 적용에 관한 연구)

  • Lee, Ki Ho
    • Cartoon and Animation Studies
    • /
    • s.33
    • /
    • pp.503-529
    • /
    • 2013
  • The commemorative AR postage stamps which are the world first presented at The YEOSU EXPO 2012 has had meaning of communicating with future in this present from a convergence that the most analog medium is using now and that the AR is cutting edge of digital technology. The AR stamps printed 10 kind out of 33 commemorative stamps. These have great significance that is artistic value than that is world first. The applied AR images are not only expressed 3D real images but also artic represented and signifying each stamp images from visualized creativity process, and build 'new art space' that is new concept between on real(analog) and virtual(digital). This study analyzes meaning of images and then makes concept of AR contents design. The processing is designed and considered the meaning of architectures and environments, and the regional specific feature of the Yeosu with surrealistic graphic concept. The 10 of deducted images were expressed after AR coding such as visual arts. This study realized markerless 3D image tracking AR stamps and deducted research result are; the first, it was able to figure out how to realize AR in the process of registering the reference images, coordinating transformation, and hybriding AR on the stamps for the mobile devices. The second, it was able to be seeked a possibility of new virtual exhibition space. The third, it was able to know possibility of satisfaction of immersing with visual formativeness and usability with informativity.

The First Quantization Parameter Decision Algorithm for the H.264/AVC Encoder (H.264/AVC를 위한 초기 Quantization Parameter 결정 알고리즘)

  • Kwon, Soon-Young;Lee, Sang-Heon;Lee, Dong-Ha
    • Journal of KIISE:Information Networking
    • /
    • v.35 no.3
    • /
    • pp.235-242
    • /
    • 2008
  • To improve video quality and coding efficiency, H.264/AVC adopted an adaptive rate control. But this method has a problem as it cannot predict an accurate quantization parameter(QP) for the first frame. The first QP is decided among four constant values by using encoder input parameters. It does not consider encoding bits, results in significant fluctuation of the image quality and decreases the average quality of the whole coded sequence. In this paper, we propose a new algorithm for the first frame QP decision in the H.264/AVC encoder. The QP is decided by the existing algorithm and the first frame is encoded. According to the encoded bits, the new initial QP is decided. We can predict optimal value because there is a linear relationship between encoded bits and the new initial QP. Next, we re-encode the first frame using the new initial QP. Experimental results show that the proposed algorithm not only achieves better quality than the state of the art algorithm, but also adopts a rate control forthe sequence that was impossible with the existing algorithm. By reducing fluctuation, subjective quality also improved.

3D Visual Attention Model and its Application to No-reference Stereoscopic Video Quality Assessment (3차원 시각 주의 모델과 이를 이용한 무참조 스테레오스코픽 비디오 화질 측정 방법)

  • Kim, Donghyun;Sohn, Kwanghoon
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.4
    • /
    • pp.110-122
    • /
    • 2014
  • As multimedia technologies develop, three-dimensional (3D) technologies are attracting increasing attention from researchers. In particular, video quality assessment (VQA) has become a critical issue in stereoscopic image/video processing applications. Furthermore, a human visual system (HVS) could play an important role in the measurement of stereoscopic video quality, yet existing VQA methods have done little to develop a HVS for stereoscopic video. We seek to amend this by proposing a 3D visual attention (3DVA) model which simulates the HVS for stereoscopic video by combining multiple perceptual stimuli such as depth, motion, color, intensity, and orientation contrast. We utilize this 3DVA model for pooling on significant regions of very poor video quality, and we propose no-reference (NR) stereoscopic VQA (SVQA) method. We validated the proposed SVQA method using subjective test scores from our results and those reported by others. Our approach yields high correlation with the measured mean opinion score (MOS) as well as consistent performance in asymmetric coding conditions. Additionally, the 3DVA model is used to extract information for the region-of-interest (ROI). Subjective evaluations of the extracted ROI indicate that the 3DVA-based ROI extraction outperforms the other compared extraction methods using spatial or/and temporal terms.