• Title/Summary/Keyword: performance video

Search Result 2,476, Processing Time 0.032 seconds

3차원 비디오 부호화 기술

  • Ho, Yo-Seong;O, Gwan-Jeong
    • Information and Communications Magazine
    • /
    • v.27 no.3
    • /
    • pp.29-35
    • /
    • 2010
  • 디지털 영상 기술의 발전과 함께 최근 3차원 영상 기술에 대한 관심이 높아지고 있다. 3차원 비디오는 고성능 비디오(high-performance video)와 함께 차세대 영상 기술로 각광받고 있다. 3차원 비디오는 사용자에게 자유로운 임의의 시점에서 입체감 있는 영상을 제공할 수 있다. 이 논문은 멀티미디어 전송에 관한 국제 표준화 기구인 MPEG의 다시점 비디오 부호화(multiview video coding, MVC)그룹과 3차원 비디오 부호화(3D video coding, 3DVC)그룹에서 다뤄진 3차원 비디오 부호화 기술을 소개한다.

A Design of Discrete Wavelet Transform Encoder for Multimedia Image Signal Processing (멀티미디어 영상신호 처리를 위한 DWT 부호화기 설계)

  • 이강현
    • Proceedings of the IEEK Conference
    • /
    • 2003.07d
    • /
    • pp.1685-1688
    • /
    • 2003
  • The modem multimedia applications which are video Processor, video conference or video phone and so forth require real time processing. Because of a large amount of image data, those require high compression performance. In this paper, the proposed image processing encoder was designed by using wavelet transform encoding. The proposed filter block can process image data on tile high speed because of composing individual function blocks by parallel and compute both highpass and lowpass coefficient in the same clock cycle. When image data is decomposed into multiresolution, the proposed scheme needs external memory and controller to save intermediate results and it can operate within 33㎒.

  • PDF

Variable-bit-rate compressed video storage and placement scheme for arbitrary-speed retrievals (임의 속도 탐색을 위한 가변 비트율 압축 비디오 데이타의 저장 및 배치기법)

  • 권택근;이석호;최양희
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.8
    • /
    • pp.15-21
    • /
    • 1996
  • This paper describes data placement schemes that provide uniform and balanced to multiple disks load for retrievals of VBR (variable bit rate) video at varying retrieval speeds. To support maximum concurent users at arbitrary-speed playbacks in a disk-arry based system, the hot spot disks should be carefully avoided. In this paper, we extend the proposed scheme, prime round-robin(PRR), for VBR video. In addition, we have compared the performance of PRR and PRR (PRR extension).

  • PDF

A Method of Intra Mode Coding for Joint Exploration Model (JEM) (차세대 비디오 부호화 실험모델(JEM)의 화면내 예측 모드 부호화 기법)

  • Park, Dohyeon;Lee, Jinho;Kang, Jung Won;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.23 no.4
    • /
    • pp.495-502
    • /
    • 2018
  • JVET (Joint Video Exploration Team) which explored evolving technologies of video coding with capabilities beyond HEVC (High Efficiency Video Coding), released a references software codec named the Joint Exploration Model (JEM) for performance verification of coding technologies. JEM has 67 intra prediction modes that extend the 35 modes of HEVC for intra prediction. Therefore, the enhancement of the coding performance is limited due to the overhead of prediction mode coding. In this paper, we analyze the probabilities of prediction modes selections, and then we propose a more efficient intra prediction mode coding based on the results of analyzed mode occurrence. In addition, we propose a context modeling for CABAC (Context-Adaptive Binary Arithmetic Coding) of the proposed mode coding. Experimental results show that the BD-rate gain is 0.02% on the AI (All Intra) coding structure compared to JEM 7.0. We need to optimize context modeling for additional coding performance enhancement.

Structure of a Storage System Considering Disk Performance and Placement Policy Considering Video Data Characteristics in VOD Storage Servers (주문형 비디오 저장 서버에서 디스크 성능을 고려한 저장 시스템의 구조와 비디오 데이터의 특성에 따른 배치정책)

  • An, Yu-Jeong;Won, Yu-Heon
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.26 no.11
    • /
    • pp.1296-1304
    • /
    • 1999
  • 본 논문은 다수의 고객들에게 실시간 재생 서비스를 제공하는 주문형 비디오 서버에서 데이타 검색의 효율을 높이기 위한 저장 시스템의 구조와 그에 따른 배치 정책을 제안한다. 주문형 비디오 저장 서버에서 동시에 보다 많은 고객들에게 그들이 원하는 서비스를 제공하기 위해서는 여러 분야에서의 다양한 정책들이 고려될 수 있으나, 특히 저장 매체들을 어떤 구조로 구성하고 여기에 비디오 객체들을 어떻게 배치할 것인가는 검색 효율과 직접 관계되는 중요한 문제이다. 본 논문에서는 디스크 배열 형태로 구성된 저장 시스템을 디스크 성능을 고려하여 재구성하고, 비디오 객체들을 저장할 때 저장하고자 하는 데이타의 특성들과 저장 구조를 함께 고려하여 검색 효율을 극대화할 수 있는 배치 방법을 제안한다. 마지막으로 제안된 정책의 검색 효율을 검증하기 위해 다양한 실험을 통하여 기존의 배치 정책들과 비교하고 성능을 평가한다.Abstract In this paper, we propose the structure of storage system and a placement policy to provide many clients with real-time playback services efficiently in VOD(video-on-demand) server. Though policies in various areas being considered to provide more clients with services of degree requested by them simultaneously in VOD storage server, it is important how to construct storage media and to place video objects on it for retrieval efficiency. In this paper, we reorganize a large disk array with disks performance and place video objects using the placement policy considering both characteristics of video data and the structure of storage system for maximizing retrieval efficiency. Lastly, we simulate the proposed policy and conventional policies through various environments, compare our policy with others n evaluate the performance of our policy.

Modified Skyscraper Broadcasting Schemes for Periodic Broadcasting with VBR Video (VBR 리디오의 주기적 브로드캐스팅을 위한 수정 Skyscraper 브로드캐스팅 기법)

  • 이재동
    • Journal of Korea Multimedia Society
    • /
    • v.5 no.5
    • /
    • pp.571-581
    • /
    • 2002
  • Many periodic broadcasting schemes for near VoD systems are proposed. Recently non-uniform segmentation schemes have been used to develop periodic broadcasting techniques for near VoD. These techniques give significant reductions in start-up latency as compared with more conventional uniform segmentation. However, all of these schemes assume that the videos are CBR-encoded. Since a CBR-encoded video has a target average tate than an VBR encoding, there is potential to obtain further Performance Improvements by using VBR videos. Unfortunately, however, the studies concerning broadcasting with VBR video ate rare and the existing techniques have the problem of virtual loss. In this paper, we modify Skyscraper Broadcasting Scheme for broadcasting with VBR videos which is a representative non-uniform segmentation scheme lot CBR videos. A VBR video can be transmitted at constant bit rate (CBR) by using prefetching. With this idea we propose Modified Skyscraper Broadcasting Schemes for VBR videos and make performance evaluation by simulation. We show that our schemes have a better performance than Skyscraper Broadcasting Scheme for CBR videos.

  • PDF

A Study on Fingerprinting Robustness Indicators for Immersive 360-degree Video (실감형 360도 영상 특징점 기술 강인성 지표에 관한 연구)

  • Kim, Youngmo;Park, Byeongchan;Jang, Seyoung;Yoo, Injae;Lee, Jaechung;Kim, Seok-Yoon
    • Journal of IKEEE
    • /
    • v.24 no.3
    • /
    • pp.743-753
    • /
    • 2020
  • In this paper, we propose a set of robustness indicators for immersive 360-degree video. With the full-fledged service of mobile carriers' 5G networks, it is possible to use large-capacity, immersive 360-degree videos at high speed anytime, anywhere. Since it can be illegally distributed in web-hard and torrents through DRM dismantling and various video modifications, however, evaluation indicators that can objectively evaluate the filtering performance for copyright protection are required. In this paper, a robustness indicators is proposed that applies the existing 2D Video robustness indicators and considers the projection method and reproduction method, which are the characteristics of Immersive 360-degree Video. The performance evaluation experiment has been carried out for a sample filtering system and it is verified that an excellent recognition rate of 95% or more has been achieved in about 3 second execution time.

Fine-tuning Neural Network for Improving Video Classification Performance Using Vision Transformer (Vision Transformer를 활용한 비디오 분류 성능 향상을 위한 Fine-tuning 신경망)

  • Kwang-Yeob Lee;Ji-Won Lee;Tae-Ryong Park
    • Journal of IKEEE
    • /
    • v.27 no.3
    • /
    • pp.313-318
    • /
    • 2023
  • This paper proposes a neural network applying fine-tuning as a way to improve the performance of Video Classification based on Vision Transformer. Recently, the need for real-time video image analysis based on deep learning has emerged. Due to the characteristics of the existing CNN model used in Image Classification, it is difficult to analyze the association of consecutive frames. We want to find and solve the optimal model by comparing and analyzing the Vision Transformer and Non-local neural network models with the Attention mechanism. In addition, we propose an optimal fine-tuning neural network model by applying various methods of fine-tuning as a transfer learning method. The experiment trained the model with the UCF101 dataset and then verified the performance of the model by applying a transfer learning method to the UTA-RLDD dataset.

Video Analysis System for Action and Emotion Detection by Object with Hierarchical Clustering based Re-ID (계층적 군집화 기반 Re-ID를 활용한 객체별 행동 및 표정 검출용 영상 분석 시스템)

  • Lee, Sang-Hyun;Yang, Seong-Hun;Oh, Seung-Jin;Kang, Jinbeom
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.89-106
    • /
    • 2022
  • Recently, the amount of video data collected from smartphones, CCTVs, black boxes, and high-definition cameras has increased rapidly. According to the increasing video data, the requirements for analysis and utilization are increasing. Due to the lack of skilled manpower to analyze videos in many industries, machine learning and artificial intelligence are actively used to assist manpower. In this situation, the demand for various computer vision technologies such as object detection and tracking, action detection, emotion detection, and Re-ID also increased rapidly. However, the object detection and tracking technology has many difficulties that degrade performance, such as re-appearance after the object's departure from the video recording location, and occlusion. Accordingly, action and emotion detection models based on object detection and tracking models also have difficulties in extracting data for each object. In addition, deep learning architectures consist of various models suffer from performance degradation due to bottlenects and lack of optimization. In this study, we propose an video analysis system consists of YOLOv5 based DeepSORT object tracking model, SlowFast based action recognition model, Torchreid based Re-ID model, and AWS Rekognition which is emotion recognition service. Proposed model uses single-linkage hierarchical clustering based Re-ID and some processing method which maximize hardware throughput. It has higher accuracy than the performance of the re-identification model using simple metrics, near real-time processing performance, and prevents tracking failure due to object departure and re-emergence, occlusion, etc. By continuously linking the action and facial emotion detection results of each object to the same object, it is possible to efficiently analyze videos. The re-identification model extracts a feature vector from the bounding box of object image detected by the object tracking model for each frame, and applies the single-linkage hierarchical clustering from the past frame using the extracted feature vectors to identify the same object that failed to track. Through the above process, it is possible to re-track the same object that has failed to tracking in the case of re-appearance or occlusion after leaving the video location. As a result, action and facial emotion detection results of the newly recognized object due to the tracking fails can be linked to those of the object that appeared in the past. On the other hand, as a way to improve processing performance, we introduce Bounding Box Queue by Object and Feature Queue method that can reduce RAM memory requirements while maximizing GPU memory throughput. Also we introduce the IoF(Intersection over Face) algorithm that allows facial emotion recognized through AWS Rekognition to be linked with object tracking information. The academic significance of this study is that the two-stage re-identification model can have real-time performance even in a high-cost environment that performs action and facial emotion detection according to processing techniques without reducing the accuracy by using simple metrics to achieve real-time performance. The practical implication of this study is that in various industrial fields that require action and facial emotion detection but have many difficulties due to the fails in object tracking can analyze videos effectively through proposed model. Proposed model which has high accuracy of retrace and processing performance can be used in various fields such as intelligent monitoring, observation services and behavioral or psychological analysis services where the integration of tracking information and extracted metadata creates greate industrial and business value. In the future, in order to measure the object tracking performance more precisely, there is a need to conduct an experiment using the MOT Challenge dataset, which is data used by many international conferences. We will investigate the problem that the IoF algorithm cannot solve to develop an additional complementary algorithm. In addition, we plan to conduct additional research to apply this model to various fields' dataset related to intelligent video analysis.

The Effects of Web-based Video Program Selection Attributes of Confidence in Nursing Performance-Mediating Effects of Learning Flow (웹 기반 동영상 프로그램 선택속성이 수술실 신규간호사의 간호 수행능력에 미치는 영향 -학습몰입의 매개효과 중심으로-)

  • Park, Jung-Hae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.1
    • /
    • pp.485-494
    • /
    • 2021
  • This study aimed to examine the effect of selected factors on confidence in nursing performance among new operating room nurses, with a focus on the mediating effect of learning flow. Data was collected from July to August 2019 using structured questionnaires. The participants were 250 new operating room nurses from university hospitals located in Incheon and Gyeonggi. The collected data was analyzed using the IBM SPSS Statistics 25.0 and AMOS 24.0. Structural equation modeling was performed to examine the effect of the selected factors namely web-based video programs on confidence in nursing performance, and the mediating effect of learning flow. Results: The factors of 'hygiene & safety' and 'patient care' from the web-based video programs had a positive effect on the learning flow. The higher the learning flow of the nurses, the greater their confidence in nursing performance. Just the 'hygiene & safety' factor significantly increased nursing performance. Conclusion: The study results suggest that new nurses perceive that 'hygiene & safety' is an important factor in building their confidence in nursing performance, even without the learning flow. Therefore, it is necessary to develop web-based video programs based on nurses' needs and to emphasize the importance of postoperative care.