• Title/Summary/Keyword: Videos

Search Result 1,523, Processing Time 0.026 seconds

A Novel Approach for Key Caption Detection in Golf Videos Using Color Patterns

  • Jung, Cheol-Kon;Kim, Joong-Kyu
    • ETRI Journal
    • /
    • v.30 no.5
    • /
    • pp.750-752
    • /
    • 2008
  • This paper provides a novel method of detecting key captions containing player information in golf videos. We use the color pattern of captions and its repetition property to determine the key captions. The experimental results show that the proposed method achieves a much higher accuracy than existing methods.

  • PDF

Technology Trends on Image/Video Perceptual Quality Assessment (정지영상 및 동영상 인지화질 측정 기술 동향)

  • Lee, D.Y.;Kim, J.H.;Jeong, S.Y.;Cho, S.H.;Kim, H.Y.;Choi, J.S.
    • Electronics and Telecommunications Trends
    • /
    • v.33 no.3
    • /
    • pp.11-21
    • /
    • 2018
  • Assessment technologies regarding the perceptual quality of images and videos have been receiving significant attention, as they serve as essential tools for monitoring and improving the quality of various media services. In this paper, we review the technology trends of recent studies on the perceptual quality assessment of images and videos, and discuss the future direction of this research field.

Robust HDR Video Synthesis Using Illumination Invariant Descriptor (밝기 변화에 강인한 특징 기술자를 이용한 고품질 HDR 동영상 합성)

  • Vo Van, Tu;Lee, Chul
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2017.06a
    • /
    • pp.83-84
    • /
    • 2017
  • We propose a novel high dynamic range (HDR) video synthesis algorithm from alternatively exposed low dynamic range (LDR) videos. We first estimate correspondences between input fames using an illumination invariant descriptor. Then, we synthesize an HDR frame with the weights computed to maximize detail preservation in the output HDR frame. Experimental results demonstrate that the proposed algorithm provides high-quality HDR videos without noticeable artifacts.

  • PDF

Smart Fire Image Recognition System using Charge-Coupled Device Camera Image (CCD 카메라 영상을 이용한 스마트 화재 영상 인식 시스템)

  • Kim, Jang-Won
    • Fire Science and Engineering
    • /
    • v.27 no.6
    • /
    • pp.77-82
    • /
    • 2013
  • This research suggested smart fire recognition system which trances firing location with CCD camera with wired/wire-less TCP/IP function and Pan/Tilt function, delivers information in real time to android system installed by smart mobile communication system and controls fire and disaster remotely. To embody suggested method, firstly, algorithm which applies hue saturation intensity (HSI) Transform for input video, eliminates surrounding lightness and unnecessary videos and segmentalized only firing videos was suggested. Secondly, Pan/Tilt function traces accurate location of firing for proper control of firing. Thirdly, android communication system installed by mobile function confirms firing state and controls it. To confirm the suggested method, 10 firing videos were input and experiment was conducted. As the result, all of 10 videos segmentalized firing sector and traced all of firing locations.

ViStoryNet: Neural Networks with Successive Event Order Embedding and BiLSTMs for Video Story Regeneration (ViStoryNet: 비디오 스토리 재현을 위한 연속 이벤트 임베딩 및 BiLSTM 기반 신경망)

  • Heo, Min-Oh;Kim, Kyung-Min;Zhang, Byoung-Tak
    • KIISE Transactions on Computing Practices
    • /
    • v.24 no.3
    • /
    • pp.138-144
    • /
    • 2018
  • A video is a vivid medium similar to human's visual-linguistic experiences, since it can inculcate a sequence of situations, actions or dialogues that can be told as a story. In this study, we propose story learning/regeneration frameworks from videos with successive event order supervision for contextual coherence. The supervision induces each episode to have a form of trajectory in the latent space, which constructs a composite representation of ordering and semantics. In this study, we incorporated the use of kids videos as a training data. Some of the advantages associated with the kids videos include omnibus style, simple/explicit storyline in short, chronological narrative order, and relatively limited number of characters and spatial environments. We build the encoder-decoder structure with successive event order embedding, and train bi-directional LSTMs as sequence models considering multi-step sequence prediction. Using a series of approximately 200 episodes of kids videos named 'Pororo the Little Penguin', we give empirical results for story regeneration tasks and SEOE. In addition, each episode shows a trajectory-like shape on the latent space of the model, which gives the geometric information for the sequence models.

Automatic Detection of Highlights in Soccer videos based on analysis of scene structure (축구 동영상에서의 장면 구조 분석에 기반한 자동적인 하이라이트 장면 검출)

  • Park, Ki-Tae;Moon, Young-Shik
    • The KIPS Transactions:PartB
    • /
    • v.14B no.1 s.111
    • /
    • pp.1-4
    • /
    • 2007
  • In this paper, we propose an efficient scheme for automatically detecting highlight scenes in soccer videos. Highlights are defined as shooting scenes and goal scenes. Through the analysis of soccer videos, we notice that most of highlight scenes are shown around the goal post area. It is also noticed that the TV camera zooms in a setter player or spectators after the highlight stones. Detection of highlight scenes for soccer videos consists of three steps. The first step is the extraction of the playing field using a statistical threshold. The second step is the detection of goal posts. In the final step, we detect a zooming of a soccer player or spectators by using connected component labeling of non-playing field. In order to evaluate the performance of our method, the precision and the recall are computed. Experimental results have shown the effectiveness of the proposed method, with 95.2% precision and 85.4% recall.

Layered Coding Method for Scalable Coding of HDR and SDR videos (HDR와 SDR 비디오의 스케일러블 부호화를 위한 계층 압축 기법)

  • Lim, Jeongyun;Ahn, Yong-Jo;Lim, Woong;Park, Seanae;Sim, Donggyu;Kang, Jung-Won
    • Journal of Broadcast Engineering
    • /
    • v.20 no.5
    • /
    • pp.756-769
    • /
    • 2015
  • In this paper, we propose a scalable coding method for high dynamic range (HDR) and standard dynamic range (SDR) videos based on Scalable High Efficiency Video Coding (SHVC). The proposed method has multi-layer coding architecture that consists of base layer for SDR videos and enhancement layer for HDR videos to support the backward compatibility with legacy codec and display devices. Also, to improve coding efficiency of enhancement layers, a global inverse tone mapping is applied to the reconstructed SDR video and the compensated frames are referred for coding of the enhancement layer. The proposed method is found to achieve BD-Rate gain of 43.0% on average (maximum 76.3%) for the enhancement layer and 15.7% on average (maximum 31%) for dual-layer against the SHM 7.0 reference software.

Optical Properties Correction of a Heterogeneous Stereoscopic Camera (이종 입체 영상 카메라의 광학 특성 일치화)

  • Jung, Eun Kyung;Baek, Seung-Hae;Park, Soon-Yong;Jang, Ho-Wook
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.49 no.11
    • /
    • pp.74-85
    • /
    • 2012
  • In this paper, we propose a optical property correction technique for a low-cost heterogeneous stereoscopic camera. Three main optical properties of a stereoscopic camera are zoom, focus, and DOF(depth of field). The difference or mis-match of these properties between two stereoscopic videos are the main causes of the visual fatigue to human eyes. The proposed correction technique reduces the difference of the optical properties between the stereoscopic videos and produces high-quality stereoscopic videos. To correct the zoom difference, a LUT(look-up table) is established to match the zoom ratio between the stereoscopic videos. To correct the DOF difference, the magnitude of image edge is measured and the lens iris is changed to control the DOF of the camera. A vertical-type stereoscopic rig is developed for the experiments of the optical property correction. Based on the experimental results, we find that a low-cost heterogeneous stereoscopic camera can be implemented, which can yield low visual fatigue to human eyes.

Effects of Self-directed Feedback Practice using Smartphone Videos on Basic Nursing Skills, Confidence in Performance and Learning Satisfaction (스마트 폰 동영상을 활용한 피드백 자율실습이 기본간호수기 수행능력, 수행자신감 및 학습만족도에 미치는 효과)

  • Lee, Seul Gi;Shin, Yun Hee
    • Journal of Korean Academy of Nursing
    • /
    • v.46 no.2
    • /
    • pp.283-292
    • /
    • 2016
  • Purpose: This study was done to verify effects of a self-directed feedback practice using smartphone videos on nursing students' basic nursing skills, confidence in performance and learning satisfaction. Methods: In this study an experimental study with a post-test only control group design was used. Twenty-nine students were assigned to the experimental group and 29 to the control group. Experimental treatment was exchanging feedback on deficiencies through smartphone recorded videos of nursing practice process taken by peers during self-directed practice. Results: Basic nursing skills scores were higher for all items in the experimental group compared to the control group, and differences were statistically significant ["Measuring vital signs" (t=-2.10, p=.039); "Wearing protective equipment when entering and exiting the quarantine room and the management of waste materials" (t=-4.74, p<.001) "Gavage tube feeding" (t=-2.70, p=.009)]. Confidence in performance was higher in the experimental group compared to the control group, but the differences were not statistically significant. However, after the complete practice, there was a statistically significant difference in overall performance confidence (t=-3.07. p=.003). Learning satisfaction was higher in the experimental group compared to the control group, but the difference was not statistically significant (t=-1.67, p=.100). Conclusion: Results of this study indicate that self-directed feedback practice using smartphone videos can improve basic nursing skills. The significance is that it can help nursing students gain confidence in their nursing skills for the future through improvement of basic nursing skills and performance of quality care, thus providing patients with safer care.

Adaptive Counting Line Detection for Traffic Analysis in CCTV Videos (CCTV영상 내 교통량 분석을 위한 적응적 계수선 검출 방법)

  • Jung, Hyeonseok;Lim, Seokjae;Lee, Ryong;Park, Minwoo;Lee, Sang-Hwan;Kim, Wonjun
    • Journal of Broadcast Engineering
    • /
    • v.25 no.1
    • /
    • pp.48-57
    • /
    • 2020
  • Recently, with the rapid development of image recognition technology, the demand for object analysis in road CCTV videos is increasing. In this paper, we propose a method that can adaptively find the counting line for traffic analysis in road CCTV videos. First, vehicles on the road are detected, and the corresponding positions of the detected vehicles are modeled as the two-dimensional pointwise Gaussian map. The paths of vehicles are estimated by accumulating pointwise Gaussian maps on successive video frames. Then, we apply clustering and linear regression to the accumulated Gaussian map to find the principal direction of the road, which is highly relevant to the counting line. Experimental results show that the proposed method for detecting the counting line is effective in various situations.