• 제목/요약/키워드: Video Object Segmentation

Search Result 141, Processing Time 0.022 seconds

Enhancement Techniques of Color Segmentation for Detecting Missing Persons in Smart Lighting System using Radar and Camera Sensors (레이다 및 카메라 내장형 스마트 조명에서 실종자 탐지용 색상 검출 향상 기법)

  • Song, Seungeon;Kim, Sangdong;Jin, Young-Seok;Lee, Jonghun
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.25 no.3
    • /
    • pp.53-59
    • /
    • 2020
  • This paper proposes color segmentation for detecting missing persons in a smart lighting system using radar and camera sensors. Recently, smart lighting systems built-in radar and cameras have been efficient in saving energy and searching for missing persons, simultaneously. In smart lighting systems, radar detects moving objects and then the lights turn on and camera records. The video recorded is useful to find out missing persons. The color of their clothes worn in missing persons is one of critical hints to look for missing persons. Therefore, color segmentation is an effective means for detecting the color of their clothes. In this paper, during the color segmentation step, the ROI(Region of interest) setting based on the size of an object is applied and the background is reduced. According to experimental results, the color segmentation has good accuracy of more than 97%.

Confidence-based Background Subtraction Algorithm for Moving Cameras (움직이는 카메라를 위한 신뢰도 기반의 배경 제거 알고리즘)

  • Mun, Hyeok;Lee, Bok Ju;Choi, Young Kyu
    • Journal of the Semiconductor & Display Technology
    • /
    • v.16 no.4
    • /
    • pp.30-35
    • /
    • 2017
  • Moving object segmentation from a nonstationary camera is a difficult problem due to the motion of both camera and the object. In this paper, we propose a new confidence-based background subtraction technique from moving camera. The method is based on clustering of motion vectors and generating adaptive multi-homography from a pair of adjacent video frames. The main innovation concerns the use of confidence images for each foreground and background motion groups. Experimental results revealed that our confidence-based approach robustly detect moving targets in sequences taken by a freely moving camera.

  • PDF

Optimizing artificial neural network architectures for enhanced soil type classification

  • Yaren Aydin;Gebrail Bekdas;Umit Isikdag;Sinan Melih Nigdeli;Zong Woo Geem
    • Geomechanics and Engineering
    • /
    • v.37 no.3
    • /
    • pp.263-277
    • /
    • 2024
  • Artificial Neural Networks (ANNs) are artificial learning algorithms that provide successful results in solving many machine learning problems such as classification, prediction, object detection, object segmentation, image and video classification. There is an increasing number of studies that use ANNs as a prediction tool in soil classification. The aim of this research was to understand the role of hyperparameter optimization in enhancing the accuracy of ANNs for soil type classification. The research results has shown that the hyperparameter optimization and hyperparamter optimized ANNs can be utilized as an efficient mechanism for increasing the estimation accuracy for this problem. It is observed that the developed hyperparameter tool (HyperNetExplorer) that is utilizing the Covariance Matrix Adaptation Evolution Strategy (CMAES), Genetic Algorithm (GA) and Jaya Algorithm (JA) optimization techniques can be successfully used for the discovery of hyperparameter optimized ANNs, which can accomplish soil classification with 100% accuracy.

Content based Video Segmentation Algorithm using Comparison of Pattern Similarity (장면의 유사도 패턴 비교를 이용한 내용기반 동영상 분할 알고리즘)

  • Won, In-Su;Cho, Ju-Hee;Na, Sang-Il;Jin, Ju-Kyong;Jeong, Jae-Hyup;Jeong, Dong-Seok
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.10
    • /
    • pp.1252-1261
    • /
    • 2011
  • In this paper, we propose the comparison method of pattern similarity for video segmentation algorithm. The shot boundary type is categorized as 2 types, abrupt change and gradual change. The representative examples of gradual change are dissolve, fade-in, fade-out or wipe transition. The proposed method consider the problem to detect shot boundary as 2-class problem. We concentrated if the shot boundary event happens or not. It is essential to define similarity between frames for shot boundary detection. We proposed 2 similarity measures, within similarity and between similarity. The within similarity is defined by feature comparison between frames belong to same shot. The between similarity is defined by feature comparison between frames belong to different scene. Finally we calculated the statistical patterns comparison between the within similarity and between similarity. Because this measure is robust to flash light or object movement, our proposed algorithm make contribution towards reducing false positive rate. We employed color histogram and mean of sub-block on frame image as frame feature. We performed the experimental evaluation with video dataset including set of TREC-2001 and TREC-2002. The proposed algorithm shows the performance, 91.84% recall and 86.43% precision in experimental circumstance.

Design of Moving Picture Retrieval System using Scene Change Technique (장면 전환 기법을 이용한 동영상 검색 시스템 설계)

  • Kim, Jang-Hui;Kang, Dae-Seong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.3
    • /
    • pp.8-15
    • /
    • 2007
  • Recently, it is important to process multimedia data efficiently. Especially, in case of retrieval of multimedia information, technique of user interface and retrieval technique are necessary. This paper proposes a new technique which detects cuts effectively in compressed image information by MPEG. A cut is a turning point of scenes. The cut-detection is the basic work and the first-step for video indexing and retrieval. Existing methods have a weak point that they detect wrong cuts according to change of a screen such as fast motion of an object, movement of a camera and a flash. Because they compare between previous frame and present frame. The proposed technique detects shots at first using DC(Direct Current) coefficient of DCT(Discrete Cosine Transform). The database is composed of these detected shots. Features are extracted by HMMD color model and edge histogram descriptor(EHD) among the MPEG-7 visual descriptors. And detections are performed in sequence by the proposed matching technique. Through this experiments, an improved video segmentation system is implemented that it performs more quickly and precisely than existing techniques have.

Depth-adaptive Sharpness Adjustments for Stereoscopic Perception Improvement and Hardware Implementation

  • Kim, Hak Gu;Kang, Jin Ku;Song, Byung Cheol
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.3 no.3
    • /
    • pp.110-117
    • /
    • 2014
  • This paper reports a depth-adaptive sharpness adjustment algorithm for stereoscopic perception improvement, and presents its field-programmable gate array (FPGA) implementation results. The first step of the proposed algorithm was to estimate the depth information of an input stereo video on a block basis. Second, the objects in the input video were segmented according to their depths. Third, the sharpness of the foreground objects was enhanced and that of the background was maintained or weakened. This paper proposes a new sharpness enhancement algorithm to suppress visually annoying artifacts, such as jagging and halos. The simulation results show that the proposed algorithm can improve stereoscopic perception without intentional depth adjustments. In addition, the hardware architecture of the proposed algorithm was designed and implemented on a general-purpose FPGA board. Real-time processing for full high-definition stereo videos was accomplished using 30,278 look-up tables, 24,553 registers, and 1,794,297 bits of memory at an operating frequency of 200MHz.

Design of a Recognizing System for Vehicle's License Plates with English Characters

  • Xing, Xiong;Choi, Byung-Jae;Chae, Seog;Lee, Mun-Hee
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.9 no.3
    • /
    • pp.166-171
    • /
    • 2009
  • In recent years, video detection systems have been implemented in various infrastructures such as airport, public transportation, power generation system, water dam and so on. Recognizing moving objects in video sequence is an important problem in computer vision, with applications in several fields, such as video surveillance and target tracking. Segmentation and tracking of multiple vehicles in crowded situations is made difficult by inter-object occlusion. In the system described in this paper, the mean shift algorithm is firstly used to filter and segment a color vehicle image in order to get candidate regions. These candidate regions are then analyzed and classified in order to decide whether a candidate region contains a license plate or not. And then some characters in the license plate is recognized by using the fuzzy ARTMAP neural network, which is a relatively new architecture of the neural network family and has the capability to learn incrementally unlike the conventional BP network. We finally design a license plate recognition system using the mean shift algorithm and fuzzy ARTMAP neural network and show its performance via some computer simulations.

Evaluation of Video Codec AI-based Multiple tasks (인공지능 기반 멀티태스크를 위한 비디오 코덱의 성능평가 방법)

  • Kim, Shin;Lee, Yegi;Yoon, Kyoungro;Choo, Hyon-Gon;Lim, Hanshin;Seo, Jeongil
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.273-282
    • /
    • 2022
  • MPEG-VCM(Video Coding for Machine) aims to standardize video codec for machines. VCM provides data sets and anchors, which provide reference data for comparison, for several machine vision tasks including object detection, object segmentation, and object tracking. The evaluation template can be used to compare compression and machine vision task performance between anchor data and various proposed video codecs. However, performance comparison is carried out separately for each machine vision task, and information related to performance evaluation of multiple machine vision tasks on a single bitstream is not provided currently. In this paper, we propose a performance evaluation method of a video codec for AI-based multi-tasks. Based on bits per pixel (BPP), which is the measure of a single bitstream size, and mean average precision(mAP), which is the accuracy measure of each task, we define three criteria for multi-task performance evaluation such as arithmetic average, weighted average, and harmonic average, and to calculate the multi-tasks performance results based on the mAP values. In addition, as the dynamic range of mAP may very different from task to task, performance results for multi-tasks are calculated and evaluated based on the normalized mAP in order to prevent a problem that would be happened because of the dynamic range.

Spatiotemporal Saliency-Based Video Summarization on a Smartphone (스마트폰에서의 시공간적 중요도 기반의 비디오 요약)

  • Lee, Won Beom;Williem, Williem;Park, In Kyu
    • Journal of Broadcast Engineering
    • /
    • v.18 no.2
    • /
    • pp.185-195
    • /
    • 2013
  • In this paper, we propose a video summarization technique on a smartphone, based on spatiotemporal saliency. The proposed technique detects scene changes by computing the difference of the color histogram, which is robust to camera and object motion. Then the similarity between adjacent frames, face region, and frame saliency are computed to analyze the spatiotemporal saliency in a video clip. Over-segmented hierarchical tree is created using scene changes and is updated iteratively using mergence and maintenance energies computed during the analysis procedure. In the updated hierarchical tree, segmented frames are extracted by applying a greedy algorithm on the node with high saliency when it satisfies the reduction ratio and the minimum interval requested by the user. Experimental result shows that the proposed method summaries a 2 minute-length video in about 10 seconds on a commercial smartphone. The summarization quality is superior to the commercial video editing software, Muvee.

Ship Detection Using Visual Saliency Map and Mean Shift Algorithm (시각집중과 평균이동 알고리즘을 이용한 선박 검출)

  • Park, Jang-Sik
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.2
    • /
    • pp.213-218
    • /
    • 2013
  • In this paper, a video based ship detection method is proposed to monitor port efficiently. Visual saliency map algorithm and mean shift algorithm is applied to detect moving ships don't include background information which is difficult to track moving ships. It is easy to detect ships at the port using saliency map algorithm, because it is very effective to extract saliency object from background. To remove background information in the saliency region, image segmentation and clustering using mean shift algorithm is used. As results of detecting simulation with images of a camera installed at the harbor, it is shown that the proposed method is effective to detect ships.