• Title/Summary/Keyword: visual descriptor

Search Result 67, Processing Time 0.022 seconds

Image Retrieval Method Using Color Descriptor (색상 정보를 이용한 영상 검색 기법)

  • Cho, Jae-Hoon;Lee, Sang-Ho;Kim, Young-Seop
    • Journal of the Semiconductor & Display Technology
    • /
    • v.7 no.2
    • /
    • pp.69-76
    • /
    • 2008
  • Recently, as the multimedia processing application increases rapidly by going on increasing multimedia data, the efficient retrieval method of image information is required in many fields of application and becoming the matter of major concern. Furthermore, in the last few years rapid improvements in hardware technology have made it possible to process, store and retrieve huge amounts of data in a multimedia format. As a result, Content-Based Image Retrieval (CBIR) has been receiving widespread interest during the last decade. This paper propose the content-based retrieval system as a method for performing image retrieval through the effective feature analysis of the object of significant meaning by using YCbCr channel merging on the basis of the characteristics of man's visual system.

  • PDF

Implementation of Object Feature Extraction within Image for Object Tracking (객체 추적을 위한 영상 내의 객체 특징점 추출 알고리즘 구현)

  • Lee, Yong-Hwan;Kim, Youngseop
    • Journal of the Semiconductor & Display Technology
    • /
    • v.17 no.3
    • /
    • pp.113-116
    • /
    • 2018
  • This paper proposes a mobile image search system which uses a sensor information of smart phone, and enables running in a variety of environments, which is implemented on Android platform. The implemented system deals with a new image descriptor using combination of the visual feature (CEDD) with EXIF attributes in the target of JPEG image, and image matching scheme, which is optimized to the mobile platform. Experimental result shows that the proposed method exhibited a significant improved searching results of around 80% in precision in the large image database. Considering the performance such as processing time and precision, we think that the proposed method can be used in other application field.

Hierarchical Graph Based Segmentation and Consensus based Human Tracking Technique

  • Ramachandra, Sunitha Madasi;Jayanna, Haradagere Siddaramaiah;Ramegowda, Ramegowda
    • Journal of Information Processing Systems
    • /
    • v.15 no.1
    • /
    • pp.67-90
    • /
    • 2019
  • Accurate detection, tracking and analysis of human movement using robots and other visual surveillance systems is still a challenge. Efforts are on to make the system robust against constraints such as variation in shape, size, pose and occlusion. Traditional methods of detection used the sliding window approach which involved scanning of various sizes of windows across an image. This paper concentrates on employing a state-of-the-art, hierarchical graph based method for segmentation. It has two stages: part level segmentation for color-consistent segments and object level segmentation for category-consistent regions. The tracking phase is achieved by employing SIFT keypoint descriptor based technique in a combined matching and tracking scheme with validation phase. Localization of human region in each frame is performed by keypoints by casting votes for the center of the human detected region. As it is difficult to avoid incorrect keypoints, a consensus-based framework is used to detect voting behavior. The designed methodology is tested on the video sequences having 3 to 4 persons.

MPEG-7 based Video/Image Retrieval System (VIRS) (MPEG-7 기반 비디오/이미지 검색 시스템(VIRS))

  • Lee, Jae-Ho;Kim, Hyoung-Joon;Kim, Whoi-Yul
    • The KIPS Transactions:PartB
    • /
    • v.10B no.5
    • /
    • pp.543-552
    • /
    • 2003
  • An increasing in quantity of multimedia data brought a new problem that expected data should be retrieved fast and exactly. The adequate representation is a key element for the efficient retrieval. For this reason, MPEG-7 standard was established for description of multimedia data in 2001. However, the content of the standard is massive and the approach method is not clear for real application system yet, because of properties of MPEG-7 standard that has to include a lot of potential cases. In this paper, we suggested implementation scheme of retrieval system with using of only visual descriptors and presented the performance results of developed system. From the result of developed system, MPEG-7 VIRS (Video/Image Retrieval System), we analyzed the retrieval results between using individual descriptor and using multiple descriptors, and showed a layout for real application system.

The Analysis of Visual Descriptors for Content-based Video Retrieval (내용기반 비디오 검색을 위한 MPEG-7 비주얼 디스크립터 분석)

  • Kim, Seong-Hee
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.16 no.2
    • /
    • pp.157-175
    • /
    • 2005
  • The main purpose of this paper is to explain and analyze visual descriptors of MPEG-7 for representing multimedia content. This study describes MPEG-7 visual descriptors that are made of color, shape, texture, and motion using some examples and application areas in detail. As a result, those visual descriptors can represent the rich and deep features in multimedia contents domain. Also, the use of those descriptors will increase the retrieval effectiveness as well as the interoperability among systems through the consistency of the content representation.

  • PDF

Neural Network Based Image Genre Classification (Neural Network을 이용한 이미지 장르 분류 시스템)

  • Ahn, Jae-Hoon;Lee, Han-Ku;Ju, Hyun-Ho
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10b
    • /
    • pp.330-335
    • /
    • 2006
  • 본 논문에서는 neural network을 이용한 이미지 장르(유형) 분류 시스템을 소개한다. 이 논문에서 제안된 시스템은 이미지를 예술(art), 사진(photo), 만화(cartoon) 이미지라는 세 가지 장르(유형) 중 하나로 분류한다. 이미지의 특성은 표준 MPEG-7 visual descriptor를 사용하여 추출된 후, neural networks를 이용하여 학습된다. 시뮬레이션 결과는 제안된 시스템이 80% 이상의 이미지들을 정확한 장르(유형)로 분류하는 것을 보여준다.

  • PDF

A Practical Solution toward SLAM in Indoor environment Based on Visual Objects and Robust Sonar Features (가정환경을 위한 실용적인 SLAM 기법 개발 : 비전 센서와 초음파 센서의 통합)

  • Ahn, Sung-Hwan;Choi, Jin-Woo;Choi, Min-Yong;Chung, Wan-Kyun
    • The Journal of Korea Robotics Society
    • /
    • v.1 no.1
    • /
    • pp.25-35
    • /
    • 2006
  • Improving practicality of SLAM requires various sensors to be fused effectively in order to cope with uncertainty induced from both environment and sensors. In this case, combining sonar and vision sensors possesses numerous advantages of economical efficiency and complementary cooperation. Especially, it can remedy false data association and divergence problem of sonar sensors, and overcome low frequency SLAM update caused by computational burden and weakness in illumination changes of vision sensors. In this paper, we propose a SLAM method to join sonar sensors and stereo camera together. It consists of two schemes, extracting robust point and line features from sonar data and recognizing planar visual objects using multi-scale Harris corner detector and its SIFT descriptor from pre-constructed object database. And fusing sonar features and visual objects through EKF-SLAM can give correct data association via object recognition and high frequency update via sonar features. As a result, it can increase robustness and accuracy of SLAM in indoor environment. The performance of the proposed algorithm was verified by experiments in home -like environment.

  • PDF

A new approach for content-based video retrieval

  • Kim, Nac-Woo;Lee, Byung-Tak;Koh, Jai-Sang;Song, Ho-Young
    • International Journal of Contents
    • /
    • v.4 no.2
    • /
    • pp.24-28
    • /
    • 2008
  • In this paper, we propose a new approach for content-based video retrieval using non-parametric based motion classification in the shot-based video indexing structure. Our system proposed in this paper has supported the real-time video retrieval using spatio-temporal feature comparison by measuring the similarity between visual features and between motion features, respectively, after extracting representative frame and non-parametric motion information from shot-based video clips segmented by scene change detection method. The extraction of non-parametric based motion features, after the normalized motion vectors are created from an MPEG-compressed stream, is effectively fulfilled by discretizing each normalized motion vector into various angle bins, and by considering the mean, variance, and direction of motion vectors in these bins. To obtain visual feature in representative frame, we use the edge-based spatial descriptor. Experimental results show that our approach is superior to conventional methods with regard to the performance for video indexing and retrieval.

Reduced-Reference Quality Assessment for Compressed Videos Based on the Similarity Measure of Edge Projections (에지 투영의 유사도를 이용한 압축된 영상에 대한 Reduced-Reference 화질 평가)

  • Kim, Dong-O;Park, Rae-Hong;Sim, Dong-Gyu
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.3
    • /
    • pp.37-45
    • /
    • 2008
  • Quality assessment ai s to evaluate if a distorted image or video has a good quality by measuring the difference between the original and distorted images or videos. In this paper, to assess the visual qualify of a distorted image or video, visual features of the distorted image are compared with those of the original image instead of the direct comparison of the distorted image with the original image. We use edge projections from two images as features, where the edge projection can be easily obtained by projecting edge pixels in an edge map along vertical/horizontal direction. In this paper, edge projections are obtained by using vertical/horizontal directions of gradients as well as the magnitude of each gradient. Experimental results show the effectiveness of the proposed quality assessment through the comparison with conventional quality assessment algorithms such as structural similarity(SSIM), edge peak signal-to-noise ratio(EPSNR), and edge histogram descriptor(EHD) methods.

Compression Method for MPEG CDVA Global Feature Descriptors (MPEG CDVA 전역 특징 서술자 압축 방법)

  • Kim, Joonsoo;Jo, Won;Lim, Guentaek;Yun, Joungil;Kwak, Sangwoon;Jung, Soon-heung;Cheong, Won-Sik;Choo, Hyon-Gon;Seo, Jeongil;Choi, Yukyung
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.295-307
    • /
    • 2022
  • In this paper, we propose a novel compression method for scalable Fisher vectors (SCFV) which is used as a global visual feature description of individual video frames in MPEG CDVA standard. CDVA standard has adopted a temporal descriptor redundancy removal technique that takes advantage of the correlation between global feature descriptors for adjacent keyframes. However, due to the variable length property of SCFV, the temporal redundancy removal scheme often results in inferior compression efficiency. It is even worse than the case when the SCFVs are not compressed at all. To enhance the compression efficiency, we propose an asymmetric SCFV difference computation method and a SCFV reconstruction method. Experiments on the FIVR dataset show that the proposed method significantly improves the compression efficiency compared to the original CDVA Experimental Model implementation.