• 제목/요약/키워드: visual/object search

검색결과 42건 처리시간 0.027초

객체의 움직임을 고려한 탐색영역 설정에 따른 가중치를 공유하는 CNN구조 기반의 객체 추적 (Object Tracking based on Weight Sharing CNN Structure according to Search Area Setting Method Considering Object Movement)

  • 김정욱;노용만
    • 한국멀티미디어학회논문지
    • /
    • 제20권7호
    • /
    • pp.986-993
    • /
    • 2017
  • Object Tracking is a technique for tracking moving objects over time in a video image. Using object tracking technique, many research are conducted such a detecting dangerous situation and recognizing the movement of nearby objects in a smart car. However, it still remains a challenging task such as occlusion, deformation, background clutter, illumination variation, etc. In this paper, we propose a novel deep visual object tracking method that can be operated in robust to many challenging task. For the robust visual object tracking, we proposed a Convolutional Neural Network(CNN) which shares weight of the convolutional layers. Input of the CNN is a three; first frame object image, object image in a previous frame, and current search frame containing the object movement. Also we propose a method to consider the motion of the object when determining the current search area to search for the location of the object. Extensive experimental results on a authorized resource database showed that the proposed method outperformed than the conventional methods.

Small Object Segmentation Based on Visual Saliency in Natural Images

  • Manh, Huynh Trung;Lee, Gueesang
    • Journal of Information Processing Systems
    • /
    • 제9권4호
    • /
    • pp.592-601
    • /
    • 2013
  • Object segmentation is a challenging task in image processing and computer vision. In this paper, we present a visual attention based segmentation method to segment small sized interesting objects in natural images. Different from the traditional methods, we first search the region of interest by using our novel saliency-based method, which is mainly based on band-pass filtering, to obtain the appropriate frequency. Secondly, we applied the Gaussian Mixture Model (GMM) to locate the object region. By incorporating the visual attention analysis into object segmentation, our proposed approach is able to narrow the search region for object segmentation, so that the accuracy is increased and the computational complexity is reduced. The experimental results indicate that our proposed approach is efficient for object segmentation in natural images, especially for small objects. Our proposed method significantly outperforms traditional GMM based segmentation.

다중 도메인 데이터 기반 구별적 모델 예측 트레커를 위한 동적 탐색 영역 특징 강화 기법 (Reinforced Feature of Dynamic Search Area for the Discriminative Model Prediction Tracker based on Multi-domain Dataset)

  • 이준하;원홍인;김병학
    • 대한임베디드공학회논문지
    • /
    • 제16권6호
    • /
    • pp.323-330
    • /
    • 2021
  • Visual object tracking is a challenging area of study in the field of computer vision due to many difficult problems, including a fast variation of target shape, occlusion, and arbitrary ground truth object designation. In this paper, we focus on the reinforced feature of the dynamic search area to get better performance than conventional discriminative model prediction trackers on the condition when the accuracy deteriorates since low feature discrimination. We propose a reinforced input feature method shown like the spotlight effect on the dynamic search area of the target tracking. This method can be used to improve performances for deep learning based discriminative model prediction tracker, also various types of trackers which are used to infer the center of the target based on the visual object tracking. The proposed method shows the improved tracking performance than the baseline trackers, achieving a relative gain of 38% quantitative improvement from 0.433 to 0.601 F-score at the visual object tracking evaluation.

검색 포털들의 모바일 검색 기능 분석 (Analysis of Mobile Search Functions of Korean Search Portals)

  • 박소연
    • 정보관리학회지
    • /
    • 제29권1호
    • /
    • pp.175-190
    • /
    • 2012
  • 본 연구에서는 국내 주요 검색 포털들인 구글 코리아, 네이버, 네이트, 다음, 야후 코리아의 모바일 검색 기능을 분석, 평가하고자 한다. 좀 더 구체적으로 이 연구에서는 유선 검색과 차별화되는 모바일 검색 기능인 음성 검색, 음악 검색, 코드 검색, 비주얼 검색(사물 검색) 등에 초점을 맞추고, 이러한 검색 기법의 특징을 포털별로 조사하고, 검색 성능을 인식의 정확도와 인식 속도에 근거하여 비교, 평가하고자 한다. 조사 결과, 네이버와 다음이 가장 다양한 모바일 검색 기능을 제공하고 있었으며, 구글은 음성 검색만을 제공하고 있었고, 네이트와 야후는 어떠한 특화된 기능도 제공하지 않고 있었다. 본 연구의 결과는 향후 포털의 효과적인 모바일 검색 기능의 개발에 활용될 수 있을 것으로 기대된다.

Modeling the Visual Target Search in Natural Scenes

  • Park, Daecheol;Myung, Rohae;Kim, Sang-Hyeob;Jang, Eun-Hye;Park, Byoung-Jun
    • 대한인간공학회지
    • /
    • 제31권6호
    • /
    • pp.705-713
    • /
    • 2012
  • Objective: The aim of this study is to predict human visual target search using ACT-R cognitive architecture in real scene images. Background: Human uses both the method of bottom-up and top-down process at the same time using characteristics of image itself and knowledge about images. Modeling of human visual search also needs to include both processes. Method: In this study, visual target object search performance in real scene images was analyzed comparing experimental data and result of ACT-R model. 10 students participated in this experiment and the model was simulated ten times. This experiment was conducted in two conditions, indoor images and outdoor images. The ACT-R model considering the first saccade region through calculating the saliency map and spatial layout was established. Proposed model in this study used the guide of visual search and adopted visual search strategies according to the guide. Results: In the analysis results, no significant difference on performance time between model prediction and empirical data was found. Conclusion: The proposed ACT-R model is able to predict the human visual search process in real scene images using salience map and spatial layout. Application: This study is useful in conducting model-based evaluation in visual search, particularly in real images. Also, this study is able to adopt in diverse image processing program such as helper of the visually impaired.

Image Processing-based Object Recognition Approach for Automatic Operation of Cranes

  • Zhou, Ying;Guo, Hongling;Ma, Ling;Zhang, Zhitian
    • 국제학술발표논문집
    • /
    • The 8th International Conference on Construction Engineering and Project Management
    • /
    • pp.399-408
    • /
    • 2020
  • The construction industry is suffering from aging workers, frequent accidents, as well as low productivity. With the rapid development of information technologies in recent years, automatic construction, especially automatic cranes, is regarded as a promising solution for the above problems and attracting more and more attention. However, in practice, limited by the complexity and dynamics of construction environment, manual inspection which is time-consuming and error-prone is still the only way to recognize the search object for the operation of crane. To solve this problem, an image-processing-based automated object recognition approach is proposed in this paper, which is a fusion of Convolutional-Neutral-Network (CNN)-based and traditional object detections. The search object is firstly extracted from the background by the trained Faster R-CNN. And then through a series of image processing including Canny, Hough and Endpoints clustering analysis, the vertices of the search object can be determined to locate it in 3D space uniquely. Finally, the features (e.g., centroid coordinate, size, and color) of the search object are extracted for further recognition. The approach presented in this paper was implemented in OpenCV, and the prototype was written in Microsoft Visual C++. This proposed approach shows great potential for the automatic operation of crane. Further researches and more extensive field experiments will follow in the future.

  • PDF

이중 능동보 모델을 이용한 영상 추적 알고리즘 (Visual tracking algorithm using the double active bar models)

  • 고국원;김재선;조형석
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1996년도 한국자동제어학술회의논문집(국내학술편); 포항공과대학교, 포항; 24-26 Oct. 1996
    • /
    • pp.89-92
    • /
    • 1996
  • In this paper, we developed visual tracking algorithm using double active bar. The active bar model to represent the object can reduce the search space of energy surface and better performance than those of snake model. However, the contour will not find global equilibrium when driving force caused by image may be weak. To overcome this problem. Double active bar is proposed for finding the global minimum point without any dependence on initialization. To achieve the goal, an deformable model with two initial contours in attempted to search for a global minimum within two specific initial contours. This approach improve the performance of finding the contour of target. To evaluate the performance, some experiments are executed. We can achieved the good result for tracking a object on noisy image.

  • PDF

저차원화된 리커런트 뉴럴 네트워크를 이용한 비주얼 서보잉 (Visual Servoing of Robot Manipulators using Pruned Recurrent Neural Networks)

  • 김대준;이동욱;심귀보
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 1997년도 춘계학술대회 학술발표 논문집
    • /
    • pp.259-262
    • /
    • 1997
  • This paper presents a visual servoing of RV-M2 robot manipulators to track and grasp moving object, using pruned dynamic recurrent neural networks(DRNN). The object is stationary in the robot work space and the robot is tracking and grasping the object by using CCD camera mounted on the end-effector. In order to optimize the structure of DRNN, we decide the node whether delete or add, by mutation probability, first in case of delete node, the node which have minimum sum of input weight is actually deleted, and then in case of add node, the weight is connected according to the number of case which added node can reach the other nodes. Using evolutionary programming(EP) that search the struture and weight of the DRNN, and evolution strategies(ES) which train the weight of neuron, we pruned the net structure of DRNN. We applied the DRNN to the Visual Servoing of a robot manipulators to control position and orientation of end-effector, and the validity and effectiveness of the pro osed control scheme will be verified by computer simulations.

  • PDF

최적화된 신경회로망을 이용한 동적물체의 비주얼 서보잉 (Visual servoing of robot manipulators using the neural network with optimal structure)

  • 김대준;전효병;심귀보
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1996년도 한국자동제어학술회의논문집(국내학술편); 포항공과대학교, 포항; 24-26 Oct. 1996
    • /
    • pp.302-305
    • /
    • 1996
  • This paper presents a visual servoing combined by Neural Network with optimal structure and predictive control for robotic manipulators to tracking or grasping of the moving object. Using the four feature image information from CCD camera attached to end-effector of RV-M2 robot manipulator having 5 dof, we want to predict the updated position of the object. The Kalman filter is used to estimate the motion parameters, namely the state vector of the moving object in successive image frames, and using the multi layer feedforward neural network that permits the connection of other layers, evolutionary programming(EP) that search the structure and weight of the neural network, and evolution strategies(ES) which training the weight of neuron, we optimized the net structure of control scheme. The validity and effectiveness of the proposed control scheme and predictive control of moving object will be verified by computer simulation.

  • PDF

변형된 절두체 컬링을 이용한 3차원 FPS 게임에서의 오브젝트 탐색 연구 (A Study on the Object Search in 3D FPS Games Using Modified Frustum Culling)

  • 최원태;박창민
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2007년도 춘계종합학술대회
    • /
    • pp.105-108
    • /
    • 2007
  • 최근, 3차원 슈팅 게임들은 빠른 화면 전환과 카메라 시야에 있는 오브젝트들을 상대로 게임을 한다. 특히 온라인 게임에서는 플레이어의 시야에 있지 않은 상대 오브젝트들의 위협성을 인식하는 것이 매우 중요하다. 본 논문에서는 변형된 절두체 컬링을 이용하여 3차원 FPS 게임에서의 오브젝트를 효율적으로 탐색하는 방법을 제시하였다. 플레이어가 감지하지 못하는 오브젝트를 위해 플레이어와 카메라 위치를 일치 시켰으며, 위협적인 오브젝트들의 개수를 파악하기 위해 플레이어와 오브젝트들의 거리를 이용하였다. 제안한 방법은 향후 3차원 FPS 게임의 발전에 주요한 역할을 할 것이다.

  • PDF