• Title/Summary/Keyword: Object Segment

Search Result 205, Processing Time 0.028 seconds

Stereo Image Composition Using Poisson Object Editing (포아송 객체 편집을 이용한 스테레오 영상 합성)

  • Baek, Eu-Tteum;Ho, Yo-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39A no.8
    • /
    • pp.453-458
    • /
    • 2014
  • In this paper, we propose a stereo image composition method based on Poisson image editing. If we synthesize images without considering their depth values, it may lead to unwanted consequences. When we segment an image into its background and foreground regions using Grabcut, we take into account their geometric positions to mix color tones; thus, the image is composited more naturally. After synthesizing images, we apply a blurring operation around object boundaries; then, the foreground object and background are composited more seamlessly. In addition, we can adjust the distance of the object by setting arbitrary depth values and generating right color and depth images automatically. Experimental results show that the proposed stereo image composition method provides naturally synthesized stereo images. Improved portions were subjectively confirmed as well.

Object Detection Method in Sea Environment Using Fast Region Merge Algorithm (해양환경에서 고속 영역 병합 알고리즘을 이용한 물표 탐지 기법)

  • Jeong, Jong-Myeon;Park, Gyei-Kark
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.5
    • /
    • pp.610-616
    • /
    • 2012
  • In this paper, we present a method to detect an object such as ship, rock and buoy from sea IR image for the safety navigation. To this end, we do the image smoothing first and the apply watershed algorithm to segment image into subregions. Since watershed algorithm almost always produces over-segmented regions, it requires posterior merging process to get meaningful segmented regions. We propose an efficient merger algorithm that requires only two times of direct access to the pixels regardless of the number of regions. Also by analyzing IR image obtained from sea environments, we could find out that most horizontal edge come out from object regions. For the given input IR image we extract horizontal edge and eliminate isolated edges produced from background and noises by adopting morphological operator. Among the segmented regions, the regions that have horizontal edges are extracted as final results. Experimental results show the adequacy of the proposed method.

Dynamic Gesture Recognition for the Remote Camera Robot Control (원격 카메라 로봇 제어를 위한 동적 제스처 인식)

  • Lee Ju-Won;Lee Byung-Ro
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.7
    • /
    • pp.1480-1487
    • /
    • 2004
  • This study is proposed the novel gesture recognition method for the remote camera robot control. To recognize the dynamics gesture, the preprocessing step is the image segmentation. The conventional methods for the effectively object segmentation has need a lot of the cole. information about the object(hand) image. And these methods in the recognition step have need a lot of the features with the each object. To improve the problems of the conventional methods, this study proposed the novel method to recognize the dynamic hand gesture such as the MMS(Max-Min Search) method to segment the object image, MSM(Mean Space Mapping) method and COG(Conte. Of Gravity) method to extract the features of image, and the structure of recognition MLPNN(Multi Layer Perceptron Neural Network) to recognize the dynamic gestures. In the results of experiment, the recognition rate of the proposed method appeared more than 90[%], and this result is shown that is available by HCI(Human Computer Interface) device for .emote robot control.

Design and Implementation of a Speech Synthesis Engine and a Plug-in for Internet Web Page (인터넷 웹페이지의 음성합성을 위한 엔진 및 플러그-인 설계 및 구현)

  • Lee, Hee-Man;Kim, Ji-Yeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.2
    • /
    • pp.461-469
    • /
    • 2000
  • In the paper, the design and the implementation of the netscape plug-in and the speech synthesis enginegenerating the speech sounds from the text information of the web pages are described. The steps of the generating speech sound from an web pages are the speech synthesis plug-in is activated when the netscape finds the audio/xesp MIME data type embedded in the browsed web page; the HTML file referenced in the EMBED MTML tag is down loaded from the referenced URL to send to the commander object located in the said plug-in; The speech synthesis engine control tags and the text characters are extracted from the down loaded HTML document by the commander object the synthesized speech sounds are generated by the speech synthesis engine. The speech synthesis engine interprets the command streams from the commander objects to call the member functions for the processing of the speech segment data in the data banks. The commander object and the speech synthesis engine are designed as an independent object to enhancethe flexitility and the portability.

  • PDF

A Study on Video Object Segmentation using Nonlinear Multiscale Filtering (비선형 다중스케일 필터링을 사용한 비디오 객체 분할에 관한 연구)

  • 이웅희;김태희;이규동;정동석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.10C
    • /
    • pp.1023-1032
    • /
    • 2003
  • Object-based coding, such as MPEG-4, enables various content-based functionalities for multimedia applications. In order to support such functionalities, as well as to improve coding efficiency, each frame of video sequences should be segmented into video objects. In this paper. we propose an effective video object segmentation method using nonlinear multiscale filtering and spatio-temporal information. Proposed method performs a spatial segmentation using a nonlinear multiscale filtering based on the stabilized inverse diffusion equation(SIDE). And, the segmented regions are merged using region adjacency graph(RAG). In this paper, we use a statistical significance test and a time-variant memory as temporal segmentation methods. By combining of extracted spatial and temporal segmentations, we can segment the video objects effectively. Proposed method is more robust to noise than the existing watershed algorithm. Experimental result shows that the proposed method improves a boundary accuracy ratio by 43% on "Akiyo" and by 29% on "Claire" than A. Neri's Method does.

Segmentation Foundation Model-based Automated Yard Management Algorithm (의미론적 분할 기반 모델을 이용한 조선소 사외 적치장 객체 자동 관리 기술)

  • Mingyu Jeong;Jeonghyun Noh;Janghyun Kim;Seongheon Ha;Taeseon Kang;Byounghak Lee;Kiryong Kang;Junhyeon Kim;Jinsun Park
    • Smart Media Journal
    • /
    • v.13 no.2
    • /
    • pp.52-61
    • /
    • 2024
  • In the shipyard, aerial images are acquired at regular intervals using Unmanned Aerial Vehicles (UAVs) for the management of external storage yards. These images are then investigated by humans to manage the status of the storage yards. This method requires a significant amount of time and manpower especially for large areas. In this paper, we propose an automated management technology based on a semantic segmentation foundation model to address these challenges and accurately assess the status of external storage yards. In addition, as there is insufficient publicly available dataset for external storage yards, we collected a small-scale dataset for external storage yards objects and equipment. Using this dataset, we fine-tune an object detector and extract initial object candidates. They are utilized as prompts for the Segment Anything Model(SAM) to obtain precise semantic segmentation results. Furthermore, to facilitate continuous storage yards dataset collection, we propose a training data generation pipeline using SAM. Our proposed method has achieved 4.00%p higher performance compared to those of previous semantic segmentation methods on average. Specifically, our method has achieved 5.08% higher performance than that of SegFormer.

Experimental Analysis of Algorithms of Splitting and Connecting Snake for Extracting of the Boundary of Multiple Objects (복수객체의 윤곽추출을 위한 스네이크 분리 및 연결 알고리즘의 실험적 분석)

  • Cui, Guo;Hwang, Jae-Yong;Jang, Jong-Whan
    • The KIPS Transactions:PartB
    • /
    • v.19B no.4
    • /
    • pp.221-224
    • /
    • 2012
  • The most famous algorithm of splitting and connecting Snake for extracting the boundary of multiple objects is the nearest method using the distance between snake points. It often can't split and connect Snake due to object topology. In this paper, its problem was discussed experimentally. The new algorithm using vector between Snake segment is proposed in order to split and connect Snake with complicated topology of objects. It is shown by experiment of two test images with 3 and 5 objects that the proposed one works better than the nearest one.

Language (Meaning) and Cognitive Science (언어(특히 의미)와 인지과학)

  • Lee, Chung-Min
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2005.05a
    • /
    • pp.23-27
    • /
    • 2005
  • Humans perceptually segment events, but models that predict where events will be segmented are limited. Developing a detailed model may be hard because of the overlapping quality of events (i.e., one can smile and walk at the same time, but the endpoint of each event can be different). However, some aspects of events appear to be universally represented in the world's languages. For example, path, the trajectory of an object's movement, is one of the most universally encoded event features. Although it is generally encoded in the prepositions of English (e.g., up), in other languagesit is encoded in the verbs (e.g., descendere). Linguistic universals may represent basic levels of event perception. Here we consider how one of these, path, might be parsed. Because the spatiotemporal projection of paths to an observation point is similar to the spatial projection of objects, we tested the hypothesis that path segmentation and object segmentation would be based on similar image properties, such as discontinuities in orientation.

  • PDF

A study of Postscript Converter using XSL-FO (XSL-FO를 이용한 PostScript Converter에 관한 연구)

  • 유동석;최호찬;이진영;김차종
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.109-112
    • /
    • 2003
  • Web documents specified with HTML and CSS is displayed with high quality on the web browser. However, basically, any printed pages don't have the same quality of softcopy. The reason is HTML and CSS is not suit for printing. The XSL-FO(XSL-Formatting Object) is Formatting Language for imaging of web document and The PostScript is one of the most famous PDL(Page Description Language). To get high-quality pages, we propose the design of converter which translate XML-FO into PostScript format. Using the designed converter, we can get hardcopies with high quality.

  • PDF

Segmentation of Computed Tomography using The Geometric Active Contour Model (기하학적 동적 외곽선 모델을 이용한 X-ray 단층촬영영상의 영상추출)

  • Jang, D.P.;Kim, S.I.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.11
    • /
    • pp.541-545
    • /
    • 1997
  • This paper presents a modified geometric active contour model or edge detection and segmentation of computed tomography(CT) scan images. The method is based on the level setup approach developed by Osher and Sethian and the modeling of propagation fronts with curvature dependent speeds by Malladi. Based on above algorithms, the geometric active contour is obtained through a particular level set of hypersurface lowing along its gradient force and curvature force. This technique retains the attractive feature which is topological and geometric flexibility of the contour in recovering objects with complex shapes and unknown topologies. But there are limitations in this algorithm which are being not able to separate the object with weak difference from neighbor object. So we use speed limitation filter to overcome those problems. We apply a 2D model to various synthetic cases and the three cases of real CT scan images in order to segment objects with complicated shapes and topologies. From the results, the presented model confirms that it attracts very naturally and efficiently to the desired feature of CT scan images.

  • PDF