• Title/Summary/Keyword: image understanding

Search Result 1,068, Processing Time 0.026 seconds

Ontology-based Image Understanding Systems (온톨로지 기반 영상이해 시스템)

  • Lee, In-K.;Seo, Suk-T.;Jeong, Hye-C.;Son, Seo-H.;Kwon, Soon-H.
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.3
    • /
    • pp.328-335
    • /
    • 2007
  • Ontology is represented by the shared concepts and relations among those. Many studies have been actively working on sharing human's knowledge with that of systems by using it. For a typical example, there is the design and implementation of ontology system for image understanding. However conventional studies on ontology-based image understanding have proposed not concrete methods but conceptual idea. In this paper, we propose an ontology-based image understanding system with following four processes: i)knowledge representation of a specific domain by the ontology, ii)feature extraction of objects through image processing and image analysis, iii)image interpretation by object features, and iv)reduction of ambiguity existing in image interpretation by ontology reasoning. We implement an image understanding system based on the proposed processed, and show the effectiveness of the proposed system from experimental results in a specific domain.

Improving visual relationship detection using linguistic and spatial cues

  • Jung, Jaewon;Park, Jongyoul
    • ETRI Journal
    • /
    • v.42 no.3
    • /
    • pp.399-410
    • /
    • 2020
  • Detecting visual relationships in an image is important in an image understanding task. It enables higher image understanding tasks, that is, predicting the next scene and understanding what occurs in an image. A visual relationship comprises of a subject, a predicate, and an object, and is related to visual, language, and spatial cues. The predicate explains the relationship between the subject and object and can be categorized into different categories such as prepositions and verbs. A large visual gap exists although the visual relationship is included in the same predicate. This study improves upon a previous study (that uses language cues using two losses) and a spatial cue (that only includes individual information) by adding relative information on the subject and object of the extant study. The architectural limitation is demonstrated and is overcome to detect all zero-shot visual relationships. A new problem is discovered, and an explanation of how it decreases performance is provided. The experiment is conducted on the VRD and VG datasets and a significant improvement over previous results is obtained.

Framework for Ontological Knowledge-based Image Understanding Systems (Ontological 지식 기반 영상이해시스템의 구조)

  • 손세호;이인근;권순학
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2004.04a
    • /
    • pp.235-240
    • /
    • 2004
  • In this paper, we propose a framework for ontological knowledge-based image understanding systems. Ontology composed of concepts can be used as a guide for describing objects from a specific domain of interest and describing relations between objects from different domains The proposed framework consists of four main subparts ⅰ) ontological knowledge bases, ⅱ) primitive feature detectors, ⅲ) concept inference engine, and ⅳ) semantic inference engine. Using ontological knowledge bases on various domains and features extracted from the detectors, concept inference engine infers concepts on regions of interest in an image and semantic inference engine reasons semantic situations between concepts from different domains. We present a outline for ontological knowledge-based image understanding systems and application examples within specific domains such as text recognition and human recognition in order to show the validity of the proposed system.

  • PDF

A Study on the Verbal Image of Interior Decoration Trend from the Year 2000 (2000년 이후 인테리어 데코레이션 트랜드의 언어심상에 관한 연구)

  • Kim, Joo-Yun;Han, Hyo-Jung;Lee, Hye-Kyung
    • Korean Institute of Interior Design Journal
    • /
    • v.15 no.6 s.59
    • /
    • pp.238-246
    • /
    • 2006
  • Recent trends of interior design have a focus on creation of more various meanings rather than past ideology which sought after the compatibility to the function of modem design. These trends requires integral understanding of social and cultural ideologies with a sens of values for a certain periods. In addition, they also require creativity which able to read, find and solve consumer's diverse demand and desire. Considering the effort of trend forecasting in Korea is still heavily rely on the foreign trend shows, it is natural to attempt to study the analytical forecasting methodology based upon more systematic principles which lead to more objective outcome, when the understanding, forcasting and analysis of interior decoration trend are required. In this thesis, the analysis and forecasting of interior decoration trend are studied by means of verbal image code process which involves the induction of design concept through data extraction, classification and analysis, in order to understanding and satisfying the diversified consumer's demand and trend. The coding process of verbal image is understanding as general concept. by extracting common elements from abstract and individual image, and/or specific concept. Therefore, it is proposed that the database building and data mining process of verbal Image, and subsequent development of programming skill can be applied as more efficient tool for various verbal image process.

An Intelligence Image Compression System through Image Understanding (영상 이해를 통한 지능형 영상압축 시스템)

  • Kim, Jin-Hyung
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.24 no.6
    • /
    • pp.961-968
    • /
    • 1987
  • This paper describes an intelligent image compression system called AIIC which is capable of adjusting image compression ratios ranging from 1:1 to 12,000:1 depending on available bandwidth. This system utilizes not only conventional image compression algorithms but also intelligent techniques through understanding image contents to achieve ultra-high compression ratios. This system was simulated on a micro-computer network.

  • PDF

A Practical Digital Video Database based on Language and Image Analysis

  • Liang, Yiqing
    • Proceedings of the Korea Database Society Conference
    • /
    • 1997.10a
    • /
    • pp.24-48
    • /
    • 1997
  • . Supported byㆍDARPA′s image Understanding (IU) program under "Video Retrieval Based on Language and image Analysis" project.DARPA′s Computer Assisted Education and Training Initiative program (CAETI)ㆍObjective: Develop practical systems for automatic understanding and indexing of video sequences using both audio and video tracks(omitted)

  • PDF

Image Understanding for Visual Dialog

  • Cho, Yeongsu;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1171-1178
    • /
    • 2019
  • This study proposes a deep neural network model based on an encoder-decoder structure for visual dialogs. Ongoing linguistic understanding of the dialog history and context is important to generate correct answers to questions in visual dialogs followed by questions and answers regarding images. Nevertheless, in many cases, a visual understanding that can identify scenes or object attributes contained in images is beneficial. Hence, in the proposed model, by employing a separate person detector and an attribute recognizer in addition to visual features extracted from the entire input image at the encoding stage using a convolutional neural network, we emphasize attributes, such as gender, age, and dress concept of the people in the corresponding image and use them to generate answers. The results of the experiments conducted using VisDial v0.9, a large benchmark dataset, confirmed that the proposed model performed well.

Construction Site Scene Understanding: A 2D Image Segmentation and Classification

  • Kim, Hongjo;Park, Sungjae;Ha, Sooji;Kim, Hyoungkwan
    • International conference on construction engineering and project management
    • /
    • 2015.10a
    • /
    • pp.333-335
    • /
    • 2015
  • A computer vision-based scene recognition algorithm is proposed for monitoring construction sites. The system analyzes images acquired from a surveillance camera to separate regions and classify them as building, ground, and hole. Mean shift image segmentation algorithm is tested for separating meaningful regions of construction site images. The system would benefit current monitoring practices in that information extracted from images could embrace an environmental context.

  • PDF

Research about a game image 3D versification (3D 게임영상 작성법에 관한 연구)

  • Lee Dong-Lyeor
    • Journal of Game and Entertainment
    • /
    • v.1 no.1
    • /
    • pp.31-38
    • /
    • 2005
  • Correct flow of various game manufacture among the justice which is used at the game development. and The understanding about the manufacture regards we making rather correct game. We justice understanding which we are correct in the image manufacture to become the reason air control of the game and We put the center in a 3B game image manufacture understanding. we are marked in maneuvered the game in actual game good. The image of the back of Cut Scene which is inserted at an opeuning incomparableness event time, we have been produced in this method. The thing which a 3D game image is utilized in a special effectiveness image though it is different from the game in the theater movie, we are the graphic which a game manufacture o'clock must be considered. The reason air control which the game player Is rather correct, we are regarded we offer the reason to immerse with his game.

  • PDF

Using Context Information to Improve Retrieval Accuracy in Content-Based Image Retrieval Systems

  • Hejazi, Mahmoud R.;Woo, Woon-Tack;Ho, Yo-Sung
    • 한국HCI학회:학술대회논문집
    • /
    • 2006.02a
    • /
    • pp.926-930
    • /
    • 2006
  • Current image retrieval techniques have shortcomings that make it difficult to search for images based on a semantic understanding of what the image is about. Since an image is normally associated with multiple contexts (e.g. when and where a picture was taken,) the knowledge of these contexts can enhance the quantity of semantic understanding of an image. In this paper, we present a context-aware image retrieval system, which uses the context information to infer a kind of metadata for the captured images as well as images in different collections and databases. Experimental results show that using these kinds of information can not only significantly increase the retrieval accuracy in conventional content-based image retrieval systems but decrease the problems arise by manual annotation in text-based image retrieval systems as well.

  • PDF