• Title/Summary/Keyword: Sketch Recognition

Search Result 26, Processing Time 0.023 seconds

Recent advances in sketch based image retrieval: a survey (스케치 기반 이미지 검색의 최신 연구 동향)

  • Sehong Oh;Ho-Sik Seok
    • Journal of IKEEE
    • /
    • v.28 no.2
    • /
    • pp.209-220
    • /
    • 2024
  • A sketch is an intuitive means to express information, but compared to actual images, it has the problem of being highly abstract, diverse, and sparse. Recent advances in deep learning models have made it possible to discover features that are common to images and sketches. In this paper, we summarize recent trends in sketch-based image retrieval (SBIR) but it is not limited to SBIR. Besides SBIR, we also introduce sketch-based image recognition and generation studies. Zero-shot learning enables models to recognize categories not encountered during training. Zero-shot SBIR methods are also discussed. Commonly used free-hand sketch datasets are summarized and retrieval performance based on these datasets is reported.

Autism Spectrum Disorder Recognition with Deep Learning

  • Shin, Jongmin;Choi, Jinwoo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.1268-1271
    • /
    • 2022
  • Since it is common to have touch-screen devices, it is less challenging to draw sketches anywhere and save them in vector form. Current research on sketches considers coordinate sequence data and adopts sequential models for learning sketch representation in sketch understanding. In the sketch dataset, it has become customary that the dataset is in vector coordinate format. Moreover, the popular dataset does not consider real-life sketches, sketches from pencil, pen, and paper. Art psychology uses real-life sketches to analyze patients. ETRI presents a unique sketch dataset for sketch recognition of autism spectrum disorder in pixel format. We present a method to formulate the dataset for better generalization of sketch data. Through experiments, we show that pixel-based models can produce a good performance.

  • PDF

Sketch Recognition Using LSTM with Attention Mechanism and Minimum Cost Flow Algorithm

  • Nguyen-Xuan, Bac;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • v.15 no.4
    • /
    • pp.8-15
    • /
    • 2019
  • This paper presents a solution of the 'Quick, Draw! Doodle Recognition Challenge' hosted by Google. Doodles are drawings comprised of concrete representational meaning or abstract lines creatively expressed by individuals. In this challenge, a doodle is presented as a sequence of sketches. From the view of at the sketch level, to learn the pattern of strokes representing a doodle, we propose a sequential model stacked with multiple convolution layers and Long Short-Term Memory (LSTM) cells following the attention mechanism [15]. From the view at the image level, we use multiple models pre-trained on ImageNet to recognize the doodle. Finally, an ensemble and a post-processing method using the minimum cost flow algorithm are introduced to combine multiple models in achieving better results. In this challenge, our solutions garnered 11th place among 1,316 teams. Our performance was 0.95037 MAP@3, only 0.4% lower than the winner. It demonstrates that our method is very competitive. The source code for this competition is published at: https://github.com/ngxbac/Kaggle-QuickDraw.

A Study on the Spatial Cognition Characteristics at Minority Traditional Village of Chengzi in Yunnan Province of China (중국 윈난성(云南省) 소수민족 전통마을 청쯔고촌(城子古村)의 공간 인지 특성 연구)

  • Son, Young-Rim;Lee, In-Hee;Yoo, Jae-Woo
    • Journal of the Architectural Institute of Korea Planning & Design
    • /
    • v.35 no.9
    • /
    • pp.101-108
    • /
    • 2019
  • Chinese ethnic minorities are inheriting their own traditions based on thousands of years of community life. Yunnan province in china is a castle in which many ethnic minorities have been living on the basis of various natural environments. Their traditional village can be regarded as a place reflecting minorities' thousands year of history and culture, and elements of positive social spaces are seen from the old village. Streets and places of the village are accumulated as images for residents. Based on their imagination-concept, sketch maps, reflecting residents' cognitive perception were collected. Analysis of 21 sketch maps shows that architectural elements, forming a unique landscape and community life contribute to establish a unity of one nation. the oldest tree in the village has a strong specificity as a place with the belief that the tree protects all residents in the village. Space in the head of the residents and Social spaces, embedded in the memories of the residents living in the community continued organically and the roads of the village showed clear recognition. Following this, the analysis methodology of social spaces and sketch will be examined in depth.

Handwriting and Voice Input using Transparent Input Overlay (투명한 입력오버레이를 이용한 필기 및 음성 입력)

  • Kim, Dae-Hyun;Kim, Myoung-Jun;Lee, Zin-O
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.4
    • /
    • pp.245-254
    • /
    • 2008
  • This paper proposes a unified multi-modal input framework to interface the recognition engines such as IBM ViaVoice and Microsoft handwriting-recognition system with general window applications, particularly, for pen-input displays. As soon as user pushes a hardware button attached to the pin-input display with one hand, the current window of focus such as a internet search window and a word processor is overlaid with a transparent window covering the whole desktop; upon which user inputs handwriting with the other hand, without losing the focus of attention on working context. As well as freeform handwriting on this transparent input overlay as a sketch pad, the user can dictate some words and draw diagrams to communicate with the system.

Web-based 3D Object Retrieval from User-drawn Sketch Query (스케치를 이용한 웹 환경에서의 3차원 모델 검색)

  • Song, Jonghun;Ju, Jae Ho;Yoon, Sang Min
    • Journal of KIISE
    • /
    • v.41 no.10
    • /
    • pp.838-846
    • /
    • 2014
  • Three-dimensional (3D) object retrieval from user-drawn sketch queries is one of the important research issues in the areas of pattern recognition and computer graphics for simulation, visualization, and Computer Aided Design. The performance of content-based 3D object retrieval system depends on the availability of effective descriptors and similarity measures for this kind of data. In this paper, we present a sketch-based 3D object retrieval system by extracting a hybrid edge descriptor which is robust against rotation and translation. The experimental results which are based on HTML5 and WebGL show that proposed sketch-based 3D object retrieval method is very efficient to search and order the 3D objects according to user's intention.

An Efficient Feature Point Detection for Interactive Pen-Input Display Applications (인터액티브 펜-입력 디스플레이 애플리케이션을 위한 효과적인 특징점 추출법)

  • Kim Dae-Hyun;Kim Myoung-Jun
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.32 no.11_12
    • /
    • pp.705-716
    • /
    • 2005
  • There exist many feature point detection algorithms that developed in pattern recognition research . However, interactive applications for the pen-input displays such as Tablet PCs and LCD tablets have set different goals; reliable segmentation for different drawing styles and real-time on-the-fly fieature point defection. This paper presents a curvature estimation method crucial for segmenting freeHand pen input. It considers only local shape descriptors, thus, peforming a novel curvature estimation on-the-fly while drawing on a pen-input display This has been used for pen marking recognition to build a 3D sketch-based modeling application.

Voice Driven Sound Sketch for Animation Authoring Tools (애니메이션 저작도구를 위한 음성 기반 음향 스케치)

  • Kwon, Soon-Il
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.4
    • /
    • pp.1-9
    • /
    • 2010
  • Authoring tools for sketching the motion of characters to be animated have been studied. However the natural interface for sound editing has not been sufficiently studied. In this paper, I present a novel method that sound sample is selected by speaking sound-imitation words(onomatopoeia). Experiment with the method based on statistical models, which is generally used for pattern recognition, showed up to 97% in the accuracy of recognition. In addition, to address the difficulty of data collection for newly enrolled sound samples, the GLR Test based on only one sample of each sound-imitation word showed almost the same accuracy as the previous method.

A study on the recognition of the dashboard in forklift (지게차 계기판의 인지성 평가에 관한 연구)

  • Choi Jin-Bong;Yun Yong-Gu;Jeong Myeong-Cheol;Park Beom
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2006.05a
    • /
    • pp.219-225
    • /
    • 2006
  • This paper studies on the visibility of dashboard in forklift. As part of the real setting devised for this study, 1. Important evaluation by males experience in forklift driving, 2. Icon cognition experiment, 3. Gage cognition experiment, subjects were asked to estimate the important evaluation, sketched to icon and gage position on the screen. Subjective evaluations were carried out by semantic differential method, sketch method, sketch method, then analyzed by consistency test, frequency rate and T-test. I gather the results concerning the relationship between consistent answers and cognition rates of dashboard understand the conditions which create a desired instrument panel.

  • PDF

Method of Generating Digital Drawing through Sketch Recognition (스케치 인식을 통한 디지털 도면 생성 기법)

  • Oh, Soohyun;Lee, Seongjin
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2019.07a
    • /
    • pp.91-94
    • /
    • 2019
  • 스케치를 거쳐 생성되는 디지털 자료로 건축도면이나 제품 디자인시안 등은 수요가 많음에도 불구하고 디지털 도면 자동생성에 대한 영상처리는 아직 연구되지 않고 있다. 현행 필기인식에 대한 영상처리 연구는 주로 글자나 숫자에 국한되어 있어 본 연구에서는 선으로 이루어진 필기를 인식하여 도면이라는 이진영상의 특징을 이용해 특징점을 도출하고 디지털 도면을 생성하는 영상처리를 제안한다. 먼저 입력받은 아날로그 스캔이미지를 메디안블러링과 OSTU임계처리로 노이즈가 없는 이진영상으로 변환한 후 해리스코너검출기를 이용하여 특징점을 검출하고 좌표를 추출하고, 좌표값을 활용해 외곽선과 내부윤곽선까지 구현하여 디지털도면을 양산한다.

  • PDF