• Title/Summary/Keyword: Multi-Scene

Search Result 187, Processing Time 0.026 seconds

Real Scene Text Image Super-Resolution Based on Multi-Scale and Attention Fusion

  • Xinhua Lu;Haihai Wei;Li Ma;Qingji Xue;Yonghui Fu
    • Journal of Information Processing Systems
    • /
    • v.19 no.4
    • /
    • pp.427-438
    • /
    • 2023
  • Plenty of works have indicated that single image super-resolution (SISR) models relying on synthetic datasets are difficult to be applied to real scene text image super-resolution (STISR) for its more complex degradation. The up-to-date dataset for realistic STISR is called TextZoom, while the current methods trained on this dataset have not considered the effect of multi-scale features of text images. In this paper, a multi-scale and attention fusion model for realistic STISR is proposed. The multi-scale learning mechanism is introduced to acquire sophisticated feature representations of text images; The spatial and channel attentions are introduced to capture the local information and inter-channel interaction information of text images; At last, this paper designs a multi-scale residual attention module by skillfully fusing multi-scale learning and attention mechanisms. The experiments on TextZoom demonstrate that the model proposed increases scene text recognition's (ASTER) average recognition accuracy by 1.2% compared to text super-resolution network.

A Study on the Multi-space Method in Fashion Illustration (현대 패션일러스트레이션의 다중공간 표현에 관한 연구)

  • Lee, Jee-Hyun
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.33 no.4
    • /
    • pp.644-654
    • /
    • 2009
  • The purpose of this study is to analyze the characteristics of current fashion illustrations within the framework of Multi-space method. Multi-space means being piled up one moment & space on others, and being amassed in a scene. This method is related with Dadaism, Surrealism and Postmodernism, and also influences on the current fashion illustration. In this study, the types of Multi-space method could be classified into 4 types; Repetitive Time Mixture in Multi-space, Juxtaposed Time Mixture in Multi-space, Reiterated Space Mixture in Multi-space, Projected Space Mixture in Multi-space. The characteristics of Multi-space were analyzed and the results are as followed. The distinctive methods for Time Mixture in Multi-space are repetition and juxtaposition in a scene. Time Mixture in Multi-space can make the nonlinear narration and unreal illusory space in fashion illustrations more effectively. Reiterated Space Mixture in Multi-space can be related with the heterogeneous, surrealistic illusions in current fashion illustrations. Projected Space Mixture in Multi-space can be characterized into inter-penetration. It can derive spectators to mix the projected & transparent images in a scene for their own imaginary stories. The final imagination can be made differently according to the personal experiences of spectators.

PROPAGATION OF MULTI-LEVEL CUES WITH ADAPTIVE CONFIDENCE FOR BILAYER SEGMENTATION OF CONSISTENT SCENE IMAGES

  • Lee, Soo-Chahn;Yun, Il-Dong;Lee, Sang-Uk
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.148-153
    • /
    • 2009
  • Few methods have dealt with segmenting multiple images with analogous content. Concurrent images of a scene and gathered images of a similar foreground are examples of these images, which we term consistent scene images. In this paper, we present a method to segment these images based on manual segmentation of one image, by iteratively propagating information via multi-level cues with adaptive confidence. The cues are classified as low-, mid-, and high- levels based on whether they pertain to pixels, patches, and shapes. Propagated cues are used to compute potentials in an MRF framework, and segmentation is done by energy minimization. Through this process, the proposed method attempts to maximize the amount of extracted information and maximize the consistency of segmentation. We demonstrate the effectiveness of the proposed method on several sets of consistent scene images and provide a comparison with results based only on mid-level cues [1].

  • PDF

Co-saliency Detection Based on Superpixel Matching and Cellular Automata

  • Zhang, Zhaofeng;Wu, Zemin;Jiang, Qingzhu;Du, Lin;Hu, Lei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.5
    • /
    • pp.2576-2589
    • /
    • 2017
  • Co-saliency detection is a task of detecting same or similar objects in multi-scene, and has been an important preprocessing step for multi-scene image processing. However existing methods lack efficiency to match similar areas from different images. In addition, they are confined to single image detection without a unified framework to calculate co-saliency. In this paper, we propose a novel model called Superpixel Matching-Cellular Automata (SMCA). We use Hausdorff distance adjacent superpixel sets instead of single superpixel since the feature matching accuracy of single superpixel is poor. We further introduce Cellular Automata to exploit the intrinsic relevance of similar regions through interactions with neighbors in multi-scene. Extensive evaluations show that the SMCA model achieves leading performance compared to state-of-the-art methods on both efficiency and accuracy.

The Method of Multi-screen Service using Scene Composition Technology based on HTML5 (HTML5 기반 장면구성 기술을 통한 멀티스크린 서비스 제공 방법)

  • Jo, Minwoo;Kim, Kyuheon
    • Journal of Broadcast Engineering
    • /
    • v.18 no.6
    • /
    • pp.895-910
    • /
    • 2013
  • Multi-screen service is a service that consumes more than one media in a number of terminals simultaneously or discriminately. This multi-screen service has become useful due to distribute of smart TV and terminals. Also, in case of hybrid broadcasting environment that is convergence of broadcasting and communication environment, it is able to provide various user experience through contents consumed by multiple screens. In hybrid broadcasting environment, scene composition technology can be used as an element technology for multi-screen service. Using scene composition technology, multiple media can be consumed complexly through the specified presentation time and space. Thus, multi-screen service based on the scene composition technology can provide spatial and temporal control and consumption of multiple media by linkage between the terminals. However, existing scene composition technologies are not able to use easily in hybrid broadcasting because of applicable environmental constraints, the difficulty in applying the various terminal and complexity. For this problems, HTML5 can be considered. HTML5 is expected to be applied in various smart terminals commonly, and provides consumption of diverse media. So, in this paper, it proposes the scene composition and multi-screen service technology based on HTML5 that is expected be used in various smart terminals providing hybrid broadcasting environment. For this, it includes the introduction in terms of HTML5 and multi-screen service, the method of providing information related with scene composition and multi-screen service through the extention of elements and attributes in HTML5, media signaling between terminals and the method of synchronization. In addition, the proposed scene composition and multi-screen service technology based on HTML5 was verified through the implementation and experiment.

Image Scene Classification of Multiclass (다중 클래스의 이미지 장면 분류)

  • Shin, Seong-Yoon;Lee, Hyun-Chang;Shin, Kwang-Seong;Kim, Hyung-Jin;Lee, Jae-Wan
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.551-552
    • /
    • 2021
  • In this paper, we present a multi-class image scene classification method based on transformation learning. ImageNet classifies multiple classes of natural scene images by relying on pre-trained network models on large image datasets. In the experiment, we obtained excellent results by classifying the optimized ResNet model on Kaggle's Intel Image Classification data set.

  • PDF

Land Cover Classification of a Wide Area through Multi-Scene Landsat Processing (다량의 Landsat 위성영상 처리를 통한 광역 토지피복분류)

  • 박성미;임정호;사공호상
    • Korean Journal of Remote Sensing
    • /
    • v.17 no.3
    • /
    • pp.189-197
    • /
    • 2001
  • Generally, remote sensing is useful to obtain the quantitative and qualitative information of a wide area. For monitoring earth resources and environment, land cover classification of remotely sensed data are needed over increasingly larger area. The objective this study is to propose the process for land cover classification method over a wide area using multi-scene satellite data. Land cover of Korean peninsula was extracted from a Landsat TM and ETM+ mosaic created from 23 scenes at 100-meter resolution. Well-known techniques that used to general image processing and classification are applied to this wide area classification. It is expected that these process is very useful to promptly and efficiently grasp of small scale spatial information such as national territorial information.

A Study on the Rhetorical Expression of Scene Design in Theatre - Focused on the Case Study of the Scene Design of King Lear - (연극 무대 공간디자인에 대한 수사학적 연구 - 세익스피어 작 "리어왕"의 무대 공간디자인 사례연구를 중심으로 -)

  • Ahn, Ju-Young
    • Korean Institute of Interior Design Journal
    • /
    • v.16 no.3 s.62
    • /
    • pp.21-29
    • /
    • 2007
  • The communication in a play has a dual structure, which is composed of the communications within a play and in a theatre space. This research focuses on the scene design that creates the background or the theme of a play and communicates theme of a play to the audience. A scene design of a theatre has meta-linguistic aspects, which is the image or mood as the theme of a play. The scene design is composed of various design elements of space and objects as the properties of a play. Design elements and the objects are the design languages in various form; plane, three-dimensional, multi-dimensional form. These design languages have the significant meanings as signs like human language. The play works chooses the rhetorical expression to arouse the audience's sympathy. The scene design is completed with rhetorical expression for communication in theatre too. This research defines the category of meanings that design elements of scene design can create and the rhetorical expression of the scene design language. King Lear directed by Robert G. Anderson was analyzed with the category of design elements as a sign and the pattern of the rhetorical expression. The scene design for a play is completed effectively by the rhetorical expression of design elements as the design language for the communication with the audience in theatre.

Multiple Color and ToF Camera System for 3D Contents Generation

  • Ho, Yo-Sung
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.6 no.3
    • /
    • pp.175-182
    • /
    • 2017
  • In this paper, we present a multi-depth generation method using a time-of-flight (ToF) fusion camera system. Multi-view color cameras in the parallel type and ToF depth sensors are used for 3D scene capturing. Although each ToF depth sensor can measure the depth information of the scene in real-time, it has several problems to overcome. Therefore, after we capture low-resolution depth images by ToF depth sensors, we perform a post-processing to solve the problems. Then, the depth information of the depth sensor is warped to color image positions and used as initial disparity values. In addition, the warped depth data is used to generate a depth-discontinuity map for efficient stereo matching. By applying the stereo matching using belief propagation with the depth-discontinuity map and the initial disparity information, we have obtained more accurate and stable multi-view disparity maps in reduced time.

A Study on the Rhetorical Expression of Scene Design in Theatre - Focused on the Analysis of Scene Design of "King Lear" - (연극 무대 공간 디자인의 수사학적 연구 - 세익스피어 작 "리어왕"의 무대 공간 디자인 사례분석을 중심으로 -)

  • Ahn, Ju-Young
    • Proceedings of the Korean Institute of Interior Design Conference
    • /
    • 2007.05a
    • /
    • pp.176-180
    • /
    • 2007
  • The stage space in theatre for the performance of a play requires two aspects of the physical space for a play and the image as the background or the theme of a play. This research focuses on the scene design that creates the background or the theme of a play and communicates it to the audience. The scene design is composed of various design elements of space and objects as the properties of a play. Design elements and the objects are the design languages in various form; plane, three-dimensional, multi-dimensional form. These design language have the significant meaning as a sign like human language. The scene design is completed with rhetorical expression for communication in theatre. This research defines the category of meaning that design elements of scene design can create and the rhetorical expression of the scene design language. King Lear directed by Robert G. Anderson was analyzed with the category of design elements as a sign and the pattern of the rhetorical expression. The scene design for a play is completed effectively by the rhetorical expression of design elements as the design language for communication with the audience in theatre.

  • PDF