• Title/Summary/Keyword: Attention Region

Search Result 596, Processing Time 0.023 seconds

Window Attention Module Based Transformer for Image Classification (윈도우 주의 모듈 기반 트랜스포머를 활용한 이미지 분류 방법)

  • Kim, Sanghoon;Kim, Wonjun
    • Journal of Broadcast Engineering
    • /
    • v.27 no.4
    • /
    • pp.538-547
    • /
    • 2022
  • Recently introduced image classification methods using Transformers show remarkable performance improvements over conventional neural network-based methods. In order to effectively consider regional features, research has been actively conducted on how to apply transformers by dividing image areas into multiple window areas, but learning of inter-window relationships is still insufficient. In this paper, to overcome this problem, we propose a transformer structure that can reflect the relationship between windows in learning. The proposed method computes the importance of each window region through compression and a fully connected layer based on self-attention operations for each window region. The calculated importance is scaled to each window area as a learned weight of the relationship between the window areas to re-calibrate the feature value. Experimental results show that the proposed method can effectively improve the performance of existing transformer-based methods.

Perception based video anticipation generation (선택적 주의 기법 기반의 영상의 기대효과 자동생성)

  • Yoon, Jong-Chul;Lee, In-Kwon
    • Journal of the Korea Computer Graphics Society
    • /
    • v.13 no.3
    • /
    • pp.1-6
    • /
    • 2007
  • Anticipation effect has been used as a traditional skill to enhance the dynamic motion of the traditional 2D animation. Basically, anticipation means the action of opposite direction which performs before the real action step. In this paper, we propose the perception-based video anticipation method to guide a user's visual attention to the important region. Using the image based attention map, we calculate the visual attention region and then combine this map with temporal saliency of video. We apply the anticipation effect in these saliency regions using the blur kernel. Using our method, we can generate the dynamic video motion which has attentive guidance.

  • PDF

A Saliency-Based Focusing Region Selection Method for Robust Auto-Focusing

  • Jeon, Jaehwan;Cho, Changhun;Paik, Joonki
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.1 no.3
    • /
    • pp.133-142
    • /
    • 2012
  • This paper presents a salient region detection algorithm for auto-focusing based on the characteristics of a human's visual attention. To describe the saliency at the local, regional, and global levels, this paper proposes a set of novel features including multi-scale local contrast, variance, center-surround entropy, and closeness to the center. Those features are then prioritized to produce a saliency map. The major advantage of the proposed approach is twofold; i) robustness to changes in focus and ii) low computational complexity. The experimental results showed that the proposed method outperforms the existing low-level feature-based methods in the sense of both robustness and accuracy for auto-focusing.

  • PDF

Implementation of Image Adaptive Map (적응적인 Saliency Map 모델 구현)

  • Park, Sang-Bum;Kim, Ki-Joong;Han, Young-Joon;Hahn, Hern-Soo
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.25 no.2
    • /
    • pp.131-139
    • /
    • 2008
  • This paper presents a new saliency map which is constructed by providing dynamic weights on individual features in an input image to search ROI(Region Of Interest) or FOA(Focus Of Attention). To construct a saliency map on there is no a priori information, three feature-maps are constructed first which emphasize orientation, color, and intensity of individual pixels, respectively. From feature-maps, conspicuity maps are generated by using the It's algorithm and their information quantities are measured in terms of entropy. Final saliency map is constructed by summing the conspicuity maps weighted with their individual entropies. The prominency of the proposed algorithm has been proved by showing that the ROIs detected by the proposed algorithm in ten different images are similar with those selected by one-hundred person's naked eyes.

A New Covert Visual Attention System by Object-based Spatiotemporal Cues and Their Dynamic Fusioned Saliency Map (객체기반의 시공간 단서와 이들의 동적결합 된돌출맵에 의한 상향식 인공시각주의 시스템)

  • Cheoi, Kyungjoo
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.4
    • /
    • pp.460-472
    • /
    • 2015
  • Most of previous visual attention system finds attention regions based on saliency map which is combined by multiple extracted features. The differences of these systems are in the methods of feature extraction and combination. This paper presents a new system which has an improvement in feature extraction method of color and motion, and in weight decision method of spatial and temporal features. Our system dynamically extracts one color which has the strongest response among two opponent colors, and detects the moving objects not moving pixels. As a combination method of spatial and temporal feature, the proposed system sets the weight dynamically by each features' relative activities. Comparative results show that our suggested feature extraction and integration method improved the detection rate of attention region.

Glioblastoma Multiforme in the Pineal Region with Leptomeningeal Dissemination and Lumbar Metastasis

  • Matsuda, Ryosuke;Hironaka, Yasuo;Suigimoto, Tadashi;Nakase, Hiroyuki
    • Journal of Korean Neurosurgical Society
    • /
    • v.58 no.5
    • /
    • pp.479-482
    • /
    • 2015
  • We report a case of a 31-year-old woman with glioblastoma multiforme (GBM) in the pineal region with associated leptomeningeal dissemination and lumbar metastasis. The patient presented with severe headache and vomiting. Magnetic resonance imaging (MRI) of the brain showed a heterogeneously enhanced tumor in the pineal region with obstructive hydrocephalus. After an urgent ventricular-peritoneal shunt, she was treated by subtotal resection and chemotherapy concomitant with radiotherapy. Two months after surgery, MRI showed no changes in the residual tumor but leptomeningeal dissemination surrounding the brainstem. One month later, she exhibited severe lumbago and bilateral leg pain. Thoracico-lumbar MRI showed drop like metastasis in the lumbar region. Finally she died five months after the initial diagnosis. Neurosurgeons should pay attention to GBM in the pineal region, not only as an important differential diagnosis among the pineal tumors, but due to the aggressive features of leptomeningeal dissemination and spinal metastasis.

Visual-Attention-Aware Progressive RoI Trick Mode Streaming in Interactive Panoramic Video Service

  • Seok, Joo Myoung;Lee, Yonghun
    • ETRI Journal
    • /
    • v.36 no.2
    • /
    • pp.253-263
    • /
    • 2014
  • In the near future, traditional narrow and fixed viewpoint video services will be replaced by high-quality panorama video services. This paper proposes a visual-attention-aware progressive region of interest (RoI) trick mode streaming service (VA-PRTS) that prioritizes video data to transmit according to the visual attention and transmits prioritized video data progressively. VA-PRTS enables the receiver to speed up the time to display without degrading the perceptual quality. For the proposed VA-PRTS, this paper defines a cutoff visual attention metric algorithm to determine the quality of the encoded video slice based on the capability of visual attention and the progressive streaming method based on the priority of RoI video data. Compared to conventional methods, VA-PRTS increases the bitrate saving by over 57% and decreases the interactive delay by over 66%, while maintaining a level of perceptual video quality. The experiment results show that the proposed VA-PRTS improves the quality of the viewer experience for interactive panoramic video streaming services. The development results show that the VA-PRTS has highly practical real-field feasibility.

2D-to-3D Conversion System using Depth Map Enhancement

  • Chen, Ju-Chin;Huang, Meng-yuan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.3
    • /
    • pp.1159-1181
    • /
    • 2016
  • This study introduces an image-based 2D-to-3D conversion system that provides significant stereoscopic visual effects for humans. The linear and atmospheric perspective cues that compensate each other are employed to estimate depth information. Rather than retrieving a precise depth value for pixels from the depth cues, a direction angle of the image is estimated and then the depth gradient, in accordance with the direction angle, is integrated with superpixels to obtain the depth map. However, stereoscopic effects of synthesized views obtained from this depth map are limited and dissatisfy viewers. To obtain impressive visual effects, the viewer's main focus is considered, and thus salient object detection is performed to explore the significance region for visual attention. Then, the depth map is refined by locally modifying the depth values within the significance region. The refinement process not only maintains global depth consistency by correcting non-uniform depth values but also enhances the visual stereoscopic effect. Experimental results show that in subjective evaluation, the subjectively evaluated degree of satisfaction with the proposed method is approximately 7% greater than both existing commercial conversion software and state-of-the-art approach.

A Study on Visual Attention on Color Perception by Visitors of Children's Hospital (어린이병원 방문자의 색채지각에 나타난 시각적 주의에 관한 연구)

  • Cho, Eun-Kil;Son, Kwang-Ho
    • Korean Institute of Interior Design Journal
    • /
    • v.25 no.2
    • /
    • pp.50-58
    • /
    • 2016
  • The design of children's hospitals is highly dependent on color schemes. As a space shared together by both adults and children, the design of children's hospitals require color coordination that takes account of the users' characteristics. Visual perception tracking experiment was conducted on the 2 chosen experimental images with a target group made up of adults and children, the following results were found. First, visual attention characteristics of spatial elements' colors were found. The contrast of colors were discovered to effect attention, especially the information desk region showed highest attention. Pillars are subjected to a higher attention relative to other spatial elements, it is suggested when using accent colors to use it only when it is absolutely necessary in partial areas. In contrast, floor patterns were found to be subjected to very low attention relative to other elements. Second, effects of color contrast on visual attention were uncovered. Although color contrast effects attention for both adults and children, children were found to be more effected by color contrast than adults. Especially, children's tendency to rely on color contrast for visual recognition was higher than adults. Since when using only one type on a wide surface children show higher attention on the < vivid > colors than adults, when planning a color coordination for children using < pale > colors instead of < vivid > ones in background for a large surface is seen as a more desired method to increase attention by putting emphasis on the [sharply contrasting] colors.

An Adaptive ROI Detection System for Spatiotemporal Features (시.공간특징에 대해 적응할 수 있는 ROI 탐지 시스템)

  • Park Min-Chul;Cheoi Kyung-Joo
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.1
    • /
    • pp.41-53
    • /
    • 2006
  • In this paper, an adaptive ROI(region of interest) detection system for spatialtemporal features is proposed. It utilizes spatiotemporal features for the purpose of detecting ROI. It is assumed that motion representing temporal visual conspicuity between adjacent frames takes higher priority over spatial visual conspicuity. Because objects or regions in motion usually draw stronger attention than others in motion pictures. In case of still images visual features that constitute topographic feature maps are used as spatial features. Comparative experiments with a human subjective evaluation show that correct detection rate of visual attention region is improved by exploiting both spatial and temporal features compared to the case of exploiting either feature.

  • PDF