• Title/Summary/Keyword: multi-scale representation

Search Result 43, Processing Time 0.029 seconds

Enhanced SIFT Descriptor Based on Modified Discrete Gaussian-Hermite Moment

  • Kang, Tae-Koo;Zhang, Huazhen;Kim, Dong W.;Park, Gwi-Tae
    • ETRI Journal
    • /
    • v.34 no.4
    • /
    • pp.572-582
    • /
    • 2012
  • The discrete Gaussian-Hermite moment (DGHM) is a global feature representation method that can be applied to square images. We propose a modified DGHM (MDGHM) method and an MDGHM-based scale-invariant feature transform (MDGHM-SIFT) descriptor. In the MDGHM, we devise a movable mask to represent the local features of a non-square image. The complete set of non-square image features are then represented by the summation of all MDGHMs. We also propose to apply an accumulated MDGHM using multi-order derivatives to obtain distinguishable feature information in the third stage of the SIFT. Finally, we calculate an MDGHM-based magnitude and an MDGHM-based orientation using the accumulated MDGHM. We carry out experiments using the proposed method with six kinds of deformations. The results show that the proposed method can be applied to non-square images without any image truncation and that it significantly outperforms the matching accuracy of other SIFT algorithms.

Experimental and numerical assessment of EBF structures with shear links

  • Caprili, Silvia;Mussini, Nicola;Salvatore, Walter
    • Steel and Composite Structures
    • /
    • v.28 no.2
    • /
    • pp.123-138
    • /
    • 2018
  • Eccentrically braced frames (EBF) represent an optimal structural solution for seismic prone areas, being able to provide high dissipative capacity and good elastic stiffness, to withstand strong seismic events without significant loss of bearing capacity and to avoid damage to non-structural elements in case of low and moderate earthquakes. The accurate knowledge of the cyclic behaviour of the dissipative links, characterizing the whole performance of EBFs, is required to optimize the structural properties and to refine the design techniques adopted for multi-storey buildings' analysis. Reliable numerical models for the links, at the same time requiring a limited computational effort, are then needed. The present work shows the results of a wide experimental test campaign executed on real-scale one storey/one bay frames with horizontal and vertical links, together with the elaboration of a simple semi-analytical model for the quick representation of the cyclic behaviour of shear links.

Vehicle Image Recognition Using Deep Convolution Neural Network and Compressed Dictionary Learning

  • Zhou, Yanyan
    • Journal of Information Processing Systems
    • /
    • v.17 no.2
    • /
    • pp.411-425
    • /
    • 2021
  • In this paper, a vehicle recognition algorithm based on deep convolutional neural network and compression dictionary is proposed. Firstly, the network structure of fine vehicle recognition based on convolutional neural network is introduced. Then, a vehicle recognition system based on multi-scale pyramid convolutional neural network is constructed. The contribution of different networks to the recognition results is adjusted by the adaptive fusion method that adjusts the network according to the recognition accuracy of a single network. The proportion of output in the network output of the entire multiscale network. Then, the compressed dictionary learning and the data dimension reduction are carried out using the effective block structure method combined with very sparse random projection matrix, which solves the computational complexity caused by high-dimensional features and shortens the dictionary learning time. Finally, the sparse representation classification method is used to realize vehicle type recognition. The experimental results show that the detection effect of the proposed algorithm is stable in sunny, cloudy and rainy weather, and it has strong adaptability to typical application scenarios such as occlusion and blurring, with an average recognition rate of more than 95%.

Adaptive Enhancement Method for Robot Sequence Motion Images

  • Yu Zhang;Guan Yang
    • Journal of Information Processing Systems
    • /
    • v.19 no.3
    • /
    • pp.370-376
    • /
    • 2023
  • Aiming at the problems of low image enhancement accuracy, long enhancement time and poor image quality in the traditional robot sequence motion image enhancement methods, an adaptive enhancement method for robot sequence motion image is proposed. The feature representation of the image was obtained by Karhunen-Loeve (K-L) transformation, and the nonlinear relationship between the robot joint angle and the image feature was established. The trajectory planning was carried out in the robot joint space to generate the robot sequence motion image, and an adaptive homomorphic filter was constructed to process the noise of the robot sequence motion image. According to the noise processing results, the brightness of robot sequence motion image was enhanced by using the multi-scale Retinex algorithm. The simulation results showed that the proposed method had higher accuracy and consumed shorter time for enhancement of robot sequence motion images. The simulation results showed that the image enhancement accuracy of the proposed method could reach 100%. The proposed method has important research significance and economic value in intelligent monitoring, automatic driving, and military fields.

Deep Reference-based Dynamic Scene Deblurring

  • Cunzhe Liu;Zhen Hua;Jinjiang Li
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.3
    • /
    • pp.653-669
    • /
    • 2024
  • Dynamic scene deblurring is a complex computer vision problem owing to its difficulty to model mathematically. In this paper, we present a novel approach for image deblurring with the help of the sharp reference image, which utilizes the reference image for high-quality and high-frequency detail results. To better utilize the clear reference image, we develop an encoder-decoder network and two novel modules are designed to guide the network for better image restoration. The proposed Reference Extraction and Aggregation Module can effectively establish the correspondence between blurry image and reference image and explore the most relevant features for better blur removal and the proposed Spatial Feature Fusion Module enables the encoder to perceive blur information at different spatial scales. In the final, the multi-scale feature maps from the encoder and cascaded Reference Extraction and Aggregation Modules are integrated into the decoder for a global fusion and representation. Extensive quantitative and qualitative experimental results from the different benchmarks show the effectiveness of our proposed method.

Human Action Recognition Via Multi-modality Information

  • Gao, Zan;Song, Jian-Ming;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
    • Journal of Electrical Engineering and Technology
    • /
    • v.9 no.2
    • /
    • pp.739-748
    • /
    • 2014
  • In this paper, we propose pyramid appearance and global structure action descriptors on both RGB and depth motion history images and a model-free method for human action recognition. In proposed algorithm, we firstly construct motion history image for both RGB and depth channels, at the same time, depth information is employed to filter RGB information, after that, different action descriptors are extracted from depth and RGB MHIs to represent these actions, and then multimodality information collaborative representation and recognition model, in which multi-modality information are put into object function naturally, and information fusion and action recognition also be done together, is proposed to classify human actions. To demonstrate the superiority of the proposed method, we evaluate it on MSR Action3D and DHA datasets, the well-known dataset for human action recognition. Large scale experiment shows our descriptors are robust, stable and efficient, when comparing with the-state-of-the-art algorithms, the performances of our descriptors are better than that of them, further, the performance of combined descriptors is much better than just using sole descriptor. What is more, our proposed model outperforms the state-of-the-art methods on both MSR Action3D and DHA datasets.

The Impacts of the Service Quality of Coffee Shop Adapting the CoffeeSERV on Customer's Perceived Value, Customer Satisfaction, Behavioral Intention: Focusing on Regulatory Focus Theory (CoffeeSERV측정모형을 활용한 커피전문점 서비스품질의 가치지각, 고객만족, 행동의도의 영향관계 연구: 조절초점동기의 조절효과를 중심으로)

  • KANG, Hwa-Seok
    • The Korean Journal of Franchise Management
    • /
    • v.10 no.3
    • /
    • pp.37-52
    • /
    • 2019
  • Purpose - This study examined the relationship between service quality, perceived value, customer satisfaction and behavioral intention of coffee shop using CoffeeSERV scale. In this model, CoffeeSERV scale consists of fundamental characteristics, physical environment, confidence, beverage characteristics, and representation factors. In particular, this study tried to demonstrate the moderating effect of customer's regulatory focus orientation among in the relationships between service quality, perceived value, customer satisfaction and behavioral intention. Research design, data, and methodology - This study intends to expand the existing service quality research by using the coffee shop service quality measurement tool developed by domestic researchers. I wanted to find some implications for the trend. In particular, this study applied the regulatory focus theory to identify individual differences of customers regulatory focusing motivation. In order to verify several hypotheses, the data were 227 college students and analyzed with SPSS/PC 21.0 and SmartPLS 3 program. The moderating role of customer's regulatory focusing motivation was tested using multi-group analysis with SmartPLS 3 program. Results - The resutls are as follows. First, the fundamental characteristic factors only had a significant influence on the utilitarian value perception, but in the hedonic value perception, all other service factors except for the beverage characteristic had a statistically significant effect. Second, utilitarian and hedonic value had significant effects on customer satisfaction. Third, customer satisfaction had a significant effect on behavioral intention. Finally, the regulatory focus orientation played a moderating role in the relationship between beverage characteristic - utilitarian value, representation - utilitarian value, fundamental characteristic - hedonic value, physical environment - hedonic value, confidence - hedonic value, and utilitarian value - behavioral intention. Conclusions - The results of this study show that the various service quality factors that make up the CoffeeSERV scale have different effects on utilitarian and hedonic value. This means that perceived benefits from product and service experience have different impacts on the customer's experience. Therefore, marketers should identify the impacts of service quality dimension that customers who use coffee shops consider important, understand the impact process of these quality factors on experience value, customer satisfaction, and behavioral intention, and allocate limited marketing budget. The results also show that it is possible to establish differentiatied response strategies using customer's regulatory focus orientation to find ways to enhance utlitarian and hedonic value, customer satisfaction, and behavioral intention using various Coffeeshop service quality factors. At the end of this paper, some limitations and future research directions were suggested.

Moving Object Detection Using Sparse Approximation and Sparse Coding Migration

  • Li, Shufang;Hu, Zhengping;Zhao, Mengyao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.5
    • /
    • pp.2141-2155
    • /
    • 2020
  • In order to meet the requirements of background change, illumination variation, moving shadow interference and high accuracy in object detection of moving camera, and strive for real-time and high efficiency, this paper presents an object detection algorithm based on sparse approximation recursion and sparse coding migration in subspace. First, low-rank sparse decomposition is used to reduce the dimension of the data. Combining with dictionary sparse representation, the computational model is established by the recursive formula of sparse approximation with the video sequences taken as subspace sets. And the moving object is calculated by the background difference method, which effectively reduces the computational complexity and running time. According to the idea of sparse coding migration, the above operations are carried out in the down-sampling space to further reduce the requirements of computational complexity and memory storage, and this will be adapt to multi-scale target objects and overcome the impact of large anomaly areas. Finally, experiments are carried out on VDAO datasets containing 59 sets of videos. The experimental results show that the algorithm can detect moving object effectively in the moving camera with uniform speed, not only in terms of low computational complexity but also in terms of low storage requirements, so that our proposed algorithm is suitable for detection systems with high real-time requirements.

Survey on Deep Learning-based Panoptic Segmentation Methods (딥 러닝 기반의 팬옵틱 분할 기법 분석)

  • Kwon, Jung Eun;Cho, Sung In
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.16 no.5
    • /
    • pp.209-214
    • /
    • 2021
  • Panoptic segmentation, which is now widely used in computer vision such as medical image analysis, and autonomous driving, helps understanding an image with holistic view. It identifies each pixel by assigning a unique class ID, and an instance ID. Specifically, it can classify 'thing' from 'stuff', and provide pixel-wise results of semantic prediction and object detection. As a result, it can solve both semantic segmentation and instance segmentation tasks through a unified single model, producing two different contexts for two segmentation tasks. Semantic segmentation task focuses on how to obtain multi-scale features from large receptive field, without losing low-level features. On the other hand, instance segmentation task focuses on how to separate 'thing' from 'stuff' and how to produce the representation of detected objects. With the advances of both segmentation techniques, several panoptic segmentation models have been proposed. Many researchers try to solve discrepancy problems between results of two segmentation branches that can be caused on the boundary of the object. In this survey paper, we will introduce the concept of panoptic segmentation, categorize the existing method into two representative methods and explain how it is operated on two methods: top-down method and bottom-up method. Then, we will analyze the performance of various methods with experimental results.

Color Image Rendering using A Modified Image Formation Model (변형된 영상 생성 모델을 이용한 칼라 영상 보정)

  • Choi, Ho-Hyoung;Yun, Byoung-Ju
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.1
    • /
    • pp.71-79
    • /
    • 2011
  • The objective of the imaging pipeline is to transform the original scene into a display image that appear similar, Generally, gamma adjustment or histogram-based method is modified to improve the contrast and detail. However, this is insufficient as the intensity and the chromaticity of illumination vary with geometric position. Thus, MSR (Multi-Scale Retinex) has been proposed. the MSR is based on a channel-independent logarithm, and it is dependent on the scale of the Gaussian filter, which varies according to input image. Therefore, after correcting the color, image quality degradations, such as halo, graying-out, and dominated color, may occur. Accordingly, this paper presents a novel color correction method using a modified image formation model in which the image is divided into three components such as global illumination, local illumination, and reflectance. The global illumination is obtained through Gaussian filtering of the original image, and the local illumination is estimated by using JND-based adaptive filter. Thereafter, the reflectance is estimated by dividing the original image by the estimated global and the local illumination to remove the influence of the illumination effects. The output image is obtained based on sRGB color representation. The experiment results show that the proposed method yields better performance of color correction over the conventional methods.