• Title/Summary/Keyword: attention mechanism

Search Result 780, Processing Time 0.027 seconds

EDMFEN: Edge detection-based multi-scale feature enhancement Network for low-light image enhancement

  • Canlin Li;Shun Song;Pengcheng Gao;Wei Huang;Lihua Bi
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.4
    • /
    • pp.980-997
    • /
    • 2024
  • To improve the brightness of images and reveal hidden information in dark areas is the main objective of low-light image enhancement (LLIE). LLIE methods based on deep learning show good performance. However, there are some limitations to these methods, such as the complex network model requires highly configurable environments, and deficient enhancement of edge details leads to blurring of the target content. Single-scale feature extraction results in the insufficient recovery of the hidden content of the enhanced images. This paper proposed an edge detection-based multi-scale feature enhancement network for LLIE (EDMFEN). To reduce the loss of edge details in the enhanced images, an edge extraction module consisting of a Sobel operator is introduced to obtain edge information by computing gradients of images. In addition, a multi-scale feature enhancement module (MSFEM) consisting of multi-scale feature extraction block (MSFEB) and a spatial attention mechanism is proposed to thoroughly recover the hidden content of the enhanced images and obtain richer features. Since the fused features may contain some useless information, the MSFEB is introduced so as to obtain the image features with different perceptual fields. To use the multi-scale features more effectively, a spatial attention mechanism module is used to retain the key features and improve the model performance after fusing multi-scale features. Experimental results on two datasets and five baseline datasets show that EDMFEN has good performance when compared with the stateof-the-art LLIE methods.

Predicting the core thermal hydraulic parameters with a gated recurrent unit model based on the soft attention mechanism

  • Anni Zhang;Siqi Chun;Zhoukai Cheng;Pengcheng Zhao
    • Nuclear Engineering and Technology
    • /
    • v.56 no.6
    • /
    • pp.2343-2351
    • /
    • 2024
  • Accurately predicting the thermal hydraulic parameters of a transient reactor core under different working conditions is the first step toward reactor safety. Mass flow rate and temperature are important parameters of core thermal hydraulics, which have often been modeled as time series prediction problems. This study aims to achieve accurate and continuous prediction of core thermal hydraulic parameters under instantaneous conditions, as well as test the feasibility of a newly constructed gated recurrent unit (GRU) model based on the soft attention mechanism for core parameter predictions. Herein, the China Experimental Fast Reactor (CEFR) is used as the research object, and CEFR 1/2 core was taken as subject to carry out continuous predictive analysis of thermal parameters under transient conditions., while the subchannel analysis code named SUBCHANFLOW is used to generate the time series of core thermal-hydraulic parameters. The GRU model is used to predict the mass flow and temperature time series of the core. The results show that compared to the adaptive radial basis function neural network, the GRU network model produces better prediction results. The average relative error for temperature is less than 0.5 % when the step size is 3, and the prediction effect is better within 15 s. The average relative error of mass flow rate is less than 5 % when the step size is 10, and the prediction effect is better in the subsequent 12 s. The GRU model not only shows a higher prediction accuracy, but also captures the trends of the dynamic time series, which is useful for maintaining reactor safety and preventing nuclear power plant accidents. Furthermore, it can provide long-term continuous predictions under transient reactor conditions, which is useful for engineering applications and improving reactor safety.

Prediction of dam inflow based on LSTM-s2s model using luong attention (Attention 기법을 적용한 LSTM-s2s 모델 기반 댐유입량 예측 연구)

  • Lee, Jonghyeok;Choi, Suyeon;Kim, Yeonjoo
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.7
    • /
    • pp.495-504
    • /
    • 2022
  • With the recent development of artificial intelligence, a Long Short-Term Memory (LSTM) model that is efficient with time-series analysis is being used to increase the accuracy of predicting the inflow of dams. In this study, we predict the inflow of the Soyang River dam, using the LSTM model with the Sequence-to-Sequence (LSTM-s2s) and attention mechanism (LSTM-s2s with attention) that can further improve the LSTM performance. Hourly inflow, temperature, and precipitation data from 2013 to 2020 were used to train the model, and validate and test for evaluating the performance of the models. As a result, the LSTM-s2s with attention showed better performance than the LSTM-s2s in general as well as in predicting a peak value. Both models captured the inflow pattern during the peaks but detailed hourly variability is limitedly simulated. We conclude that the proposed LSTM-s2s with attention can improve inflow forecasting despite its limits in hourly prediction.

Modeling and Analysis of a Biomimetic Foot Mechanism (생체모방 발 메커니즘의 모델링 및 해석)

  • Seo, Jong-Tae;Yi, Byung-Ju
    • Journal of the Korean Society of Manufacturing Technology Engineers
    • /
    • v.18 no.5
    • /
    • pp.521-528
    • /
    • 2009
  • In these days, biomimetic apprioach in the design and control of robotic system has drawn much attention. The human-being and mammals possess their own feet. Using the mobility of their feet, they are able to walk in various environments such as plain land, desert, swamp, and so on. Previously developed biped robots and four-legged robots did not employ such adaptable foot. In this work, a biomimetic foot mechanism is investigated through analysis of the foot structure of the human-being. This foot mechanism consists of a toe, an ankle, a heel, and some springs replacing the foot muscles and tendons. Using five toes and springs, this foot can adapt to various environments. A mathematical modeling for this foot mechanism was performed and its characteristics were observed through numerical simulation.

  • PDF

E2GSM: Energy Effective Gear-Shifting Mechanism in Cloud Storage System

  • You, Xindong;Han, GuangJie;Zhu, Chuan;Dong, Chi;Shen, Jian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.10
    • /
    • pp.4681-4702
    • /
    • 2016
  • Recently, Massive energy consumption in Cloud Storage System has attracted great attention both in industry and research community. However, most of the solutions utilize single method to reduce the energy consumption only in one aspect. This paper proposed an energy effective gear-shifting mechanism (E2GSM) in Cloud Storage System to save energy consumption from multi-aspects. E2GSM is established on data classification mechanism and data replication management strategy. Data is classified according to its properties and then be placed into the corresponding zones through the data classification mechanism. Data replication management strategies determine the minimum replica number through a mathematical model and make decision on replica placement. Based on the above data classification mechanism and replica management strategies, the energy effective gear-shifting mechanism (E2GSM) can automatically gear-shifting among the nodes. Mathematical analytical model certificates our proposed E2GSM is energy effective. Simulation experiments based on Gridsim show that the proposed gear-shifting mechanism is cost effective. Compared to the other energy-saved mechanism, our E2GSM can save energy consumption substantially at the slight expense of performance loss while meeting the QoS of user.

A Study on Attention Mechanism in DeepLabv3+ for Deep Learning-based Semantic Segmentation (딥러닝 기반의 Semantic Segmentation을 위한 DeepLabv3+에서 강조 기법에 관한 연구)

  • Shin, SeokYong;Lee, SangHun;Han, HyunHo
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.10
    • /
    • pp.55-61
    • /
    • 2021
  • In this paper, we proposed a DeepLabv3+ based encoder-decoder model utilizing an attention mechanism for precise semantic segmentation. The DeepLabv3+ is a semantic segmentation method based on deep learning and is mainly used in applications such as autonomous vehicles, and infrared image analysis. In the conventional DeepLabv3+, there is little use of the encoder's intermediate feature map in the decoder part, resulting in loss in restoration process. Such restoration loss causes a problem of reducing segmentation accuracy. Therefore, the proposed method firstly minimized the restoration loss by additionally using one intermediate feature map. Furthermore, we fused hierarchically from small feature map in order to effectively utilize this. Finally, we applied an attention mechanism to the decoder to maximize the decoder's ability to converge intermediate feature maps. We evaluated the proposed method on the Cityscapes dataset, which is commonly used for street scene image segmentation research. Experiment results showed that our proposed method improved segmentation results compared to the conventional DeepLabv3+. The proposed method can be used in applications that require high accuracy.

Design and Analysis of a New Shift Automation Mechanism for Automated Manual Transmission (AMT용 새로운 변속자동 메커니즘 설계 및 해석)

  • Kim, Jung-Yun;Kim, Gi-Dae
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.34 no.4
    • /
    • pp.66-71
    • /
    • 2011
  • This article proposes a novel shift automation mechanism for an automated manual transmission (AMT). The development of an automated manual transmission is currently being paid considerable attention by vehicle manufacturers, with the prospects of combining the comfort of an automatic transmission and the high efficiency of a manual transmission. In order to automate the shift mechanism of a manual transmission, the proposed shift automation mechanism consists of two electric motors, cross shaped pinion gears, rack type shift rails, and a ball splined hollow shaft. First we describe the shift mechanism and operating principles of a manual transmission to investigate important design criteria for the shift automation device. And a new shift automation mechanism is described with its structure, elements, and operating principles in detail. Using a conventional manual transmission, we develop a full three-dimensional CAD model of an AMT which includes main components of the manual transmission and the designed shift automation mechanism. Finally we investigate the operating performances and feasibility of the designed AMT by a dynamic analysis.

Pointer Networks based on Skip Pointing Model (스킵 포인팅 모델 기반 포인터 네트워크)

  • Park, Cheoneum;Lee, Changki
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.12
    • /
    • pp.625-631
    • /
    • 2016
  • Pointer Networks is a model which generates an output sequence with elements that correspond to an input sequence, based on the attention mechanism. A time complexity of the pointer networks is $O(N^2)$ resulting in longer decoding time of the model. This is because the model calculates attention for each input, if size of the input sequence is N. In this paper, we propose the pointer networks based on skip pointing model, which confirms the necessary input vector at decoding for reducing the decoding time of the pointer networks. Furthermore, experiments were conducted for the pronouns coreference resolution, which uses the method proposed in this paper. Our results show that the processing time per sentence was approximately 1.15 times faster, and the MUC F1 was 83.60%; this was approximately 2.17% improvement and a better performance than the original pointer networks.

A Study on Image Generation from Sentence Embedding Applying Self-Attention (Self-Attention을 적용한 문장 임베딩으로부터 이미지 생성 연구)

  • Yu, Kyungho;No, Juhyeon;Hong, Taekeun;Kim, Hyeong-Ju;Kim, Pankoo
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.63-69
    • /
    • 2021
  • When a person sees a sentence and understands the sentence, the person understands the sentence by reminiscent of the main word in the sentence as an image. Text-to-image is what allows computers to do this associative process. The previous deep learning-based text-to-image model extracts text features using Convolutional Neural Network (CNN)-Long Short Term Memory (LSTM) and bi-directional LSTM, and generates an image by inputting it to the GAN. The previous text-to-image model uses basic embedding in text feature extraction, and it takes a long time to train because images are generated using several modules. Therefore, in this research, we propose a method of extracting features by using the attention mechanism, which has improved performance in the natural language processing field, for sentence embedding, and generating an image by inputting the extracted features into the GAN. As a result of the experiment, the inception score was higher than that of the model used in the previous study, and when judged with the naked eye, an image that expresses the features well in the input sentence was created. In addition, even when a long sentence is input, an image that expresses the sentence well was created.

Deep learning-based post-disaster building inspection with channel-wise attention and semi-supervised learning

  • Wen Tang;Tarutal Ghosh Mondal;Rih-Teng Wu;Abhishek Subedi;Mohammad R. Jahanshahi
    • Smart Structures and Systems
    • /
    • v.31 no.4
    • /
    • pp.365-381
    • /
    • 2023
  • The existing vision-based techniques for inspection and condition assessment of civil infrastructure are mostly manual and consequently time-consuming, expensive, subjective, and risky. As a viable alternative, researchers in the past resorted to deep learning-based autonomous damage detection algorithms for expedited post-disaster reconnaissance of structures. Although a number of automatic damage detection algorithms have been proposed, the scarcity of labeled training data remains a major concern. To address this issue, this study proposed a semi-supervised learning (SSL) framework based on consistency regularization and cross-supervision. Image data from post-earthquake reconnaissance, that contains cracks, spalling, and exposed rebars are used to evaluate the proposed solution. Experiments are carried out under different data partition protocols, and it is shown that the proposed SSL method can make use of unlabeled images to enhance the segmentation performance when limited amount of ground truth labels are provided. This study also proposes DeepLab-AASPP and modified versions of U-Net++ based on channel-wise attention mechanism to better segment the components and damage areas from images of reinforced concrete buildings. The channel-wise attention mechanism can effectively improve the performance of the network by dynamically scaling the feature maps so that the networks can focus on more informative feature maps in the concatenation layer. The proposed DeepLab-AASPP achieves the best performance on component segmentation and damage state segmentation tasks with mIoU scores of 0.9850 and 0.7032, respectively. For crack, spalling, and rebar segmentation tasks, modified U-Net++ obtains the best performance with Igou scores (excluding the background pixels) of 0.5449, 0.9375, and 0.5018, respectively. The proposed architectures win the second place in IC-SHM2021 competition in all five tasks of Project 2.