Search | Korea Science

Content-Adaptive Model Update of Convolutional Neural Networks for Super-Resolution

Ki, Sehwan;Kim, Munchurl
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.234-236
- /
- 2020
Content-adaptive training and transmission of the model parameters of neural networks can boost up the SR performance with higher restoration fidelity. In this case, efficient transmission of neural network parameters are essentially needed. Thus, we propose a novel method of compressing the network model parameters based on the training of network model parameters in the sense that the residues of filter parameters and content loss are jointly minimized. So, the residues of filter parameters are only transmitted to receiver sides for different temporal portions of video under consideration. This is advantage for image restoration applications with receivers (user terminals) of low complexity. In this case, the user terminals are assumed to have a limited computation and storage resource.
PDF

Analyzing DNN Model Performance Depending on Backbone Network (백본 네트워크에 따른 사람 속성 검출 모델의 성능 변화 분석)

Chun-Su Park
- Journal of the Semiconductor & Display Technology
- /
- v.22 no.2
- /
- pp.128-132
- /
- 2023
Recently, with the development of deep learning technology, research on pedestrian attribute recognition technology using deep neural networks has been actively conducted. Existing pedestrian attribute recognition techniques can be obtained in such a way as global-based, regional-area-based, visual attention-based, sequential prediction-based, and newly designed loss function-based, depending on how pedestrian attributes are detected. It is known that the performance of these pedestrian attribute recognition technologies varies greatly depending on the type of backbone network that constitutes the deep neural networks model. Therefore, in this paper, several backbone networks are applied to the baseline pedestrian attribute recognition model and the performance changes of the model are analyzed. In this paper, the analysis is conducted using Resnet34, Resnet50, Resnet101, Swin-tiny, and Swinv2-tiny, which are representative backbone networks used in the fields of image classification, object detection, etc. Furthermore, this paper analyzes the change in time complexity when inferencing each backbone network using a CPU and a GPU.
PDF

Traffic Flow Prediction with Spatio-Temporal Information Fusion using Graph Neural Networks

Huijuan Ding;Giseop Noh
- International journal of advanced smart convergence
- /
- v.12 no.4
- /
- pp.88-97
- /
- 2023
Traffic flow prediction is of great significance in urban planning and traffic management. As the complexity of urban traffic increases, existing prediction methods still face challenges, especially for the fusion of spatiotemporal information and the capture of long-term dependencies. This study aims to use the fusion model of graph neural network to solve the spatio-temporal information fusion problem in traffic flow prediction. We propose a new deep learning model Spatio-Temporal Information Fusion using Graph Neural Networks (STFGNN). We use GCN module, TCN module and LSTM module alternately to carry out spatiotemporal information fusion. GCN and multi-core TCN capture the temporal and spatial dependencies of traffic flow respectively, and LSTM connects multiple fusion modules to carry out spatiotemporal information fusion. In the experimental evaluation of real traffic flow data, STFGNN showed better performance than other models.
https://doi.org/10.7236/IJASC.2023.12.4.88 인용 PDF

Oriented object detection in satellite images using convolutional neural network based on ResNeXt

Asep Haryono;Grafika Jati;Wisnu Jatmiko
- ETRI Journal
- /
- v.46 no.2
- /
- pp.307-322
- /
- 2024
Most object detection methods use a horizontal bounding box that causes problems between adjacent objects with arbitrary directions, resulting in misaligned detection. Hence, the horizontal anchor should be replaced by a rotating anchor to determine oriented bounding boxes. A two-stage process of delineating a horizontal bounding box and then converting it into an oriented bounding box is inefficient. To improve detection, a box-boundary-aware vector can be estimated based on a convolutional neural network. Specifically, we propose a ResNeXt101 encoder to overcome the weaknesses of the conventional ResNet, which is less effective as the network depth and complexity increase. Owing to the cardinality of using a homogeneous design and multi-branch architecture with few hyperparameters, ResNeXt captures better information than ResNet. Experimental results demonstrate more accurate and faster oriented object detection of our proposal compared with a baseline, achieving a mean average precision of 89.41% and inference rate of 23.67 fps.
https://doi.org/10.4218/etrij.2022-0446 인용 PDF

Complexity Control Method of Chaos Dynamics in Recurrent Neural Networks

Sakai, Masao;Honma, Noriyasu;Abe, Kenichi
- 제어로봇시스템학회:학술대회논문집
- /
- 2000.10a
- /
- pp.494-494
- /
- 2000
This paper demonstrates that the largest Lyapunov exponent $\lambda$ of recurrent neural networks can be controlled by a gradient method. The method minimizes a square error $e_{\lambda}=(\lambda-\lambda^{obj})^2$ where $\lambda^{obj}$ is desired exponent. The $\lambda$ can be given as a function of the network parameters P such as connection weights and thresholds of neurons' activation. Then changes of parameters to minimize the error are given by calculating their gradients $\partial\lambda/\partialP$. In a previous paper, we derived a control method of $\lambda$via a direct calculation of $\partial\lambda/\partialP$ with a gradient collection through time. This method however is computationally expensive for large-scale recurrent networks and the control is unstable for recurrent networks with chaotic dynamics. Our new method proposed in this paper is based on a stochastic relation between the complexity $\lambda$ and parameters P of the networks configuration under a restriction. Then the new method allows us to approximate the gradient collection in a fashion without time evolution. This approximation requires only $O(N^2)$ run time while our previous method needs $O(N^{5}T)$ run time for networks with N neurons and T evolution. Simulation results show that the new method can realize a "stable" control for larege-scale networks with chaotic dynamics.
PDF

Support vector machine for prediction of the compressive strength of no-slump concrete

Sobhani, J.;Khanzadi, M.;Movahedian, A.H.
- Computers and Concrete
- /
- v.11 no.4
- /
- pp.337-350
- /
- 2013
The sensitivity of compressive strength of no-slump concrete to its ingredient materials and proportions, necessitate the use of robust models to guarantee both estimation and generalization features. It was known that the problem of compressive strength prediction owes high degree of complexity and uncertainty due to the variable nature of materials, workmanship quality, etc. Moreover, using the chemical and mineral additives, superimposes the problem's complexity. Traditionally this property of concrete is predicted by conventional linear or nonlinear regression models. In general, these models comprise lower accuracy and in most cases they fail to meet the extrapolation accuracy and generalization requirements. Recently, artificial intelligence-based robust systems have been successfully implemented in this area. In this regard, this paper aims to investigate the use of optimized support vector machine (SVM) to predict the compressive strength of no-slump concrete and compare with optimized neural network (ANN). The results showed that after optimization process, both models are applicable for prediction purposes with similar high-qualities of estimation and generalization norms; however, it was indicated that optimization and modeling with SVM is very rapid than ANN models.
https://doi.org/10.12989/cac.2013.11.4.337 인용 KSCI

A Study on Compression of Connections in Deep Artificial Neural Networks (인공신경망의 연결압축에 대한 연구)

Ahn, Heejune
- Journal of Korea Society of Industrial Information Systems
- /
- v.22 no.5
- /
- pp.17-24
- /
- 2017
Recently Deep-learning, Technologies using Large or Deep Artificial Neural Networks, have Shown Remarkable Performance, and the Increasing Size of the Network Contributes to its Performance Improvement. However, the Increase in the Size of the Neural Network Leads to an Increase in the Calculation Amount, which Causes Problems Such as Circuit Complexity, Price, Heat Generation, and Real-time Restriction. In This Paper, We Propose and Test a Method to Reduce the Number of Network Connections by Effectively Pruning the Redundancy in the Connection and Showing the Difference between the Performance and the Desired Range of the Original Neural Network. In Particular, we Proposed a Simple Method to Improve the Performance by Re-learning and to Guarantee the Desired Performance by Allocating the Error Rate per Layer in Order to Consider the Difference of each Layer. Experiments have been Performed on a Typical Neural Network Structure such as FCN (full connection network) and CNN (convolution neural network) Structure and Confirmed that the Performance Similar to that of the Original Neural Network can be Obtained by Only about 1/10 Connection.
https://doi.org/10.9723/jksiis.2017.22.5.017 인용 PDF KSCI

Research and Optimization of Face Detection Algorithm Based on MTCNN Model in Complex Environment (복잡한 환경에서 MTCNN 모델 기반 얼굴 검출 알고리즘 개선 연구)

Fu, Yumei;Kim, Minyoung;Jang, Jong-wook
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.24 no.1
- /
- pp.50-56
- /
- 2020
With the rapid development of deep neural network theory and application research, the effect of face detection has been improved. However, due to the complexity of deep neural network calculation and the high complexity of the detection environment, how to detect face quickly and accurately becomes the main problem. This paper is based on the relatively simple model of the MTCNN model, using FDDB (Face Detection Dataset and Benchmark Homepage), LFW (Field Label Face) and FaceScrub public datasets as training samples. At the same time of sorting out and introducing MTCNN(Multi-Task Cascaded Convolutional Neural Network) model, it explores how to improve training speed and Increase performance at the same time. In this paper, the dynamic image pyramid technology is used to replace the traditional image pyramid technology to segment samples, and OHEM (the online hard example mine) function in MTCNN model is deleted in training, so as to improve the training speed.
https://doi.org/10.6109/jkiice.2020.24.1.50 인용 PDF KSCI

A Position Sensorless Control System of SRM using Neural Network (신경회로망을 이용한 위치센서 없는 스위치드 릴럭턴스 전동기의 제어시스템)

김민회;백원식;이상석;박찬규
- The Transactions of the Korean Institute of Power Electronics
- /
- v.9 no.3
- /
- pp.246-252
- /
- 2004
This paper presents a position sensorless control system of Switched Reluctance Motor (SRM) using neural network. The control of SRM depends on the commutation of the stator phases in synchronism with the rotor position. The position sensing requirement increases the overall cost and complexity. In this paper, the current-flux-rotor position lookup table based position sensorless operation of SRM is presented. Neural network is used to construct the current-flux-rotor position lookup table, and is trained by sufficient experimental data. Experimental results for a 1-hp SRM is presented for the verification of the proposed sensorless algorithm.
PDF KSCI

CCTV Object Detection with Background Subtraction and Convolutional Neural Network (배경 차분과 CNN 기반의 CCTV 객체 검출)

Kim, Young-Min;Lee, Jiyoung;Yoon, Illo;Han, Taekjin;Kim, Chulyeon
- KIISE Transactions on Computing Practices
- /
- v.24 no.3
- /
- pp.151-156
- /
- 2018
In this paper, a method to classify objects in outdoor CCTV images using Convolutional Neural Network(CNN) and background subtraction is proposed. Object candidates are extracted using background subtraction and they are classified with CNN to detect objects in the image. At the end, computation complexity is highly reduced in comparison to other object detection algorithms. A database is constructed by filming alleys and playgrounds, places where crime occurs mainly. In experiments, different image sizes and experimental settings are tested to construct a best classifier detecting person. And the final classification accuracy became 80% for same camera data and 67.5% for a different camera.
https://doi.org/10.5626/KTCP.2018.24.3.151 인용 KSCI

Search Result 283, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)