• Title/Summary/Keyword: Hidden layers

Search Result 253, Processing Time 0.026 seconds

Evaluation of Artificial Intelligence Accuracy by Increasing the CNN Hidden Layers: Using Cerebral Hemorrhage CT Data (CNN 은닉층 증가에 따른 인공지능 정확도 평가: 뇌출혈 CT 데이터)

  • Kim, Han-Jun;Kang, Min-Ji;Kim, Eun-Ji;Na, Yong-Hyeon;Park, Jae-Hee;Baek, Su-Eun;Sim, Su-Man;Hong, Joo-Wan
    • Journal of the Korean Society of Radiology
    • /
    • v.16 no.1
    • /
    • pp.1-6
    • /
    • 2022
  • Deep learning is a collection of algorithms that enable learning by summarizing the key contents of large amounts of data; it is being developed to diagnose lesions in the medical imaging field. To evaluate the accuracy of the cerebral hemorrhage diagnosis, we used a convolutional neural network (CNN) to derive the diagnostic accuracy of cerebral parenchyma computed tomography (CT) images and the cerebral parenchyma CT images of areas where cerebral hemorrhages are suspected of having occurred. We compared the accuracy of CNN with different numbers of hidden layers and discovered that CNN with more hidden layers resulted in higher accuracy. The analysis results of the derived CT images used in this study to determine the presence of cerebral hemorrhages are expected to be used as foundation data in studies related to the application of artificial intelligence in the medical imaging industry.

Proposal of DNN-based predictive model for calculating concrete mixing proportions accroding to admixture (혼화재 혼입에 따른 콘크리트 배합요소 산정을 위한 DNN 기반의 예측모델 제안)

  • Choi, Ju-Hee;Lee, Kwang-Soo;Lee, Han-Seung
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2022.11a
    • /
    • pp.57-58
    • /
    • 2022
  • Concrete mix design is used as essential data for the quality of concrete, analysis of structures, and stable use of sustainable structures. However, since most of the formulation design is established based on the experience of experts, there is a lack of data to base it on. are suffering Accordingly, in this study, the purpose of this study is to build a predictive model to use the concrete mixing factor as basic data for calculation using the DNN technique. As for the data set for DNN model learning, OPC and ternary concrete data were collected according to the presence or absence of admixture, respectively, and the model was separated for OPC and ternary concrete, and training was carried out. In addition, by varying the number of hidden layers of the DNN model, the prediction performance was evaluated according to the model structure. The higher the number of hidden layers in the model, the higher the predictive performance for the prediction of the mixing elements except for the compressive strength factor set as the output value, and the ternary concrete model showed higher performance than the OPC. This is expected because the data set used when training the model also affected the training.

  • PDF

A Study on Rotating Object Classification using Deep Neural Networks (깊은신경망을 이용한 회전객체 분류 연구)

  • Lee, Yong-Kyu;Lee, Yill-Byung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.25 no.5
    • /
    • pp.425-430
    • /
    • 2015
  • This paper is a study to improve the classification efficiency of rotating objects by using deep neural networks to which a deep learning algorithm was applied. For the classification experiment of rotating objects, COIL-20 is used as data and total 3 types of classifiers are compared and analyzed. 3 types of classifiers used in the study include PCA classifier to derive a feature value while reducing the dimension of data by using Principal Component Analysis and classify by using euclidean distance, MLP classifier of the way of reducing the error energy by using error back-propagation algorithm and finally, deep learning applied DBN classifier of the way of increasing the probability of observing learning data through pre-training and reducing the error energy through fine-tuning. In order to identify the structure-specific error rate of the deep neural networks, the experiment is carried out while changing the number of hidden layers and number of hidden neurons. The classifier using DBN showed the lowest error rate. Its structure of deep neural networks with 2 hidden layers showed a high recognition rate by moving parameters to a location helpful for recognition.

Alleviation of Vanishing Gradient Problem Using Parametric Activation Functions (파라메트릭 활성함수를 이용한 기울기 소실 문제의 완화)

  • Ko, Young Min;Ko, Sun Woo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.10
    • /
    • pp.407-420
    • /
    • 2021
  • Deep neural networks are widely used to solve various problems. However, the deep neural network with a deep hidden layer frequently has a vanishing gradient or exploding gradient problem, which is a major obstacle to learning the deep neural network. In this paper, we propose a parametric activation function to alleviate the vanishing gradient problem that can be caused by nonlinear activation function. The proposed parametric activation function can be obtained by applying a parameter that can convert the scale and location of the activation function according to the characteristics of the input data, and the loss function can be minimized without limiting the derivative of the activation function through the backpropagation process. Through the XOR problem with 10 hidden layers and the MNIST classification problem with 8 hidden layers, the performance of the original nonlinear and parametric activation functions was compared, and it was confirmed that the proposed parametric activation function has superior performance in alleviating the vanishing gradient.

Kriging Regressive Deep Belief WSN-Assisted IoT for Stable Routing and Energy Conserved Data Transmission

  • Muthulakshmi, L.;Banumathi, A.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.7
    • /
    • pp.91-102
    • /
    • 2022
  • With the evolution of wireless sensor network (WSN) technology, the routing policy has foremost importance in the Internet of Things (IoT). A systematic routing policy is one of the primary mechanics to make certain the precise and robust transmission of wireless sensor networks in an energy-efficient manner. In an IoT environment, WSN is utilized for controlling services concerning data like, data gathering, sensing and transmission. With the advantages of IoT potentialities, the traditional routing in a WSN are augmented with decision-making in an energy efficient manner to concur finer optimization. In this paper, we study how to combine IoT-based deep learning classifier with routing called, Kriging Regressive Deep Belief Neural Learning (KR-DBNL) to propose an efficient data packet routing to cope with scalability issues and therefore ensure robust data packet transmission. The KR-DBNL method includes four layers, namely input layer, two hidden layers and one output layer for performing data transmission between source and destination sensor node. Initially, the KR-DBNL method acquires the patient data from different location. Followed by which, the input layer transmits sensor nodes to first hidden layer where analysis of energy consumption, bandwidth consumption and light intensity are made using kriging regression function to perform classification. According to classified results, sensor nodes are classified into higher performance and lower performance sensor nodes. The higher performance sensor nodes are then transmitted to second hidden layer. Here high performance sensor nodes neighbouring sensor with higher signal strength and frequency are selected and sent to the output layer where the actual data packet transmission is performed. Experimental evaluation is carried out on factors such as energy consumption, packet delivery ratio, packet loss rate and end-to-end delay with respect to number of patient data packets and sensor nodes.

Prediction of Software Fault Severity using Deep Learning Methods (딥러닝을 이용한 소프트웨어 결함 심각도 예측)

  • Hong, Euyseok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.6
    • /
    • pp.113-119
    • /
    • 2022
  • In software fault prediction, a multi classification model that predicts the fault severity category of a module can be much more useful than a binary classification model that simply predicts the presence or absence of faults. A small number of severity-based fault prediction models have been proposed, but no classifier using deep learning techniques has been proposed. In this paper, we construct MLP models with 3 or 5 hidden layers, and they have a structure with a fixed or variable number of hidden layer nodes. As a result of the model evaluation experiment, MLP-based deep learning models shows significantly better performance in both Accuracy and AUC than MLPs, which showed the best performance among models that did not use deep learning. In particular, the model structure with 3 hidden layers, 32 batch size, and 64 nodes shows the best performance.

Fabrication and Experiment of Micromirror with Aluminum Pin-joint (알루미늄 핀-조인트를 사용한 마이크로 미러의 제작과 측정)

  • Ji, Chang-Hyeon;Kim, Yong-Gwon
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • v.49 no.8
    • /
    • pp.487-494
    • /
    • 2000
  • This paper describes the design, fabrication and experiments of surface-micromachined aluminum micromirror array with hidden pin-joints. Instead of the conventional elastic spring components as connection between mirror plate and supporting structure, we used pin-joint composed of pin and staples to support the mirror plate. The placement of pin-joint under the mirror plate makes large active surface area possible. These flexureless micromirrors are driven by electrostatic force. As the mirror plate has discrete deflection angles, the device can be ap;lied to adaptive optics and digitally-operating optical applications. Four-level metal structural layers and semi-cured photoresist sacrificial layers were used in the fabrication process and sacrificial layers were removed by oxygen plasma ashing. Static characteristics of fabricated samples were measured and compared with modeling results.

  • PDF

Weight Distribution of Neural Networks in Computer Vision (컴퓨터 비전에서 신경망의 가중치 분포)

  • Wu, Chenmou;Lee, Hyo-Jon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.594-596
    • /
    • 2022
  • Over the last decades, deep neural networks have demonstrated significant success in various tasks. To address the special vision task, choosing a hot network as backbone to extract feature is a common way in both research and industry project. However, the choice of backbone usually requires the expert experience and affects the performance of the classification task. In this work, we propose a novel idea to support backbone decision-making by exploring the feature attribution and weights distribution of hidden layers from various backbones. We first analyze the visualization of feature maps on different size object and different depth layers to observe learning ability. Then, we compared the variance of weights and feature in last three layers. Based on analysis of the feature and wights, we summarize the traits and commonalities of existing networks.

Prediction of Asphalt Pavement Service Life using Deep Learning (딥러닝을 활용한 일반국도 아스팔트포장의 공용수명 예측)

  • Choi, Seunghyun;Do, Myungsik
    • International Journal of Highway Engineering
    • /
    • v.20 no.2
    • /
    • pp.57-65
    • /
    • 2018
  • PURPOSES : The study aims to predict the service life of national highway asphalt pavements through deep learning methods by using maintenance history data of the National Highway Pavement Management System. METHODS : For the configuration of a deep learning network, this study used Tensorflow 1.5, an open source program which has excellent usability among deep learning frameworks. For the analysis, nine variables of cumulative annual average daily traffic, cumulative equivalent single axle loads, maintenance layer, surface, base, subbase, anti-frost layer, structural number of pavement, and region were selected as input data, while service life was chosen to construct the input layer and output layers as output data. Additionally, for scenario analysis, in this study, a model was formed with four different numbers of 1, 2, 4, and 8 hidden layers and a simulation analysis was performed according to the applicability of the over fitting resolution algorithm. RESULTS : The results of the analysis have shown that regardless of the number of hidden layers, when an over fitting resolution algorithm, such as dropout, is applied, the prediction capability is improved as the coefficient of determination ($R^2$) of the test data increases. Furthermore, the result of the sensitivity analysis of the applicability of region variables demonstrates that estimating service life requires sufficient consideration of regional characteristics as $R^2$ had a maximum of between 0.73 and 0.84, when regional variables where taken into consideration. CONCLUSIONS : As a result, this study proposes that it is possible to precisely predict the service life of national highway pavement sections with the consideration of traffic, pavement thickness, and regional factors and concludes that the use of the prediction of service life is fundamental data in decision making within pavement management systems.

Comparison of the BOD Forecasting Ability of the ARIMA model and the Artificial Neural Network Model (ARIMA 모형과 인공신경망모형의 BOD예측력 비교)

  • 정효준;이홍근
    • Journal of Environmental Health Sciences
    • /
    • v.28 no.3
    • /
    • pp.19-25
    • /
    • 2002
  • In this paper, the water quality forecast was performed on the BOD of the Chungju Dam using the ARIMA model, which is a nonlinear statistics model, and the artificial neural network model. The monthly data of water quality were collected from 1991 to 2000. The most appropriate ARIMA model for Chungju dam was found to be the multiplicative seasonal ARIMA(1,0,1)(1,0,1)$_{12}$, model. While the artificial neural network model, which is used relatively often in recent days, forecasts new data by the strength of a learned matrix like human neurons. The BOD values were forecasted using the back-propagation algorithm of multi-layer perceptrons in this paper. Artificial neural network model was com- posed of two hidden layers and the node number of each hidden layer was designed fifteen. It was demonstrated that the ARIMA model was more appropriate in terms of changes around the overall average, but the artificial neural net-work model was more appropriate in terms of reflecting the minimum and the maximum values.s.