• Title/Summary/Keyword: MLP.

Search Result 676, Processing Time 0.082 seconds

Recognition of dog's front face using deep learning and machine learning (딥러닝 및 기계학습 활용 반려견 얼굴 정면판별 방법)

  • Kim, Jong-Bok;Jang, Dong-Hwa;Yang, Kayoung;Kwon, Kyeong-Seok;Kim, Jung-Kon;Lee, Joon-Whoan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.12
    • /
    • pp.1-9
    • /
    • 2020
  • As pet dogs rapidly increase in number, abandoned and lost dogs are also increasing in number. In Korea, animal registration has been in force since 2014, but the registration rate is not high owing to safety and effectiveness issues. Biometrics is attracting attention as an alternative. In order to increase the recognition rate from biometrics, it is necessary to collect biometric images in the same form as much as possible-from the face. This paper proposes a method to determine whether a dog is facing front or not in a real-time video. The proposed method detects the dog's eyes and nose using deep learning, and extracts five types of directional face information through the relative size and position of the detected face. Then, a machine learning classifier determines whether the dog is facing front or not. We used 2,000 dog images for learning, verification, and testing. YOLOv3 and YOLOv4 were used to detect the eyes and nose, and Multi-layer Perceptron (MLP), Random Forest (RF), and the Support Vector Machine (SVM) were used as classifiers. When YOLOv4 and the RF classifier were used with all five types of the proposed face orientation information, the face recognition rate was best, at 95.25%, and we found that real-time processing is possible.

Development of Marine Debris Monitoring Methods Using Satellite and Drone Images (위성 및 드론 영상을 이용한 해안쓰레기 모니터링 기법 개발)

  • Kim, Heung-Min;Bak, Suho;Han, Jeong-ik;Ye, Geon Hui;Jang, Seon Woong
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1109-1124
    • /
    • 2022
  • This study proposes a marine debris monitoring methods using satellite and drone multispectral images. A multi-layer perceptron (MLP) model was applied to detect marine debris using Sentinel-2 satellite image. And for the detection of marine debris using drone multispectral images, performance evaluation and comparison of U-Net, DeepLabv3+ (ResNet50) and DeepLabv3+ (Inceptionv3) among deep learning models were performed (mIoU 0.68). As a result of marine debris detection using satellite image, the F1-Score was 0.97. Marine debris detection using drone multispectral images was performed on vegetative debris and plastics. As a result of detection, when DeepLabv3+ (Inceptionv3) was used, the most model accuracy, mean intersection over union (mIoU), was 0.68. Vegetative debris showed an F1-Score of 0.93 and IoU of 0.86, while plastics showed low performance with an F1-Score of 0.5 and IoU of 0.33. However, the F1-Score of the spectral index applied to generate plastic mask images was 0.81, which was higher than the plastics detection performance of DeepLabv3+ (Inceptionv3), and it was confirmed that plastics monitoring using the spectral index was possible. The marine debris monitoring technique proposed in this study can be used to establish a plan for marine debris collection and treatment as well as to provide quantitative data on marine debris generation.

Predicting Probability of Precipitation Using Artificial Neural Network and Mesoscale Numerical Weather Prediction (인공신경망과 중규모기상수치예보를 이용한 강수확률예측)

  • Kang, Boosik;Lee, Bongki
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.5B
    • /
    • pp.485-493
    • /
    • 2008
  • The Artificial Neural Network (ANN) model was suggested for predicting probability of precipitation (PoP) using RDAPS NWP model, observation at AWS and upper-air sounding station. The prediction work was implemented for flood season and the data period is the July, August of 2001 and June of 2002. Neural network input variables (predictors) were composed of geopotential height 500/750/1000 hPa, atmospheric thickness 500-1000 hPa, X & Y-component of wind at 500 hPa, X & Y-component of wind at 750 hPa, wind speed at surface, temperature at 500/750 hPa/surface, mean sea level pressure, 3-hr accumulated precipitation, occurrence of observed precipitation, precipitation accumulated in 6 & 12 hrs previous to RDAPS run, precipitation occurrence in 6 & 12 hrs previous to RDAPS run, relative humidity measured 0 & 12 hrs before RDAPS run, precipitable water measured 0 & 12 hrs before RDAPS run, precipitable water difference in 12 hrs previous to RDAPS run. The suggested ANN has a 3-layer perceptron (multi layer perceptron; MLP) and back-propagation learning algorithm. The result shows that there were 6.8% increase in Hit rate (H), especially 99.2% and 148.1% increase in Threat Score (TS) and Probability of Detection (POD). It illustrates that the suggested ANN model can be a useful tool for predicting rainfall event prediction. The Kuipers Skill Score (KSS) was increased 92.8%, which the ANN model improves the rainfall occurrence prediction over RDAPS.

Multi-View 3D Human Pose Estimation Based on Transformer (트랜스포머 기반의 다중 시점 3차원 인체자세추정)

  • Seoung Wook Choi;Jin Young Lee;Gye Young Kim
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.48-56
    • /
    • 2023
  • The technology of Three-dimensional human posture estimation is used in sports, motion recognition, and special effects of video media. Among various methods for this, multi-view 3D human pose estimation is essential for precise estimation even in complex real-world environments. But Existing models for multi-view 3D human posture estimation have the disadvantage of high order of time complexity as they use 3D feature maps. This paper proposes a method to extend an existing monocular viewpoint multi-frame model based on Transformer with lower time complexity to 3D human posture estimation for multi-viewpoints. To expand to multi-viewpoints our proposed method first generates an 8-dimensional joint coordinate that connects 2-dimensional joint coordinates for 17 joints at 4-vieiwpoints acquired using the 2-dimensional human posture detector, CPN(Cascaded Pyramid Network). This paper then converts them into 17×32 data with patch embedding, and enters the data into a transformer model, finally. Consequently, the MLP(Multi-Layer Perceptron) block that outputs the 3D-human posture simultaneously updates the 3D human posture estimation for 4-viewpoints at every iteration. Compared to Zheng[5]'s method the number of model parameters of the proposed method was 48.9%, MPJPE(Mean Per Joint Position Error) was reduced by 20.6 mm (43.8%) and the average learning time per epoch was more than 20 times faster.

  • PDF

Energy Demand/Supply Prediction and Simulator UI Design for Energy Efficiency in the Industrial Complex (산업단지 에너지 효율화를 위한 에너지 수요/공급 예측 및 시뮬레이터 UI 설계)

  • Hyungah Lee;Jong-hyeok Park;Woojin Cho;Dongju Kim;Jae-hoi Gu
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.693-700
    • /
    • 2024
  • As of the end of March 2022, the total area of domestic industrial complexes is 606 km2, which is only about 0.6% of the total land area. However, as of 2018, the annual energy consumption of domestic industrial complexes is 110,866.1 thousand TOE, accounting for 53.5% of the country's total energy consumption and 83.1% of the entire industrial sector energy consumption. In addition, industrial complexes have a significant impact on the environment, accounting for 45.1% of the country's total greenhouse gas emissions and 76.8% of industrial sector greenhouse gas emissions. Under this background, in this study, in order to contribute to the energy efficiency of industrial complexes, a prediction study on energy demand and supply for an industrial complex in Korea using machine learning was conducted. In addition, a simulator UI screen was designed to more efficiently convey information on energy demand/supply prediction results and energy consumption status. Among the machine learning algorithms, Multi-Layer Perceptron (MLP) was used, and Bayesian Optimization was applied as an optimization technique for the prediction model. The energy prediction model for the industrial complex built in this study showed a prediction accuracy of 87.90% for compressed air demand and 99.54% for the flow rate available for the public air compressor.

A Study on Artificial Intelligence Models for Predicting the Causes of Chemical Accidents Using Chemical Accident Status and Case Data (화학물질 사고 현황 및 사례 데이터를 이용한 인공지능 사고 원인 예측 모델에 관한 연구)

  • KyungHyun Lee;RackJune Baek;Hyeseong Jung;WooSu Kim;HeeJeong Choi
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.5
    • /
    • pp.725-733
    • /
    • 2024
  • This study aims to develop an artificial intelligence-based model for predicting the causes of chemical accidents, utilizing data on 865 chemical accident situations and cases provided by the Chemical Safety Agency under the Ministry of Environment from January 2014 to January 2024. The research involved training the data using six artificial intelligence models and compared evaluation metrics such as accuracy, precision, recall, and F1 score. Based on 356 chemical accident cases from 2020 to 2024, additional training data sets were applied using chemical accident cause investigations and similar accident prevention measures suggested by the Chemical Safety Agency from 2021 to 2022. Through this process, the Multi-Layer Perceptron (MLP) model showed an accuracy of 0.6590 and a precision of 0.6821. the Multi-Layer Perceptron (MLP) model showed an accuracy of 0.6590 and a precision of 0.6821. The Logistic Regression model improved its accuracy from 0.6647 to 0.7778 and its precision from 0.6790 to 0.7992, confirming that the Logistic Regression model is the most effective for predicting the causes of chemical accidents.

EFFICIENT COMPUTATION OF COMPRESSIBLE FLOW BY HIGHER-ORDER METHOD ACCELERATED USING GPU (고차 정확도 수치기법의 GPU 계산을 통한 효율적인 압축성 유동 해석)

  • Chang, T.K.;Park, J.S.;Kim, C.
    • Journal of computational fluids engineering
    • /
    • v.19 no.3
    • /
    • pp.52-61
    • /
    • 2014
  • The present paper deals with the efficient computation of higher-order CFD methods for compressible flow using graphics processing units (GPU). The higher-order CFD methods, such as discontinuous Galerkin (DG) methods and correction procedure via reconstruction (CPR) methods, can realize arbitrary higher-order accuracy with compact stencil on unstructured mesh. However, they require much more computational costs compared to the widely used finite volume methods (FVM). Graphics processing unit, consisting of hundreds or thousands small cores, is apt to massive parallel computations of compressible flow based on the higher-order CFD methods and can reduce computational time greatly. Higher-order multi-dimensional limiting process (MLP) is applied for the robust control of numerical oscillations around shock discontinuity and implemented efficiently on GPU. The program is written and optimized in CUDA library offered from NVIDIA. The whole algorithms are implemented to guarantee accurate and efficient computations for parallel programming on shared-memory model of GPU. The extensive numerical experiments validates that the GPU successfully accelerates computing compressible flow using higher-order method.

Artificial neural network calculations for a receding contact problem

  • Yaylaci, Ecren Uzun;Yaylaci, Murat;Olmez, Hasan;Birinci, Ahmet
    • Computers and Concrete
    • /
    • v.25 no.6
    • /
    • pp.551-563
    • /
    • 2020
  • This paper investigates the artificial neural network (ANN) to predict the dimensionless parameters for the maximum contact pressures and contact areas of a contact problem. Firstly, the problem is formulated and solved theoretically by using Theory of Elasticity and Integral Transform Technique. Secondly, the contact problem has been extended based on the ANN. The multilayer perceptron (MLP) with three-layer was used to calculate the contact distances. External load, distance between the two quarter planes, layer heights and material properties were created by giving examples of different values were used at the training and test stages of ANN. Program code was rewritten in C++. Different types of network structures were used in the training process. The accuracy of the trained neural networks for the case was tested using 173 new data which were generated via theoretical solutions so as to determine the best network model. As a result, minimum deviation value (difference between theoretical and C++ ANN results) of was obtained for the network model. Theoretical results were compared with artificial neural network results and well agreements between them were achieved.

Cluster-based Linear Projection and %ixture of Experts Model for ATR System (자동 목표물 인식 시스템을 위한 클러스터 기반 투영기법과 혼합 전문가 구조)

  • 신호철;최재철;이진성;조주현;김성대
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.3
    • /
    • pp.203-216
    • /
    • 2003
  • In this paper a new feature extraction and target classification method is proposed for the recognition part of FLIR(Forwar Looking Infrared)-image-based ATR system. Proposed feature extraction method is "cluster(=set of classes)-based"version of previous fisherfaces method that is known by its robustness to illumination changes in face recognition. Expecially introduced class clustering and cluster-based projection method maximizes the performance of fisherfaces method. Proposed target image classification method is based on the mixture of experts model which consists of RBF-type experts and MLP-type gating networks. Mixture of experts model is well-suited with ATR system because it should recognizee various targets in complexed feature space by variously mixed conditions. In proposed classification method, one expert takes charge of one cluster and the separated structure with experts reduces the complexity of feature space and achieves more accurate local discrimination between classes. Proposed feature extraction and classification method showed distinguished performances in recognition test with customized. FLIR-vehicle-image database. Expecially robustness to pixelwise sensor noise and un-wanted intensity variations was verified by simulation.

Prediction of a winner in PGA tournament using neural network (신경망을 이용한 우승자 예측모형)

  • Min, Dae-Kee;Hyun, Moo-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.6
    • /
    • pp.1119-1127
    • /
    • 2009
  • In PGA golf, total prize money and average score are good response variable related to golf skills such as driving distance, green in regulation and putts per green in regulation. But it's not easy to predict the winner of coming tournament. Thus I applied Neural Networks which has pretty good advantages for non-linear complex modeling to binary data. In neural network architectures, I applied NRBF and MLP architecture model for binary data which represent who had a win or not.

  • PDF