Search | Korea Science

Deep Learning Model Parallelism (딥러닝 모델 병렬 처리)

Park, Y.M.;Ahn, S.Y.;Lim, E.J.;Choi, Y.S.;Woo, Y.C.;Choi, W.
- Electronics and Telecommunications Trends
- /
- v.33 no.4
- /
- pp.1-13
- /
- 2018
Deep learning (DL) models have been widely applied to AI applications such image recognition and language translation with big data. Recently, DL models have becomes larger and more complicated, and have merged together. For the accelerated training of a large-scale deep learning model, model parallelism that partitions the model parameters for non-shared parallel access and updates across multiple machines was provided by a few distributed deep learning frameworks. Model parallelism as a training acceleration method, however, is not as commonly used as data parallelism owing to the difficulty of efficient model parallelism. This paper provides a comprehensive survey of the state of the art in model parallelism by comparing the implementation technologies in several deep learning frameworks that support model parallelism, and suggests a future research directions for improving model parallelism technology.
https://doi.org/10.22648/ETRI.2018.J.330401 인용 PDF

Comparison of Deep Learning Loss Function Performance for Medical Video Biomarker Extraction (의료 영상 바이오마커 추출을 위한 딥러닝 손실함수 성능 비교)

Seo, Jin-beom;Cho, Young-bok
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.72-74
- /
- 2021
The deep learning process currently utilized in various fields consists of data preparation, data preprocessing, model generation, model learning, and model evaluation. In the process of model learning, the loss function compares the value of the model with the actual value and outputs the difference. In this paper, we analyze various loss functions used in the deep learning model for biomarker extraction, which measure the degree of loss of neural network output values, and try to find the best loss function through experiments.
PDF

A Study on the Establishment of Odor Management System in Gangwon-do Traditional Market

Min-Jae JUNG;Kwang-Yeol YOON;Sang-Rul KIM;Su-Hye KIM
- Journal of Wellbeing Management and Applied Psychology
- /
- v.6 no.2
- /
- pp.27-31
- /
- 2023
Purpose: Establishment of a real-time monitoring system for odor control in traditional markets in Gangwon-do and a system for linking prevention facilities. Research design, data and methodology: Build server and system logic based on data through real-time monitoring device (sensor-based). A temporary data generation program for deep learning is developed to develop a model for odor data. Results: A REST API was developed for using the model prediction service, and a test was performed to find an algorithm with high prediction probability and parameter values optimized for learning. In the deep learning algorithm for AI modeling development, Pandas was used for data analysis and processing, and TensorFlow V2 (keras) was used as the deep learning library. The activation function was swish, the performance of the model was optimized for Adam, the performance was measured with MSE, the model method was Functional API, and the model storage format was Sequential API (LSTM)/HDF5. Conclusions: The developed system has the potential to effectively monitor and manage odors in traditional markets. By utilizing real-time data, the system can provide timely alerts and facilitate preventive measures to control and mitigate odors. The AI modeling component enhances the system's predictive capabilities, allowing for proactive odor management.
https://doi.org/10.13106/jwmap.2023.Vol6.no2.27 인용 PDF

A Study on Development Environments for Machine Learning (머신러닝 자동화를 위한 개발 환경에 관한 연구)

Kim, Dong Gil;Park, Yong-Soon;Park, Lae-Jeong;Chung, Tae-Yun
- IEMEK Journal of Embedded Systems and Applications
- /
- v.15 no.6
- /
- pp.307-316
- /
- 2020
Machine learning model data is highly affected by performance. preprocessing is needed to enable analysis of various types of data, such as letters, numbers, and special characters. This paper proposes a development environment that aims to process categorical and continuous data according to the type of missing values in stage 1, implementing the function of selecting the best performing algorithm in stage 2 and automating the process of checking model performance in stage 3. Using this model, machine learning models can be created without prior knowledge of data preprocessing.
https://doi.org/10.14372/IEMEK.2020.15.6.307 인용 PDF KSCI

Determination of Optimal Adhesion Conditions for FDM Type 3D Printer Using Machine Learning

Woo Young Lee;Jong-Hyeok Yu;Kug Weon Kim
- Journal of Practical Engineering Education
- /
- v.15 no.2
- /
- pp.419-427
- /
- 2023
In this study, optimal adhesion conditions to alleviate defects caused by heat shrinkage with FDM type 3D printers with machine learning are researched. Machine learning is one of the "statistical methods of extracting the law from data" and can be classified as supervised learning, unsupervised learning and reinforcement learning. Among them, a function model for adhesion between the bed and the output is presented using supervised learning specialized for optimization, which can be expected to reduce output defects with FDM type 3D printers by deriving conditions for optimum adhesion between the bed and the output. Machine learning codes prepared using Python generate a function model that predicts the effect of operating variables on adhesion using data obtained through adhesion testing. The adhesion prediction data and verification data have been shown to be very consistent, and the potential of this method is explained by conclusions.
https://doi.org/10.14702/JPEE.2023.419 인용 PDF

Forecasting of Rental Demand for Public Bicycles Using a Deep Learning Model (딥러닝 모형을 활용한 공공자전거 대여량 예측에 관한 연구)

Cho, Keun-min;Lee, Sang-Soo;Nam, Doohee
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.19 no.3
- /
- pp.28-37
- /
- 2020
This study developed a deep learning model that predicts rental demand for public bicycles. For this, public bicycle rental data, weather data, and subway usage data were collected. After building an exponential smoothing model, ARIMA model and LSTM-based deep learning model, forecasting errors were compared and evaluated using MSE and MAE evaluation indicators. Based on the analysis results, MSE 348.74 and MAE 14.15 were calculated using the exponential smoothing model. The ARIMA model produced MSE 170.10 and MAE 9.30 values. In addition, MSE 120.22 and MAE 6.76 values were calculated using the deep learning model. Compared to the value of the exponential smoothing model, the MSE of the ARIMA model decreased by 51% and the MAE by 34%. In addition, the MSE of the deep learning model decreased by 66% and the MAE by 52%, which was found to have the least error in the deep learning model. These results show that the prediction error in public bicycle rental demand forecasting can be greatly reduced by applying the deep learning model.
https://doi.org/10.12815/kits.2020.19.3.28 인용 PDF KSCI

Prediction of Chest Deflection Using Frontal Impact Test Results and Deep Learning Model (정면충돌 시험결과와 딥러닝 모델을 이용한 흉부변형량의 예측)

Kwon-Hee Lee;Jaemoon Lim
- Journal of Auto-vehicle Safety Association
- /
- v.15 no.1
- /
- pp.55-62
- /
- 2023
In this study, a chest deflection is predicted by introducing a deep learning technique with the results of the frontal impact of the USNCAP conducted for 110 car models from MY2018 to MY2020. The 120 data are divided into training data and test data, and the training data is divided into training data and validation data to determine the hyperparameters. In this process, the deceleration data of each vehicle is averaged in units of 10 ms from crash pulses measured up to 100 ms. The performance of the deep learning model is measured by the indices of the mean squared error and the mean absolute error on the test data. A DNN (Deep Neural Network) model can give different predictions for the same hyperparameter values at every run. Considering this, the mean and standard deviation of the MSE (Mean Squared Error) and the MAE (Mean Absolute Error) are calculated. In addition, the deep learning model performance according to the inclusion of CVW (Curb Vehicle Weight) is also reviewed.
https://doi.org/10.22680/kasa2023.15.1.055 인용 PDF

Trend of Utilization of Machine Learning Technology for Digital Healthcare Data Analysis (디지털 헬스케어 데이터 분석을 위한 머신 러닝 기술 활용 동향)

Woo, Y.C.;Lee, S.Y.;Choi, W.;Ahn, C.W.;Baek, O.K.
- Electronics and Telecommunications Trends
- /
- v.34 no.1
- /
- pp.98-110
- /
- 2019
Machine learning has been applied to medical imaging and has shown an excellent recognition rate. Recently, there has been much interest in preventive medicine. If data are accessible, machine learning packages can be used easily in digital healthcare fields. However, it is necessary to prepare the data in advance, and model evaluation and tuning are required to construct a reliable model. On average, these processes take more than 80% of the total effort required. In this study, we describe the basic concepts of machine learning, pre-processing and visualization of datasets, feature engineering for reliable models, model evaluation and tuning, and the latest trends in popular machine learning frameworks. Finally, we survey a explainable machine learning analysis tool and will discuss the future direction of machine learning.
https://doi.org/10.22648/ETRI.2019.J.340109 인용 PDF HTML

Development of ensemble machine learning model considering the characteristics of input variables and the interpretation of model performance using explainable artificial intelligence (수질자료의 특성을 고려한 앙상블 머신러닝 모형 구축 및 설명가능한 인공지능을 이용한 모형결과 해석에 대한 연구)

Park, Jungsu
- Journal of Korean Society of Water and Wastewater
- /
- v.36 no.4
- /
- pp.239-248
- /
- 2022
The prediction of algal bloom is an important field of study in algal bloom management, and chlorophyll-a concentration(Chl-a) is commonly used to represent the status of algal bloom. In, recent years advanced machine learning algorithms are increasingly used for the prediction of algal bloom. In this study, XGBoost(XGB), an ensemble machine learning algorithm, was used to develop a model to predict Chl-a in a reservoir. The daily observation of water quality data and climate data was used for the training and testing of the model. In the first step of the study, the input variables were clustered into two groups(low and high value groups) based on the observed value of water temperature(TEMP), total organic carbon concentration(TOC), total nitrogen concentration(TN) and total phosphorus concentration(TP). For each of the four water quality items, two XGB models were developed using only the data in each clustered group(Model 1). The results were compared to the prediction of an XGB model developed by using the entire data before clustering(Model 2). The model performance was evaluated using three indices including root mean squared error-observation standard deviation ratio(RSR). The model performance was improved using Model 1 for TEMP, TN, TP as the RSR of each model was 0.503, 0.477 and 0.493, respectively, while the RSR of Model 2 was 0.521. On the other hand, Model 2 shows better performance than Model 1 for TOC, where the RSR was 0.532. Explainable artificial intelligence(XAI) is an ongoing field of research in machine learning study. Shapley value analysis, a novel XAI algorithm, was also used for the quantitative interpretation of the XGB model performance developed in this study.
https://doi.org/10.11001/jksww.2022.36.4.239 인용 PDF KSCI

A Study on Adaptive Learning Model for Performance Improvement of Stream Analytics (실시간 데이터 분석의 성능개선을 위한 적응형 학습 모델 연구)

Ku, Jin-Hee
- Journal of Convergence for Information Technology
- /
- v.8 no.1
- /
- pp.201-206
- /
- 2018
Recently, as technologies for realizing artificial intelligence have become more common, machine learning is widely used. Machine learning provides insight into collecting large amounts of data, batch processing, and taking final action, but the effects of the work are not immediately integrated into the learning process. In this paper proposed an adaptive learning model to improve the performance of real-time stream analysis as a big business issue. Adaptive learning generates the ensemble by adapting to the complexity of the data set, and the algorithm uses the data needed to determine the optimal data point to sample. In an experiment for six standard data sets, the adaptive learning model outperformed the simple machine learning model for classification at the learning time and accuracy. In particular, the support vector machine showed excellent performance at the end of all ensembles. Adaptive learning is expected to be applicable to a wide range of problems that need to be adaptively updated in the inference of changes in various parameters over time.
https://doi.org/10.22156/CS4SMB.2018.8.1.201 인용 PDF KSCI

Search Result 4,477, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)