• Title/Summary/Keyword: machine accuracy

Search Result 3,181, Processing Time 0.026 seconds

Development of Artificial Intelligence Model for Predicting Citrus Sugar Content based on Meteorological Data (기상 데이터 기반 감귤 당도 예측 인공지능 모델 개발)

  • Seo, Dongmin
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.6
    • /
    • pp.35-43
    • /
    • 2021
  • Citrus quality is generally determined by its sugar content and acidity. In particular, sugar content is a very important factor because it determines the taste of citrus. Currently, the most commonly used method of measuring citrus sugar content in farms is a portable juiced sugar meter and a non-destructive sugar meter. This method can be easily measured by individuals, but the accuracy of the sugar content is inferior to that of the citrus NongHyup official machine. In particular, there is an error difference of 0.5 Brix or more, which is still insufficient for use in the field. Therefore, in this paper, we propose an AI model that predicts the citrus sugar content of unmeasured days within the error range of 0.5 Brix or less based on the previously collected citrus sugar content and meteorological data (average temperature, humidity, rainfall, solar radiation, and average wind speed). In addition, it was confirmed that the prediction model proposed through performance evaluation had an mean absolute error of 0.1154 for Seongsan area and 0.1983 for the Hawon area in Jeju Island. Lastly, the proposed model supports an error difference of less than 0.5 Brix and is a technology that supports predictive measurement, so it is expected that its usability will be highly progressive.

A hybrid intrusion detection system based on CBA and OCSVM for unknown threat detection (알려지지 않은 위협 탐지를 위한 CBA와 OCSVM 기반 하이브리드 침입 탐지 시스템)

  • Shin, Gun-Yoon;Kim, Dong-Wook;Yun, Jiyoung;Kim, Sang-Soo;Han, Myung-Mook
    • Journal of Internet Computing and Services
    • /
    • v.22 no.3
    • /
    • pp.27-35
    • /
    • 2021
  • With the development of the Internet, various IT technologies such as IoT, Cloud, etc. have been developed, and various systems have been built in countries and companies. Because these systems generate and share vast amounts of data, they needed a variety of systems that could detect threats to protect the critical data contained in the system, which has been actively studied to date. Typical techniques include anomaly detection and misuse detection, and these techniques detect threats that are known or exhibit behavior different from normal. However, as IT technology advances, so do technologies that threaten systems, and these methods of detection. Advanced Persistent Threat (APT) attacks national or companies systems to steal important information and perform attacks such as system down. These threats apply previously unknown malware and attack technologies. Therefore, in this paper, we propose a hybrid intrusion detection system that combines anomaly detection and misuse detection to detect unknown threats. Two detection techniques have been applied to enable the detection of known and unknown threats, and by applying machine learning, more accurate threat detection is possible. In misuse detection, we applied Classification based on Association Rule(CBA) to generate rules for known threats, and in anomaly detection, we used One-Class SVM(OCSVM) to detect unknown threats. Experiments show that unknown threat detection accuracy is about 94%, and we confirm that unknown threats can be detected.

A study on the 3-step classification algorithm for the diagnosis and classification of refrigeration system failures and their types (냉동시스템 고장 진단 및 고장유형 분석을 위한 3단계 분류 알고리즘에 관한 연구)

  • Lee, Kangbae;Park, Sungho;Lee, Hui-Won;Lee, Seung-Jae;Lee, Seung-hyun
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.8
    • /
    • pp.31-37
    • /
    • 2021
  • As the size of buildings increases due to urbanization due to the development of industry, the need to purify the air and maintain a comfortable indoor environment is also increasing. With the development of monitoring technology for refrigeration systems, it has become possible to manage the amount of electricity consumed in buildings. In particular, refrigeration systems account for about 40% of power consumption in commercial buildings. Therefore, in order to develop the refrigeration system failure diagnosis algorithm in this study, the purpose of this study was to understand the structure of the refrigeration system, collect and analyze data generated during the operation of the refrigeration system, and quickly detect and classify failure situations with various types and severity . In particular, in order to improve the classification accuracy of failure types that are difficult to classify, a three-step diagnosis and classification algorithm was developed and proposed. A model based on SVM and LGBM was presented as a classification model suitable for each stage after a number of experiments and hyper-parameter optimization process. In this study, the characteristics affecting failure were preserved as much as possible, and all failure types, including refrigerant-related failures, which had been difficult in previous studies, were derived with excellent results.

Comparison of Korean Classification Models' Korean Essay Score Range Prediction Performance (한국어 학습 모델별 한국어 쓰기 답안지 점수 구간 예측 성능 비교)

  • Cho, Heeryon;Im, Hyeonyeol;Yi, Yumi;Cha, Junwoo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.3
    • /
    • pp.133-140
    • /
    • 2022
  • We investigate the performance of deep learning-based Korean language models on a task of predicting the score range of Korean essays written by foreign students. We construct a data set containing a total of 304 essays, which include essays discussing the criteria for choosing a job ('job'), conditions of a happy life ('happ'), relationship between money and happiness ('econ'), and definition of success ('succ'). These essays were labeled according to four letter grades (A, B, C, and D), and a total of eleven essay score range prediction experiments were conducted (i.e., five for predicting the score range of 'job' essays, five for predicting the score range of 'happiness' essays, and one for predicting the score range of mixed topic essays). Three deep learning-based Korean language models, KoBERT, KcBERT, and KR-BERT, were fine-tuned using various training data. Moreover, two traditional probabilistic machine learning classifiers, naive Bayes and logistic regression, were also evaluated. Experiment results show that deep learning-based Korean language models performed better than the two traditional classifiers, with KR-BERT performing the best with 55.83% overall average prediction accuracy. A close second was KcBERT (55.77%) followed by KoBERT (54.91%). The performances of naive Bayes and logistic regression classifiers were 52.52% and 50.28% respectively. Due to the scarcity of training data and the imbalance in class distribution, the overall prediction performance was not high for all classifiers. Moreover, the classifiers' vocabulary did not explicitly capture the error features that were helpful in correctly grading the Korean essay. By overcoming these two limitations, we expect the score range prediction performance to improve.

Deep Learning-based Stock Price Prediction Using Limit Order Books and News Headlines (호가창과 뉴스 헤드라인을 이용한 딥러닝 기반 주가 변동 예측 기법)

  • Ryoo, Euirim;Lee, Ki Yong;Chung, Yon Dohn
    • The Journal of Society for e-Business Studies
    • /
    • v.27 no.1
    • /
    • pp.63-79
    • /
    • 2022
  • Recently, various studies have been conducted on stock price prediction using machine learning and deep learning techniques. Among these studies, the latest studies have attempted to predict stock prices using limit order books, which contain buy and sell order information of stocks. However, most of the studies using limit order books consider only the trend of limit order books over the most recent period of a specified length, and few studies consider both the medium and short term trends of limit order books. Therefore, in this paper, we propose a deep learning-based prediction model that predicts stock price more accurately by considering both the medium and short term trends of limit order books. Moreover, the proposed model considers news headlines during the same period to reflect the qualitative status of the company in the stock price prediction. The proposed model extracts the features of changes in limit order books with CNNs and the features of news headlines using Word2vec, and combines these information to predict whether a particular company's stock will rise or fall the next day. We conducted experiments to predict the daily stock price fluctuations of five stocks (Amazon, Apple, Facebook, Google, Tesla) with the proposed model using the real NASDAQ limit order book data and news headline data, and the proposed model improved the accuracy by up to 17.66%p and the average by 14.47%p on average. In addition, we conducted a simulated investment with the proposed model and earned a minimum of $492.46 and a maximum of $2,840.93 depending on the stock for 21 business days.

Development and Validation of Digital Twin for Analysis of Plant Factory Airflow (식물공장 기류해석을 위한 디지털트윈 개발 및 실증)

  • Jeong, Jin-Lip;Won, Bo-Young;Yoo, Ho-Dong;Kim, Tag Gon;Kang, Dae-Hyun;Hong, Kyung-Jin
    • Journal of the Korea Society for Simulation
    • /
    • v.31 no.1
    • /
    • pp.29-41
    • /
    • 2022
  • As one of the alternatives to solve the problem of unstable food supply and demand imbalance caused by abnormal climate change, the need for plant factories is increasing. Airflow in plant factory is recognized as one of important factor of plant which influence transpiration and heat transfer. On the other hand, Digital Twin (DT) is getting attention as a means of providing various services that are impossible only with the real system by replicating the real system in the virtual world. This study aimed to develop a digital twin model for airflow prediction that can predict airflow in various situations by applying the concept of digital twin to a plant factory in operation. To this end, first, the mathematical formalism of the digital twin model for airflow analysis in plant factories is presented, and based on this, the information necessary for airflow prediction modeling of a plant factory in operation is specified. Then, the shape of the plant factory is implemented in CAD and the DT model is developed by combining the computational fluid dynamics (CFD) components for airflow behavior analysis. Finally, the DT model for high-accuracy airflow prediction is completed through the validation of the model and the machine learning-based calibration process by comparing the simulation analysis result of the DT model with the actual airflow value collected from the plant factory.

Development of Cloud-Based Medical Image Labeling System and It's Quantitative Analysis of Sarcopenia (클라우드기반 의료영상 라벨링 시스템 개발 및 근감소증 정량 분석)

  • Lee, Chung-Sub;Lim, Dong-Wook;Kim, Ji-Eon;Noh, Si-Hyeong;Yu, Yeong-Ju;Kim, Tae-Hoon;Yoon, Kwon-Ha;Jeong, Chang-Won
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.11 no.7
    • /
    • pp.233-240
    • /
    • 2022
  • Most of the recent AI researches has focused on developing AI models. However, recently, artificial intelligence research has gradually changed from model-centric to data-centric, and the importance of learning data is getting a lot of attention based on this trend. However, it takes a lot of time and effort because the preparation of learning data takes up a significant part of the entire process, and the generation of labeling data also differs depending on the purpose of development. Therefore, it is need to develop a tool with various labeling functions to solve the existing unmetneeds. In this paper, we describe a labeling system for creating precise and fast labeling data of medical images. To implement this, a semi-automatic method using Back Projection, Grabcut techniques and an automatic method predicted through a machine learning model were implemented. We not only showed the advantage of running time for the generation of labeling data of the proposed system, but also showed superiority through comparative evaluation of accuracy. In addition, by analyzing the image data set of about 1,000 patients, meaningful diagnostic indexes were presented for men and women in the diagnosis of sarcopenia.

Imputation of Missing SST Observation Data Using Multivariate Bidirectional RNN (다변수 Bidirectional RNN을 이용한 표층수온 결측 데이터 보간)

  • Shin, YongTak;Kim, Dong-Hoon;Kim, Hyeon-Jae;Lim, Chaewook;Woo, Seung-Buhm
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.34 no.4
    • /
    • pp.109-118
    • /
    • 2022
  • The data of the missing section among the vertex surface sea temperature observation data was imputed using the Bidirectional Recurrent Neural Network(BiRNN). Among artificial intelligence techniques, Recurrent Neural Networks (RNNs), which are commonly used for time series data, only estimate in the direction of time flow or in the reverse direction to the missing estimation position, so the estimation performance is poor in the long-term missing section. On the other hand, in this study, estimation performance can be improved even for long-term missing data by estimating in both directions before and after the missing section. Also, by using all available data around the observation point (sea surface temperature, temperature, wind field, atmospheric pressure, humidity), the imputation performance was further improved by estimating the imputation data from these correlations together. For performance verification, a statistical model, Multivariate Imputation by Chained Equations (MICE), a machine learning-based Random Forest model, and an RNN model using Long Short-Term Memory (LSTM) were compared. For imputation of long-term missing for 7 days, the average accuracy of the BiRNN/statistical models is 70.8%/61.2%, respectively, and the average error is 0.28 degrees/0.44 degrees, respectively, so the BiRNN model performs better than other models. By applying a temporal decay factor representing the missing pattern, it is judged that the BiRNN technique has better imputation performance than the existing method as the missing section becomes longer.

A Study on Prediction of PM2.5 Concentration Using DNN (Deep Neural Network를 활용한 초미세먼지 농도 예측에 관한 연구)

  • Choi, Inho;Lee, Wonyoung;Eun, Beomjin;Heo, Jeongsook;Chang, Kwang-Hyeon;Oh, Jongmin
    • Journal of Environmental Impact Assessment
    • /
    • v.31 no.2
    • /
    • pp.83-94
    • /
    • 2022
  • In this study, DNN-based models were learned using air quality determination data for 2017, 2019, and 2020 provided by the National Measurement Network (Air Korea), and this models evaluated using data from 2016 and 2018. Based on Pearson correlation coefficient 0.2, four items (SO2, CO, NO2, PM10) were initially modeled as independent variables. In order to improve the accuracy of prediction, monthly independent modeling was carried out. The error was calculated by RMSE (Root Mean Square Error) method, and the initial model of RMSE was 5.78, which was about 46% betterthan the national moving average modelresult (10.77). In addition, the performance improvement of the independent monthly model was observed in months other than November compared to the initial model. Therefore, this study confirms that DNN modeling was effective in predicting PM2.5 concentrations based on air pollutants concentrations, and that the learning performance of the model could be improved by selecting additional independent variables.

Development of an intelligent skin condition diagnosis information system based on social media

  • Kim, Hyung-Hoon;Ohk, Seung-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.8
    • /
    • pp.241-251
    • /
    • 2022
  • Diagnosis and management of customer's skin condition is an important essential function in the cosmetics and beauty industry. As the social media environment spreads and generalizes to all fields of society, the interaction of questions and answers to various and delicate concerns and requirements regarding the diagnosis and management of skin conditions is being actively dealt with in the social media community. However, since social media information is very diverse and atypical big data, an intelligent skin condition diagnosis system that combines appropriate skin condition information analysis and artificial intelligence technology is necessary. In this paper, we developed the skin condition diagnosis system SCDIS to intelligently diagnose and manage the skin condition of customers by processing the text analysis information of social media into learning data. In SCDIS, an artificial neural network model, AnnTFIDF, that automatically diagnoses skin condition types using artificial neural network technology, a deep learning machine learning method, was built up and used. The performance of the artificial neural network model AnnTFIDF was analyzed using test sample data, and the accuracy of the skin condition type diagnosis prediction value showed a high performance of about 95%. Through the experimental and performance analysis results of this paper, SCDIS can be evaluated as an intelligent tool that can be used efficiently in the skin condition analysis and diagnosis management process in the cosmetic and beauty industry. And this study can be used as a basic research to solve the new technology trend, customized cosmetics manufacturing and consumer-oriented beauty industry technology demand.