• Title/Summary/Keyword: Anomaly prediction

Search Result 106, Processing Time 0.025 seconds

Study on Anomaly Detection Method of Improper Foods using Import Food Big data (수입식품 빅데이터를 이용한 부적합식품 탐지 시스템에 관한 연구)

  • Cho, Sanggoo;Choi, Gyunghyun
    • The Journal of Bigdata
    • /
    • v.3 no.2
    • /
    • pp.19-33
    • /
    • 2018
  • Owing to the increase of FTA, food trade, and versatile preferences of consumers, food import has increased at tremendous rate every year. While the inspection check of imported food accounts for about 20% of the total food import, the budget and manpower necessary for the government's import inspection control is reaching its limit. The sudden import food accidents can cause enormous social and economic losses. Therefore, predictive system to forecast the compliance of food import with its preemptive measures will greatly improve the efficiency and effectiveness of import safety control management. There has already been a huge data accumulated from the past. The processed foods account for 75% of the total food import in the import food sector. The analysis of big data and the application of analytical techniques are also used to extract meaningful information from a large amount of data. Unfortunately, not many studies have been done regarding analyzing the import food and its implication with understanding the big data of food import. In this context, this study applied a variety of classification algorithms in the field of machine learning and suggested a data preprocessing method through the generation of new derivative variables to improve the accuracy of the model. In addition, the present study compared the performance of the predictive classification algorithms with the general base classifier. The Gaussian Naïve Bayes prediction model among various base classifiers showed the best performance to detect and predict the nonconformity of imported food. In the future, it is expected that the application of the abnormality detection model using the Gaussian Naïve Bayes. The predictive model will reduce the burdens of the inspection of import food and increase the non-conformity rate, which will have a great effect on the efficiency of the food import safety control and the speed of import customs clearance.

433 MHz Radio Frequency and 2G based Smart Irrigation Monitoring System (433 MHz 무선주파수와 2G 통신 기반의 스마트 관개 모니터링 시스템)

  • Manongi, Frank Andrew;Ahn, Sung-Hoon
    • Journal of Appropriate Technology
    • /
    • v.6 no.2
    • /
    • pp.136-145
    • /
    • 2020
  • Agriculture is the backbone of the economy of most developing countries. In these countries, agriculture or farming is mostly done manually with little integration of machinery, intelligent systems and data monitoring. Irrigation is an essential process that directly influences crop production. The fluctuating amount of rainfall per year has led to the adoption of irrigation systems in most farms. The absence of smart sensors, monitoring methods and control, has led to low harvests and draining water sources. In this research paper, we introduce a 433 MHz Radio Frequency and 2G based Smart Irrigation Meter System and a water prepayment system for rural areas of Tanzania with no reliable internet coverage. Specifically, Ngurudoto area in Arusha region where it will be used as a case study for data collection. The proposed system is hybrid, comprising of both weather data (evapotranspiration) and soil moisture data. The architecture of the system has on-site weather measurement controllers, soil moisture sensors buried on the ground, water flow sensors, a solenoid valve, and a prepayment system. To achieve high precision in linear and nonlinear regression and to improve classification and prediction, this work cascades a Dynamic Regression Algorithm and Naïve Bayes algorithm.

Development of an intelligent IIoT platform for stable data collection (안정적 데이터 수집을 위한 지능형 IIoT 플랫폼 개발)

  • Woojin Cho;Hyungah Lee;Dongju Kim;Jae-hoi Gu
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.687-692
    • /
    • 2024
  • The energy crisis is emerging as a serious problem around the world. In the case of Korea, there is great interest in energy efficiency research related to industrial complexes, which use more than 53% of total energy and account for more than 45% of greenhouse gas emissions in Korea. One of the studies is a study on saving energy through sharing facilities between factories using the same utility in an industrial complex called a virtual energy network plant and through transactions between energy producing and demand factories. In such energy-saving research, data collection is very important because there are various uses for data, such as analysis and prediction. However, existing systems had several shortcomings in reliably collecting time series data. In this study, we propose an intelligent IIoT platform to improve it. The intelligent IIoT platform includes a preprocessing system to identify abnormal data and process it in a timely manner, classifies abnormal and missing data, and presents interpolation techniques to maintain stable time series data. Additionally, time series data collection is streamlined through database optimization. This paper contributes to increasing data usability in the industrial environment through stable data collection and rapid problem response, and contributes to reducing the burden of data collection and optimizing monitoring load by introducing a variety of chatbot notification systems.

Development of a complex failure prediction system using Hierarchical Attention Network (Hierarchical Attention Network를 이용한 복합 장애 발생 예측 시스템 개발)

  • Park, Youngchan;An, Sangjun;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.127-148
    • /
    • 2020
  • The data center is a physical environment facility for accommodating computer systems and related components, and is an essential foundation technology for next-generation core industries such as big data, smart factories, wearables, and smart homes. In particular, with the growth of cloud computing, the proportional expansion of the data center infrastructure is inevitable. Monitoring the health of these data center facilities is a way to maintain and manage the system and prevent failure. If a failure occurs in some elements of the facility, it may affect not only the relevant equipment but also other connected equipment, and may cause enormous damage. In particular, IT facilities are irregular due to interdependence and it is difficult to know the cause. In the previous study predicting failure in data center, failure was predicted by looking at a single server as a single state without assuming that the devices were mixed. Therefore, in this study, data center failures were classified into failures occurring inside the server (Outage A) and failures occurring outside the server (Outage B), and focused on analyzing complex failures occurring within the server. Server external failures include power, cooling, user errors, etc. Since such failures can be prevented in the early stages of data center facility construction, various solutions are being developed. On the other hand, the cause of the failure occurring in the server is difficult to determine, and adequate prevention has not yet been achieved. In particular, this is the reason why server failures do not occur singularly, cause other server failures, or receive something that causes failures from other servers. In other words, while the existing studies assumed that it was a single server that did not affect the servers and analyzed the failure, in this study, the failure occurred on the assumption that it had an effect between servers. In order to define the complex failure situation in the data center, failure history data for each equipment existing in the data center was used. There are four major failures considered in this study: Network Node Down, Server Down, Windows Activation Services Down, and Database Management System Service Down. The failures that occur for each device are sorted in chronological order, and when a failure occurs in a specific equipment, if a failure occurs in a specific equipment within 5 minutes from the time of occurrence, it is defined that the failure occurs simultaneously. After configuring the sequence for the devices that have failed at the same time, 5 devices that frequently occur simultaneously within the configured sequence were selected, and the case where the selected devices failed at the same time was confirmed through visualization. Since the server resource information collected for failure analysis is in units of time series and has flow, we used Long Short-term Memory (LSTM), a deep learning algorithm that can predict the next state through the previous state. In addition, unlike a single server, the Hierarchical Attention Network deep learning model structure was used in consideration of the fact that the level of multiple failures for each server is different. This algorithm is a method of increasing the prediction accuracy by giving weight to the server as the impact on the failure increases. The study began with defining the type of failure and selecting the analysis target. In the first experiment, the same collected data was assumed as a single server state and a multiple server state, and compared and analyzed. The second experiment improved the prediction accuracy in the case of a complex server by optimizing each server threshold. In the first experiment, which assumed each of a single server and multiple servers, in the case of a single server, it was predicted that three of the five servers did not have a failure even though the actual failure occurred. However, assuming multiple servers, all five servers were predicted to have failed. As a result of the experiment, the hypothesis that there is an effect between servers is proven. As a result of this study, it was confirmed that the prediction performance was superior when the multiple servers were assumed than when the single server was assumed. In particular, applying the Hierarchical Attention Network algorithm, assuming that the effects of each server will be different, played a role in improving the analysis effect. In addition, by applying a different threshold for each server, the prediction accuracy could be improved. This study showed that failures that are difficult to determine the cause can be predicted through historical data, and a model that can predict failures occurring in servers in data centers is presented. It is expected that the occurrence of disability can be prevented in advance using the results of this study.

Impacts of Argo temperature in East Sea Regional Ocean Model with a 3D-Var Data Assimilation (동해 해양자료동화시스템에 대한 Argo 자료동화 민감도 분석)

  • KIM, SOYEON;JO, YOUNGSOON;KIM, YOUNG-HO;LIM, BYUNGHWAN;CHANG, PIL-HUN
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.20 no.3
    • /
    • pp.119-130
    • /
    • 2015
  • Impacts of Argo temperature assimilation on the analysis fields in the East Sea is investigated by using DAESROM, the East Sea Regional Ocean Model with a 3-dimensional variational assimilation module (Kim et al., 2009). Namely, we produced analysis fields in 2009, in which temperature profiles, sea surface temperature (SST) and sea surface height (SSH) anomaly were assimilated (Exp. AllDa) and carried out additional experiment by withdrawing Argo temperature data (Exp. NoArgo). When comparing both experimental results using assimilated temperature profiles, Root Mean Square Error (RMSE) of the Exp. AllDa is generally lower than the Exp. NoArgo. In particular, the Argo impacts are large in the subsurface layer, showing the RMSE difference of about $0.5^{\circ}C$. Based on the observations of 14 surface drifters, Argo impacts on the current and temperature fields in the surface layer are investigated. In general, surface currents along the drifter positions are improved in the Exp. AllDa, and large RMSE differences (about 2.0~6.0 cm/s) between both experiments are found in drifters which observed longer period in the southern region where Argo density was high. On the other hand, Argo impacts on the SST fields are negligible, and it is considered that SST assimilation with 1-day interval has dominant effects. Similar to the difference of surface current fields between both experiments, SSH fields also reveal significant difference in the southern East Sea, for example the southwestern Yamato Basin where anticyclonic circulation develops. The comparison of SSH fields implies that SSH assimilation does not correct the SSH difference caused by withdrawing Argo data. Thus Argo assimilation has an important role to reproduce meso-scale circulation features in the East Sea.

Long-term Predictability for El Nino/La Nina using PNU/CME CGCM (PNU/CME CGCM을 이용한 엘니뇨/라니냐 장기 예측성 연구)

  • Jeong, Hye-In;Ahn, Joong-Bae
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.12 no.3
    • /
    • pp.170-177
    • /
    • 2007
  • In this study, the long-term predictability of El Nino and La Nina events of Pusan National University Coupled General Circulation Model(PNU/CME CGCM) developed from a Research and Development Grant funded by Korea Meteorology Administration(KMA) was examined in terms of the correlation coefficients of the sea surface temperature between the model and observation and skill scores at the tropical Pacific. For the purpose, long-term global climate was hindcasted using PNU/CME CGCM for 12 months starting from April, July, October and January(APR RUN, JUL RUN, OCT RUN and JAN RUN, respectively) of each and every years between 1979 and 2004. Each 12-month hindcast consisted of 5 ensemble members. Relatively high correlation was maintained throughout the 12-month lead hindcasts at the equatorial Pacific for the four RUNs starting at different months. It is found that the predictability of our CGCM in forecasting equatorial SST anomalies is more pronounced within 6-month of lead time, in particular. For the assessment of model capability in predicting El Nino and La Nina, various skill scores such as Hit rates and False Alarm rate are calculated. According to the results, PNU/CME CGCM has a good predictability in forecasting warm and cold events, in spite of relatively poor capability in predicting normal state of equatorial Pacific. The predictability of our CGCM was also compared with those of other CGCMs participating DEMETER project. The comparative analysis also illustrated that our CGCM has reasonable long-term predictability comparable to the DEMETER participating CGCMs. As a conclusion, PNU/CME CGCM can predict El Nino and La Nina events at least 12 months ahead in terms of NIino 3.4 SST anomaly, showing much better predictability within 6-month of leading time.