Search | Korea Science

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
- Journal of Intelligence and Information Systems
- /
- v.27 no.3
- /
- pp.57-73
- /
- 2021
Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.
https://doi.org/10.13088/jiis.2021.27.3.057 인용 PDF KSCI

Improving Bidirectional LSTM-CRF model Of Sequence Tagging by using Ontology knowledge based feature (온톨로지 지식 기반 특성치를 활용한 Bidirectional LSTM-CRF 모델의 시퀀스 태깅 성능 향상에 관한 연구)

Jin, Seunghee;Jang, Heewon;Kim, Wooju
- Journal of Intelligence and Information Systems
- /
- v.24 no.1
- /
- pp.253-266
- /
- 2018
This paper proposes a methodology applying sequence tagging methodology to improve the performance of NER(Named Entity Recognition) used in QA system. In order to retrieve the correct answers stored in the database, it is necessary to switch the user's query into a language of the database such as SQL(Structured Query Language). Then, the computer can recognize the language of the user. This is the process of identifying the class or data name contained in the database. The method of retrieving the words contained in the query in the existing database and recognizing the object does not identify the homophone and the word phrases because it does not consider the context of the user's query. If there are multiple search results, all of them are returned as a result, so there can be many interpretations on the query and the time complexity for the calculation becomes large. To overcome these, this study aims to solve this problem by reflecting the contextual meaning of the query using Bidirectional LSTM-CRF. Also we tried to solve the disadvantages of the neural network model which can't identify the untrained words by using ontology knowledge based feature. Experiments were conducted on the ontology knowledge base of music domain and the performance was evaluated. In order to accurately evaluate the performance of the L-Bidirectional LSTM-CRF proposed in this study, we experimented with converting the words included in the learned query into untrained words in order to test whether the words were included in the database but correctly identified the untrained words. As a result, it was possible to recognize objects considering the context and can recognize the untrained words without re-training the L-Bidirectional LSTM-CRF mode, and it is confirmed that the performance of the object recognition as a whole is improved.
https://doi.org/10.13088/jiis.2018.24.1.253 인용 PDF KSCI

Implementation of Reporting Tool Supporting OLAP and Data Mining Analysis Using XMLA (XMLA를 사용한 OLAP과 데이타 마이닝 분석이 가능한 리포팅 툴의 구현)

Choe, Jee-Woong;Kim, Myung-Ho
- Journal of KIISE:Computing Practices and Letters
- /
- v.15 no.3
- /
- pp.154-166
- /
- 2009
Database query and reporting tools, OLAP tools and data mining tools are typical front-end tools in Business Intelligence environment which is able to support gathering, consolidating and analyzing data produced from business operation activities and provide access to the result to enterprise's users. Traditional reporting tools have an advantage of creating sophisticated dynamic reports including SQL query result sets, which look like documents produced by word processors, and publishing the reports to the Web environment, but data source for the tools is limited to RDBMS. On the other hand, OLAP tools and data mining tools have an advantage of providing powerful information analysis functions on each own way, but built-in visualization components for analysis results are limited to tables or some charts. Thus, this paper presents a system that integrates three typical front-end tools to complement one another for BI environment. Traditional reporting tools only have a query editor for generating SQL statements to bring data from RDBMS. However, the reporting tool presented by this paper can extract data also from OLAP and data mining servers, because editors for OLAP and data mining query requests are added into this tool. Traditional systems produce all documents in the server side. This structure enables reporting tools to avoid repetitive process to generate documents, when many clients intend to access the same dynamic document. But, because this system targets that a few users generate documents for data analysis, this tool generates documents at the client side. Therefore, the tool has a processing mechanism to deal with a number of data despite the limited memory capacity of the report viewer in the client side. Also, this reporting tool has data structure for integrating data from three kinds of data sources into one document. Finally, most of traditional front-end tools for BI are dependent on data source architecture from specific vendor. To overcome the problem, this system uses XMLA that is a protocol based on web service to access to data sources for OLAP and data mining services from various vendors.
PDF KSCI

Incremental Ensemble Learning for The Combination of Multiple Models of Locally Weighted Regression Using Genetic Algorithm (유전 알고리즘을 이용한 국소가중회귀의 다중모델 결합을 위한 점진적 앙상블 학습)

Kim, Sang Hun;Chung, Byung Hee;Lee, Gun Ho
- KIPS Transactions on Software and Data Engineering
- /
- v.7 no.9
- /
- pp.351-360
- /
- 2018
The LWR (Locally Weighted Regression) model, which is traditionally a lazy learning model, is designed to obtain the solution of the prediction according to the input variable, the query point, and it is a kind of the regression equation in the short interval obtained as a result of the learning that gives a higher weight value closer to the query point. We study on an incremental ensemble learning approach for LWR, a form of lazy learning and memory-based learning. The proposed incremental ensemble learning method of LWR is to sequentially generate and integrate LWR models over time using a genetic algorithm to obtain a solution of a specific query point. The weaknesses of existing LWR models are that multiple LWR models can be generated based on the indicator function and data sample selection, and the quality of the predictions can also vary depending on this model. However, no research has been conducted to solve the problem of selection or combination of multiple LWR models. In this study, after generating the initial LWR model according to the indicator function and the sample data set, we iterate evolution learning process to obtain the proper indicator function and assess the LWR models applied to the other sample data sets to overcome the data set bias. We adopt Eager learning method to generate and store LWR model gradually when data is generated for all sections. In order to obtain a prediction solution at a specific point in time, an LWR model is generated based on newly generated data within a predetermined interval and then combined with existing LWR models in a section using a genetic algorithm. The proposed method shows better results than the method of selecting multiple LWR models using the simple average method. The results of this study are compared with the predicted results using multiple regression analysis by applying the real data such as the amount of traffic per hour in a specific area and hourly sales of a resting place of the highway, etc.
https://doi.org/10.3745/KTSDE.2018.7.9.351 인용 PDF KSCI

Development of the Information Delivery System for the Home Nursing Service (가정간호사업 운용을 위한 정보전달체계 개발 I (가정간호 데이터베이스 구축과 뇌졸중 환자의 가정간호 전산개발))

Park, J.H;Kim, M.J;Hong, K.J;Han, K.J;Park, S.A;Yung, S.N;Lee, I.S;Joh, H.;Bang, K.S
- Journal of Home Health Care Nursing
- /
- v.4
- /
- pp.5-22
- /
- 1997
The purpose of the study was to development an information delivery system for the home nursing service, to demonstrate and to evaluate the efficiency of it. The period of research conduct was from September 1996 to August 31, 1997. At the 1st stage to achieve the purpose, Firstly Assessment tool for the patients with cerebral vascular disease who have the first priority of HNS among the patients with various health problems at home was developed through literature review. Secondly, after identification of patient nursing problem by the home care nurse with the assessment tool, the patient's classification system developed by Park (1988) that was 128 nursing activities under 6 categories was used to identify the home care nurse's activities of the patient with CAV at home. The research team had several workshops with 5 clinical nurse experts to refine it. At last 110 nursing activities under 11 categories for the patients with CVA were derived. At the second stage, algorithms were developed to connect 110 nursing activities with the patient nursing problems identified by assessment tool. The computerizing process of the algorithms is as follows: These algorithms are realized with the computer program by use of the software engineering technique. The development is made by the prototyping method, which is the requirement analysis of the software specifications. The basic features of the usability, compatibility, adaptability and maintainability are taken into consideration. Particular emphasis is given to the efficient construction of the database. To enhance the database efficiency and to establish the structural cohesion, the data field is categorized with the weight of relevance to the particular disease. This approach permits the easy adaptability when numerous diseases are applied in the future. In paralleled with this, the expandability and maintainability is stressed through out the program development, which leads to the modular concept. However since the disease to be applied is increased in number as the project progress and since they are interrelated and coupled each other, the expand ability as well as maintainability should be considered with a big priority. Furthermore, since the system is to be synthesized with other medical systems in the future, these properties are very important. The prototype developed in this project is to be evaluated through the stage of system testing. There are various evaluation metrics such as cohesion, coupling and adaptability so on. But unfortunately, direct measurement of these metrics are very difficult, and accordingly, analytical and quantitative evaluations are almost impossible. Therefore, instead of the analytical evaluation, the experimental evaluation is to be applied through the test run by various users. This system testing will provide the viewpoint analysis of the user's level, and the detail and additional requirement specifications arising from user's real situation will be feedback into the system modeling. Also. the degree of freedom of the input and output will be improved, and the hardware limitation will be investigated. Upon the refining, the prototype system will be used as a design template. and will be used to develop the more extensive system. In detail. the relevant modules will be developed for the various diseases, and the module will be integrated by the macroscopic design process focusing on the inter modularity, generality of the database. and compatibility with other systems. The Home care Evaluation System is comprised of three main modules of : (1) General information on a patient, (2) General health status of a patient, and (3) Cerebrovascular disease patient. The general health status module has five sub modules of physical measurement, vitality, nursing, pharmaceutical description and emotional/cognition ability. The CVA patient module is divided into ten sub modules such as subjective sense, consciousness, memory and language pattern so on. The typical sub modules are described in appendix 3.
PDF

A Study on Transcranial Magnetic Electrode Simulation Using Maxwell 3D (Maxwell 3D를 이용한 경두개 자기 전극 시뮬레이션에 관한 연구)

Lee, Geun-Yong;Yoon, Se-Jin;Jeong, Jin-hyoung;Kim, Jun-Tae;Lee, Sang-sik
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.12 no.6
- /
- pp.657-665
- /
- 2019
In this study, we conducted a study on the transcranial magnetic electrode, a method for the study of dementia and muscle pain, a neurodegenerative disease caused by an aging society, which is becoming a problem worldwide. In particular, transcranial magnetic electrodes have been studied to improve their ability to be deteriorated by dementia symptoms such as speech, cognitive ability, and memory by outputting magnetism deep into the brain using coils on the head epidermis. In this study, simulation was performed using Maxwell 3D program for the design of coil, the core of transcranial magnetic electrode. As a result of the simulation comparison between the coil designed by the previous research and the coil through the research and development, the output was found to be superior to the conventional designed coil. The graphs of the coil outputs of B-Field and H-Field are found to be symmetrical, but the symmetry between each coil is pseudo-symmetrical and not accurate. Based on these results, an experiment was conducted to confirm whether the output of the head epidermis through both coils is possible. In the magnitude field of the reverse-coil 2-coil analysis, the maximum output was 3.3920e + 004 H [A_per_meter], and the vector field showed the strongest magnetic field around 35 to 165 degrees. It was confirmed that the magnetic output canceled due to the magnetic output. In the case of the forward 2-coil, a maximum of 3.2348e + 004H [A_per_meter] similar to the reverse coil was observed, but in the case of the vector field, the magnetic output regarding the forward output and the head skin output was confirmed. However, when the height change in the output coil, the magnetic output was reduced.
https://doi.org/10.17661/jkiiect.2019.12.6.657 인용 PDF KSCI

A Spatio-Temporal Clustering Technique for the Moving Object Path Search (이동 객체 경로 탐색을 위한 시공간 클러스터링 기법)

Lee, Ki-Young;Kang, Hong-Koo;Yun, Jae-Kwan;Han, Ki-Joon
- Journal of Korea Spatial Information System Society
- /
- v.7 no.3 s.15
- /
- pp.67-81
- /
- 2005
Recently, the interest and research on the development of new application services such as the Location Based Service and Telemetics providing the emergency service, neighbor information search, and route search according to the development of the Geographic Information System have been increasing. User's search in the spatio-temporal database which is used in the field of Location Based Service or Telemetics usually fixes the current time on the time axis and queries the spatial and aspatial attributes. Thus, if the range of query on the time axis is extensive, it is difficult to efficiently deal with the search operation. For solving this problem, the snapshot, a method to summarize the location data of moving objects, was introduced. However, if the range to store data is wide, more space for storing data is required. And, the snapshot is created even for unnecessary space that is not frequently used for search. Thus, non storage space and memory are generally used in the snapshot method. Therefore, in this paper, we suggests the Hash-based Spatio-Temporal Clustering Algorithm(H-STCA) that extends the two-dimensional spatial hash algorithm used for the spatial clustering in the past to the three-dimensional spatial hash algorithm for overcoming the disadvantages of the snapshot method. And, this paper also suggests the knowledge extraction algorithm to extract the knowledge for the path search of moving objects from the past location data based on the suggested H-STCA algorithm. Moreover, as the results of the performance evaluation, the snapshot clustering method using H-STCA, in the search time, storage structure construction time, optimal path search time, related to the huge amount of moving object data demonstrated the higher performance than the spatio-temporal index methods and the original snapshot method. Especially, for the snapshot clustering method using H-STCA, the more the number of moving objects was increased, the more the performance was improved, as compared to the existing spatio-temporal index methods and the original snapshot method.
PDF

COMPARATIVE STUDY OF BEHAVIOR AND COGNITIVE FUNCTION BY ADMINISTRATION OF METHYLPHENIDATE AND IMIPRAMINE IN ATTENTION DEFICIT-HYPERACTIVITY DISORDER (Methylphenidate와 Imipramine투여에 따른 주의력 결핍${\cdot}$과잉운동장애 환아의 행동 및 인지기능 변화에 대한 연구)

Ahn, D.H;Hong, K.E;Oh, K.J;Shin, M.S;Yoo, B.C;Chung, K.M
- Journal of the Korean Academy of Child and Adolescent Psychiatry
- /
- v.3 no.1
- /
- pp.26-45
- /
- 1992
This study presents the behavioral and cognitive changes by administration of methylphenidate(MPH) and imipramine(IMI) for the treatment of attention-deficit hyperactivity disorder(ADHD) in $5_{1/2}{\sim}12$ years old children referred to child psychiatric clinics. Behavioral changes are assessed with parent's and teacher's ratings. Drug effects on attention. short-term memory, and impulsivity are evaluated with psychological tests in laboratory. The changes were assessed twice in a 8-week periods. The data were analyzed seperately for 15 subjects each drug using repeated measured analysis of variance(ANOVA). The findings indicates that behavioral and cognitive impairments are improved by both drugs, but impulsivity is not. And MPH is superior to IMI on the improvement of attentional problem ; especially the findings indicates important differences between simple task and complex. perceptual-search task. These data confirm the effectiveness of MPH for treatment of ADHD, also raise questions regarding assessment method of attention and impulsivity as fell as importance of impulsivity in ADHD.
PDF

A Study on the Cubism - In it's relation to Bergsonian Philosophy and Simultaneity - (큐비즘에 관한 연구 - 베르그송 철학과 동시성 개념을 중심으로 -)

Ryu, Ji-Seok;Oh, Chan-Ohk
- Archives of design research
- /
- v.18 no.3 s.61
- /
- pp.117-128
- /
- 2005
The French Belle Epoque is a period where the literary and artistic movement was very activated. The birth of the cubism reflects this atmosphere of the times and the change of paradigm in all fields. The Bergsonism is often designated as one of the important backgrounds of cubism. The problem consists in knowing if Bergsonian ideas gave real influence on the cubist movement and up to what point. Our analysis will show that it is not homogenous and very variable according to painters. In the case of Picasso and Braques it seems be a simple inspiration of Zeitgeist. But the influence upon Metzinger and Gleizes is explicit. The text of 1912, Du cubism, prove their attachment to his thought. The key concept of cubist theory, influenced by Bergsonian philosophy, is the concept of simultaneity. Cubist simultaneity is in one hand a reflection of an artist's psychological experience and the other hand a synthesis of multiple views for grasping the object in itself by the way of conceptual representation. The temporal simultaneity could be identified with the notion of memory, which is a temporal continuity connecting the past to dynamic present. The spatial simultaneity is a juxtaposition of multiple views obtained by the movement around the object. But the dose reading of Bergson's text shows that there is a divergence between the notion of cubist simultaneity and his ideas. The biased interpretation is often, as well as the strict understanding, like the history shows us well, a great source of inspiration and creativity. The cubist mouvement is not far from this case.
PDF

Data collection strategy for building rainfall-runoff LSTM model predicting daily runoff (강수-일유출량 추정 LSTM 모형의 구축을 위한 자료 수집 방안)

Kim, Dongkyun;Kang, Seokkoo
- Journal of Korea Water Resources Association
- /
- v.54 no.10
- /
- pp.795-805
- /
- 2021
In this study, after developing an LSTM-based deep learning model for estimating daily runoff in the Soyang River Dam basin, the accuracy of the model for various combinations of model structure and input data was investigated. A model was built based on the database consisting of average daily precipitation, average daily temperature, average daily wind speed (input up to here), and daily average flow rate (output) during the first 12 years (1997.1.1-2008.12.31). The Nash-Sutcliffe Model Efficiency Coefficient (NSE) and RMSE were examined for validation using the flow discharge data of the later 12 years (2009.1.1-2020.12.31). The combination that showed the highest accuracy was the case in which all possible input data (12 years of daily precipitation, weather temperature, wind speed) were used on the LSTM model structure with 64 hidden units. The NSE and RMSE of the verification period were 0.862 and 76.8 m³/s, respectively. When the number of hidden units of LSTM exceeds 500, the performance degradation of the model due to overfitting begins to appear, and when the number of hidden units exceeds 1000, the overfitting problem becomes prominent. A model with very high performance (NSE=0.8~0.84) could be obtained when only 12 years of daily precipitation was used for model training. A model with reasonably high performance (NSE=0.63-0.85) when only one year of input data was used for model training. In particular, an accurate model (NSE=0.85) could be obtained if the one year of training data contains a wide magnitude of flow events such as extreme flow and droughts as well as normal events. If the training data includes both the normal and extreme flow rates, input data that is longer than 5 years did not significantly improve the model performance.
https://doi.org/10.3741/JKWRA.2021.54.10.795 인용 PDF KSCI

Search Result 1,086, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)