• Title/Summary/Keyword: Data normalization

Search Result 488, Processing Time 0.026 seconds

LSTM-based Business Process Remaining Time Prediction Model Featured in Activity-centric Normalization Techniques (액티비티별 특징 정규화를 적용한 LSTM 기반 비즈니스 프로세스 잔여시간 예측 모델)

  • Ham, Seong-Hun;Ahn, Hyun;Kim, Kwanghoon Pio
    • Journal of Internet Computing and Services
    • /
    • v.21 no.3
    • /
    • pp.83-92
    • /
    • 2020
  • Recently, many companies and organizations are interested in predictive process monitoring for the efficient operation of business process models. Traditional process monitoring focused on the elapsed execution state of a particular process instance. On the other hand, predictive process monitoring focuses on predicting the future execution status of a particular process instance. In this paper, we implement the function of the business process remaining time prediction, which is one of the predictive process monitoring functions. In order to effectively model the remaining time, normalization by activity is proposed and applied to the predictive model by taking into account the difference in the distribution of time feature values according to the properties of each activity. In order to demonstrate the superiority of the predictive performance of the proposed model in this paper, it is compared with previous studies through event log data of actual companies provided by 4TU.Centre for Research Data.

2D ECG Compression Using Optimal Sorting Scheme (정렬과 평균 정규화를 이용한 2D ECG 신호 압축 방법)

  • Lee, Kyu-Bong;Joo, Young-Bok;Han, Chan-Ho;Huh, Kyung-Moo;Park, Kil-Houm
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.46 no.4
    • /
    • pp.23-27
    • /
    • 2009
  • In this paper, we propose an effective compression method for electrocardiogram (ECG) signals. 1-D ECG signals are reconstructed to 2-D ECG data by period and complexity sorting schemes with image compression techniques to increase inter and intra-beat correlation. The proposed method added block division and mean-period normalization techniques on top of conventional 2-D data ECG compression methods. JPEG 2000 is chosen for compression of 2-D ECG data. Standard MIT-BIH arrhythmia database is used for evaluation and experiment. The results show that the proposed method outperforms compared to the most recent literature especially in case of high compression rate.

OrdinalEncoder based DNN for Natural Gas Leak Prediction (천연가스 누출 예측을 위한 OrdinalEncoder 기반 DNN)

  • Khongorzul, Dashdondov;Lee, Sang-Mu;Kim, Mi-Hye
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.10
    • /
    • pp.7-13
    • /
    • 2019
  • The natural gas (NG), mostly methane leaks into the air, it is a big problem for the climate. detected NG leaks under U.S. city streets and collected data. In this paper, we introduced a Deep Neural Network (DNN) classification of prediction for a level of NS leak. The proposed method is OrdinalEncoder(OE) based K-means clustering and Multilayer Perceptron(MLP) for predicting NG leak. The 15 features are the input neurons and the using backpropagation. In this paper, we propose the OE method for labeling target data using k-means clustering and compared normalization methods performance for NG leak prediction. There five normalization methods used. We have shown that our proposed OE based MLP method is accuracy 97.7%, F1-score 96.4%, which is relatively higher than the other methods. The system has implemented SPSS and Python, including its performance, is tested on real open data.

Monitoring of Gene Regulations Using Average Rank in DNA Microarray: Implementation of R

  • Park, Chang-Soon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.4
    • /
    • pp.1005-1021
    • /
    • 2007
  • Traditional procedures for DNA microarray data analysis are to preprocess and normalize the gene expression data, and then to analyze the normalized data using statistical tests. Drawbacks of the traditional methods are: genuine biological signal may be unwillingly eliminated together with artifacts, the limited number of arrays per gene make statistical tests difficult to use the normality assumption or nonparametric method, and genes are tested independently without consideration of interrelationships among genes. A novel method using average rank in each array is proposed to eliminate such drawbacks. This average rank method monitors differentially regulated genes among genetically different groups and the selected genes are somewhat different from those selected by traditional P-value method. Addition of genes selected by the average rank method to the traditional method will provide better understanding of genetic differences of groups.

  • PDF

Program Development of Integrated Expression Profile Analysis System for DNA Chip Data Analysis (DNA칩 데이터 분석을 위한 유전자발연 통합분석 프로그램의 개발)

  • 양영렬;허철구
    • KSBB Journal
    • /
    • v.16 no.4
    • /
    • pp.381-388
    • /
    • 2001
  • A program for integrated gene expression profile analysis such as hierarchical clustering, K-means, fuzzy c-means, self-organizing map(SOM), principal component analysis(PCA), and singular value decomposition(SVD) was made for DNA chip data anlysis by using Matlab. It also contained the normalization method of gene expression input data. The integrated data anlysis program could be effectively used in DNA chip data analysis and help researchers to get more comprehensive analysis view on gene expression data of their own.

  • PDF

Research on the Normalization Schemes for Insolvent Development Site on Mutual Savings Banks (상호저축은행 부실PF사업장 정상화 방안)

  • Shin, Jong-Chil;Baik, Min-Seok
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.1
    • /
    • pp.195-204
    • /
    • 2015
  • This study analyzed the normalization cases of the mutual savings bank insolvency PF (MSBIPF) to suggest the appropriate improvements according to the purpose. The results were as follows. First, the original intention to normalize the MSBIPF was unsuccessful. This may be caused by the daunting situation of the real estate market along with the complex and shared interests. On the other hand, it can be responsible for the lack of evidence and related regulations as well as the lukewarm attitude on public projects. Active institutional settings are warranted to compensate the remaining insolvent businesses to PF even today and in the future. The data related to the recognized sites as the poorest 32 PF sites was compared primarily to normalize by KAMCO and the relevant sites. The area variable was the only significant variable according to the correlation analysis and logit analysis. The direct investment, diverse PF-backed bonds and the activation of the Ritz can be suggested as alternative ways of normalization with respect to the issue of the KAMCO.

The Predictive Factors of the Serum Creatine Kinase Level Normalization Time in Patients with Rhabdomyolysis due to Doxylamine Ingestion (독시라민 중독으로 발생한 횡문근융해증 환자에게서 혈중 크레아틴인산활성화효소 수치가 정상화되는 시기를 예측할 수 있는 인자)

  • Shin, Min-Chul;Kwon, Oh-Young;Lee, Jong-Suk;Choi, Han-Sung;Hong, Hoon-Pyo;Ko, Young-Gwan
    • Journal of The Korean Society of Clinical Toxicology
    • /
    • v.7 no.2
    • /
    • pp.156-163
    • /
    • 2009
  • Purpose: Doxylamine succinate (DS) is frequently used to treat insomnia and it may induce rhabdomyolysis in the overdose cases. The purpose of this study is to evaluate the factors that can predict the serum creatine kinase (CK) level normalization time for patients with rhabdomyolysis due to DS ingestion. Methods: This study was conducted on 71 patients who were admitted with rhabdomyolysis after DS ingestion during the period from January 2000 to July 2009. Rhabdomyolysis was defined as a serum CK level over 1,000 U/L. The collected data included the general characteristics, the anticholinergic symptoms, the ingested dose, the peak serum CK level, the time interval (TI) from the event to the peak CK level and the TI from the event to a CK level below 1,000 U/L. We evaluated the correlation between the patients' variables and the TI from the event to the peak CK level time and the time for a CK level below 1,000 U/L. Results: The mean ingested dose per body weight (BW) was $30.86{\pm}18.63\;mg/kg$ and the mean TI from the event to treatment was $4.04{\pm}3.67$ hours. The TI from the event to the peak CK level was longer for the patients with a larger ingestion dose per BW (r=0.587, p<0.05). The CK normalization time was longer for the patients with a larger ingested dose per BW (r=0.446, p<0.05) and a higher peak CK level (r=0.634, p<0.05). Conclusion: The ingested dose per BW was correlated with the TI from the event to the peak CK level, and the ingested dose per BW and the peak CK level have significant correlations with the CK normalization time. These factors may be used to determine the discharge period of patients who had rhabdomyolysis following a OS overdose.

  • PDF

DEVELOPMENT OF AN AUTOMATIC PROCESSING PROGRAM FOR BOES DATA (BOES 관측데이터의 자동처리 프로그램 개발)

  • Kang, Dong-Il;Park, Hong-Suh;Han, In-Woo;Valyavin, G.;Lee, Byeong-Cheol;Kim, Kang-Min
    • Publications of The Korean Astronomical Society
    • /
    • v.20 no.1 s.24
    • /
    • pp.97-107
    • /
    • 2005
  • We developed a data reduction program (RX) to process BOES data automatically. It processes a whole set of data taken during one night automatically - preprocessing, extraction to one-dimensional spectra and wavelength calibration. The execution is very fast and the performance looks pretty good. We described the performance of this program, comparing its procedure with that of IRAF. RX does not have functions for continuum normalization yet. We will develop those functions in the next works.

Derivation of Design Low Flows by Transformation Method

  • 이순혁;명성진
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.37 no.E
    • /
    • pp.1-9
    • /
    • 1995
  • It is shown that two step power transformation is more efficient for the normalization of frequency distribution with the coefficient of skewness of zero in comparison with others including SMEMAX and power transformations. It is confirmed that the design low flows calculated using power and two step power transformations used in this study are generally nearer to the observed data as compared with those of SMEMAX transformation at all return periods in the applied watersheds of the Kum, Naktong and Yongsan rivers in Korea.

  • PDF

Text-dependent Speaker Verification System Over Telephone Lines (전화망을 위한 어구 종속 화자 확인 시스템)

  • 김유진;정재호
    • Proceedings of the IEEK Conference
    • /
    • 1999.11a
    • /
    • pp.663-667
    • /
    • 1999
  • In this paper, we review the conventional speaker verification algorithm and present the text-dependent speaker verification system for application over telephone lines and its result of experiments. We apply blind-segmentation algorithm which segments speech into sub-word unit without linguistic information to the speaker verification system for training speaker model effectively with limited enrollment data. And the World-mode] that is created from PBW DB for score normalization is used. The experiments are presented in implemented system using database, which were constructed to simulate field test, and are shown 3.3% EER.

  • PDF