• Title/Summary/Keyword: data sets

Search Result 3,763, Processing Time 0.03 seconds

A Review of Practical Use and Research Trends on Nursing Management Minimum Data Sets (NMMDS) (Nursing Management Minimum Data Sets (NMMDS) 연구의 최신 동향)

  • Jung, Myun Sook;Park, Jung In;Delaney, Connie W.;Westra, Bonnie L.
    • Journal of Korean Academy of Nursing Administration
    • /
    • v.20 no.4
    • /
    • pp.405-413
    • /
    • 2014
  • Purpose: The purpose of this study was to review articles on Nursing Management Minimum Data Sets (NMMDS) and to suggest strategies to improve practical use of NMMDS in nursing management. Methods: A systematic search for articles published until 2013 was undertaken using the following biomedical databases: CINAHL, PubMed, and Google scholar. Seventeen articles were fully reviewed. Results: The results showed that studies were related to updating NMMDS reflecting current EHR use, mapping NMMDS to standardized national databases, and validating, translating and evaluating NMMDS for international uses. NMMDS has three dimensions and was developed reflecting the needs of nurse managers. Conclusion: The study findings provide a summary of recent trends in NMMDS. These results can serve as basic information to promote practical use of NMMDS in the healthcare organization to provide nursing management data for nurse managers.

Parameter Estimation and Comparison for SRGMs and ARIMA Model in Software Failure Data

  • Song, Kwang Yoon;Chang, In Hong;Lee, Dong Su
    • Journal of Integrative Natural Science
    • /
    • v.7 no.3
    • /
    • pp.193-199
    • /
    • 2014
  • As the requirement on the quality of the system has increased, the reliability is very important part in terms of enhance stability and to provide high quality services to customers. Many statistical models have been developed in the past years for the estimation of software reliability. We consider the functions for NHPP software reliability model and time series model in software failure data. We estimate parameters for the proposed models from three data sets. The values of SSE and MSE is presented from three data sets. We compare the predicted number of faults with the actual three data sets using the NHPP software reliability model and time series model.

Improving the Error Back-Propagation Algorithm for Imbalanced Data Sets

  • Oh, Sang-Hoon
    • International Journal of Contents
    • /
    • v.8 no.2
    • /
    • pp.7-12
    • /
    • 2012
  • Imbalanced data sets are difficult to be classified since most classifiers are developed based on the assumption that class distributions are well-balanced. In order to improve the error back-propagation algorithm for the classification of imbalanced data sets, a new error function is proposed. The error function controls weight-updating with regards to the classes in which the training samples are. This has the effect that samples in the minority class have a greater chance to be classified but samples in the majority class have a less chance to be classified. The proposed method is compared with the two-phase, threshold-moving, and target node methods through simulations in a mammography data set and the proposed method attains the best results.

A Feature Vector Selection Method for Cancer Classification

  • Yun, Zheng;Keong, Kwoh-Chee
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.23-28
    • /
    • 2005
  • The high-dimensionality and insufficiency of gene expression profiles and proteomic profiles makes feature selection become a critical step in efficiently building accurate models for cancer problems based on such data sets. In this paper, we use a method, called Discrete Function Learning algorithm, to find discriminatory feature vectors based on information theory. The target feature vectors contain all or most information (in terms of entropy) of the class attribute. Two data sets are selected to validate our approach, one leukemia subtype gene expression data set and one ovarian cancer proteomic data set. The experimental results show that the our method generalizes well when applied to these insufficient and high-dimensional data sets. Furthermore, the obtained classifiers are highly understandable and accurate.

  • PDF

Empirical modeling of flexural and splitting tensile strengths of concrete containing fly ash by GEP

  • Saridemir, Mustafa
    • Computers and Concrete
    • /
    • v.17 no.4
    • /
    • pp.489-498
    • /
    • 2016
  • In this paper, the flexural strength ($f_{fs}$) and splitting tensile strength ($f_{sts}$) of concrete containing different proportions of fly ash have been modeled by using gene expression programming (GEP). Two GEP models called GEP-I and GEP-II are constituted to predict the $f_{fs}$ and $f_{sts}$ values, respectively. In these models, the age of specimen, cement, water, sand, aggregate, superplasticizer and fly ash are used as independent input parameters. GEP-I model is constructed by 292 experimental data and trisected into 170, 86 and 36 data for training, testing and validating sets, respectively. Similarly, GEP-II model is constructed by 278 experimental data and trisected into 142, 70 and 66 data for training, testing and validating sets, respectively. The experimental data used in the validating set of these models are independent from the training and testing sets. The results of the statistical parameters obtained from the models indicate that the proposed empirical models have good prediction and generalization capability.

A Construction of Fuzzy Model for Data Mining

  • Kim, Do-Wan;Joo, Young-Hoon;Park, Jin-Bae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.2
    • /
    • pp.209-215
    • /
    • 2003
  • A new GA-based methodology using information granules is suggested for the construction of fuzzy classifiers. The proposed scheme consists of three steps: selection of information granules, construction of the associated fuzzy sets, and tuning of the fuzzy rules. First, the genetic algorithm (GA) is applied to the development of the adequate information granules. The fuzzy sets are then constructed from the analysis of the developed information granules. An interpretable fuzzy classifier is designed by using the constructed fuzzy sets. Finally, the GA are utilized for tuning of the fuzzy rules, which can enhance the classification performance on the misclassified data (e.g., data with the strange pattern or on the boundaries of the classes). To show the effectiveness of the proposed method, an example, the classification of the Iris data, is provided.

Comparison and Analysis of P2P Botnet Detection Schemes

  • Cho, Kyungsan;Ye, Wujian
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.3
    • /
    • pp.69-79
    • /
    • 2017
  • In this paper, we propose our four-phase life cycle of P2P botnet with corresponding detection methods and the future direction for more effective P2P botnet detection. Our proposals are based on the intensive analysis that compares existing P2P botnet detection schemes in different points of view such as life cycle of P2P botnet, machine learning methods for data mining based detection, composition of data sets, and performance matrix. Our proposed life cycle model composed of linear sequence stages suggests to utilize features in the vulnerable phase rather than the entire life cycle. In addition, we suggest the hybrid detection scheme with data mining based method and our proposed life cycle, and present the improved composition of experimental data sets through analysing the limitations of previous works.

A Prediction Model Based on Relevance Vector Machine and Granularity Analysis

  • Cho, Young Im
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.16 no.3
    • /
    • pp.157-162
    • /
    • 2016
  • In this paper, a yield prediction model based on relevance vector machine (RVM) and a granular computing model (quotient space theory) is presented. With a granular computing model, massive and complex meteorological data can be analyzed at different layers of different grain sizes, and new meteorological feature data sets can be formed in this way. In order to forecast the crop yield, a grey model is introduced to label the training sample data sets, which also can be used for computing the tendency yield. An RVM algorithm is introduced as the classification model for meteorological data mining. Experiments on data sets from the real world using this model show an advantage in terms of yield prediction compared with other models.

Intelligent Intrusion Detection Systems Using the Asymmetric costs of Errors in Data Mining (데이터 마이닝의 비대칭 오류비용을 이용한 지능형 침입탐지시스템 개발)

  • Hong, Tae-Ho;Kim, Jin-Wan
    • The Journal of Information Systems
    • /
    • v.15 no.4
    • /
    • pp.211-224
    • /
    • 2006
  • This study investigates the application of data mining techniques such as artificial neural networks, rough sets, and induction teaming to the intrusion detection systems. To maximize the effectiveness of data mining for intrusion detection systems, we introduced the asymmetric costs with false positive errors and false negative errors. And we present a method for intrusion detection systems to utilize the asymmetric costs of errors in data mining. The results of our empirical experiment show our intrusion detection model provides high accuracy in intrusion detection. In addition the approach using the asymmetric costs of errors in rough sets and neural networks is effective according to the change of threshold value. We found the threshold has most important role of intrusion detection model for decreasing the costs, which result from false negative errors.

  • PDF

Recognition and classification of dimension set for automatic input of mechanical drawings (기계 도면의 자동 입력을 위한 치수 집합의 인식 및 분류)

  • 정윤수;박길흠
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.34S no.11
    • /
    • pp.114-125
    • /
    • 1997
  • This paper presents a method that automatically recognizes dimension sets from the mechanical drawings, and that classifies 6 types dimension sets according to functional purpose. In the proposed method, the object and closed-loop symbols are separated from the character-free drawings. Then object lines and interpretation lines are vectorized. And, after recognizing dimension sets(consistings of arrowhead, shape line, tail lines, extension lines, text-string, and feature control frame), we classify recognized dimension sets as horizontal, vertical, angular, diametral, radial, and leader dimension sets. Finally the proposed method converts classified dimension sets into AutoCAD data by using AutoLisp language. By using the methods of geometric modeling, the proposed method readily recognized and classifies dimension sets from complex drawings. Experimetnal results are presented, which are obtained by applying the proposed method to drawings drawn in compliance with the KS drafting standard.

  • PDF