• Title/Summary/Keyword: NASA software dataset

Search Result 6, Processing Time 0.02 seconds

Hybrid Fuzzy Neural Networks by Means of Information Granulation and Genetic Optimization and Its Application to Software Process

  • Park, Byoung-Jun;Oh, Sung-Kwun;Lee, Young-Il
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.7 no.2
    • /
    • pp.132-137
    • /
    • 2007
  • Experimental software data capturing the essence of software projects (expressed e.g., in terms of their complexity and development time) have been a subject of intensive modeling. In this study, we introduce a new category of Hybrid Fuzzy Neural Networks (gHFNN) and discuss their comprehensive design methodology. The gHFNN architecture results from highly synergistic linkages between Fuzzy Neural Networks (FNN) and Polynomial Neural Networks (PNN). We develop a rule-based model consisting of a number of "if-then" statements whose antecedents are formed in the input space and linked with the consequents (conclusion pats) formed in the output space. In this framework, FNNs contribute to the formation of the premise part of the overall network structure of the gHFNN. The consequences of the rules are designed with the aid of genetically endowed PNNs. The experiments reported in this study deal with well-known software data such as the NASA dataset. In comparison with the previously discussed approaches, the proposed self-organizing networks are more accurate and yield significant generalization abilities.

Defect Severity-based Dimension Reduction Model using PCA (PCA를 적용한 결함 심각도 기반 차원 축소 모델)

  • Kwon, Ki Tae;Lee, Na-Young
    • Journal of Software Assessment and Valuation
    • /
    • v.15 no.1
    • /
    • pp.79-86
    • /
    • 2019
  • Software dimension reduction identifies the commonality of elements and extracts important feature elements. So it reduces complexity by simplify and solves multi-collinearity problems. And it reduces redundancy by performing redundancy and noise detection. In this study, we proposed defect severity-based dimension reduction model. Proposed model is applied defect severity-based NASA dataset. And it is verified the number of dimensions in the column that affect the severity of the defect. Then it is compares and analyzes the dimensions of the data before and after reduction. In this study experiment result, the number of dimensions of PC4's dataset is 2 to 3. It was possible to reduce the dimension.

Defect Severity-based Defect Prediction Model using CL

  • Lee, Na-Young;Kwon, Ki-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.9
    • /
    • pp.81-86
    • /
    • 2018
  • Software defect severity is very important in projects with limited historical data or new projects. But general software defect prediction is very difficult to collect the label information of the training set and cross-project defect prediction must have a lot of data. In this paper, an unclassified data set with defect severity is clustered according to the distribution ratio. And defect severity-based prediction model is proposed by way of labeling. Proposed model is applied CLAMI in JM1, PC4 with the least ambiguity of defect severity-based NASA dataset. And it is evaluated the value of ACC compared to original data. In this study experiment result, proposed model is improved JM1 0.15 (15%), PC4 0.12(12%) than existing defect severity-based prediction models.

Rule-Based Fuzzy Polynomial Neural Networks in Modeling Software Process Data

  • Park, Byoung-Jun;Lee, Dong-Yoon;Oh, Sung-Kwun
    • International Journal of Control, Automation, and Systems
    • /
    • v.1 no.3
    • /
    • pp.321-331
    • /
    • 2003
  • Experimental software datasets describing software projects in terms of their complexity and development time have been the subject of intensive modeling. A number of various modeling methodologies and modeling designs have been proposed including such approaches as neural networks, fuzzy, and fuzzy neural network models. In this study, we introduce the concept of the Rule-based fuzzy polynomial neural networks (RFPNN) as a hybrid modeling architecture and discuss its comprehensive design methodology. The development of the RFPNN dwells on the technologies of Computational Intelligence (CI), namely fuzzy sets, neural networks, and genetic algorithms. The architecture of the RFPNN results from a synergistic usage of RFNN and PNN. RFNN contribute to the formation of the premise part of the rule-based structure of the RFPNN. The consequence part of the RFPNN is designed using PNN. We discuss two kinds of RFPNN architectures and propose a comprehensive learning algorithm. In particular, it is shown that this network exhibits a dynamic structure. The experimental results include well-known software data such as the NASA dataset concerning software cost estimation and the one describing software modules of the Medical Imaging System (MIS).

Semi-supervised Software Defect Prediction Model Based on Tri-training

  • Meng, Fanqi;Cheng, Wenying;Wang, Jingdong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.11
    • /
    • pp.4028-4042
    • /
    • 2021
  • Aiming at the problem of software defect prediction difficulty caused by insufficient software defect marker samples and unbalanced classification, a semi-supervised software defect prediction model based on a tri-training algorithm was proposed by combining feature normalization, over-sampling technology, and a Tri-training algorithm. First, the feature normalization method is used to smooth the feature data to eliminate the influence of too large or too small feature values on the model's classification performance. Secondly, the oversampling method is used to expand and sample the data, which solves the unbalanced classification of labelled samples. Finally, the Tri-training algorithm performs machine learning on the training samples and establishes a defect prediction model. The novelty of this model is that it can effectively combine feature normalization, oversampling techniques, and the Tri-training algorithm to solve both the under-labelled sample and class imbalance problems. Simulation experiments using the NASA software defect prediction dataset show that the proposed method outperforms four existing supervised and semi-supervised learning in terms of Precision, Recall, and F-Measure values.

Design of Neurofuzzy Networks by Means of Linear Fuzzy Inference and Its Application to Software Engineering (선형 퍼지추론을 이용한 뉴로퍼지 네트워크의 설계와 소프트웨어 공학으로의 응용)

  • Park, Byoung-Jun;Park, Ho-Sung;Oh, Sung-Kwun
    • Proceedings of the KIEE Conference
    • /
    • 2002.07d
    • /
    • pp.2818-2820
    • /
    • 2002
  • In this paper, we design neurofuzzy networks architecture by means of linear fuzzy inference. The proposed neurofuzzy networks are equivalent to linear fuzzy rules, and the structure of these networks is composed of two main substructures, namely premise part and consequence part. The premise part of neurofuzzy networks use fuzzy space partitioning in terms of all variables for considering correlation between input variables. The consequence part is networks constituted as first-order linear form. The consequence part of neurofuzzy networks in general structure(for instance ANFIS networks) consists of nodes with a function that is a linear combination of input variables. But that of the proposed neurofuzzy networks consists of not nodes but networks that are constructed by connection weight and itself correspond to a linear combination of input variables functionally. The connection weights in consequence part are learned by back-propagation algorithm. For the evaluation of proposed neurofuzzy networks. The experimental results include a well-known NASA dataset concerning software cost estimation.

  • PDF