• Title/Summary/Keyword: ML techniques

Search Result 341, Processing Time 0.035 seconds

A Pragmatic Framework for Predicting Change Prone Files Using Machine Learning Techniques with Java-based Software

  • Loveleen Kaur;Ashutosh Mishra
    • Asia pacific journal of information systems
    • /
    • v.30 no.3
    • /
    • pp.457-496
    • /
    • 2020
  • This study aims to extensively analyze the performance of various Machine Learning (ML) techniques for predicting version to version change-proneness of source code Java files. 17 object-oriented metrics have been utilized in this work for predicting change-prone files using 31 ML techniques and the framework proposed has been implemented on various consecutive releases of two Java-based software projects available as plug-ins. 10-fold and inter-release validation methods have been employed to validate the models and statistical tests provide supplementary information regarding the reliability and significance of the results. The results of experiments conducted in this article indicate that the ML techniques perform differently under the different validation settings. The results also confirm the proficiency of the selected ML techniques in lieu of developing change-proneness prediction models which could aid the software engineers in the initial stages of software development for classifying change-prone Java files of a software, in turn aiding in the trend estimation of change-proneness over future versions.

Development of ML and IoT Enabled Disease Diagnosis Model for a Smart Healthcare System

  • Mehra, Navita;Mittal, Pooja
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.7
    • /
    • pp.1-12
    • /
    • 2022
  • The current progression in the Internet of Things (IoT) and Machine Learning (ML) based technologies converted the traditional healthcare system into a smart healthcare system. The incorporation of IoT and ML has changed the way of treating patients and offers lots of opportunities in the healthcare domain. In this view, this research article presents a new IoT and ML-based disease diagnosis model for the diagnosis of different diseases. In the proposed model, vital signs are collected via IoT-based smart medical devices, and the analysis is done by using different data mining techniques for detecting the possibility of risk in people's health status. Recommendations are made based on the results generated by different data mining techniques, for high-risk patients, an emergency alert will be generated to healthcare service providers and family members. Implementation of this model is done on Anaconda Jupyter notebook by using different Python libraries in it. The result states that among all data mining techniques, SVM achieved the highest accuracy of 0.897 on the same dataset for classification of Parkinson's disease.

Machine learning modeling of irradiation embrittlement in low alloy steel of nuclear power plants

  • Lee, Gyeong-Geun;Kim, Min-Chul;Lee, Bong-Sang
    • Nuclear Engineering and Technology
    • /
    • v.53 no.12
    • /
    • pp.4022-4032
    • /
    • 2021
  • In this study, machine learning (ML) techniques were used to model surveillance test data of nuclear power plants from an international database of the ASTM E10.02 committee. Regression modeling was conducted using various techniques, including Cubist, XGBoost, and a support vector machine. The root mean square deviation of each ML model for the baseline dataset was less than that of the ASTM E900-15 nonlinear regression model. With respect to the interpolation, the ML methods provided excellent predictions with relatively few computations when applied to the given data range. The effect of the explanatory variables on the transition temperature shift (TTS) for the ML methods was analyzed, and the trends were slightly different from those for the ASTM E900-15 model. ML methods showed some weakness in the extrapolation of the fluence in comparison to the ASTM E900-15, while the Cubist method achieved an extrapolation to a certain extent. To achieve a more reliable prediction of the TTS, it was confirmed that advanced techniques should be considered for extrapolation when applying ML modeling.

Design and Performance Analysis of ML Techniques for Finger Motion Recognition (손가락 움직임 인식을 위한 웨어러블 디바이스 설계 및 ML 기법별 성능 분석)

  • Jung, Woosoon;Lee, Hyung Gyu
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.25 no.2
    • /
    • pp.129-136
    • /
    • 2020
  • Recognizing finger movements have been used as a intuitive way of human-computer interaction. In this study, we implement an wearable device for finger motion recognition and evaluate the accuracy of several ML (Machine learning) techniques. Not only HMM (Hidden markov model) and DTW (Dynamic time warping) techniques that have been traditionally used as time series data analysis, but also NN (Neural network) technique are applied to compare and analyze the accuracy of each technique. In order to minimize the computational requirement, we also apply the pre-processing to each ML techniques. Our extensive evaluations demonstrate that the NN-based gesture recognition system achieves 99.1% recognition accuracy while the HMM and DTW achieve 96.6% and 95.9% recognition accuracy, respectively.

Resume Classification System using Natural Language Processing & Machine Learning Techniques

  • Irfan Ali;Nimra;Ghulam Mujtaba;Zahid Hussain Khand;Zafar Ali;Sajid Khan
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.7
    • /
    • pp.108-117
    • /
    • 2024
  • The selection and recommendation of a suitable job applicant from the pool of thousands of applications are often daunting jobs for an employer. The recommendation and selection process significantly increases the workload of the concerned department of an employer. Thus, Resume Classification System using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process and ease the job of an employer. Moreover, the automation of this process can significantly expedite and transparent the applicants' selection process with mere human involvement. Nevertheless, various Machine Learning approaches have been proposed to develop Resume Classification Systems. However, this study presents an automated NLP and ML-based system that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of Resume Classification Systems and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP & ML techniques for processing & classification of Resumes, the extracted features were tested on nine machine learning models Support Vector Machine - SVM (Linear, SGD, SVC & NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN) and Logistic Regression (LR). The Term-Frequency Inverse Document (TF-IDF) feature representation scheme proven suitable for Resume Classification Task. The developed models were evaluated using F-ScoreM, RecallM, PrecissionM, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume Classification task, the SVM class of Machine Learning algorithms performed better on the study dataset with over 96% overall accuracy. The promising results suggest that NLP & ML techniques employed in this study could be used for the Resume Classification task.

Machine Learning in FET-based Chemical and Biological Sensors: A Mini Review

  • Ahn, Jae-Hyuk
    • Journal of Sensor Science and Technology
    • /
    • v.30 no.1
    • /
    • pp.1-9
    • /
    • 2021
  • This mini review summarizes some of the recent advances in machine-learning (ML)-driven chemical and biological sensors. Specific focus is on field-effect-transistor (FET)-based sensors with a description of their structures and detection mechanisms. Key ML techniques are briefly reviewed for an audience not familiar with the basic principles. We mainly discuss two aspects: (1) data analysis based on ML and (2) ML applied to sensor design. In conclusion, the challenges and opportunities for the advancement of ML-based sensors are briefly considered.

Avoiding Indefiniteness in Criteria for Maximum Likelihood Bearing Estimation with Arbitrary Array Configuration

  • Suzuki, Masakiyo
    • Proceedings of the IEEK Conference
    • /
    • 2002.07c
    • /
    • pp.1807-1810
    • /
    • 2002
  • This paper presents a technique for avoid- ing indefiniteness in Maximum Likelihood (ML) criteria for Direction-of-Arrival (DOA) finding using a sensor ar- ray with arbitrary configuration. The ML criterion has singular points in the solution space where the criterion becomes indefinite. Solutions fly iterative techniques for ML bearing estimation may oscillate because of numerical instability which occurs due to the indefiniteness, when bearings more than one approach to the identical value. The oscillation makes the condition for terminating iterations complex. This paper proposes a technique for avoiding the indefiniteness in ML criteria.

  • PDF

Modeling of AutoML using Colored Petri Net

  • Yo-Seob, Lee
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.420-426
    • /
    • 2022
  • Developing a machine learning model and putting it into production goes through a number of steps. Automated Machine Learning(AutoML) appeared to increase productivity and efficiency by automating inefficient tasks that occur while repeating this process whenever machine learning is applied. The high degree of automation of AutoML models allows non-experts to use machine learning models and techniques without the need to become machine learning experts. Automating the process of applying machine learning end-to-end with AutoML models has the added benefit of creating simpler solutions, generating these solutions faster, and often generating models that outperform hand-designed models. In this paper, the AutoML data is collected and AutoML's Color Petri net model is created and analyzed based on it.

Design of Efficient Storage Exploiting Structural Similarity in Microarray Data (마이크로어레이 데이터의 구조적 유사성을 이용한 효율적인 저장 구조의 설계)

  • Yun, Jong-Han;Shin, Dong-Kyu;Shin, Dong-Il
    • The KIPS Transactions:PartD
    • /
    • v.16D no.5
    • /
    • pp.643-650
    • /
    • 2009
  • As one of typical techniques for acquiring bio-information, microarray has contributed greatly to development of bioinformatics. Although it is established as a core technology in bioinformatics, it has difficulty in sharing and storing data because data from experiments has huge and complex type. In this paper, we propose a new method which uses the feature that microarray data format in MAGE-ML, a standard format for exchanging data, has frequent structurally similar patterns. This method constructs compact database by simplifying MAGE-ML schema. In this method, Inlining techniques and newly proposed classification techniques using structural similarity of elements are used. The structure of database becomes simpler and number of table-joins is reduced, performance is enhanced using this method.

Role of Machine Learning in Intrusion Detection System: A Systematic Review

  • Alhasani, Areej;Al omrani, Faten;Alzahrani, Taghreed;alFahhad, Rehab;Alotaibi, Mohamed
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.3
    • /
    • pp.155-162
    • /
    • 2022
  • Over the last 10 years, there has been rapid growth in the use of Machine Learning (ML) techniques to automate the process of intrusion threat detection at a scale never imagined before. This has prompted researchers, software engineers, and network specialists to rethink the applications of machine ML techniques particularly in the area of cybersecurity. As a result there exists numerous research documentations on the use ML techniques to detect and block cyber-attacks. This article is a systematic review involving the identification of published scholarly articles as found on IEEE Explore and Scopus databases. The articles exclusively related to the use of machine learning in Intrusion Detection Systems (IDS). Methods, concepts, results, and conclusions as found in the texts are analyzed. A description on the process taken in the identification of the research articles included: First, an introduction to the topic which is followed by a methodology section. A table is used to list identified research articles in the form of title, authors, methodology, and key findings.