• Title/Summary/Keyword: Classification:

Search Result 22,559, Processing Time 0.043 seconds

Resume Classification System using Natural Language Processing & Machine Learning Techniques

  • Irfan Ali;Nimra;Ghulam Mujtaba;Zahid Hussain Khand;Zafar Ali;Sajid Khan
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.7
    • /
    • pp.108-117
    • /
    • 2024
  • The selection and recommendation of a suitable job applicant from the pool of thousands of applications are often daunting jobs for an employer. The recommendation and selection process significantly increases the workload of the concerned department of an employer. Thus, Resume Classification System using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process and ease the job of an employer. Moreover, the automation of this process can significantly expedite and transparent the applicants' selection process with mere human involvement. Nevertheless, various Machine Learning approaches have been proposed to develop Resume Classification Systems. However, this study presents an automated NLP and ML-based system that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of Resume Classification Systems and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP & ML techniques for processing & classification of Resumes, the extracted features were tested on nine machine learning models Support Vector Machine - SVM (Linear, SGD, SVC & NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN) and Logistic Regression (LR). The Term-Frequency Inverse Document (TF-IDF) feature representation scheme proven suitable for Resume Classification Task. The developed models were evaluated using F-ScoreM, RecallM, PrecissionM, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume Classification task, the SVM class of Machine Learning algorithms performed better on the study dataset with over 96% overall accuracy. The promising results suggest that NLP & ML techniques employed in this study could be used for the Resume Classification task.

A Study for Definition and Classification of Offshore Units (해양시설 용어 정의 및 분류 체계에 관한 일고찰)

  • LIM, Youngsub;KWON, Do Joong;LEE, Chang-Hee
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.29 no.3
    • /
    • pp.689-701
    • /
    • 2017
  • In recent offshore industries, various ambiguous terms have been used without clear definition or classification, causing difficulties in legal, technical, and educational understanding and usage. For an example, the commonly used term of 'Offshore Plant' in Korea is not an universal word technically. There has been no clear technical or legal definition about the 'Offshore Plant' and its classification is also very ambiguous; sometimes it is used to refer offshore oil and gas production platform or it is used to mean offshore renewable power generation plant in some cases. To build a conceptual framework, therefore, this paper suggests a classification of offshore units (1) using internationally agreed terms, (2) agreed with the technical classification used by the ship classification society and (3) being able to include not only the current but also future concepts of offshore units.

A Comparison Study of Classification Algorithms in Data Mining

  • Lee, Seung-Joo;Jun, Sung-Rae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.8 no.1
    • /
    • pp.1-5
    • /
    • 2008
  • Generally the analytical tools of data mining have two learning types which are supervised and unsupervised learning algorithms. Classification and prediction are main analysis tools for supervised learning. In this paper, we perform a comparison study of classification algorithms in data mining. We make comparative studies between popular classification algorithms which are LDA, QDA, kernel method, K-nearest neighbor, naive Bayesian, SVM, and CART. Also, we use almost all classification data sets of UCI machine learning repository for our experiments. According to our results, we are able to select proper algorithms for given classification data sets.

Comparison of Performance Measures for Credit-Card Delinquents Classification Models : Measured by Hit Ratio vs. by Utility (신용카드 연체자 분류모형의 성능평가 척도 비교 : 예측률과 유틸리티 중심으로)

  • Chung, Suk-Hoon;Suh, Yong-Moo
    • Journal of Information Technology Applications and Management
    • /
    • v.15 no.4
    • /
    • pp.21-36
    • /
    • 2008
  • As the great disturbance from abusing credit cards in Korea becomes stabilized, credit card companies need to interpret credit-card delinquents classification models from the viewpoint of profit. However, hit ratio which has been used as a measure of goodness of classification models just tells us how much correctly they classified rather than how much profits can be obtained as a result of using classification models. In this research, we tried to develop a new utility-based measure from the viewpoint of profit and then used this new measure to analyze two classification models(Neural Networks and Decision Tree models). We found that the hit ratio of neural model is higher than that of decision tree model, but the utility value of decision tree model is higher than that of neural model. This experiment shows the importance of utility based measure for credit-card delinquents classification models. We expect this new measure will contribute to increasing profits of credit card companies.

  • PDF

New Classification System for the Standardization of Power IT Terminologies (새로운 매트릭스분류체제에 의한 전력 IT용어 제정에 관한 연구)

  • Kim, Jung-Hoon;Hwang, Hu-Mor;Won, Jong-Ryul
    • Proceedings of the KIEE Conference
    • /
    • 2008.11a
    • /
    • pp.360-362
    • /
    • 2008
  • Based on classification systems of power and IT standard dictionaries, scientific and technological standard, SPARK, power IT fields of IEC and organization units of corporations, we propose a new classification system for the standardization of power of terminologies. The classification system consists of a hierarchical structure with general classification, application fields and specific technologies while keeping the conventional matrix-type classification system. Interpretation work of the power of terminologies confirms that the proposed classification system is efficient.

  • PDF

IoT Device Classification According to Context-aware Using Multi-classification Model

  • Zhang, Xu;Ryu, Shinhye;Kim, Sangwook
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.3
    • /
    • pp.447-459
    • /
    • 2020
  • The Internet of Things(IoT) paradigm is flourishing strenuously for the last two decades. Researchers around the globe have their dreams to transmute every real-world object to the virtual object. Consequently, IoT devices are escalating exponentially. The abrupt evolution of these IoT devices has caused a major challenge i.e. object classification. In order to classify devices comprehensively and accurately, this paper proposes a context-aware based multi-classification model for devices, which classifies the smart devices according to people's contexts. However, the classification features of contextual data of different contexts are difficult to extract. The deep learning algorithm has the capability to solve this problem. This paper proposes a context-aware based multi-classification model of devices, which classifies the smart devices according to people's contexts.

A Study on the Han-Un Decimal Classification (한은도서분류법에 관한 연구)

  • Yeo, Ji-Suk;Oh, Dong-Geun
    • Journal of Korean Library and Information Science Society
    • /
    • v.37 no.1
    • /
    • pp.329-352
    • /
    • 2006
  • This study investigated the background of the first and revised editions of the Han-Un Decimal Classification(HUDC), and analyzed their relationships to and influences on other major related classification systems. HUDC was compiled in 1954 and revised in 1981. HUDC was influenced by NDC in most classes of main classes and mnemonic schedules, and influenced by KDCP in the classes Religion, Language and Literature.

  • PDF

Classification Accuracy Improvement for Decision Tree (의사결정트리의 분류 정확도 향상)

  • Rezene, Mehari Marta;Park, Sanghyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.787-790
    • /
    • 2017
  • Data quality is the main issue in the classification problems; generally, the presence of noisy instances in the training dataset will not lead to robust classification performance. Such instances may cause the generated decision tree to suffer from over-fitting and its accuracy may decrease. Decision trees are useful, efficient, and commonly used for solving various real world classification problems in data mining. In this paper, we introduce a preprocessing technique to improve the classification accuracy rates of the C4.5 decision tree algorithm. In the proposed preprocessing method, we applied the naive Bayes classifier to remove the noisy instances from the training dataset. We applied our proposed method to a real e-commerce sales dataset to test the performance of the proposed algorithm against the existing C4.5 decision tree classifier. As the experimental results, the proposed method improved the classification accuracy by 8.5% and 14.32% using training dataset and 10-fold crossvalidation, respectively.

Terrain Classification for Enhancing Mobility of Outdoor Mobile Robot (실외 주행 로봇의 이동 성능 개선을 위한 지형 분류)

  • Kim, Ja-Young;Lee, Jong-Hwa;Lee, Ji-Hong;Kweon, In-So
    • The Journal of Korea Robotics Society
    • /
    • v.5 no.4
    • /
    • pp.339-348
    • /
    • 2010
  • One of the requirements for autonomous vehicles on off-road is to move stably in unstructured environments. Such capacity of autonomous vehicles is one of the most important abilities in consideration of mobility. So, many researchers use contact and/or non-contact methods to determine a terrain whether the vehicle can move on or not. In this paper we introduce an algorithm to classify terrains using visual information(one of the non-contacting methods). As a pre-processing, a contrast enhancement technique is introduced to improve classification of terrain. Also, for conducting classification algorithm, training images are grouped according to materials of the surface, and then Bayesian classification are applied to new images to determine membership to each group. In addition to the classification, we can build Traversability map specified by friction coefficients on which autonomous vehicles can decide to go or not. Experiments are made with Load-Cell to determine real friction coefficients of various terrains.

Active Sonar Target/Nontarget Classification Using Real Sea-trial Data (실제 해상 실험 데이터를 이용한 능동소나 표적/비표적 식별)

  • Seok, J.W.
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.10
    • /
    • pp.1637-1645
    • /
    • 2017
  • Target/Nontarget classification can be divided into the study of shape estimation of the target analysing reflected echo signal and of type classification of the target using acoustical features. In active sonar system, the feature vectors are extracted from the signal reflected from the target, and an classification algorithm is applied to determine whether the received signal is a target or not. However, received sonar signals can be distorted in the underwater environments, and the spatio-temporal characteristics of active sonar signals change according to the aspect of the target. In addition, it is very difficult to collect real sea-trial data for research. In this paper, target/non-target classification were performed using real sea-trial data. Feature vectors are extracted using MFCC(Mel-Frequency Cepstral Coefficients), filterbank energy in the Fourier spectrum and wavelet domain. For the performance verification, classification experiments were performed using backpropagation neural network classifiers.