• Title/Summary/Keyword: Classification Algorithms

Search Result 1,182, Processing Time 0.026 seconds

Opera Clustering: K-means on librettos datasets

  • Jeong, Harim;Yoo, Joo Hun
    • Journal of Internet Computing and Services
    • /
    • v.23 no.2
    • /
    • pp.45-52
    • /
    • 2022
  • With the development of artificial intelligence analysis methods, especially machine learning, various fields are widely expanding their application ranges. However, in the case of classical music, there still remain some difficulties in applying machine learning techniques. Genre classification or music recommendation systems generated by deep learning algorithms are actively used in general music, but not in classical music. In this paper, we attempted to classify opera among classical music. To this end, an experiment was conducted to determine which criteria are most suitable among, composer, period of composition, and emotional atmosphere, which are the basic features of music. To generate emotional labels, we adopted zero-shot classification with four basic emotions, 'happiness', 'sadness', 'anger', and 'fear.' After embedding the opera libretto with the doc2vec processing model, the optimal number of clusters is computed based on the result of the elbow method. Decided four centroids are then adopted in k-means clustering to classify unsupervised libretto datasets. We were able to get optimized clustering based on the result of adjusted rand index scores. With these results, we compared them with notated variables of music. As a result, it was confirmed that the four clusterings calculated by machine after training were most similar to the grouping result by period. Additionally, we were able to verify that the emotional similarity between composer and period did not appear significantly. At the end of the study, by knowing the period is the right criteria, we hope that it makes easier for music listeners to find music that suits their tastes.

Deep Learning-Based Model for Classification of Medical Record Types in EEG Report (EEG Report의 의무기록 유형 분류를 위한 딥러닝 기반 모델)

  • Oh, Kyoungsu;Kang, Min;Kang, Seok-hwan;Lee, Young-ho
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.5
    • /
    • pp.203-210
    • /
    • 2022
  • As more and more research and companies use health care data, efforts are being made to vitalize health care data worldwide. However, the system and format used by each institution is different. Therefore, this research established a basic model to classify text data onto multiple institutions according to the type of the future by establishing a basic model to classify the types of medical records of the EEG Report. For EEG Report classification, four deep learning-based algorithms were compared. As a result of the experiment, the ANN model trained by vectorizing with One-Hot Encoding showed the highest performance with an accuracy of 71%.

DISEASE FORECAST USING MACHINE LEARNING ALGORITHMS

  • HUSSAIN, MOHAMMED MUZAFFAR;DEVI, S. KALPANA
    • Journal of applied mathematics & informatics
    • /
    • v.40 no.5_6
    • /
    • pp.1151-1165
    • /
    • 2022
  • Key drive of information quarrying is to digest liked information starting possible information. With the colossal amount of realities kept in documents, information bases, and stores, in the medical care area, it's inexorably significant, assuming excessive, arising compelling resources aimed at examination besides comprehension like information on behalf of the withdrawal of gen that might assistance in independent direction. Classification is method in information mining; it's characterized as per private, passing on item toward a specific course established happening it is likeness toward past instances of different substances trendy the data collection. In pre-owned recycled four Classification algorithm that incorporate Multi-Layer perception, KSTAR, Bayesian Network and PART to fabricate the grouping replicas arranged the malaria data collection and analyze the replicas, degree their exhibition through Waikato Environment for Knowledge Analysis introduced to Java Development Kit 8, then utilizations outfit's technique trendy promoting presentation of the arrangement methodology. The outcome perceived that Bayesian Network return most elevated exactness of 50.05% when working on followed by Multi-Layer perception, with 49.9% when helping is half, then, at that point, Kstar with precision of 49.44%, 49.5% when supporting individually and PART have lesser precision of 48.1% when helping, The exploration recommended that Bayesian Network is awesome toward remain utilized on Malaria data collection in our sanatoriums.

Classification of Midinfrared Spectra of Colon Cancer Tissue Using a Convolutional Neural Network

  • Kim, In Gyoung;Lee, Changho;Kim, Hyeon Sik;Lim, Sung Chul;Ahn, Jae Sung
    • Current Optics and Photonics
    • /
    • v.6 no.1
    • /
    • pp.92-103
    • /
    • 2022
  • The development of midinfrared (mid-IR) quantum cascade lasers (QCLs) has enabled rapid high-contrast measurement of the mid-IR spectra of biological tissues. Several studies have compared the differences between the mid-IR spectra of colon cancer and noncancerous colon tissues. Most mid-IR spectrum classification studies have been proposed as machine-learning-based algorithms, but this results in deviations depending on the initial data and threshold values. We aim to develop a process for classifying colon cancer and noncancerous colon tissues through a deep-learning-based convolutional-neural-network (CNN) model. First, we image the midinfrared spectrum for the CNN model, an image-based deep-learning (DL) algorithm. Then, it is trained with the CNN algorithm and the classification ratio is evaluated using the test data. When the tissue microarray (TMA) and routine pathological slide are tested, the ML-based support-vector-machine (SVM) model produces biased results, whereas we confirm that the CNN model classifies colon cancer and noncancerous colon tissues. These results demonstrate that the CNN model using midinfrared-spectrum images is effective at classifying colon cancer tissue and noncancerous colon tissue, and not only submillimeter-sized TMA but also routine colon cancer tissue samples a few tens of millimeters in size.

An Analysis of Plant Diseases Identification Based on Deep Learning Methods

  • Xulu Gong;Shujuan Zhang
    • The Plant Pathology Journal
    • /
    • v.39 no.4
    • /
    • pp.319-334
    • /
    • 2023
  • Plant disease is an important factor affecting crop yield. With various types and complex conditions, plant diseases cause serious economic losses, as well as modern agriculture constraints. Hence, rapid, accurate, and early identification of crop diseases is of great significance. Recent developments in deep learning, especially convolutional neural network (CNN), have shown impressive performance in plant disease classification. However, most of the existing datasets for plant disease classification are a single background environment rather than a real field environment. In addition, the classification can only obtain the category of a single disease and fail to obtain the location of multiple different diseases, which limits the practical application. Therefore, the object detection method based on CNN can overcome these shortcomings and has broad application prospects. In this study, an annotated apple leaf disease dataset in a real field environment was first constructed to compensate for the lack of existing datasets. Moreover, the Faster R-CNN and YOLOv3 architectures were trained to detect apple leaf diseases in our dataset. Finally, comparative experiments were conducted and a variety of evaluation indicators were analyzed. The experimental results demonstrate that deep learning algorithms represented by YOLOv3 and Faster R-CNN are feasible for plant disease detection and have their own strong points and weaknesses.

Classification of Construction Worker's Activities Towards Collective Sensing for Safety Hazards

  • Yang, Kanghyeok;Ahn, Changbum R.
    • International conference on construction engineering and project management
    • /
    • 2017.10a
    • /
    • pp.80-88
    • /
    • 2017
  • Although hazard identification is one of the most important steps of safety management process, numerous hazards remain unidentified in the construction workplace due to the dynamic environment of the construction site and the lack of available resource for visual inspection. To this end, our previous study proposed the collective sensing approach for safety hazard identification and showed the feasibility of identifying hazards by capturing collective abnormalities in workers' walking patterns. However, workers generally performed different activities during the construction task in the workplace. Thereby, an additional process that can identify the worker's walking activity is necessary to utilize the proposed hazard identification approach in real world settings. In this context, this study investigated the feasibility of identifying walking activities during construction task using Wearable Inertial Measurement Units (WIMU) attached to the worker's ankle. This study simulated the indoor masonry work for data collection and investigated the classification performance with three different machine learning algorithms (i.e., Decision Tree, Neural Network, and Support Vector Machine). The analysis results showed the feasibility of identifying worker's activities including walking activity using an ankle-attached WIMU. Moreover, the finding of this study will help to enhance the performance of activity recognition and hazard identification in construction.

  • PDF

A Multi-Class Classifier of Modified Convolution Neural Network by Dynamic Hyperplane of Support Vector Machine

  • Nur Suhailayani Suhaimi;Zalinda Othman;Mohd Ridzwan Yaakub
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.11
    • /
    • pp.21-31
    • /
    • 2023
  • In this paper, we focused on the problem of evaluating multi-class classification accuracy and simulation of multiple classifier performance metrics. Multi-class classifiers for sentiment analysis involved many challenges, whereas previous research narrowed to the binary classification model since it provides higher accuracy when dealing with text data. Thus, we take inspiration from the non-linear Support Vector Machine to modify the algorithm by embedding dynamic hyperplanes representing multiple class labels. Then we analyzed the performance of multi-class classifiers using macro-accuracy, micro-accuracy and several other metrics to justify the significance of our algorithm enhancement. Furthermore, we hybridized Enhanced Convolution Neural Network (ECNN) with Dynamic Support Vector Machine (DSVM) to demonstrate the effectiveness and efficiency of the classifier towards multi-class text data. We performed experiments on three hybrid classifiers, which are ECNN with Binary SVM (ECNN-BSVM), and ECNN with linear Multi-Class SVM (ECNN-MCSVM) and our proposed algorithm (ECNNDSVM). Comparative experiments of hybrid algorithms yielded 85.12 % for single metric accuracy; 86.95 % for multiple metrics on average. As for our modified algorithm of the ECNN-DSVM classifier, we reached 98.29 % micro-accuracy results with an f-score value of 98 % at most. For the future direction of this research, we are aiming for hyperplane optimization analysis.

Improvement of Facial Emotion Recognition Performance through Addition of Geometric Features (기하학적 특징 추가를 통한 얼굴 감정 인식 성능 개선)

  • Hoyoung Jung;Hee-Il Hahn
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.1
    • /
    • pp.155-161
    • /
    • 2024
  • In this paper, we propose a new model by adding landmark information as a feature vector to the existing CNN-based facial emotion classification model. Facial emotion classification research using CNN-based models is being studied in various ways, but the recognition rate is very low. In order to improve the CNN-based models, we propose algorithms that improves facial expression classification accuracy by combining the CNN model with a landmark-based fully connected network obtained by ASM. By including landmarks in the CNN model, the recognition rate was improved by several percent, and experiments confirmed that further improved results could be obtained by adding FACS-based action units to the landmarks.

Segmentation of LiDAR Point Data Using Contour Tree (Contour Tree를 이용한 LiDAR Point 데이터의 분할)

  • Han Dong-Yeob;Kim Yong-Il
    • Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
    • /
    • 2006.04a
    • /
    • pp.463-467
    • /
    • 2006
  • Several segmentation algorithms have been proposed for DTM generation or building modeling from airborne LiDAR data. Three components are important for accurate segmentation: (i) the adjacent relationship of n-nearest points or mesh, etc. (ii) the effective decision parameters of height, slope, curvature, and plane condition, (iii) grouping methods. In this paper, we created the topology of point cloud data using the contour tree and implemented the region-growing Terrain and non-terrain points were classified correctly in the segmented data, which can be used also for feature classification.

  • PDF

Comparative Analyses of Two Algorithms for Region Segmentation of Color Image (칼라이미지의 영역분할을 위한 두 알고리즘의 비교분석)

  • 허민권;성병우;최흥국;김상균;서정욱
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 1998.04a
    • /
    • pp.83-88
    • /
    • 1998
  • 칼라이미지를 인식 및 분석을 하기 위해서는 이미지에 대한 영역분할이 우선적으로 먼저 이루어져야 되므로, 본 연구에서는 영역분할에 대한 두 개의 알고리즘을 구현하여 비교 분석하였다. 여러 가지 영역분할 방법 중에서 가장 쉽게 적용할 수 있고 또 가장 빠르게 영역을 분할 할 수 있는 Box classification 알고리즘을 이용하여 심근조직 표본의 현미경 영상이미지에 대해서 육안으로 선택한 영역과 histogram을 미분하여 최저 값에 문턱치를 정하여 줌으로써 선택한 영역에 대해 추출하고 이들 각각을 HLS 칼라모델에서 비교 분석하였다.

  • PDF