• Title/Summary/Keyword: CLASSIFICATION KEY

Search Result 689, Processing Time 0.03 seconds

Deterministic and probabilistic analysis of tunnel face stability using support vector machine

  • Li, Bin;Fu, Yong;Hong, Yi;Cao, Zijun
    • Geomechanics and Engineering
    • /
    • v.25 no.1
    • /
    • pp.17-30
    • /
    • 2021
  • This paper develops a convenient approach for deterministic and probabilistic evaluations of tunnel face stability using support vector machine classifiers. The proposed method is comprised of two major steps, i.e., construction of the training dataset and determination of instance-based classifiers. In step one, the orthogonal design is utilized to produce representative samples after the ranges and levels of the factors that influence tunnel face stability are specified. The training dataset is then labeled by two-dimensional strength reduction analyses embedded within OptumG2. For any unknown instance, the second step applies the training dataset for classification, which is achieved by an ad hoc Python program. The classification of unknown samples starts with selection of instance-based training samples using the k-nearest neighbors algorithm, followed by the construction of an instance-based SVM-KNN classifier. It eventually provides labels of the unknown instances, avoiding calculate its corresponding performance function. Probabilistic evaluations are performed by Monte Carlo simulation based on the SVM-KNN classifier. The ratio of the number of unstable samples to the total number of simulated samples is computed and is taken as the failure probability, which is validated and compared with the response surface method.

Study on sloshing simulation in the independent tank for an ice-breaking LNG carrier

  • Ding, Shifeng;Wang, Gang;Luo, Qiuming
    • International Journal of Naval Architecture and Ocean Engineering
    • /
    • v.12 no.1
    • /
    • pp.667-679
    • /
    • 2020
  • As the LNG carrier operates in ice covered waters, it is key to ensure the overall safety, which is related to the coupling effect of ice-breaking process and internal liquid sloshing. This paper focuses on the sloshing simulation of the ice-breaking LNG carrier, and the numerical method is proposed using Circumferential Crack Method (CCM) and Volume of Vluid (VOF) with two main key factors (velocity νx and force Fx). The ship motion analysis is carried out by CCM when the ship navigates in the ice-covered waters with a constant propulsion power. The velocity νx is gained, which is the initial excitation condition for the calculation of internal sloshing force Fx. Then, the ship motion is modified based on iterative computations under the union action of ice-breaking force and liquid sloshing load. The sloshing simulation under the LNG tank is studied with the modified ship motion. Moreover, an ice-breaking LNG ship with three-leaf type tank is used for case study. The internal LNG sloshing is simulated with three different liquid heights, including free surface shape and sloshing pressure distribution at a given moment, pressure curves at monitoring points on the bulkhead. This present method is effective to solve the sloshing simulation during ice-breaking process, which could be a good reference for the design of the polar ice-breaking LNG carrier.

Towards Improving Causality Mining using BERT with Multi-level Feature Networks

  • Ali, Wajid;Zuo, Wanli;Ali, Rahman;Rahman, Gohar;Zuo, Xianglin;Ullah, Inam
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.10
    • /
    • pp.3230-3255
    • /
    • 2022
  • Causality mining in NLP is a significant area of interest, which benefits in many daily life applications, including decision making, business risk management, question answering, future event prediction, scenario generation, and information retrieval. Mining those causalities was a challenging and open problem for the prior non-statistical and statistical techniques using web sources that required hand-crafted linguistics patterns for feature engineering, which were subject to domain knowledge and required much human effort. Those studies overlooked implicit, ambiguous, and heterogeneous causality and focused on explicit causality mining. In contrast to statistical and non-statistical approaches, we present Bidirectional Encoder Representations from Transformers (BERT) integrated with Multi-level Feature Networks (MFN) for causality recognition, called BERT+MFN for causality recognition in noisy and informal web datasets without human-designed features. In our model, MFN consists of a three-column knowledge-oriented network (TC-KN), bi-LSTM, and Relation Network (RN) that mine causality information at the segment level. BERT captures semantic features at the word level. We perform experiments on Alternative Lexicalization (AltLexes) datasets. The experimental outcomes show that our model outperforms baseline causality and text mining techniques.

Empirical Analysis on Product Based Differentiation Strategies in B2C industry (제품 특성과 B2C 차별화 전략의 실증 분석)

  • Joung, Seok-In;Park, Woo-Sung;Han, Hyun-Soo
    • 한국경영정보학회:학술대회논문집
    • /
    • 2007.11a
    • /
    • pp.527-532
    • /
    • 2007
  • Differentiation strategies have been suggested as the critical sources of competitive advantage in B2C industry where customers can switch internet shopping mall with one click with virtually no transaction cost. Indeed, competition on low pricing cannot be a viable strategy in B2C industry. Moreover, cultivating customer loyalty to attain profitability is still a challenging task for most internet shopping mall. In this study, we provide empirical analysis results on key managerial variables that indicate the difference between the product categories in terms of customer perception on relative value importance. We first identified comprehensive managerial variables and organized them in terms of customer decision stage. Next, with reference to extant literatures on product characteristics based e-commerce strategy, hypotheses are developed to formalize the customer value differences on the key managerial variables. Empirical testing results indicated that there are significant differences on customer perceived value of the key managerial variables between the product groups. The findings provide useful insight for further study on e-commerce differentiation strategy.

  • PDF

Malware Application Classification based on Feature Extraction and Machine Learning for Malicious Behavior Analysis in Android Platform (안드로이드 플랫폼에서 악성 행위 분석을 통한 특징 추출과 머신러닝 기반 악성 어플리케이션 분류)

  • Kim, Dong-Wook;Na, Kyung-Gi;Han, Myung-Mook;Kim, Mijoo;Go, Woong;Park, Jun Hyung
    • Journal of Internet Computing and Services
    • /
    • v.19 no.1
    • /
    • pp.27-35
    • /
    • 2018
  • This paper is a study to classify malicious applications in Android environment. And studying the threat and behavioral analysis of malicious Android applications. In addition, malicious apps classified by machine learning were performed as experiments. Android behavior analysis can use dynamic analysis tools. Through this tool, API Calls, Runtime Log, System Resource, and Network information for the application can be extracted. We redefined the properties extracted for machine learning and evaluated the results of machine learning classification by verifying between the overall features and the main features. The results show that key features have been improved by 1~4% over the full feature set. Especially, SVM classifier improved by 10%. From these results, we found that the application of the key features as a key feature was more effective in the performance of the classification algorithm than in the use of the overall features. It was also identified as important to select meaningful features from the data sets.

LiDAR Ground Classification Enhancement Based on Weighted Gradient Kernel (가중 경사 커널 기반 LiDAR 미추출 지형 분류 개선)

  • Lee, Ho-Young;An, Seung-Man;Kim, Sung-Su;Sung, Hyo-Hyun;Kim, Chang-Hun
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.18 no.2
    • /
    • pp.29-33
    • /
    • 2010
  • The purpose of LiDAR ground classification is to archive both goals which are acquiring confident ground points with high precision and describing ground shape in detail. In spite of many studies about developing optimized algorithms to kick out this, it is very difficult to classify ground points and describing ground shape by airborne LiDAR data. Especially it is more difficult in a dense forested area like Korea. Principle misclassification was mainly caused by complex forest canopy hierarchy in Korea and relatively coarse LiDAR points density for ground classification. Unfortunately, a lot of LiDAR surveying performed in summer in South Korea. And by that reason, schematic LiDAR points distribution is very different from those of Europe. So, this study propose enhanced ground classification method considering Korean land cover characteristics. Firstly, this study designate highly confident candidated LiDAR points as a first ground points which is acquired by using big roller classification algorithm. Secondly, this study applied weighted gradient kernel(WGK) algorithm to find and include highly expected ground points from the remained candidate points. This study methods is very useful for reconstruct deformed terrain due to misclassification results by detecting and include important terrain model key points for describing ground shape at site. Especially in the case of deformed bank side of river area, this study showed highly enhanced classification and reconstruction results by using WGK algorithm.

Bioclimatic Classification and Characterization in South Korea (남한의 생물기후권역 구분과 특성 규명)

  • Choi, Yu-Young;Lim, Chul-Hee;Ryu, Ji-Eun;Piao, Dongfan;Kang, Jin-Young;Zhu, Weihong;Cui, Guishan;Lee, Woo-Kyun;Jeon, Seong-Woo
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.20 no.3
    • /
    • pp.1-18
    • /
    • 2017
  • This study constructed a high-resolution bioclimatic classification map of South Korea which classifies land into homogeneous zones by similar environment properties using advanced statistical techniques compared to existing ecological area classification studies. The climate data provided by WorldClim(1960-1990) were used to generate 27 bioclimatic variables affecting biological habitats, and key environmental variables were derived from Correlation Analysis and Principal Component Analysis. Clustering Analysis was performed using the ISODATA method to construct a 30'(~1km) resolution bioclimatic classification map. South Korea was divided into 21 regions and the results of classification were verified by correlation analysis with the Gross Primary Production(GPP), Actual Vegetation map made by the Ministry of Environment. Each zones' were described and named by its environmental characteristics and major vegetation distribution. This study could provide useful spatial frameworks to support ecosystem research, monitoring and policy decisions.

Efficient Implementation of SVM-Based Speech/Music Classification on Embedded Systems (SVM 기반 음성/음악 분류기의 효율적인 임베디드 시스템 구현)

  • Lim, Chung-Soo;Chang, Joon-Hyuk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.8
    • /
    • pp.461-467
    • /
    • 2011
  • Accurate classification of input signals is the key prerequisite for variable bit-rate coding, which has been introduced in order to effectively utilize limited communication bandwidth. Especially, recent surge of multimedia services elevate the importance of speech/music classification. Among many speech/music classifier, the ones based on support vector machine (SVM) have a strong selling point, high classification accuracy, but their computational complexity and memory requirement hinder their way into actual implementations. Therefore, techniques that reduce the computational complexity and the memory requirement is inevitable, particularly for embedded systems. We first analyze implementation of an SVM-based classifier on embedded systems in terms of execution time and energy consumption, and then propose two techniques that alleviate the implementation requirements: One is a technique that removes support vectors that have insignificant contribution to the final classification, and the other is to skip processing some of input signals by virtue of strong correlations in speech/music frames. These are post-processing techniques that can work with any other optimization techniques applied during the training phase of SVM. With experiments, we validate the proposed algorithms from the perspectives of classification accuracy, execution time, and energy consumption.

Predictive Analysis of Problematic Smartphone Use by Machine Learning Technique

  • Kim, Yu Jeong;Lee, Dong Su
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.2
    • /
    • pp.213-219
    • /
    • 2020
  • In this paper, we propose a classification analysis method for diagnosing and predicting problematic smartphone use in order to provide policy data on problematic smartphone use, which is getting worse year after year. Attempts have been made to identify key variables that affect the study. For this purpose, the classification rates of Decision Tree, Random Forest, and Support Vector Machine among machine learning analysis methods, which are artificial intelligence methods, were compared. The data were from 25,465 people who responded to the '2018 Problematic Smartphone Use Survey' provided by the Korea Information Society Agency and analyzed using the R statistical package (ver. 3.6.2). As a result, the three classification techniques showed similar classification rates, and there was no problem of overfitting the model. The classification rate of the Support Vector Machine was the highest among the three classification methods, followed by Decision Tree and Random Forest. The top three variables affecting the classification rate among smartphone use types were Life Service type, Information Seeking type, and Leisure Activity Seeking type.

Performance Comparison of Korean Dialect Classification Models Based on Acoustic Features

  • Kim, Young Kook;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.10
    • /
    • pp.37-43
    • /
    • 2021
  • Using the acoustic features of speech, important social and linguistic information about the speaker can be obtained, and one of the key features is the dialect. A speaker's use of a dialect is a major barrier to interaction with a computer. Dialects can be distinguished at various levels such as phonemes, syllables, words, phrases, and sentences, but it is difficult to distinguish dialects by identifying them one by one. Therefore, in this paper, we propose a lightweight Korean dialect classification model using only MFCC among the features of speech data. We study the optimal method to utilize MFCC features through Korean conversational voice data, and compare the classification performance of five Korean dialects in Gyeonggi/Seoul, Gangwon, Chungcheong, Jeolla, and Gyeongsang in eight machine learning and deep learning classification models. The performance of most classification models was improved by normalizing the MFCC, and the accuracy was improved by 1.07% and F1-score by 2.04% compared to the best performance of the classification model before normalizing the MFCC.