• Title/Summary/Keyword: Cross-Validation Approach

Search Result 130, Processing Time 0.026 seconds

Global Big Data Analysis Exploring the Determinants of Application Ratings: Evidence from the Google Play Store

  • Seo, Min-Kyo;Yang, Oh-Suk;Yang, Yoon-Ho
    • Journal of Korea Trade
    • /
    • v.24 no.7
    • /
    • pp.1-28
    • /
    • 2020
  • Purpose - This paper empirically investigates the predictors and main determinants of consumers' ratings of mobile applications in the Google Play Store. Using a linear and nonlinear model comparison to identify the function of users' review, in determining application rating across countries, this study estimates the direct effects of users' reviews on the application rating. In addition, extending our modelling into a sentimental analysis, this paper also aims to explore the effects of review polarity and subjectivity on the application rating, followed by an examination of the moderating effect of user reviews on the polarity-rating and subjectivity-rating relationships. Design/methodology - Our empirical model considers nonlinear association as well as linear causality between features and targets. This study employs competing theoretical frameworks - multiple regression, decision-tree and neural network models - to identify the predictors and main determinants of app ratings, using data from the Google Play Store. Using a cross-validation method, our analysis investigates the direct and moderating effects of predictors and main determinants of application ratings in a global app market. Findings - The main findings of this study can be summarized as follows: the number of user's review is positively associated with the ratings of a given app and it positively moderates the polarity-rating relationship. Applying the review polarity measured by a sentimental analysis to the modelling, it was found that the polarity is not significantly associated with the rating. This result best applies to the function of both positive and negative reviews in playing a word-of-mouth role, as well as serving as a channel for communication, leading to product innovation. Originality/value - Applying a proxy measured by binomial figures, previous studies have predominantly focused on positive and negative sentiment in examining the determinants of app ratings, assuming that they are significantly associated. Given the constraints to measurement of sentiment in current research, this paper employs sentimental analysis to measure the real integer for users' polarity and subjectivity. This paper also seeks to compare the suitability of three distinct models - linear regression, decision-tree and neural network models. Although a comparison between methodologies has long been considered important to the empirical approach, it has hitherto been underexplored in studies on the app market.

Dynamic Bayesian Network based Two-Hand Gesture Recognition (동적 베이스망 기반의 양손 제스처 인식)

  • Suk, Heung-Il;Sin, Bong-Kee
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.4
    • /
    • pp.265-279
    • /
    • 2008
  • The idea of using hand gestures for human-computer interaction is not new and has been studied intensively during the last dorado with a significant amount of qualitative progress that, however, has been short of our expectations. This paper describes a dynamic Bayesian network or DBN based approach to both two-hand gestures and one-hand gestures. Unlike wired glove-based approaches, the success of camera-based methods depends greatly on the image processing and feature extraction results. So the proposed method of DBN-based inference is preceded by fail-safe steps of skin extraction and modeling, and motion tracking. Then a new gesture recognition model for a set of both one-hand and two-hand gestures is proposed based on the dynamic Bayesian network framework which makes it easy to represent the relationship among features and incorporate new information to a model. In an experiment with ten isolated gestures, we obtained the recognition rate upwards of 99.59% with cross validation. The proposed model and the related approach are believed to have a strong potential for successful applications to other related problems such as sign languages.

Structural performance of timber frame joints - Full scale tests and numerical validation

  • Aejaz, S.A.;Dar, A.R.;Bhat, J.A.
    • Structural Engineering and Mechanics
    • /
    • v.74 no.4
    • /
    • pp.457-470
    • /
    • 2020
  • The force resisting ability of a connection has direct implications on the overall response of a timber framed structure to various actions, thereby governing the integrity and safety of such constructions. The behavior of timber framed structures has been studied by many researchers by testing full-scale-connections in timber frames so as to establish consistent design provisions on the same. However, much emphasis in this approach has been unidirectional, that has focused on a particular connection configuration, with no research output stressing on the refinement of the existing connection details in order to optimize their performance. In this regard, addition of adhesive to dowelled timber connections is an economically effective technique that has a potential to improve their performance. Therefore, a comparative study to evaluate the performance of various full-scale timber frame Nailed connections (Bridled Tenon, Cross Halved, Dovetail Halved and Mortise Tenon) supplemented by adhesive with respect to Nailed-Only counterparts under tensile loading has been investigated in this paper. The load-deformation values measured have been used to calculate stiffness, load capacity and ductility in both the connection forms (with and without adhesion) which in turn have been compared to other configurations along with the observed failure modes. The observed load capacity of the tested models has also been compared to the design strengths predicted by National Design Specifications (NDS-2018) for timber construction. Additionally, the experimental behavior was validated by developing non-linear finite element models in ABAQUS. All the results showed incorporation of adhesive to be an efficient and an economical technique in significantly enhancing the performance of various timber nailed connections under tensile action. Thus, this research is novel in a sense that it not only explores the tensile behavior of different nailed joint configurations common in timber construction but also stresses on improvising the same in a logical manner hence making it distinctive in its approach.

Image Quality Assessment by Combining Masking Texture and Perceptual Color Difference Model

  • Tang, Zhisen;Zheng, Yuanlin;Wang, Wei;Liao, Kaiyang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.7
    • /
    • pp.2938-2956
    • /
    • 2020
  • Objective image quality assessment (IQA) models have been developed by effective features to imitate the characteristics of human visual system (HVS). Actually, HVS is extremely sensitive to color degradation and complex texture changes. In this paper, we firstly reveal that many existing full reference image quality assessment (FR-IQA) methods can hardly measure the image quality with contrast and masking texture changes. To solve this problem, considering texture masking effect, we proposed a novel FR-IQA method, called Texture and Color Quality Index (TCQI). The proposed method considers both in the masking effect texture and color visual perceptual threshold, which adopts three kinds of features to reflect masking texture, color difference and structural information. Furthermore, random forest (RF) is used to address the drawbacks of existing pooling technologies. Compared with other traditional learning-based tools (support vector regression and neural network), RF can achieve the better prediction performance. Experiments conducted on five large-scale databases demonstrate that our approach is highly consistent with subjective perception, outperforms twelve the state-of-the-art IQA models in terms of prediction accuracy and keeps a moderate computational complexity. The cross database validation also validates our approach achieves the ability to maintain high robustness.

Prediction of the turning and zig-zag maneuvering performance of a surface combatant with URANS

  • Duman, Suleyman;Bal, Sakir
    • Ocean Systems Engineering
    • /
    • v.7 no.4
    • /
    • pp.435-460
    • /
    • 2017
  • The main objective of this study is to investigate the turning and zig-zag maneuvering performance of the well-known naval surface combatant DTMB (David Taylor Model Basin) 5415 hull with URANS (Unsteady Reynolds-averaged Navier-Stokes) method. Numerical simulations of static drift tests have been performed by a commercial RANS solver based on a finite volume method (FVM) in an unsteady manner. The fluid flow is considered as 3-D, incompressible and fully turbulent. Hydrodynamic analyses have been carried out for a fixed Froude number 0.28. During the analyses, the free surface effects have been taken into account using VOF (Volume of Fluid) method and the hull is considered as fixed. First, the code has been validated with the available experimental data in literature. After validation, static drift, static rudder and drift and rudder tests have been simulated. The forces and moments acting on the hull have been computed with URANS approach. Numerical results have been applied to determine the hydrodynamic maneuvering coefficients, such as, velocity terms and rudder terms. The acceleration, angular velocity and cross-coupled terms have been taken from the available experimental data. A computer program has been developed to apply a fast maneuvering simulation technique. Abkowitz's non-linear mathematical model has been used to calculate the forces and moment acting on the hull during the maneuvering motion. Euler method on the other hand has been applied to solve the simultaneous differential equations. Turning and zig-zag maneuvering simulations have been carried out and the maneuvering characteristics have been determined and the numerical simulation results have been compared with the available data in literature. In addition, viscous effects have been investigated using Eulerian approach for several static drift cases.

Reliability-based combined high and low cycle fatigue analysis of turbine blade using adaptive least squares support vector machines

  • Ma, Juan;Yue, Peng;Du, Wenyi;Dai, Changping;Wriggers, Peter
    • Structural Engineering and Mechanics
    • /
    • v.83 no.3
    • /
    • pp.293-304
    • /
    • 2022
  • In this work, a novel reliability approach for combined high and low cycle fatigue (CCF) estimation is developed by combining active learning strategy with least squares support vector machines (LS-SVM) (named as ALS-SVM) surrogate model to address the multi-resources uncertainties, including working loads, material properties and model itself. Initially, a new active learner function combining LS-SVM approach with Monte Carlo simulation (MCS) is presented to improve computational efficiency with fewer calls to the performance function. To consider the uncertainty of surrogate model at candidate sample points, the learning function employs k-fold cross validation method and introduces the predicted variance to sequentially select sampling. Following that, low cycle fatigue (LCF) loads and high cycle fatigue (HCF) loads are firstly estimated based on the training samples extracted from finite element (FE) simulations, and their simulated responses together with the sample points of model parameters in Coffin-Manson formula are selected as the MC samples to establish ALS-SVM model. In this analysis, the MC samples are substituted to predict the CCF reliability of turbine blades by using the built ALS-SVM model. Through the comparison of the two approaches, it is indicated that the reliability model by linear cumulative damage rule provides a non-conservative result compared with that by the proposed one. In addition, the results demonstrate that ALS-SVM is an effective analysis method holding high computational efficiency with small training samples to gain accurate fatigue reliability.

An In Silico Drug Repositioning Strategy to Identify Specific STAT-3 Inhibitors for Breast Cancer

  • Sruthy Sathish
    • Journal of Integrative Natural Science
    • /
    • v.16 no.4
    • /
    • pp.123-131
    • /
    • 2023
  • Breast cancer continues to pose a substantial worldwide health challenge, thereby requiring the development of innovative strategies to discover new therapeutic interventions. Signal Transducer and Activator of Transcription 3 (STAT-3) has been identified as a significant factor in the development of several types of cancer, including breast cancer. This is primarily attributed to its diverse functions in promoting tumour formation and conferring resistance to therapeutic interventions. This study presents an in silico drug repositioning approach that focuses on identifying specific inhibitors of STAT-3 for the purpose of treating breast cancer. We initially examined the structural and functional attributes of STAT-3, thereby elucidating its crucial involvement in cellular signalling cascades. A comprehensive virtual screening was performed on a diverse collection of drugs that have been approved by the FDA from zinc15 database. Various computational techniques, including molecular docking, cross docking, and cDFT analysis, were utilised in order to prioritise potential candidates. This prioritisation was based on their predicted binding energies and outer molecular orbital reactivity. The findings of our study have unveiled a Dihydroergotamine and Paritaprevir that have been approved by the FDA and exhibit considerable promise as selective inhibitors of STAT-3. In conclusion, the utilisation of our in silico drug repositioning approach presents a prompt and economically efficient method for the identification of potential compounds that warrant subsequent experimental validation as selective STAT-3 inhibitors in the context of breast cancer. The present study highlights the considerable potential of employing computational strategies to expedite the drug discovery process. Moreover, it provides valuable insights into novel avenues for targeted therapeutic interventions in the context of breast cancer treatment.

Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach

  • Marwat, M. Irfan;Khan, Javed Ali;Alshehri, Dr. Mohammad Dahman;Ali, Muhammad Asghar;Hizbullah;Ali, Haider;Assam, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.3
    • /
    • pp.830-860
    • /
    • 2022
  • [Introduction] Nowadays, many companies are shifting their businesses online due to the growing trend among customers to buy and shop online, as people prefer online purchasing products. [Problem] Users share a vast amount of information about products, making it difficult and challenging for the end-users to make certain decisions. [Motivation] Therefore, we need a mechanism to automatically analyze end-user opinions, thoughts, or feelings in the social media platform about the products that might be useful for the customers to make or change their decisions about buying or purchasing specific products. [Proposed Solution] For this purpose, we proposed an automated SentiDecpective approach, which classifies end-user reviews into negative, positive, and neutral sentiments and identifies deceptive crowd-users rating information in the social media platform to help the user in decision-making. [Methodology] For this purpose, we first collected 11781 end-users comments from the Amazon store and Flipkart web application covering distant products, such as watches, mobile, shoes, clothes, and perfumes. Next, we develop a coding guideline used as a base for the comments annotation process. We then applied the content analysis approach and existing VADER library to annotate the end-user comments in the data set with the identified codes, which results in a labelled data set used as an input to the machine learning classifiers. Finally, we applied the sentiment analysis approach to identify the end-users opinions and overcome the deceptive rating information in the social media platforms by first preprocessing the input data to remove the irrelevant (stop words, special characters, etc.) data from the dataset, employing two standard resampling approaches to balance the data set, i-e, oversampling, and under-sampling, extract different features (TF-IDF and BOW) from the textual data in the data set and then train & test the machine learning algorithms by applying a standard cross-validation approach (KFold and Shuffle Split). [Results/Outcomes] Furthermore, to support our research study, we developed an automated tool that automatically analyzes each customer feedback and displays the collective sentiments of customers about a specific product with the help of a graph, which helps customers to make certain decisions. In a nutshell, our proposed sentiments approach produces good results when identifying the customer sentiments from the online user feedbacks, i-e, obtained an average 94.01% precision, 93.69% recall, and 93.81% F-measure value for classifying positive sentiments.

Analysis of the Impact of Satellite Remote Sensing Information on the Prediction Performance of Ungauged Basin Stream Flow Using Data-driven Models (인공위성 원격 탐사 정보가 자료 기반 모형의 미계측 유역 하천유출 예측성능에 미치는 영향 분석)

  • Seo, Jiyu;Jung, Haeun;Won, Jeongeun;Choi, Sijung;Kim, Sangdan
    • Journal of Wetlands Research
    • /
    • v.26 no.2
    • /
    • pp.147-159
    • /
    • 2024
  • Lack of streamflow observations makes model calibration difficult and limits model performance improvement. Satellite-based remote sensing products offer a new alternative as they can be actively utilized to obtain hydrological data. Recently, several studies have shown that artificial intelligence-based solutions are more appropriate than traditional conceptual and physical models. In this study, a data-driven approach combining various recurrent neural networks and decision tree-based algorithms is proposed, and the utilization of satellite remote sensing information for AI training is investigated. The satellite imagery used in this study is from MODIS and SMAP. The proposed approach is validated using publicly available data from 25 watersheds. Inspired by the traditional regionalization approach, a strategy is adopted to learn one data-driven model by integrating data from all basins, and the potential of the proposed approach is evaluated by using a leave-one-out cross-validation regionalization setting to predict streamflow from different basins with one model. The GRU + Light GBM model was found to be a suitable model combination for target basins and showed good streamflow prediction performance in ungauged basins (The average model efficiency coefficient for predicting daily streamflow in 25 ungauged basins is 0.7187) except for the period when streamflow is very small. The influence of satellite remote sensing information was found to be up to 10%, with the additional application of satellite information having a greater impact on streamflow prediction during low or dry seasons than during wet or normal seasons.

A Study on Forecasting Accuracy Improvement of Case Based Reasoning Approach Using Fuzzy Relation (퍼지 관계를 활용한 사례기반추론 예측 정확성 향상에 관한 연구)

  • Lee, In-Ho;Shin, Kyung-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.67-84
    • /
    • 2010
  • In terms of business, forecasting is a work of what is expected to happen in the future to make managerial decisions and plans. Therefore, the accurate forecasting is very important for major managerial decision making and is the basis for making various strategies of business. But it is very difficult to make an unbiased and consistent estimate because of uncertainty and complexity in the future business environment. That is why we should use scientific forecasting model to support business decision making, and make an effort to minimize the model's forecasting error which is difference between observation and estimator. Nevertheless, minimizing the error is not an easy task. Case-based reasoning is a problem solving method that utilizes the past similar case to solve the current problem. To build the successful case-based reasoning models, retrieving the case not only the most similar case but also the most relevant case is very important. To retrieve the similar and relevant case from past cases, the measurement of similarities between cases is an important key factor. Especially, if the cases contain symbolic data, it is more difficult to measure the distances. The purpose of this study is to improve the forecasting accuracy of case-based reasoning approach using fuzzy relation and composition. Especially, two methods are adopted to measure the similarity between cases containing symbolic data. One is to deduct the similarity matrix following binary logic(the judgment of sameness between two symbolic data), the other is to deduct the similarity matrix following fuzzy relation and composition. This study is conducted in the following order; data gathering and preprocessing, model building and analysis, validation analysis, conclusion. First, in the progress of data gathering and preprocessing we collect data set including categorical dependent variables. Also, the data set gathered is cross-section data and independent variables of the data set include several qualitative variables expressed symbolic data. The research data consists of many financial ratios and the corresponding bond ratings of Korean companies. The ratings we employ in this study cover all bonds rated by one of the bond rating agencies in Korea. Our total sample includes 1,816 companies whose commercial papers have been rated in the period 1997~2000. Credit grades are defined as outputs and classified into 5 rating categories(A1, A2, A3, B, C) according to credit levels. Second, in the progress of model building and analysis we deduct the similarity matrix following binary logic and fuzzy composition to measure the similarity between cases containing symbolic data. In this process, the used types of fuzzy composition are max-min, max-product, max-average. And then, the analysis is carried out by case-based reasoning approach with the deducted similarity matrix. Third, in the progress of validation analysis we verify the validation of model through McNemar test based on hit ratio. Finally, we draw a conclusion from the study. As a result, the similarity measuring method using fuzzy relation and composition shows good forecasting performance compared to the similarity measuring method using binary logic for similarity measurement between two symbolic data. But the results of the analysis are not statistically significant in forecasting performance among the types of fuzzy composition. The contributions of this study are as follows. We propose another methodology that fuzzy relation and fuzzy composition could be applied for the similarity measurement between two symbolic data. That is the most important factor to build case-based reasoning model.