• Title/Summary/Keyword: inference model

Search Result 1,167, Processing Time 0.027 seconds

Causal Inference Network of Genes Related with Bone Metastasis of Breast Cancer and Osteoblasts Using Causal Bayesian Networks

  • Park, Sung Bae;Chung, Chun Kee;Gonzalez, Efrain;Yoo, Changwon
    • Journal of Bone Metabolism
    • /
    • v.25 no.4
    • /
    • pp.251-266
    • /
    • 2018
  • Background: The causal networks among genes that are commonly expressed in osteoblasts and during bone metastasis (BM) of breast cancer (BC) are not well understood. Here, we developed a machine learning method to obtain a plausible causal network of genes that are commonly expressed during BM and in osteoblasts in BC. Methods: We selected BC genes that are commonly expressed during BM and in osteoblasts from the Gene Expression Omnibus database. Bayesian Network Inference with Java Objects (Banjo) was used to obtain the Bayesian network. Genes registered as BC related genes were included as candidate genes in the implementation of Banjo. Next, we obtained the Bayesian structure and assessed the prediction rate for BM, conditional independence among nodes, and causality among nodes. Furthermore, we reported the maximum relative risks (RRs) of combined gene expression of the genes in the model. Results: We mechanistically identified 33 significantly related and plausibly involved genes in the development of BC BM. Further model evaluations showed that 16 genes were enough for a model to be statistically significant in terms of maximum likelihood of the causal Bayesian networks (CBNs) and for correct prediction of BM of BC. Maximum RRs of combined gene expression patterns showed that the expression levels of UBIAD1, HEBP1, BTNL8, TSPO, PSAT1, and ZFP36L2 significantly affected development of BM from BC. Conclusions: The CBN structure can be used as a reasonable inference network for accurately predicting BM in BC.

A study on improving self-inference performance through iterative retraining of false positives of deep-learning object detection in tunnels (터널 내 딥러닝 객체인식 오탐지 데이터의 반복 재학습을 통한 자가 추론 성능 향상 방법에 관한 연구)

  • Kyu Beom Lee;Hyu-Soung Shin
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.26 no.2
    • /
    • pp.129-152
    • /
    • 2024
  • In the application of deep learning object detection via CCTV in tunnels, a large number of false positive detections occur due to the poor environmental conditions of tunnels, such as low illumination and severe perspective effect. This problem directly impacts the reliability of the tunnel CCTV-based accident detection system reliant on object detection performance. Hence, it is necessary to reduce the number of false positive detections while also enhancing the number of true positive detections. Based on a deep learning object detection model, this paper proposes a false positive data training method that not only reduces false positives but also improves true positive detection performance through retraining of false positive data. This paper's false positive data training method is based on the following steps: initial training of a training dataset - inference of a validation dataset - correction of false positive data and dataset composition - addition to the training dataset and retraining. In this paper, experiments were conducted to verify the performance of this method. First, the optimal hyperparameters of the deep learning object detection model to be applied in this experiment were determined through previous experiments. Then, in this experiment, training image format was determined, and experiments were conducted sequentially to check the long-term performance improvement through retraining of repeated false detection datasets. As a result, in the first experiment, it was found that the inclusion of the background in the inferred image was more advantageous for object detection performance than the removal of the background excluding the object. In the second experiment, it was found that retraining by accumulating false positives from each level of retraining was more advantageous than retraining independently for each level of retraining in terms of continuous improvement of object detection performance. After retraining the false positive data with the method determined in the two experiments, the car object class showed excellent inference performance with an AP value of 0.95 or higher after the first retraining, and by the fifth retraining, the inference performance was improved by about 1.06 times compared to the initial inference. And the person object class continued to improve its inference performance as retraining progressed, and by the 18th retraining, it showed that it could self-improve its inference performance by more than 2.3 times compared to the initial inference.

Building a Traffic Accident Frequency Prediction Model at Unsignalized Intersections in Urban Areas by Using Adaptive Neuro-Fuzzy Inference System (적응 뉴로-퍼지를 이용한 도시부 비신호교차로 교통사고예측모형 구축)

  • Kim, Kyung Whan;Kang, Jung Hyun;Kang, Jong Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.32 no.2D
    • /
    • pp.137-145
    • /
    • 2012
  • According to the National Police Agency, the total number of traffic accidents which occurred in 2010 was 226,878. Intersection accidents accounts for 44.8%, the largest portion of the entire traffic accidents. An research on the signalized intersection is constantly made, while an research on the unsignalized intersection is yet insufficient. This study selected traffic volume, road width, and sight distance as the input variables which affect unsignalized intersection accidents, and number of accidents as the output variable to build a model using ANFIS(Adaptive Neuro-Fuzzy Inference System). The forecast performance of this model is evaluated by comparing the actual measurement value with the forecasted value. The compatibility is evaluated by R2, the coefficient of determination, along with Mean Absolute Error (MAE) and Mean Square Error (MSE), the indicators which represent the degree of error and distribution. The result shows that the $R^2$ is 0.9817, while MAE and MSE are 0.4773 and 0.3037 respectively, which means that the explanatory power of the model is quite decent. This study is expected to provide the basic data for establishment of safety measure for unsignalized intersection and the improvement of traffic accidents.

Analysis of Access Authorization Conflict for Partial Information Hiding of RDF Web Document (RDF 웹 문서의 부분적인 정보 은닉과 관련한 접근 권한 충돌 문제의 분석)

  • Kim, Jae-Hoon;Park, Seog
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.18 no.2
    • /
    • pp.49-63
    • /
    • 2008
  • RDF is the base ontology model which is used in Semantic Web defined by W3C. OWL expands the RDF base model by providing various vocabularies for defining much more ontology relationships. Recently Jain and Farkas have suggested an RDF access control model based on RDF triple. Their research point is to introduce an authorization conflict problem by RDF inference which must be considered in RDF ontology data. Due to the problem, we cannot adopt XML access control model for RDF, although RDF is represented by XML. However, Jain and Farkas did not define the authorization propagation over the RDF upper/lower ontology concepts when an RDF authorization is specified. The reason why the authorization specification should be defined clearly is that finally, the authorizatin conflict is the problem between the authorization propagation in specifying an authorization and the authorization propagation in inferencing authorizations. In this article, first we define an RDF access authorization specification based on RDF triple in detail. Next, based on the definition, we analyze the authoriztion conflict problem by RDF inference in detail. Next, we briefly introduce a method which can quickly find an authorization conflict by using graph labeling techniques. This method is especially related with the subsumption relationship based inference. Finally, we present a comparison analysis with Jain and Farkas' study, and some experimental results showing the efficiency of the suggested conflict detection method.

Survival Analysis for White Non-Hispanic Female Breast Cancer Patients

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Stewart, Tiffanie Shauna-Jeanne;Bhatt, Chintan
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.9
    • /
    • pp.4049-4054
    • /
    • 2014
  • Background: Race and ethnicity are significant factors in predicting survival time of breast cancer patients. In this study, we applied advanced statistical methods to predict the survival of White non-Hispanic female breast cancer patients, who were diagnosed between the years 1973 and 2009 in the United States (U.S.). Materials and Methods: Demographic data from the Surveillance Epidemiology and End Results (SEER) database were used for the purpose of this study. Nine states were randomly selected from 12 U.S. cancer registries. A stratified random sampling method was used to select 2,000 female breast cancer patients from these nine states. We compared four types of advanced statistical probability models to identify the best-fit model for the White non-Hispanic female breast cancer survival data. Three model building criterion were used to measure and compare goodness of fit of the models. These include Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC). In addition, we used a novel Bayesian method and the Markov Chain Monte Carlo technique to determine the posterior density function of the parameters. After evaluating the model parameters, we selected the model having the lowest DIC value. Using this Bayesian method, we derived the predictive survival density for future survival time and its related inferences. Results: The analytical sample of White non-Hispanic women included 2,000 breast cancer cases from the SEER database (1973-2009). The majority of cases were married (55.2%), the mean age of diagnosis was 63.61 years (SD = 14.24) and the mean survival time was 84 months (SD = 35.01). After comparing the four statistical models, results suggested that the exponentiated Weibull model (DIC= 19818.220) was a better fit for White non-Hispanic females' breast cancer survival data. This model predicted the survival times (in months) for White non-Hispanic women after implementation of precise estimates of the model parameters. Conclusions: By using modern model building criteria, we determined that the data best fit the exponentiated Weibull model. We incorporated precise estimates of the parameter into the predictive model and evaluated the survival inference for the White non-Hispanic female population. This method of analysis will assist researchers in making scientific and clinical conclusions when assessing survival time of breast cancer patients.

A Hierarchical Bayesian Model for Survey Data with Nonresponse

  • Han, Geunshik
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.3
    • /
    • pp.435-451
    • /
    • 2001
  • We describe a hierarchical bayesian model to analyze multinomial nonignorable nonresponse data. Using a Dirichlet and beta prior to model the cell probabilities, We develop a complete hierarchical bayesian analysis for multinomial proportions without making any algebraic approximation. Inference is sampling based and Markove chain Monte Carlo methods are used to perform the computations. We apply our method to the dta on body mass index(BMI) and show the model works reasonably well.

  • PDF

A Closed-Form Bayesian Inferences for Multinomial Randomized Response Model

  • Heo, Tae-Young;Kim, Jong-Min
    • Communications for Statistical Applications and Methods
    • /
    • v.14 no.1
    • /
    • pp.121-131
    • /
    • 2007
  • In this paper, we examine the problem of estimating the sensitive characteristics and behaviors in a multinomial randomized response model using Bayesian approach. We derived a posterior distribution for parameter of interest for multinomial randomized response model. Based on the posterior distribution, we also calculated a credible intervals and mean squared error (MSE). We finally compare the maximum likelihood estimator and the Bayes estimator in terms of MSE.

Interval Estimation for Sum of Variance Components in a Simple Linear Regression Model with Unbalanced Nested Error Structure

  • Park, Dong-Joon
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.361-370
    • /
    • 2003
  • Those who are interested in making inferences concerning linear combination of valiance components in a simple linear regression model with unbalanced nested error structure can use the confidence intervals proposed in this paper. Two approximate confidence intervals for the sum of two variance components in the model are proposed. Simulation study is peformed to compare the methods. The methods are applied to a numerical example and recommendations are given for choosing a proper interval.

Inference of Parameters for Superposition with Goel-Okumoto model and Weibull model Using Gibbs Sampler

  • Heecheul Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.1
    • /
    • pp.169-180
    • /
    • 1999
  • A Markov Chain Monte Carlo method with development of computation is used to be the software system reliability probability model. For Bayesian estimator considering computational problem and theoretical justification we studies relation Markov Chain with Gibbs sampling. Special case of GOS with Superposition for Goel-Okumoto and Weibull models using Gibbs sampling and Metropolis algorithm considered. In this paper discuss Bayesian computation and model selection using posterior predictive likelihood criterion. We consider in this paper data using method by Cox-Lewis. A numerical example with a simulated data set is given.

  • PDF

Fuzzy regression using regularlization method based on Tanaka's model

  • Hong Dug-Hun;Kim Kyung-Tae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.16 no.4
    • /
    • pp.499-505
    • /
    • 2006
  • Regularlization approach to regression can be easily found in Statistics and Information Science literature. The technique of regularlization was introduced as a way of controlling the smoothness properties of regression function. In this paper, we have presented a new method to evaluate linear and non-linear fuzzy regression model based on Tanaka's model using the idea of regularlization technique. Especially this method is a very attractive approach to model non -linear fuzzy data.