• Title/Summary/Keyword: Model-Log Comparison

Search Result 81, Processing Time 0.024 seconds

Modeling of Breast Cancer Prognostic Factors Using a Parametric Log-Logistic Model in Fars Province, Southern Iran

  • Zare, Najaf;Doostfatemeh, Marzieh;Rezaianzadeh, Abass
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.4
    • /
    • pp.1533-1537
    • /
    • 2012
  • In general, breast cancer is the most common malignancy among women in developed as well as some developing countries, often being the second leading cause of cancer mortality after lung cancer. Using a parametric log-logistic model to consider the effects of prognostic factors, the present study focused on the 5-year survival of women with the diagnosis of breast cancer in Southern Iran. A total of 1,148 women who were diagnosed with primary invasive breast cancer from January 2001 to January 2005 were included and divided into three prognosis groups: poor, medium, and good. The survival times as well as the hazard rates of the three different groups were compared. The log-logistic model was employed as the best parametric model which could explain survival times. The hazard rates of the poor and the medium prognosis groups were respectively 13 and 3 times greater than in the good prognosis group. Also, the difference between the overall survival rates of the poor and the medium prognosis groups was highly significant in comparison to the good prognosis group. Use of the parametric log-logistic model - also a proportional odds model - allowed assessment of the natural process of the disease based on hazard and identification of trends.

Disjunctive Process Patterns Refinement and Probability Extraction from Workflow Logs

  • Kim, Kyoungsook;Ham, Seonghun;Ahn, Hyun;Kim, Kwanghoon Pio
    • Journal of Internet Computing and Services
    • /
    • v.20 no.3
    • /
    • pp.85-92
    • /
    • 2019
  • In this paper, we extract the quantitative relation data of activities from the workflow event log file recorded in the XES standard format and connect them to rediscover the workflow process model. Extract the workflow process patterns and proportions with the rediscovered model. There are four types of control-flow elements that should be used to extract workflow process patterns and portions with log files: linear (sequential) routing, disjunctive (selective) routing, conjunctive (parallel) routing, and iterative routing patterns. In this paper, we focus on four of the factors, disjunctive routing, and conjunctive path. A framework implemented by the authors' research group extracts and arranges the activity data from the log and converts the iteration of duplicate relationships into a quantitative value. Also, for accurate analysis, a parallel process is recorded in the log file based on execution time, and algorithms for finding and eliminating information distortion are designed and implemented. With these refined data, we rediscover the workflow process model following the relationship between the activities. This series of experiments are conducted using the Large Bank Transaction Process Model provided by 4TU and visualizes the experiment process and results.

The Comparative Software Reliability Cost Model of Considering Shape Parameter (형상모수를 고려한 소프트웨어 신뢰성 비용 모형에 관한 비교 연구)

  • Kim, Kyung-Soo;Kim, Hee-Cheul
    • Journal of Digital Convergence
    • /
    • v.12 no.3
    • /
    • pp.219-226
    • /
    • 2014
  • In this study, reliability software cost model considering shape parameter based on life distribution from the process of software product testing was studied. The shape parameter using the Erlang and Log-logistic model that is widely used in the field of reliability problems presented. The software failure model was used finite failure non-homogeneous Poisson process model, the parameters estimation using maximum likelihood estimation was conducted. In comparison result of software cost model based on the Erlang distribution and the log-logistic distribution software cost model, because Erlang model is to predict the optimal release time can be software, but the log-logistic model to predict to optimal release time can not be, Erlang distribution than the log-logistic distribution appears to be effective. In this research, software developers to identify software development cost some extent be able to help is considered.

An Application of the Clustering Threshold Gradient Descent Regularization Method for Selecting Genes in Predicting the Survival Time of Lung Carcinomas

  • Lee, Seung-Yeoun;Kim, Young-Chul
    • Genomics & Informatics
    • /
    • v.5 no.3
    • /
    • pp.95-101
    • /
    • 2007
  • In this paper, we consider the variable selection methods in the Cox model when a large number of gene expression levels are involved with survival time. Deciding which genes are associated with survival time has been a challenging problem because of the large number of genes and relatively small sample size (n<

Comparison of Chlorine, Chlorine Dioxide and Ozone as Disinfectants in Drinking Water (정수소독공정에 이용되는 염소, 이산화염소, 오존 소독제의 비교, 고찰에 관한 연구)

  • Lee, Yoon-Jin;Lee, Sun-Jong;Lee, Dong-Chan;Kim, Hyun;Lee, Hwan;Lee, Cheol-Hyo;Nam, Sang-Ho
    • Journal of Environmental Health Sciences
    • /
    • v.28 no.3
    • /
    • pp.1-8
    • /
    • 2002
  • The experiments for the characterization of inactivation were performed in a series of batch processes with the total coliform as a general indicator organism based on chlorine, chlorine dioxide and ozone as disinfectants. The water sam-ples were taken from the outlet of settling basin in a conventional surface water treatment system that is provided with the raw water drawn from the mid-stream of the Han River. The inactivation of total coliform was experimentally ana-lyzed for the dose of disinfectant contact time, pH, Temperature and DOC. The nearly 2.4,3.0,3.9 log inactivation of total coliform killed by injecting 1 mか1 at 5 minutes for chlorine, chlorine dioxide and ozone. For the inactivation of 99.9%(3 log), Disinfectants required were 1.70, 1.00 and 0.60 mか1 for chlorine, chlorine dioxide and ozone, respec-tively. The higher the pH is, the poorer the disinfections effects are in the range of pH 6-9 by using chlorine and ozone. But the irfluence of pH value on killing effects of chlorine dioxide is weak. The parameters estimated by the models of Chick-Watson, Hom, and Selleck from our experimental data obtained for chlorine are: log(N/ $N_{0}$ )=-0.16 CT with n= 1, log(N/ $N_{0}$ )=-0.71 $C^{0.87}$T with n$\neq$1, for Chicks-Watson model, log (N/ $N_{0}$ )= -1.87 $C^{0.47}$ $T^{0.36}$ for Hom model. For chlorine dioxide are: log(N/ $N_{0}$ )= -1.53 CT with n = 1, log(N/ $N_{0}$ )= -2.29 $C^{0.94}$T with n$\neq$1,, for Chicks-Watson model, log(N/ $N_{0}$ )= -3.64 $C^{0.43}$ $T^{0.24}$ for Hom model and for ozone are: log(N/ $N_{0}$ )= -2.59 CT with n = 1, log(N/ $N_{0}$ )= -2.28 $C^{0.36}$T with n$\neq$1, for Chicks-Watson model, log(N/ $N_{0}$ )= -4.53 $C^{0.26}$ $T^{0.19}$ for Hom model.19/ for Hom model.

Comparison of accuracy between LC model and 4-PFM when COVID-19 impacts mortality structure

  • Choi, Janghoon
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.3
    • /
    • pp.233-250
    • /
    • 2021
  • This paper studies if the accuracies of mortality models (LC model vs. 4-parametric model) are aggravated if a mortality structure changes due to the impact of COVID-19. LC model (LCM) uses dimension reduction for fitting to the log mortality matrix so that the performance of the dimension reduction method may not be good when the matrix structure changes. On the other hand, 4-parametric factor model (4-PFM) is designed to use factors for fitting to log mortality data by age groups so that it would be less affected by the change of the mortality structure. In fact, the forecast accuracies of LCM are better than those of 4-PFM when life-tables are used whereas those of 4-PFM are better when the mortality structure changes. Thus this result shows that 4-PFM is more reliable in performance to the structural changes of the mortality. To support the accuracy changes of LCM the functional aspect is explained by computing eigenvalues produced by singular vector decomposition

Comparison of head-related transfer function models based on principal components analysis (주성분 분석법을 이용한 머리전달함수 모형화 기법의 성능 비교)

  • Hwang, Sung-Mok;Park, Young-Jin;Park, Youn-Sik
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2008.04a
    • /
    • pp.920-927
    • /
    • 2008
  • This study deals with modeling of Head-Related Transfer Functions (HRTFs) using Principal Components Analysis (PCA) in the time and frequency domains. Four PCA models based on Head-Related Impulse Responses (HRIRs), complex-valued HRTFs, augmented HRTFs, and log-magnitudes of HRTFs are investigated. The objective of this study is to compare modeling performances of the PCA models in the least-squares sense and to show the theoretical relationship between the PCA models. In terms of the number of principal components needed for modeling, the PCA model based on HRIR or augmented HRTFs showed more efficient modeling performance than the PCA model based on complex-valued HRTFs. The PCA model based on HRIRs in the time domain and that based on augmented HRTFs in the frequency domain are shown to be theoretically equivalent. Modeling performance of the PCA model based on log-magnitudes of HRTFs cannot be compared with that of other PCA models because the PCA model deals with log-scaled magnitude components only, whereas the other PCA models consider both magnitude and phase components in linear scale.

  • PDF

Empirical Comparisons of Disparity Measures for Partial Association Models in Three Dimensional Contingency Tables

  • Jeong, D.B.;Hong, C.S.;Yoon, S.H.
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.1
    • /
    • pp.135-144
    • /
    • 2003
  • This work is concerned with comparison of the recently developed disparity measures for the partial association model in three dimensional categorical data. Data are generated by using simulation on each term in the log-linear model equation based on the partial association model, which is a proposed method in this paper. This alternative Monte Carlo methods are explored to study the behavior of disparity measures such as the power divergence statistic I(λ), the Pearson chi-square statistic X$^2$, the likelihood ratio statistic G$^2$, the blended weight chi-square statistic BWCS(λ), the blended weight Hellinger distance statistic BWHD(λ), and the negative exponential disparity statistic NED(λ) for moderate sample sizes. We find that the power divergence statistic I(2/3) and the blended weight Hellinger distance family BWHD(1/9) are the best tests with respect to size and power.

Is it Possible to Predict the ADI of Pesticides using the QSAR Approach?

  • Kim, Jae Hyoun
    • Journal of Environmental Health Sciences
    • /
    • v.38 no.6
    • /
    • pp.550-560
    • /
    • 2012
  • Objectives: QSAR methodology was applied to explain two different sets of acceptable daily intake (ADI) data of 74 pesticides proposed by both the USEPA and WHO in terms of setting guidelines for food and drinking water. Methods: A subset of calculated descriptors was selected from Dragon$^{(R)}$ software. QSARs were then developed utilizing a statistical technique, genetic algorithm-multiple linear regression (GA-MLR). The differences in each specific model in the prediction of the ADI of the pesticides were discussed. Results: The stepwise multiple linear regression analysis resulted in a statistically significant QSAR model with five descriptors. Resultant QSAR models were robust, showing good utility across multiple classes of pesticide compounds. The applicability domain was also defined. The proposed models were robust and satisfactory. Conclusions: The QSAR model could be a feasible and effective tool for predicting ADI and for the comparison of logADIEPA to logADIWHO. The statistical results agree with the fact that USEPA focuses on more subtle endpoints than does WHO.

Appropriate identification of optimum number of hidden states for identification of extreme rainfall using Hidden Markov Model: Case study in Colombo, Sri Lanka

  • Chandrasekara, S.S.K.;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.390-390
    • /
    • 2019
  • Application of Hidden Markov Model (HMM) to the hydrological time series would be an innovative way to identify extreme rainfall events in a series. Even though the optimum number of hidden states can be identify based on maximizing the log-likelihood or minimizing Bayesian information criterion. However, occasionally value for the log-likelihood keep increasing with the state which gives false identification of the optimum hidden state. Therefore, this study attempts to identify optimum number of hidden states for Colombo station, Sri Lanka as fundamental approach to identify frequency and percentage of extreme rainfall events for the station. Colombo station consisted of daily rainfall values between 1961 and 2015. The representative station is located at the wet zone of Sri Lanka where the major rainfall season falls on May to September. Therefore, HMM was ran for the season of May to September between 1961 and 2015. Results showed more or less similar log-likelihood which could be identified as maximum for states between 4 to 7. Therefore, measure of central tendency (i.e. mean, median, mode, standard deviation, variance and auto-correlation) for observed and simulated daily rainfall series was carried to each state to identify optimum state which could give statistically compatible results. Further, the method was applied for the second major rainfall season (i.e. October to February) for the same station as a comparison.

  • PDF