• Title/Summary/Keyword: Probability and statistics

Search Result 1,183, Processing Time 0.035 seconds

Model-Based Survival Estimates of Female Breast Cancer Data

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Rana, Sagar;Ahmed, Nasar Uddin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.6
    • /
    • pp.2893-2900
    • /
    • 2014
  • Background: Statistical methods are very important to precisely measure breast cancer patient survival times for healthcare management. Previous studies considered basic statistics to measure survival times without incorporating statistical modeling strategies. The objective of this study was to develop a data-based statistical probability model from the female breast cancer patients' survival times by using the Bayesian approach to predict future inferences of survival times. Materials and Methods: A random sample of 500 female patients was selected from the Surveillance Epidemiology and End Results cancer registry database. For goodness of fit, the standard model building criteria were used. The Bayesian approach is used to obtain the predictive survival times from the data-based Exponentiated Exponential Model. Markov Chain Monte Carlo method was used to obtain the summary results for predictive inference. Results: The highest number of female breast cancer patients was found in California and the lowest in New Mexico. The majority of them were married. The mean (SD) age at diagnosis (in years) was 60.92 (14.92). The mean (SD) survival time (in months) for female patients was 90.33 (83.10). The Exponentiated Exponential Model found better fits for the female survival times compared to the Exponentiated Weibull Model. The Bayesian method is used to obtain predictive inference for future survival times. Conclusions: The findings with the proposed modeling strategy will assist healthcare researchers and providers to precisely predict future survival estimates as the recent growing challenges of analyzing healthcare data have created new demand for model-based survival estimates. The application of Bayesian will produce precise estimates of future survival times.

Theoretical Analysis of MIMO Antenna Selection & Switching System to Spatial Channel Correlation using Channel Statistics (공간적 채널 상관도에 따른 통계적인 채널 특성을 이용한 다중 안테나 선택 및 스위칭 시스템의 성능 분석)

  • Lee Hakju;Park Seungil;Lee Chungyong;Park Hyuncheol;Hong Daesik
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.42 no.4 s.334
    • /
    • pp.15-20
    • /
    • 2005
  • Multi-Input, Multi-Output system suffers for the spatial channel correlation due to lack of spatial diversity. To overcome this defect, the antenna selection and switching system is proposed which selects the adequate antenna subset with highest channel diversity gain and switches the trasmission techniques according to channel environments. However. its performance analysis is insufficient due to the difficulty of modeling the spatial channel correlation. In this paper, the theoretical upper bound of symbol error probability is derived by using the statistical properties of Frobenius norm and minimum eigen-value of channel matrix. By computer simulation, it is shown that the derived theoretical upper bound is similar to the simulation results.

Decision Making from the 5th Grade' III-Structured Problem of Data Analysis (자료분석에 관한 비구조화된 문제해결모형 적용에서 나타난 초등학교 5학년 학생들의 의사결정에 관한 연구)

  • Kim, Min-Kyeong;Lee, Ji-Young;Hong, Jee-Yun;Joo, Hyun-Jung
    • Communications of Mathematical Education
    • /
    • v.26 no.2
    • /
    • pp.221-249
    • /
    • 2012
  • The purpose of this study is to investigate students decision-making progress through ill-structured problem solving process. For this study, 25 fifth graders in an elementary school were observed by applying ABCDE model (Analyze - Browse - Create - Decision making - Evaluate), and analyzed their decision-making progress analyzing framework which follows 3 steps - making their own decision, discussing/revising with peers, and lastly decision making/solving problem. Upper two groups with better performance in ill-structured problem solving model among 6 groups showed active discussion in group and decision making process with 3 steps (making their own decision, discussing/revising with peers). Even though their decisions are not good-fit to mathematical reasoning result, development and application of ill-structured problems would bring better ability of high level thinking and problem solving to students.

The Effect of Social Support on Youth Student-Athletes' Stress and Deviant Behavior (사회적지지가 청소년 운동선수들의 스트레스 및 일탈행동에 미치는 영향)

  • Kwon, Wook-Dong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.8
    • /
    • pp.5198-5206
    • /
    • 2015
  • The purpose of this study was to examine the effect of social support from friend, senior junior, and coach on student-athletes' stress and deviant behavior. By using convenience sampling method of non-probability sampling, a total of 350 student-athletes from D and G city were selected. Of 217 copies of the questionnaire gathered, 35 were discarded owing to having excessive missing values. Thus, by analysing a total of 182 surveys with structural equation modeling through AMOS 20.0 statistics program, this study found the followings. First, social support has a negative influence on stress. Second, stress has a positive influence on deviant behavior. Third, social support has a negative influence on deviant behavior mediated by stress.

Redesigning KNSO s Household Survey Sample (통계청 가구부문 조사의 표본설계)

  • 윤연옥;김규영;이명호
    • Survey Research
    • /
    • v.5 no.1
    • /
    • pp.103-130
    • /
    • 2004
  • Main monthly household surveys conducted by Korea National Statistical Office are economically active population survey(EAPS) and household income and expenditure survey(HIES). Samples of these two surveys are redesigned every 5 years based on Census. This paper is about sample redesign of household survey conducted in 2002 based on 2000 Census. Main improvements of 2002 sample redesign are the introduction of rotation sampling system, the expansion of HIES survey area from urban to whole country and the foundation of basement to make small area estimation for the unemployment statistics. Also the number of sample households within a enumeration district(ED) is reduced from 24 to 20. That makes it possible to select more ED samples which provides better precision for EAPS and HIES. To select representative samples for the population, different classification index is used for each metropolitan area and provinces.

  • PDF

Dynamic recomposition of document category using user intention tree (사용자 의도 트리를 사용한 동적 카테고리 재구성)

  • Kim, Hyo-Lae;Jang, Young-Cheol;Lee, Chang-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.8B no.6
    • /
    • pp.657-668
    • /
    • 2001
  • It is difficult that web documents are classified with exact user intention because existing document classification systems are based on word frequency number using single keyword. To improve this defect, first, we use keyword, a query, domain knowledge. Like explanation based learning, first, query is analyzed with knowledge based information and then structured user intention information is extracted. We use this intention tree in the course of existing word frequency number based document classification as user information and constraints. Thus, we can classify web documents with more exact user intention. In classifying document, structured user intention information is helpful to keep more documents and information which can be lost in the system using single keyword information. Our hybrid approach integrating user intention information with existing statistics and probability method is more efficient to decide direction and range of document category than existing word frequency approach.

  • PDF

Bandwidth selections based on cross-validation for estimation of a discontinuity point in density (교차타당성을 이용한 확률밀도함수의 불연속점 추정의 띠폭 선택)

  • Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.765-775
    • /
    • 2012
  • The cross-validation is a popular method to select bandwidth in all types of kernel estimation. The maximum likelihood cross-validation, the least squares cross-validation and biased cross-validation have been proposed for bandwidth selection in kernel density estimation. In the case that the probability density function has a discontinuity point, Huh (2012) proposed a method of bandwidth selection using the maximum likelihood cross-validation. In this paper, two forms of cross-validation with the one-sided kernel function are proposed for bandwidth selection to estimate the location and jump size of the discontinuity point of density. These methods are motivated by the least squares cross-validation and the biased cross-validation. By simulated examples, the finite sample performances of two proposed methods with the one of Huh (2012) are compared.

Weighted Least Squares Based on Feature Transformation using Distance Computation for Binary Classification (이진 분류를 위하여 거리계산을 이용한 특징 변환 기반의 가중된 최소 자승법)

  • Jang, Se-In;Park, Choong-Shik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.2
    • /
    • pp.219-224
    • /
    • 2020
  • Binary classification has been broadly investigated in machine learning. In addition, binary classification can be easily extended to multi class problems. To successfully utilize machine learning methods for classification tasks, preprocessing and feature extraction steps are essential. These are important steps to improve their classification performances. In this paper, we propose a new learning method based on weighted least squares. In the weighted least squares, designing weights has a significant role. Due to this necessity, we also propose a new technique to obtain weights that can achieve feature transformation. Based on this weighting technique, we also propose a method to combine the learning and feature extraction processes together to perform both processes simultaneously in one step. The proposed method shows the promising performance on five UCI machine learning data sets.

Seismic Fragility Analysis of Reinforced Concrete Bridge Piers According to Damage State (철근콘크리트 교량 교각의 손상상태에 따른 지진취약도 해석)

  • Jeon, Jeong Moon;Shin, Jae Kwan;Shim, Jae Yeob;Lee, Do Hyung
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.34 no.6
    • /
    • pp.1695-1705
    • /
    • 2014
  • In the present study, a total of 275 tested specimens (149 of non-seismically designed and 126 of seismically designed) for reinforced concrete bridge piers with circular section have been investigated in order to suggest drift limits probabilistically according to damage states in seismic fragility analysis. Thus, quantitative damage states of the piers have been evaluated depending on details of the piers. Nonlinear time-history analyses have been conducted for a damaged bridge in terms of using the suggested drift limits. Then, seismic fragility analysis for a reinforced concrete bridge structure has been conducted using both suggested and existing drift limits. Comparative analyses have revealed that median values by the suggested limits is smaller than those by the existing limits. This implies that seismic performance of the structure can be overestimated when the existing limits are used.

A-priori Comparative Assessment of the Performance of Adjustment Models for Estimation of the Surface Parameters against Modeling Factors (표면 파라미터 계산시 모델링 인자에 따른 조정계산 추정 성능의 사전 비교분석)

  • Seo, Su-Young
    • Spatial Information Research
    • /
    • v.19 no.2
    • /
    • pp.29-36
    • /
    • 2011
  • This study performed quantitative assessment of the performance of adjustment models by a-priori analysis of the statistics of the surface parameter estimates against modeling factors. Lidar, airborne imagery, and SAR imagery have been used to acquire the earth surface elevation, where the shape properties of the surface need to be determined through neighboring observations around target location. In this study, parameters which are selected to be estimated are elevation, slope, second order coefficient. In this study, several factors which are needed to be specified to compose adjustment models are classified into three types: mathematical functions, kernel sizes, and weighting types. Accordingly, a-priori standard deviations of the parameters are computed for varying adjustment models. Then their corresponding confidence regions for both the standard deviation of the estimate and the estimate itself are calculated in association with probability distributions. Thereafter, the resulting confidence regions are compared to each other against the factors constituting the adjustment models and the quantitative performance of adjustment models are ascertained.