• Title/Summary/Keyword: Population data estimation

Search Result 385, Processing Time 0.037 seconds

Estimation of Product Reliability with Incomplete Field Warranty Data (불완전한 사용현장 보증 데이터를 이용한 제품 신뢰도 추정)

  • Lim, Tae-Jin
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.28 no.4
    • /
    • pp.368-378
    • /
    • 2002
  • As more companies are equipped with data aquisition systems for their products, huge amount of field warranty data has been accumulated. We focus on the case when the field data for a given product comprise with the number of sales and the number of the first failures for each period. The number of censored items and their ages are assumed to be given. This type of data are incomplete in the sense that the age of a failed item is unknown. We construct a model for this type of data and propose an algorithm for nonparametric maximum likelihood estimation of the product reliability. Unlike the nonhomogeneous Poisson process(NHPP) model, our method can handle the data with censored items as well as those with small population. A few examples are investigated to characterize our model, and a real field warranty data set is analyzed by the method.

A Validation of Estimating the National Cancer Incidence in Korea using the Databases of 7 Population-based Regional Cancer Registries except Seoul (서울을 제외한 7개 지역암등록본부 자료를 활용한 국가 암통계 추정의 타당성)

  • Bae, Jong-Myeon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.42 no.2
    • /
    • pp.130-134
    • /
    • 2009
  • Objectives : A method of estimation using 8 populationbased cancer registries databases in Korea(KRCR DB) has been introduced as another strategy for validly estimating the national cancer incidence(NCI) in Korea. The purpose of this study was to evaluate the validity of this method with using the 7 KRCR DBs, excluding Seoul covering 21% of the total population of Korea. Methods : We designed the study method(NCSE_7) as same as the estimating method with using 8 KRCR DBs (NCSE_8) in order to ensure maximal comparability. We defined the expected number of cancer cases in each registry as the number of observed cases and then we added the weighted observed cases according to gender, age and the proportion of the population covered by each registry for the population of the seven regions and the population of all areas, with excluding these seven regions. From the expected number of total cancer incidents, the estimated NCI was calculated by dividing the expected number of cancer cases by the number of the total population. The standard error(SE) of the estimated incidence was also taken from the expected number of total cancer incidents. Results : Compared with the results of the NCSE_8, the overall age-standardized rates(ASR) in men and women became over-estimated and under-estimated, respectively. Primary sites that showed statistically significant differences were the colo-rectum, prostate, breast and thyroid. The index of death certificate only(DCO)and microscopically verified(MV)% indicating levels of data quality were decreased, especially for the brain in DCO% and kidney in the MV%. Conclusions : The database of Seoul regional cancer registry has a key role for the method to estimate the valid nationwide cancer statistics in Korea with using the population-based cancer registries databases.

A Comparison of NLSY and CPS Data

  • Jo, Yoon-Ae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.3
    • /
    • pp.851-859
    • /
    • 2006
  • The family income distributions of NLSY97 and CPS youth data are compared by using the generalized beta distribution of the second kind. The null hypothesis that the two data sets represent the same underlying population is rejected. The ML estimation suggests that NLSY97 data are oversampled in an income group of $11,308 or less, by about 15.7% compared to CPS data.

  • PDF

Estimation of Population Mean Using Centered Modified Systematic Sampling and Interpolation

  • Kim, Hyuk-Joo;Choi, Byoung-Chul
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.17-24
    • /
    • 2001
  • A method is proposed for efficiently estimating the mean of a population which has a linear trend. The proposed estimator is based on the centered modified systematic sampling method and the concept or interpolation. Using the expected mean square error criterion, it is shown that the proposed method is more efficient than conventional methods in most real cases.

  • PDF

Study of Sample Design for Improving the Method of Distribution of Singing Rooms Royalty (노래연습장 저작권사용료 분배방법 개선을 위한 표본설계연구)

  • Lee, Kay-O;Choi, Jung-Bai
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.5
    • /
    • pp.763-779
    • /
    • 2009
  • Sample design is studied to estimate the number of records from singing rooms. The definition of survey population, stratification, sample size and allocation of sample are discussed with using the collected real data. The validity of sample design is testified by the log data collected from online singing room which is sampled with suggested design. Estimation of the number of hitting of off-line singing room is proposed and calculated from the log data collected from sampled off-line singing rooms. The difference of online and off-line singing rooms is analyzed and the method of distribution of singing room royalty is suggested.

Assessment of Adequacy of Urban Water Supply (도시 상수도 공급량 산정의 적정성 평가)

  • Kim, Jang Jin;Chang, Hyung Joon;Lee, Ho Jin
    • Journal of Korean Society of Disaster and Security
    • /
    • v.14 no.4
    • /
    • pp.61-67
    • /
    • 2021
  • In this study, the adequacy of water supply critical factors of urban design was examined. The supply of water supply is analyzed in various ways as the design progresses. Starting with basic data collection and analysis a supply and demand plan is established to calculate the amount of water supply and in this study the adequacy of population estimation and original unit calculation was evaluated. Among the second new cities where actual data can be secured Wirye New Town was selected as the study target area. Related data were analyzed to confirm the future population and the original unit and compared with the measured data. As of September 2020, the population of Wirye New Town was 93,977, showing the appropriateness of about 84% with a planned population of 110,990 confirming that the planned population and the actual population were almost similar. In the case of the original unit, it was calculated as 314 liters per person in Seoul and 320 liters per person in Seongnam at the time of design. As a result, it was found that there was some agreement in the population estimation while examining the supply in the planned city. In the case of Korea, there is a lot of interest in revitalizing the existing city, away from continuous development. Therefore it is judged that there is a need for further research on the adequacy of supply for the old city center.

Comparison of Three Optimization Methods Using Korean Population Data

  • Oh, Deok-Kyo
    • Korean System Dynamics Review
    • /
    • v.13 no.2
    • /
    • pp.47-71
    • /
    • 2012
  • The purpose of this research is the examination of validity of data as well as simulation model, i.e. to simulate the real data in the SD model with the least error using the adjustments for the faithful reflection of real data to the simulation. In general, SD programs (e.g. VENSIM) utilize the Euler or Runge-Kutta method as an algorithm. It is possible to reflect the trend of real data via these two estimation methods however can cause the validity problem in case of the simulation requiring the accuracy as they have endogenous errors. In this article, the future population estimated by the Korea National Statistical Office (KNSO) to 2050 is simulated by the aging chain model, dividing the population into three cohorts, 0-14, 15-64, 65 and over cohorts by age and offering the adjustments to them. Adjustments are calculated by optimization with three different methods, optimization in EXCEL, manual optimization with iterative calculation, and optimization in VENSIM DSS, the results are compared, and at last the optimal adjustment set with the least error are found among them. The simulation results with the pre-determined optimal adjustment set are validated by methods proposed by Barlas (1996) and other alternative methods. It is concluded that the result of simulation model in this research has no significant difference from the real data and reflects the real trend faithfully.

  • PDF

A Post-stratified Estimation in Multivariate Stratified Sampling Surveys

  • Park, Jinwoo
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.755-760
    • /
    • 1999
  • In multivariate stratified sampling surveys it is general to use a few stratification variables which are highly correlated with the important variables at design stage. But there might be some secondary study variables which are not so highly correlated with those stratification variables. In that case it is not efficient to use the same type of estimator due to the secondary variables as the one base on the important variables. A post-stratified estimation is proposed to increase the efficiency of the estimator with existence of secondary variables. The proposed method is illustrated with a set of fishery household population survey data.

  • PDF

Comparative assessment of the effective population size and linkage disequilibrium of Karan Fries cattle revealed viable population dynamics

  • Shivam Bhardwaj;Oshin Togla;Shabahat Mumtaz;Nistha Yadav;Jigyasha Tiwari;Lal Muansangi;Satish Kumar Illa;Yaser Mushtaq Wani;Sabyasachi Mukherjee;Anupama Mukherjee
    • Animal Bioscience
    • /
    • v.37 no.5
    • /
    • pp.795-806
    • /
    • 2024
  • Objective: Karan Fries (KF), a high-producing composite cattle was developed through crossing indicine Tharparkar cows with taurine bulls (Holstein Friesian, Brown Swiss, and Jersey), to increase the milk yield across India. This composite cattle population must maintain sufficient genetic diversity for long-term development and breed improvement in the coming years. The level of linkage disequilibrium (LD) measures the influence of population genetic forces on the genomic structure and provides insights into the evolutionary history of populations, while the decay of LD is important in understanding the limits of genome-wide association studies for a population. Effective population size (Ne) which is genomically based on LD accumulated over the course of previous generations, is a valuable tool for e valuation of the genetic diversity and level of inbreeding. The present study was undertaken to understand KF population dynamics through the estimation of Ne and LD for the long-term sustainability of these breeds. Methods: The present study included 96 KF samples genotyped using Illumina HDBovine array to estimate the effective population and examine the LD pattern. The genotype data were also obtained for other crossbreds (Santa Gertrudis, Brangus, and Beefmaster) and Holstein Friesian cattle for comparison purposes. Results: The average LD between single nucleotide polymorphisms (SNPs) was r2 = 0.13 in the present study. LD decay (r2 = 0.2) was observed at 40 kb inter-marker distance, indicating a panel with 62,765 SNPs was sufficient for genomic breeding value estimation in KF cattle. The pedigree-based Ne of KF was determined to be 78, while the Ne estimates obtained using LD-based methods were 52 (SNeP) and 219 (genetic optimization for Ne estimation), respectively. Conclusion: KF cattle have an Ne exceeding the FAO's minimum recommended level of 50, which was desirable. The study also revealed significant population dynamics of KF cattle and increased our understanding of devising suitable breeding strategies for long-term sustainable development.

Implementation of Efficient Weather Forecasting Model Using the Selecting Concentration Learning of Neural Network (신경망의 선별학습 집중화를 이용한 효율적 온도변화예측모델 구현)

  • 이기준;강경아;정채영
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.6B
    • /
    • pp.1120-1126
    • /
    • 2000
  • Recently, in order to analyze the time series problems that occur in the nature word, and analyzing method using a neural electric network is being studied more than a typical statistical analysis method. A neural electric network has a generalization performance that is possible to estimate and analyze about non-learning data through the learning of a population. In this paper, after collecting weather datum that was collected from 1987 to 1996 and learning a population established, it suggests the weather forecasting system for an estimation and analysis the future weather. The suggested weather forecasting system uses 28*30*1 neural network structure, raises the total learning numbers and accuracy letting the selecting concentration learning about the pattern, that is not collected, using the descending epsilon learning method. Also, the weather forecasting system, that is suggested through a comparative experiment of the typical time series analysis method shows more superior than the existing statistical analysis method in the part of future estimation capacity.

  • PDF