• Title/Summary/Keyword: 집락자료

Search Result 108, Processing Time 0.028 seconds

Testing Independence in Contingency Tables with Clustered Data (집락자료의 분할표에서 독립성검정)

  • 정광모;이현영
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.2
    • /
    • pp.337-346
    • /
    • 2004
  • The Pearson chi-square goodness-of-fit test and the likelihood ratio tests are usually used for testing independence in two-way contingency tables under random sampling. But both of these tests may provide false results for the contingency table with clustered observations. In this case we consider the generalized linear mixed model which includes random effects of clustering in addition to the fixed effects of covariates. Both the heterogeneity between clusters and the dependency within a cluster can be explained via generalized linear mixed model. In this paper we introduce several types of generalized linear mixed model for testing independence in contingency tables with clustered observations. We also discuss the fitting of these models through a real dataset.

A Comparative Study on the Statistical Methodology to Determine the Optimal Aggregation Interval for Travel Time Estimation of the Interrupted Traffic Flow (단속류 통행시간 추정을 위한 적정 집락간격 결정에 관한 통계적 방법론 비교 연구)

  • Lim, Houng-Seok;Lee, Seung-Hwan;Lee, Hyun-Jae
    • Journal of Korean Society of Transportation
    • /
    • v.23 no.3 s.81
    • /
    • pp.109-123
    • /
    • 2005
  • The goals of this paper are two folds: i) to evaluate whether the data collected by a license plate matching AVI equipment being operated on some segment of a national highway are suitable or not for use in travel time estimation of interrupted traffic flows; ii) to study the statistical methodologies to be used for the determination of the optimal aggregation interval for travel time estimation. In this study it was found that the AVI data are not representative because the data are collected on some selected lanes of a roadway where main traffic is thru-traffic and, thus the AVI data are different from those collected from all lanes in traffic characteristics. For the determination of the optimal aggregation interval for travel time estimation. two statistical methods. namely point estimation and interval estimation. were tested. The test shows that the point estimation method is more sensitive and gives more desirable results in determing the optimal aggregation interval than the interval estimation method. And it turned out that the optimal aggregation interval on interrupted traffic flows has been calculated as 5 minute and thus the existing aggregation interval. 5 minute is proper.

A Study on the Optimal Aggregation Interval for Travel Time Estimation on the Rural Arterial Interrupted Traffic flow (지방부 간선도로 단속류 통행시간 추정을 위한 적정 집락간격 결정에 관한 연구)

  • Lim Houng-Seak;Lee Seung-Hwan;Lee Hyun-Jae
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.3 no.2 s.5
    • /
    • pp.129-140
    • /
    • 2004
  • In this paper, we conduct the research about optimal aggregation interval of travel time data on interrupted traffic flow and verify the reliability of AVI collected data by using car plate matching method in RTMS for systematic collection and analysis of link travel time data on interrupted traffic flow rural arterial. We perform Kolmosorov-Smirnov test on AVT collected sample data and on entire population data, and conclude that the sample data does not represent pure random sampling and hence includes sample collection error. We suggest that additional review is necessary to investigate the effectiveness of AVI collected sample data as link representative data. We also develop statistical model by applying two estimation techniques namely point estimation and interval estimation for calculating optimal aggregation interval. We have implemented our model and determine that point estimate is preferable over interval estimate for exactly selecting and deciding optimal aggregation interval. Our final conclusion is that 5-minute aggregation interval is optimal to estimate travel time in RTMS, as is currently being used our investigation is based on AVI data collected from Yang-ji to Yong-in $42^{nd}$ National road.

  • PDF

Choosing clusters for two-stage household surveys (가구조사를 위한 이단추출 표본설계에서의 집락선택)

  • Park, Inho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.2
    • /
    • pp.363-372
    • /
    • 2016
  • Two-stage sample designs are commonly used for household surveys in Korea using as clusters the enumeration districts (EDs). Since clustering decomposes the population variation into within- and between-cluster variations, the sample sizes allocated in stages can affect the overall precision. Alternative clusters are often considered due to diverse reasons such as the EDs' limitation in size, being out-of-date, and in-assessibility to their household lists. In addition, the EDs are currently under development by the Statistics Korea as an joint effort toward their transition from the traditional practice to the register census from 2015. We present an approach for evaluating the difference in the precision of the mean estimators of the sets of the cluster units in between a hierachical and nested form, where the design effect is used to reflect the effect of the clustering and the sample allocation. We also demonstrate our approach using the U.S. Census counts from the year 2000 for Anne Arundel County in Maryland. Our research shows that the within-cluster variance can be significantly different for survey variables and thus the choice of cluster units and the associated sample allocation scheme should reflect the corresponding variance decomposition due to clustering.

A study on the relation between dissimilarity and hierarchical agglomerative in clust analysis (집락분석법에 있어서 비유사도와 계층적 응집법의 관계에 관한 연구)

  • 조완현
    • The Korean Journal of Applied Statistics
    • /
    • v.5 no.2
    • /
    • pp.211-227
    • /
    • 1992
  • In this paper we consider the definition and mathematical properties of similarity or dissimilarity which have often used in clust analysis, and we apply a hierarchical agglomerative cluster algorithm to a dissimilarity metrx generated by these distance. Here we investigate the effect of relation between distance function and cluster algorithm on the retrieval ability of natural clusters. We present an empirical results for qualitative data as well as quantitative data.

  • PDF

Development of a Forest Inventory System for the Sustainable Forest Management (지속가능한 산림경영에 적합한 표본조사 방법의 개발)

  • Shin, Man Yong;Han, Won Sung
    • Journal of Korean Society of Forest Science
    • /
    • v.95 no.3
    • /
    • pp.370-377
    • /
    • 2006
  • This study was conducted to develop an efficient method of sampling design appropriate for the sustainable forest management. For this, data were collected in Yangpyung-Gun, Gyunggi Province based on three different sampling designs such as systematic design, systematic cluster design, and stratified cluster design. Based on evaluation statistics, the sampling designs were compared to select a sampling method fitted to sustainable forest management. It was found that the systematical cluster sampling is the most efficient sampling method in terms of feasibility for sustainable forest management. It was also recommended that the sample plots should be made as a cluster of triangle-shape. The clusters should be consisted of a main plot and three sub-plots. And the sub-plots should be arranged with a distance of 50m from the main plot in the center of cluster.

Megakaryocyte Colony Formation of Fetal Liver Cells (태아 간세포의 거핵구 집락형성)

  • Kwon, Byung O;Ju, Hye Young;Kim, Chun Soo;Jeon, Dong Seok;Kim, Jong In;Kim, Heung Sik
    • Clinical and Experimental Pediatrics
    • /
    • v.45 no.2
    • /
    • pp.247-255
    • /
    • 2002
  • Purpose : This study was undertaken to obtain basic data about the megakaryocyte colony formation of fetal liver cells by using immunocytochemical staining and ex vivo culture with growth factors. Methods : The mononuclear cells were isolated from fetal liver and bone marrow with idiopathic thrombocytopenic purpura(ITP) and pancytopenia. These mononuclear cells were cultured in $MegaCult^{TM}-C$(Stem Cell Tech, Canada) media in the presence of growth factors and CFU-Megakaryocyte( CFU-Mk) colonies were counted on day 12. The expansion of CD34+ and CD41+ cell was analyzed by flow cytometry after 5 days incubation using flask culture. Results : The numbers of CFU-Mk colonies of mononuclear cells obtained from fetal liver in the 11th week gestational age were more than those in the 19th week specimens; growth factors could not enhance the colony expansion in all cases. Total numbers of CFU-Mk colony of fetal liver cells were higher than bone marrow from ITP or pancytopenia groups. The numbers of pure or large CFU-Mk colonies of fetal liver cells were also higher than bone marrow specimens. The rate of CD34+ cell expression of fetal liver was increased after flask culture and the enhancement effect of epression was seen only in cases which added thrombopoietin. The rate of CD41+ cell expression of fetal liver was increased after incubation, but the enhancement effect of growth factors was unclear. Conclusion : This study revealed good results about the megakaryocyte colony assay of fetal liver mononuclear cells using $MegaCult^{TM}-C$ media. This study suggests that the fetal liver could be a good source of megakaryocytic progenitor cells for clinical application in hematopoietic stem cell transplantation.

A Sampling Design of the Non-consignment Fishery Products (수산물 비계통 생산량 조사를 위한 표본설계 연구)

  • 박진우
    • The Korean Journal of Applied Statistics
    • /
    • v.12 no.1
    • /
    • pp.1-15
    • /
    • 1999
  • 수산물 비계통 생산량 조사는 수산물 생산량 조사 중 어가부분에서 비계통 출하된 양에 대한 표본조사이다. 본연구는 1995년 어업 총조사 자료에 근거하여 새로운 표본 설계를 제안하는 것을 목적으로 한다. 표본은 층화 2단 집락 추출방식에 의해 추출되었으며 층화변수로는 어업조사구 내의 일반어류 어가 비율을 사용하였다.