• 제목/요약/키워드: Statistics data

검색결과 13,789건 처리시간 0.035초

Zooming Statistics: Inference across scales

  • Hannig, Jan;Marron, J.S.;Riedi, R.H.
    • Journal of the Korean Statistical Society
    • /
    • 제30권2호
    • /
    • pp.327-345
    • /
    • 2001
  • New statistical methods are ended to analyzed data in a multi-scale way. Some multi-scale extensions of stand methods, including novel visualization using dynamic graphics are proposed. These tools are used to explore non-standard structure in internet traffic data.

  • PDF

Jackknife Estimation for Mean in Exponential Model with Grouped and Censored Data

  • Kil Ho Cho;Yong Ku Kim;Seong Kwa Jeong
    • Communications for Statistical Applications and Methods
    • /
    • 제5권3호
    • /
    • pp.869-878
    • /
    • 1998
  • In this paper, we propose some jackknife estimators for mean in the exponential model with grouped and censored data. Also, we compare the proposed jackknife estimators to other approximate estimators in terms of the mean square error and bias.

  • PDF

Test for the Presence of Seasonality in Time Series Models

  • 이성덕
    • Journal of the Korean Data and Information Science Society
    • /
    • 제12권1호
    • /
    • pp.71-78
    • /
    • 2001
  • Three test statistics are proposed for the presence of seasonality in multiplicative seasonal time series models. Further their common limiting distribution is derived under some assumptions.

  • PDF

The Teaching of Statistics using Excel VBA

  • Choi, Hyun-Seok
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권3호
    • /
    • pp.811-820
    • /
    • 2006
  • We introduce a program that enhances the interest and understanding of students in Statistics. This program explains various statistical concepts and procedures by showing detailed steps of calculations with graphs and simulations. This program utilizes a readily accessible Excel VBA.

  • PDF

유용성과 노출 위험성 지표를 이용한 재현자료 기법 비교 연구 (A comparison of synthetic data approaches using utility and disclosure risk measures)

  • 안성빈;트랑 도안;이주희;김지우;김용재;김윤지;윤창원;정성규;김동하;권성훈;김항준;안정연;박철우
    • 응용통계연구
    • /
    • 제36권2호
    • /
    • pp.141-166
    • /
    • 2023
  • 재현자료를 생성하여 배포하는 것은 데이터 공개에 따른 정보 유출의 위험을 방지하는 대표적인 방법이다. 최근 산업에서 데이터의 활용이 중요해진 만큼 한국을 포함한 많은 국가 및 기관에서 재현자료에 관한 연구가 활발히 진행되고 있다. 본 논문에서는 대표적인 재현자료 생성 기법들과 평가 지표들을 소개한다. 전통적인 재현자료 생성 방법인 다중대체와 최근 제시된 인공신경망 기반의 재현자료 생성 방법 등을 활용하여 재현자료를 생성하는 과정을 기술함에 따라 재현자료 생성 방법에 대한 전반적인 이해를 돕는다. 이에 더해 다양한 재현자료 평가 지표를 바탕으로 생성된 재현자료들을 분석 및 비교함에 따라 앞으로의 연구에 대한 방향을 제시하고 그에 대한 토대를 마련하고자 한다.

Cox 비례위험모형을 따르는 중도절단자료 생성 (Generating censored data from Cox proportional hazards models)

  • 김지현;김봉성
    • 응용통계연구
    • /
    • 제31권6호
    • /
    • pp.761-769
    • /
    • 2018
  • 통계학 연구에 모의실험이 중요하게 쓰이며 중도절단자료를 다루는 생존분석에서도 마찬가지다. 생존분석에서 Cox 모형이 널리 쓰이는데, Cox 모형을 따르는 중도절단자료를 생성하는 방법에 대해 살펴보았다. Bender 등 (Statistics in Medicine, 24, 1713-1723, 2005)은 생존시간을 생성하는 모수적 방법을 제시하였으나 생존시간뿐만 아니라 중도절단시간도 생성해야 중도절단자료를 얻게 된다. 중도절단자료를 생성하기 위한 모수적 방법과 함께 비모수적 방법도 제시하였으며 실제 자료에도 적용해 보았다.

Investigations into Coarsening Continuous Variables

  • Jeong, Dong-Myeong;Kim, Jay-J.
    • 응용통계연구
    • /
    • 제23권2호
    • /
    • pp.325-333
    • /
    • 2010
  • Protection against disclosure of survey respondents' identifiable and/or sensitive information is a prerequisite for statistical agencies that release microdata files from their sample surveys. Coarsening is one of popular methods for protecting the confidentiality of the data. Grouped data can be released in the form of microdata or tabular data. Instead of releasing the data in a tabular form only, having microdata available to the public with interval codes with their representative values greatly enhances the utility of the data. It allows the researchers to compute covariance between the variables and build statistical models or to run a variety of statistical tests on the data. It may be conjectured that the variance of the interval data is lower that of the ungrouped data in the sense that the coarsened data do not have the within interval variance. This conjecture will be investigated using the uniform and triangular distributions. Traditionally, midpoint is used to represent all the values in an interval. This approach implicitly assumes that the data is uniformly distributed within each interval. However, this assumption may not hold, especially in the last interval of the economic data. In this paper, we will use three distributional assumptions - uniform, Pareto and lognormal distribution - in the last interval and use either midpoint or median for other intervals for wage and food costs of the Statistics Korea's 2006 Household Income and Expenditure Survey(HIES) data and compare these approaches in terms of the first two moments.

Statistical Properties of News Coverage Data

  • Lim, Eunju;Hahn, Kyu S.;Lim, Johan;Kim, Myungsuk;Park, Jeongyeon;Yoon, Jihee
    • Communications for Statistical Applications and Methods
    • /
    • 제19권6호
    • /
    • pp.771-780
    • /
    • 2012
  • In the current analysis, we examine news coverage data widely used in media studies. News coverage data is usually time series data to capture the volume or the tone of the news media's coverage of a topic. We first describe the distributional properties of autoregressive conditionally heteroscadestic(ARCH) effects and compare two major American newspaper's coverage of U.S.-North Korea relations. Subsequently, we propose a change point detection model and apply it to the detection of major change points in the tone of American newspaper coverage of U.S.-North Korea relations.

Testing and Adjustment for Inhomogeneity Temperature Series Using the SNHT Method

  • Lee, Yung-Seop;Kim, Hee-Kyung;Lee, Jung-In;Lee, Jae-Won;Kim, Hee-Soo
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.977-985
    • /
    • 2012
  • Data quality and climate forecasting performance deteriorates because of long climate data contaminated by non-climatic factors such as the station relocation or new instrument replacement. For a trusted climate forecast, it is necessary to implement data quality control and test inhomogeneous data. Before the inhomogeneity test, a reference series was created by $d$ index to measure the temperature series relationship between the candidate and surrounding stations. In this study, a inhomogeneity test to each season and climatological station was performed on the daily mean temperatures, daily minimum temperatures and daily maximum temperatures. After comparing two inhomogeneity tests, the traditional and the adjusted SNHT method, we found the adjusted SNHT method was slightly superior to the traditional one.