• 제목/요약/키워드: small data

검색결과 10,764건 처리시간 0.031초

Bayesian pooling for contingency tables from small areas

  • Jo, Aejung;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권6호
    • /
    • pp.1621-1629
    • /
    • 2016
  • This paper studies Bayesian pooling for analysis of categorical data from small areas. Many surveys consist of categorical data collected on a contingency table in each area. Statistical inference for small areas requires considerable care because the subpopulation sample sizes are usually very small. Typically we use the hierarchical Bayesian model for pooling subpopulation data. However, the customary hierarchical Bayesian models may specify more exchangeability than warranted. We, therefore, investigate the effects of pooling in hierarchical Bayesian modeling for the contingency table from small areas. In specific, this paper focuses on the methods of direct or indirect pooling of categorical data collected on a contingency table in each area through Dirichlet priors. We compare the pooling effects of hierarchical Bayesian models by fitting the simulated data. The analysis is carried out using Markov chain Monte Carlo methods.

End-to-End Delay Analysis of a Dynamic Mobile Data Traffic Offload Scheme using Small-cells in HetNets

  • 김세진
    • 인터넷정보학회논문지
    • /
    • 제22권5호
    • /
    • pp.9-16
    • /
    • 2021
  • Recently, the traffic volume of mobile communications increases rapidly and the small-cell is one of the solutions using two offload schemes, i.e., local IP access (LIPA) and selected IP traffic offload (SIPTO), to reduce the end-to-end delay and amount of mobile data traffic in the core network (CN). However, 3GPP describes the concept of LIPA and SIPTO and there is no decision algorithm to decide the path from source nodes (SNs) to destination nodes (DNs). Therefore, this paper proposes a dynamic mobile data traffic offload scheme using small-cells to decide the path based on the SN and DN, i.e., macro user equipment, small-cell user equipment (SUE), and multimedia server, and type of the mobile data traffic for the real-time and non-real-time. Through analytical models, it is shown that the proposed offload scheme outperforms the conventional small-cell network in terms of the delay of end-to-end mobile data communications and probability of the mobile data traffic in the CN for the heterogeneous networks.

소표본인 경우 신뢰성 순위 척도의 고찰 (Overview of Reliability Rank Measures for Small Sample)

  • 최성운
    • 대한안전경영과학회지
    • /
    • 제9권2호
    • /
    • pp.161-169
    • /
    • 2007
  • This paper presents three methods for expression of reliability measures for large and small data. First method is to express parametric estimation of cardinal reliability measure data for large sample, which requires numerous sample. Second is to obtain nonparametric distribution classification of ordinal reliability measure data for small sample. However it is difficult for field user to understand this method. Last method is to acquire parametric estimation of ordinal reliability measure data for small data. Because this method requires small sample and is comprehensive, we recommend this one among the proposed methods. Various reliability rank measures are presented.

Small Domain Estimation of the Proportion Using Survey Weights

  • Kim, Dal-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권4호
    • /
    • pp.1179-1189
    • /
    • 2007
  • In this paper, we estimate the proportion of individuals having health insurance in a given year for several small domains cross-classified by age, sex and other demographic characteristics using the data provided by the National Center for Health Statistics(NCHS). We employ Bayesian as well as frequentist methodology to obtain small domain estimates and the associated measures of precision. One of the new features of our study is that we utilize the survey weights along with the model to derive the small domain estimates.

  • PDF

설계용 S/W를 활용한 소형비행기의 비행특성 매개변수 추출과 주관적 시험평가방식에 관한 연구 (Derivation and Validation of Aerodynamic Parameters of Small Airplanes Using Design Software and Subjective Tests)

  • 이숙경;공지영;최유환;윤석준
    • 한국시뮬레이션학회:학술대회논문집
    • /
    • 한국시뮬레이션학회 2004년도 춘계학술대회 논문집
    • /
    • pp.142-147
    • /
    • 2004
  • It is very difficult to acquire high-fidelity flight test data for small airplanes such as typical unmanned aerial vehicles because MEMS-type small sensors used in the tests do not present reliable data in general. Besides, it is not practical to conduct expensive flight tests for low-cost small airplanes in order to simulate their flight characteristics. A practical approach to obtain acceptable flight data, including stability and control derivatives and data of weight and balance, is proposed in this study. Aircraft design software such as Darcorp's AAA is used to generate aerodynamic data for small airplanes, and moments of inertia are calculated using CATIA, structural design software. These flight data from simulation software are evaluated subjectively and tailored using simulation flight by experienced pilots, based on the certified procedures in FAA AC 120-45A and 40B, which are used for manned airplane simulators.

  • PDF

중소기업의 자동화 생산 정보 플랫폼 구축 모델 설계 (Designing an Automated Production Information Platform for Small and Medium-sized Businesses)

  • 정윤수;김용태;박길철
    • 융합정보논문지
    • /
    • 제9권1호
    • /
    • pp.116-122
    • /
    • 2019
  • 최근 중소기업은 세계적인 경쟁력을 갖추기 위해서 공정/품질/에너지 데이터 집계가 자동 또는 실시간으로 처리할 수 있는 산업 구조로 급격하게 변화하고 있다. 특히, 중소기업 생산 공정에서 생산되는 실시간 정보 분석은 중소기업의 유의미한 성과들을 분석, 예측, 처방 및 이행하는 새로운 공정 프로세스 형태로 진화해 가고 있다. 본 논문에서는 중소기업에서 생상되는 데이터를 고도화할 수 있도록 중소기업의 자동화 생산 정보 시스템을 빅데이터화 할 수 있는 플랫폼 구축 모델을 제안한다. 제안 모델은 스마트한 중소기업의 데이터 수집을 위해 중소기업에서 생산되는 제품의 기본 정보에 대한 다양한 데이터를 활용해 중소기업의 운영 효율화(컨설팅 및 교육 등) 및 전략적 의사결정을 지원할 수 있는 기능이 있다. 또한, 제안 모델은 종소기업의 정보 공유 및 시스템 연계가 원활하게 서로 다른 지역적 특성 및 분야를 가지는 중소기업들간에 긴밀한 협조가 가능한 것이 특징이다.

A Bayesian model for two-way contingency tables with nonignorable nonresponse from small areas

  • Woo, Namkyo;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권1호
    • /
    • pp.245-254
    • /
    • 2016
  • Many surveys provide categorical data and there may be one or more missing categories. We describe a nonignorable nonresponse model for the analysis of two-way contingency tables from small areas. There are both item and unit nonresponse. One approach to analyze these data is to construct several tables corresponding to missing categories. We describe a hierarchical Bayesian model to analyze two-way categorical data from different areas. This allows a "borrowing of strength" of the data from larger areas to improve the reliability in the estimates of the model parameters corresponding to the small areas. Also we use a nonignorable nonresponse model with Bayesian uncertainty analysis by placing priors in nonidentifiable parameters instead of a sensitivity analysis for nonidentifiable parameters. We use the griddy Gibbs sampler to fit our models and compute DIC and BPP for model diagnostics. We illustrate our method using data from NHANES III data on thirteen states to obtain the finite population proportions.

Estimating small area proportions with kernel logistic regressions models

  • Shim, Jooyong;Hwang, Changha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권4호
    • /
    • pp.941-949
    • /
    • 2014
  • Unit level logistic regression model with mixed effects has been used for estimating small area proportions, which treats the spatial effects as random effects and assumes linearity between the logistic link and the covariates. However, when the functional form of the relationship between the logistic link and the covariates is not linear, it may lead to biased estimators of the small area proportions. In this paper, we relax the linearity assumption and propose two types of kernel-based logistic regression models for estimating small area proportions. We also demonstrate the efficiency of our propose models using simulated data and real data.

A Study on Korean Sentiment Analysis Rate Using Neural Network and Ensemble Combination

  • Sim, YuJeong;Moon, Seok-Jae;Lee, Jong-Youg
    • International Journal of Advanced Culture Technology
    • /
    • 제9권4호
    • /
    • pp.268-273
    • /
    • 2021
  • In this paper, we propose a sentiment analysis model that improves performance on small-scale data. A sentiment analysis model for small-scale data is proposed and verified through experiments. To this end, we propose Bagging-Bi-GRU, which combines Bi-GRU, which learns GRU, which is a variant of LSTM (Long Short-Term Memory) with excellent performance on sequential data, in both directions and the bagging technique, which is one of the ensembles learning methods. In order to verify the performance of the proposed model, it is applied to small-scale data and large-scale data. And by comparing and analyzing it with the existing machine learning algorithm, Bi-GRU, it shows that the performance of the proposed model is improved not only for small data but also for large data.

Small Area Estimation of Unemployment Rate for the Economically Active Population Survey

  • Kim, Young-Won;Jo, Ran
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권1호
    • /
    • pp.1-10
    • /
    • 2004
  • In the Korean Economically Active Population Survey(EAPS), the sample sizes for small areas are typically too small to provide reliable estimators because the EAPS has been designed to produce unemployment statistics for large areas such as Metropolitan Cities and Province. In this study, we consider the synthetic and composite estimators for the unemployment rate of small areas, and apply them to real data on Choongbook province which is from the Korean EAPS of December 2000. The mean square errors of these estimators were estimated by the Jackknife method, and the efficiencies of small area estimators were evaluated in terms of the relative standard errors and the relative root mean square errors. As a result, the composite estimator is much more efficient than other estimators and it turns out that the composite estimator can produce the reliable estimates of the unemployment rate of small areas under the current EAPS system.

  • PDF