• Title/Summary/Keyword: Count Data

Search Result 1,116, Processing Time 0.027 seconds

Weighted zero-inflated Poisson mixed model with an application to Medicaid utilization data

  • Lee, Sang Mee;Karrison, Theodore;Nocon, Robert S.;Huang, Elbert
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.2
    • /
    • pp.173-184
    • /
    • 2018
  • In medical or public health research, it is common to encounter clustered or longitudinal count data that exhibit excess zeros. For example, health care utilization data often have a multi-modal distribution with excess zeroes as well as a multilevel structure where patients are nested within physicians and hospitals. To analyze this type of data, zero-inflated count models with mixed effects have been developed where a count response variable is assumed to be distributed as a mixture of a Poisson or negative binomial and a distribution with a point mass of zeros that include random effects. However, no study has considered a situation where data are also censored due to the finite nature of the observation period or follow-up. In this paper, we present a weighted version of zero-inflated Poisson model with random effects accounting for variable individual follow-up times. We suggested two different types of weight function. The performance of the proposed model is evaluated and compared to a standard zero-inflated mixed model through simulation studies. This approach is then applied to Medicaid data analysis.

A Study on Phon Call Big Data Analytics (전화통화 빅데이터 분석에 관한 연구)

  • Kim, Jeongrae;Jeong, Chanki
    • Journal of Information Technology and Architecture
    • /
    • v.10 no.3
    • /
    • pp.387-397
    • /
    • 2013
  • This paper proposes an approach to big data analytics for phon call data. The analytical models for phon call data is composed of the PVPF (Parallel Variable-length Phrase Finding) algorithm for identifying verbal phrases of natural language and the word count algorithm for measuring the usage frequency of keywords. In the proposed model, we identify words using the PVPF algorithm, and measure the usage frequency of the identified words using word count algorithm in MapReduce. The results can be interpreted from various viewpoints. We design and implement the model based HDFS (Hadoop Distributed File System), verify the proposed approach through a case study of phon call data. So we extract useful results through analysis of keyword correlation and usage frequency.

Design of Zigbee based Portable ECG monitoring system (지그비 기반의 휴대형 심전도 모니터링 시스템 설계)

  • Hong, Joo-Hyun;Kim, Nam-Jin;Cha, Eun-Jong;Lee, Tae-Soo
    • Proceedings of the KIEE Conference
    • /
    • 2006.04a
    • /
    • pp.51-53
    • /
    • 2006
  • This paper proposes a portable ECG monitoring system, which integrates uptodate PDA and RF communication technology. The aim of the study is to acquire the subject's biomedical signal without any constraint. It has two types of transmission mode, which are total signal transmission mode and HR(heart rate)/SC(step count) transmission mode. In audition, wireless communication technology uses Zigbee Wireless PAN and can work in low-power mode, which is one of the advantages of ZiBbee communication technology. The developed system is composed of a transmitter and a receiver. The transmitter has three-axial acceleration sensor. ECG amplifier and Zigbee communication controller. In total signal transmission mode, it can send data 50 packets per second whose transmission speed corresponds to 300 ECG samples and 60 acceleration samples. In HR/SG transmission mode, it can calculate heart rate from EEG data with 216 samples per second and step count from acceleration data and send a packet every cardiac cycle. The receiver forwards the received data to PDA, where the data can be stored and displayed. Therefore, the developed device enables to continuous monitoring for Activities of Daily Living(ADL). Also, this method will reduce medical costs in the aged society.

  • PDF

Statistical Classification of Highway Segments for Improving the Efficiency of Short-term Traffic Count Planning (효율적인 교통량 조사를 계획하기 위한 조사구간의 통계적 특성 분류 연구)

  • Jung, YooSeok;Oh, JuSam
    • International Journal of Highway Engineering
    • /
    • v.18 no.3
    • /
    • pp.109-114
    • /
    • 2016
  • PURPOSES : The demand for extending national highways is increasing, but traffic monitoring is hindered because of resource limitations. Hence, this study classified highway segments into 5 types to improve the efficiency of short-term traffic count planning. METHODS : The traffic volume trends of 880 highway segments were classified through R-squared and linear regression analyses; the steadiness of traffic volume trends was evaluated through coefficient of variance (COV), and the normality of the data were determined through the Shapiro-Wilk W-test. RESULTS : Of the 880 segments, 574 segments had relatively low COV and were classified as type 1 segments, and 123 and 64 segments with increasing and decreasing traffic volume trends were classified as type 2 and type 3 segments, respectively; 80 segments that failed the normality test were classified as type 4, and the remaining 39 were classified as type 5 segments. CONCLUSIONS : A theoretical basis for biennial count planning was established. Biennial count is recommended for types 1~4 because their mean absolute percentage errors (MAPEs) are approximately 10%. For type 5 (MAPE =19.26%), the conventional annual count can be continued. The results of this analysis can reduce the traffic monitoring budget.

Integer-Valued GARCH Models for Count Time Series: Case Study (계수 시계열을 위한 정수값 GARCH 모델링: 사례분석)

  • Yoon, J.E.;Hwang, S.Y.
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.1
    • /
    • pp.115-122
    • /
    • 2015
  • This article is concerned with count time series taking values in non-negative integers. Along with the first order mean of the count time series, conditional variance (volatility) has recently been paid attention to and therefore various integer-valued GARCH(generalized autoregressive conditional heteroscedasticity) models have been suggested in the last decade. We introduce diverse integer-valued GARCH(INGARCH, for short) processes to count time series and a real data application is illustrated as a case study. In addition, zero inflated INGARCH models are discussed to accommodate zero-inflated count time series.

Trends and Methodological Issues in Spatial Cluster Analysis for Count Data (카운트 데이터 기반 공간 군집 분석 연구의 동향과 방법론적 이슈)

  • Cho, Daeheon
    • Journal of the Korean Geographical Society
    • /
    • v.48 no.5
    • /
    • pp.768-785
    • /
    • 2013
  • Count data aggregated into areal units such as administrative boundaries are the most important sources of information for geographic research. Despite of ongoing research on spatial cluster analysis of count data, it has received relatively little attention and besides, it is difficult to comprehend research trends as well as major outcomes and challenges. This study aims to review the research literature conducted during the last two decades, to examine methodological characteristics, and finally to discuss some issues and challenges. Methods for indentifying spatial clusters have been used in various fields including geography, criminology, and epidemiology. However, their methodological features are not only quite distinct from each other, but there are issues related to the statistical reliability. Therefore, these have to be taken into account carefully when particular methods are used, and further empirical research about methodological issues and the development of analysis tools is needed.

  • PDF

Determinants of the Performance of Government Assistance to R&D Activities

  • Kwak, So-Yoon;Yoo, Seung-Hoon
    • Asian Journal of Innovation and Policy
    • /
    • v.3 no.1
    • /
    • pp.94-116
    • /
    • 2014
  • The technological innovation is considered as an important factor and there is a positive externality in developing technology in the form of technology spillover. In this context, it is argued that government should play an active role in advancing technology development and several means have been introduced. This study attempts to analyze manufacturing firms' evaluation for the performance of government assistance programs to their R&D activities. Considering that the performance evaluation takes the form of a count outcome, we apply several kinds of count data models. Some interesting findings emerge from the analysis. For example, we found that a firm's sales amount, dummy for the firm's having an R&D department, dummy for the firm's being a venture one, and the number of the firm's innovative activities have positive relationships with the degree that the firm evaluates government assistance as being useful.

Analysis of Types of Gather Drape with Visual Evaluation (시각적 평가에 의한 개더 드레이프 형상 분석)

  • Lee Myung-Hee;Jung Hee-Kyeong
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.7 no.1
    • /
    • pp.33-40
    • /
    • 2005
  • Gathering is method used to control fullness along a seam line. The purpose of this study was to investigate the relationship between the quantitative research and qualitative method; the effect of gather and the types of gather drape. The experimental design consists of four factors: (l) three kinds of different weight and thickness of fabrics (2) three kinds of stitch densities (3) five kinds of ratio of gathers (4) three kinds of grain directions. Therefore one hundred thirty five (135) samples were made. And utilized SPSS WIN 10.0 Package in data analysis. The results of this study were as follows; First, after frequency analysis, side height, hem line width, node depth, node count, node width accorded with these result data recording. Second, after correlation analysis, side height related with front statements. Side height and entire visual was negative correlation. Hem line width, node depth, node count with section statements was negative correlation but node width at section statements was positive correlation. Third, after $k^2$ analysis, front picture parts getting excellent evaluation were 1st side height, 3rd hem line width, 4th node depth, 3rd node count, 3rd node width. And section illustration parts getting excellent evaluation were 4th side height, 1st hem line width, 2nd node depth, 3rd node count, 4th node width.

  • PDF

Assessing Hematological Change Associated with Cardiovascular Disease Risk among Korean Taxi Drivers Using Data from the Second (2012-2014) Korean National Environmental Health Survey: A Propensity Score Matching Approach (제2기(2012-2014) 국민환경보건 기초조사 자료를 활용한 국내 남성 택시 기사의 심혈관계 위험도 관련 혈액학적 변화에 대한 연구: 성향점수 매칭을 활용하여)

  • Baek, Kiook
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.31 no.4
    • /
    • pp.367-377
    • /
    • 2021
  • Objectives: Taxi drivers are exposed to various hazards, such as long periods of sedentary work and traffic-related air pollutants. However, studies on the health effects among taxi drivers in South Korea are insufficient. Methods: To assess subclinical hematologic change related to cardiovascular disease among male taxi drivers, we analyzed data from the second Korean National Environmental Health Survey. Fifty-nine taxi drivers and 1,912 controls were included in the analysis. Propensity score matching was performed to adjust for age, body mass index, and urinary cotinine. A total of 295 subjects were matched with 59 taxi drivers. Leukocyte count, platelet count, hematocrit, triglyceride, total cholesterol, HDL cholesterol land total IgE of the taxi drivers were compared with the control groups. Results: Taxi drivers showed significantly elevated blood leukocytes and platelets. Serum total IgE was significantly reduced in taxi drivers. However, blood leukocytes, platelets, and serum total IgE were not significantly correlated with work period among taxi drivers. Conclusions: Regarding the change of the blood leukocyte count, platelet count, and serum total IgE, taxi driving has the possibility to be associated with peripheral inflammation, humoral immunity and cardiovascular risk.

Improving Flexibility of External Data Exchange in Count-fire Operation System by Adapting Dynamic Parser Software (동적 구문처리기 소프트웨어 적용을 통한 대화력전 수행체계 연동의 유연성 향상 방안)

  • Hong, Won-Eui
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.11 no.1
    • /
    • pp.51-56
    • /
    • 2008
  • The counter-fire operation system performs its mission exchanging information with other related systems such as command & control systems and military information systems. In the process of exchanging information, the counter-fire operation system uses a type of data message which contains exchange data information in the format of KMTF. The requirement of data exchange of count-fire operation will continue to evolve. But the EDX(External Data eXchange) configuration item of the current counter-fire operation system can not effectively cope with the variation of data exchange requirements due to its fixed software structure. In the paper, a solution for improving flexibility of external data exchange in counter-fire operation system is proposed.