• Title/Summary/Keyword: Cluster estimation

Search Result 210, Processing Time 0.032 seconds

On Nonparametric Estimation of Data Edges

  • Park, Byeong U.
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.2
    • /
    • pp.265-280
    • /
    • 2001
  • Estimation of the edge of a distribution has many important applications. It is related to classification, cluster analysis, neural network, and statistical image recovering. The problem also arises in measuring production efficiency in economic systems. Three most promising nonparametric estimators in the existing literature are introduced. Their statistical properties are provided, some of which are new. Themes of future study are also discussed.

  • PDF

Wireless sensor network design for large-scale infrastructures health monitoring with optimal information-lifespan tradeoff

  • Xiao-Han, Hao;Sin-Chi, Kuok;Ka-Veng, Yuen
    • Smart Structures and Systems
    • /
    • v.30 no.6
    • /
    • pp.583-599
    • /
    • 2022
  • In this paper, a multi-objective wireless sensor network configuration optimization method is proposed. The proposed method aims to determine the optimal information and lifespan wireless sensor network for structural health monitoring of large-scale infrastructures. In particular, cluster-based wireless sensor networks with multi-type of sensors are considered. To optimize the lifetime of the wireless sensor network, a cluster-based network optimization algorithm that optimizes the arrangement of cluster heads and base station is developed. On the other hand, based on the Bayesian inference, the uncertainty of the estimated parameters can be quantified. The coefficient of variance of the estimated parameters can be obtained, which is utilized as a holistic measure to evaluate the estimation accuracy of sensor configurations with multi-type of sensors. The proposed method provides the optimal wireless sensor network configuration that satisfies the required estimation accuracy with the longest lifetime. The proposed method is illustrated by designing the optimal wireless sensor network configuration of a cable-stayed bridge and a space truss.

Traffic based Estimation of Optimal Number of Super-peers in Clustered P2P Environments

  • Kim, Ju-Gyun;Lee, Jun-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.12
    • /
    • pp.1706-1715
    • /
    • 2008
  • In a super-peer based P2P network, the network is clustered and each cluster is managed by a special peer, which is called a super-peer. A Super-peer has information of all the peers in its cluster. This type of clustered P2P model is known to have efficient information search and less traffic load than unclustered P2P model. In this paper, we compute the message traffic cost incurred by peers' query, join and update actions within a cluster as well as between the clusters. With these values, we estimate the optimal number of super-peers that minimizes the traffic cost for the various size of super-peer based P2P networks.

  • PDF

Estimation of PM10 source locations in Busan using PSCF model (PSCF 모델을 활용한 부산지역 PM10의 발생원 추정)

  • Do, Woo-Gon;Jung, Woo-Sik
    • Journal of Environmental Science International
    • /
    • v.24 no.6
    • /
    • pp.793-806
    • /
    • 2015
  • The purpose of this study is to find out the air flow patterns affecting the PM10 concentration in Busan and the potential sources within each trajectory pattern. The synoptic air flow trajectories are classified into four clusters by HYSPLIT model and the potential sources of PM10 are estimated by PSCF model for each cluster from 2008 to 2012. The potential source locations of PM10 are compared with the distribution of PM10 anthropogenic emissions in east Asia developed in 2006 for the NASA INTEX-B mission. The annual mean concentrations of PM10 in Busan decreased from $51ug/m^3$ in 2008 to $43ug/m^3$ in 2012. The monthly mean concentrations of PM10 were high during a spring season, March to May and low during a summer season, August and September. The cluster2 composed of the air trajectories from the eastern China to Busan through the west sea showed the highest frequency, 44 %. The cluster1 composed of the air trajectories from the inner Mongolia region to Busan through the northeast area of China showed the second high frequency, 26 %. The cluster3 and 4 were composed of the trajectories originated in the southeast sea and the east sea of Busan respectively and showed low frequencies. The concentrations of in each cluster were $47ug/m^3$ in cluster1, $56ug/m^3$ in cluster2, $42ug/m^3$ in cluster3 and $37ug/m^3$ in cluster4. From these results, it was proved that the cluster1 and 2 composed of the trajectories originated in the east and northeast area of China were the causes of high PM10 concentrations in Busan. The results of PSCF and CWT model showed that the potential sources of the high PM10 concentrations were the areas of the around Mongolia and the eastern China having high emissions of PM10 from Beijing, Hebei to Shanghai through Shandong, Jiangsu.

Identification of Cluster with Composite Mean and Variance (합성된 평균과 분산을 가진 군집 식별)

  • Kim, Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.3
    • /
    • pp.391-401
    • /
    • 2011
  • Consider a cluster, so called a 'son cluster', whose mean and variance is composed of the means and variances of both clusters called as a 'father cluster' and a 'mother cluster'. In this paper, a method for identifying each of three clusters is provided by modeling the relationship with father and mother clusters. Under the normal mixture model, the parameters are estimated via EM algorithm. We were able to overcome the problems of estimation using ECM approximation. Numerical examples show that our method can effectively identify the three clusters, so called a 'family of clusters'.

Analyzing Clustered and Interval-Censored Data based on the Semiparametric Frailty Model

  • Kim, Jin-Heum;Kim, Youn-Nam
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.5
    • /
    • pp.707-718
    • /
    • 2012
  • We propose a semi-parametric model to analyze clustered and interval-censored data; in addition, we plugged-in a gamma frailty to the model to measure the association of members within the same cluster. We propose an estimation procedure based on EM algorithm. Simulation results showed that our estimation procedure may result in unbiased estimates. The standard error is smaller than expected and provides conservative results to estimate the coverage rate; however, this trend gradually disappeared as the number of members in the same cluster increased. In addition, our proposed method was illustrated with data taken from diabetic retinopathy studies to evaluate the effectiveness of laser photocoagulation in delaying or preventing the onset of blindness in individuals with diabetic retinopathy.

An Improved Estimation Model of Server Power Consumption for Saving Energy in a Server Cluster Environment (서버 클러스터 환경에서 에너지 절약을 위한 향상된 서버 전력 소비 추정 모델)

  • Kim, Dong-Jun;Kwak, Hu-Keun;Kwon, Hui-Ung;Kim, Young-Jong;Chung, Kyu-Sik
    • The KIPS Transactions:PartA
    • /
    • v.19A no.3
    • /
    • pp.139-146
    • /
    • 2012
  • In the server cluster environment, one of the ways saving energy is to control server's power according to traffic conditions. This is to determine the ON/OFF state of servers according to energy usage of data center and each server. To do this, we need a way to estimate each server's energy. In this paper, we use a software-based power consumption estimation model because it is more efficient than the hardware model using power meter in terms of energy and cost. The traditional software-based power consumption estimation model has a drawback in that it doesn't know well the computing status of servers because it uses only the idle status field of CPU. Therefore it doesn't estimate consumption power effectively. In this paper, we present a CPU field based power consumption estimation model to estimate more accurate than the two traditional models (CPU/Disk/Memory utilization based power consumption estimation model and CPU idle utilization based power consumption estimation model) by using the various status fields of CPU to get the CPU status of servers and the overall status of system. We performed experiments using 2 PCs and compared the power consumption estimated by the power consumption model (software) with that measured by the power meter (hardware). The experimental results show that the traditional model has about 8-15% average error rate but our proposed model has about 2% average error rate.

Bayesian analysis of finite mixture model with cluster-specific random effects (군집 특정 변량효과를 포함한 유한 혼합 모형의 베이지안 분석)

  • Lee, Hyejin;Kyung, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.57-68
    • /
    • 2017
  • Clustering algorithms attempt to find a partition of a finite set of objects in to a potentially predetermined number of nonempty subsets. Gibbs sampling of a normal mixture of linear mixed regressions with a Dirichlet prior distribution calculates posterior probabilities when the number of clusters was known. Our approach provides simultaneous partitioning and parameter estimation with the computation of classification probabilities. A Monte Carlo study of curve estimation results showed that the model was useful for function estimation. Examples are given to show how these models perform on real data.