• Title/Summary/Keyword: data partition

Search Result 416, Processing Time 0.025 seconds

Nonparametric Bayesian Multiple Comparisons for Geometric Populations

  • Ali, M. Masoom;Cho, J.S.;Begum, Munni
    • Journal of the Korean Data and Information Science Society
    • /
    • v.16 no.4
    • /
    • pp.1129-1140
    • /
    • 2005
  • A nonparametric Bayesian method for calculating posterior probabilities of the multiple comparison problem on the parameters of several Geometric populations is presented. Bayesian multiple comparisons under two different prior/ likelihood combinations was studied by Gopalan and Berry(1998) using Dirichlet process priors. In this paper, we followed the same approach to calculate posterior probabilities for various hypotheses in a statistical experiment with a partition on the parameter space induced by equality and inequality relationships on the parameters of several geometric populations. This also leads to a simple method for obtaining pairwise comparisons of probability of successes. Gibbs sampling technique was used to evaluate the posterior probabilities of all possible hypotheses that are analytically intractable. A numerical example is given to illustrate the procedure.

  • PDF

A Two-Stage Method for Near-Optimal Clustering (최적에 가까운 군집화를 위한 이단계 방법)

  • 윤복식
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.29 no.1
    • /
    • pp.43-56
    • /
    • 2004
  • The purpose of clustering is to partition a set of objects into several clusters based on some appropriate similarity measure. In most cases, clustering is considered without any prior information on the number of clusters or the structure of the given data, which makes clustering is one example of very complicated combinatorial optimization problems. In this paper we propose a general-purpose clustering method that can determine the proper number of clusters as well as efficiently carry out clustering analysis for various types of data. The method is composed of two stages. In the first stage, two different hierarchical clustering methods are used to get a reasonably good clustering result, which is improved In the second stage by ASA(accelerated simulated annealing) algorithm equipped with specially designed perturbation schemes. Extensive experimental results are given to demonstrate the apparent usefulness of our ASA clustering method.

A Novel Reconfigurable Processor Using Dynamically Partitioned SIMD for Multimedia Applications

  • Lyuh, Chun-Gi;Suk, Jung-Hee;Chun, Ik-Jae;Roh, Tae-Moon
    • ETRI Journal
    • /
    • v.31 no.6
    • /
    • pp.709-716
    • /
    • 2009
  • In this paper, we propose a novel reconfigurable processor using dynamically partitioned single-instruction multiple-data (DP-SIMD) which is able to process multimedia data. The SIMD processor and parallel SIMD (P-SIMD) processor, which is composed of a number of SIMD processors, are usually used these days. But these processors are inefficient because all processing units (PUs) should process the same operations all the time. Moreover, the PUs can process different operations only when every SIMD group operation is predefined. We propose a processor control method which can partition parallel processors into multiple SIMD-based processors dynamically to enhance efficiency. For performance evaluation of the proposed method, we carried out the inverse transform, inverse quantization, and motion compensation operations of H.264 using processors based on SIMD, P-SIMD, and DP-SIMD. Experimental results show that the DP-SIMD control method is more efficient than SIMD and P-SIMD control methods by about 15% and 14%, respectively.

Dynamic Hysteresis Model Based on Fuzzy Clustering Approach

  • Mourad, Mordjaoui;Bouzid, Boudjema
    • Journal of Electrical Engineering and Technology
    • /
    • v.7 no.6
    • /
    • pp.884-890
    • /
    • 2012
  • Hysteretic behavior model of soft magnetic material usually used in electrical machines and electronic devices is necessary for numerical solution of Maxwell equation. In this study, a new dynamic hysteresis model is presented, based on the nonlinear dynamic system identification from measured data capabilities of fuzzy clustering algorithm. The developed model is based on a Gustafson-Kessel (GK) fuzzy approach used on a normalized gathered data from measured dynamic cycles on a C core transformer made of 0.33mm laminations of cold rolled SiFe. The number of fuzzy rules is optimized by some cluster validity measures like 'partition coefficient' and 'classification entropy'. The clustering results from the GK approach show that it is not only very accurate but also provides its effectiveness and potential for dynamic magnetic hysteresis modeling.

Semidefinite Spectral Clustering (준정부호 스펙트럼의 군집화)

  • Kim, Jae-Hwan;Choi, Seung-Jin
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07a
    • /
    • pp.892-894
    • /
    • 2005
  • Graph partitioning provides an important tool for data clustering, but is an NP-hard combinatorial optimization problem. Spectral clustering where the clustering is performed by the eigen-decomposition of an affinity matrix [1,2]. This is a popular way of solving the graph partitioning problem. On the other hand, semidefinite relaxation, is an alternative way of relaxing combinatorial optimization. issuing to a convex optimization[4]. In this paper we present a semidefinite programming (SDP) approach to graph equi-partitioning for clustering and then we use eigen-decomposition to obtain an optimal partition set. Therefore, the method is referred to as semidefinite spectral clustering (SSC). Numerical experiments with several artificial and real data sets, demonstrate the useful behavior of our SSC. compared to existing spectral clustering methods.

  • PDF

Mechanisms of Cu(II) Sorption at Several Mineral/Water Interfaces: An EPR Study

  • Cho, Young-Hwan;Hyun, Sung-Pil;Pilsoo Hahn
    • Proceedings of the Korean Magnetic Resonance Society Conference
    • /
    • 2002.08a
    • /
    • pp.72-72
    • /
    • 2002
  • In most traditional sorption study in environmental conditions, experimental sorption data have been measured and interpreted by empirical ways such as partition coefficient and sorption isotherms. A mechanistic understanding of heavy metal interactions with various minerals (metal oxides, clay minerals) in aqueous medium is required to describe the behavior of radioactive metal ions in the environment. Various spectroscopic methods provide direct or indirect information on sorption mechanisms involved. We applied EPR (Electron Paramagnetic Resonance) spectroscopy to investigate the nature of metal ion sorption at water/mineral interfaces using Cu(II) as a spin probe. The major sorbed species and their motional state was identified by their EPR spectra. They showed distinct signals due to their strength of binding, local structure and motional state. The EPR results together with macroscopic sorption data show that sorption involved at least three different mechanisms depending on chemical environments (1).

  • PDF

Multi-mode Radar Signal Sorting by Means of Spatial Data Mining

  • Wan, Jian;Nan, Pulong;Guo, Qiang;Wang, Qiangbo
    • Journal of Communications and Networks
    • /
    • v.18 no.5
    • /
    • pp.725-734
    • /
    • 2016
  • For multi-mode radar signals in complex electromagnetic environment, different modes of one emitter tend to be deinterleaved into several emitters, called as "extension", when processing received signals by use of existing sorting methods. The "extension" problem inevitably deteriorates the sorting performance of multi-mode radar signals. In this paper, a novel method based on spatial data mining is presented to address above challenge. Based on theories of data field, we describe the distribution information of feature parameters using potential field, and makes partition clustering of parameter samples according to revealed distribution features. Additionally, an evaluation criterion based on cloud model membership is established to measure the relevance between different cluster-classes, which provides important spatial knowledge for the solution of the "extension" problem. It is shown through numerical simulations that the proposed method is effective on solving the "extension" problem in multi-mode radar signal sorting, and can achieve higher correct sorting rate.

An efficient VLSI Implementation of the 2-D DCT with the Algorithm Decomposition (알고리즘 분해를 이용한 2-D DCT)

  • Jeong, Jae-Gil
    • The Journal of Natural Sciences
    • /
    • v.7
    • /
    • pp.27-35
    • /
    • 1995
  • This paper introduces a VLSI (Very Large Scale Integrated Circuit) implementation of the 2-D Discrete Cosine Transform (DCT) with an application to image and video coding. This implementation, which is based upon a state space model, uses both algorithm and data partitioning to achieve high efficiency. With this implementation, the amount of data transfers between the processing elements (PEs) are reduced and all the data transfers are limitted to be local. This system accepts the input as a progressively scanned data stream which reduces the hardware required for the input data control module. With proper ordering of computations, a matrix transposition between two matrix by matrix multiplications, which is required in many 2-D DCT systems based upon a row-column decomposition, can be also removed. The new implementation scheme makes it feasible to implement a single 2-D DCT VLSI chip which can be easily expanded for a larger 2-D DCT by cascading these chips.

  • PDF

An Efficient Parallel Construction Scheme of An R-Tree using Hadoop (Hadoop을 이용한 R-트리의 효율적인 병렬 구축 기법)

  • Cong, Viet-Ngu Huynh;Kim, Jongmin;Kwon, Oh-Heum;Song, Ha-Joo
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.2
    • /
    • pp.231-241
    • /
    • 2019
  • Bulk-loading an R-tree can be a good approach to build an efficient one. However, it takes a lot of time to bulk-load an R-tree for huge amount of data. In this paper, we propose a parallel R-tree construction scheme based on a Hadoop framework. The proposed scheme divides the data set into a number of partitions for which local R-trees are built in parallel via Map-Reduce operations. Then the local R-trees are merged into an global R-tree that covers the whole data set. While generating the partitions, it considers the spatial distribution of the data into account so that each partition has nearly equal amounts of data. Therefore, the proposed scheme gives an efficient index structure while reducing the construction time. Experimental tests show that the proposed scheme builds an R-tree more efficiently than the existing approaches.

Shape Reconstruction from Large Amount of Point Data using Repetitive Domain Decomposition Method (반복적 영역분할법을 이용한 대용량의 점데이터로부터의 형상 재구성)

  • Yoo, Dong-Jin
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.23 no.11 s.188
    • /
    • pp.93-102
    • /
    • 2006
  • In this study an advanced domain decomposition method is suggested in order to construct surface models from very large amount of points. In this method the spatial domain of interest that is occupied by the input set of points is divided in repetitive manner. First, the space is divided into smaller domains where the problem can be solved independently. Then each subdomain is again divided into much smaller domains where the problem can be solved locally. These local solutions of subdivided domains are blended together to obtain a solution of each subdomain using partition of unity function. Then the solutions of subdomains are merged together in order to construct whole surface model. The suggested methods are conceptually very simple and easy to implement. Since RDDM(Repetitive Domain Decomposition Method) is effective in the computation time and memory consumption, the present study is capable of providing a fast and accurate reconstructions of complex shapes from large amount of point data containing millions of points. The effectiveness and validity of the suggested methods are demonstrated by performing numerical experiments for the various types of point data.