• 제목/요약/키워드: Cluster Modeling

검색결과 200건 처리시간 0.025초

A Table Integration Technique Using Query Similarity Analysis

  • Choi, Go-Bong;Woo, Yong-Tae
    • 한국컴퓨터정보학회논문지
    • /
    • 제24권3호
    • /
    • pp.105-112
    • /
    • 2019
  • In this paper, we propose a technique to analyze similarity between SQL queries and to assist integrating similar tables. First, the table information was extracted from the SQL queries through the query structure analyzer, and the similarity between the tables was measured using the Jacquard index technique. Then, similar table clusters are generated through hierarchical cluster analysis method and the co-occurence probability of the table used in the query is calculated. The possibility of integrating similar tables is classified by using the possibility of co-occurence of similarity table and table, and classifying them into an integrable cluster, a cluster requiring expert review, and a cluster with low integration possibility. This technique analyzes the SQL query in practice and analyse the possibility of table integration independent of the existing business, so that the existing schema can be effectively reconstructed without interruption of work or additional cost.

합성된 평균과 분산을 가진 군집 식별 (Identification of Cluster with Composite Mean and Variance)

  • 김승구
    • Communications for Statistical Applications and Methods
    • /
    • 제18권3호
    • /
    • pp.391-401
    • /
    • 2011
  • 본 논문에서는 자료 내의 군집 중에 '부(父) 군집'과 모(母) 군집'이라 부르는 두 군집 사이에, 합성된 평균 분산을 가지는 '합성군집' 즉 '자식 군집'이라 부르는 한 군집이 있을 경우에 주목하여, 그들의 관계를 평균과 분산에 관해 모형화하고 각각의 군집을 식별하는 방법을 제공하였다. 관측치는 정규혼합모형을 따른다고 가정하고, EM 알고리즘을 통해 모형 추정을 시도하였다. 추정 과정에 여러 난제가 있었으나, 근사적 방법으로 비교적 잘 극복할수 있었다. 그리고 수치실험을 통해 제안방법은 성공적으로 주어진 세 군집 즉 '군집족(族)'을 식별할수 있음을 보였다.

클러스터 기반 퍼지 모델트리를 이용한 데이터 모델링 (Data Modeling using Cluster Based Fuzzy Model Tree)

  • 이대종;박진일;박상영;정남정;전명근
    • 한국지능시스템학회논문지
    • /
    • 제16권5호
    • /
    • pp.608-615
    • /
    • 2006
  • 본 논문에서는 퍼지 클러스터 기법을 이용하여 구간 분할된 퍼지 모델트리의 제안과 이를 이용한 데이터 모델링 기법을 다룬다. 제안된 방법은 먼저 입력과 출력변수의 속성을 고려한 퍼지 클러스터링에 의해 중심벡터를 계산한 후, 중심벡터들과 입력속성간의 소속도를 이용하여 구간 분할된 영역별로 각각의 선형모델을 구축한다. 노드의 확장은 부모노드(parent node)에서 만들어진 모델에서 계산된 오차값과 자식노드(child node)에서 계산된 오차값을 비교하여 이루어진다. 출력값 예측 단계에서는 입력된 데이터와 잎노드에서 계산된 클러스터 중심값과 비교하여 소속도가 높은 선형모델을 선택하여 데이터에 대한 출력값을 예측하게 된다. 제안된 방법의 우수성을 보이기 위해 다양한 데이터를 대상으로 실험한 결과, 기존의 모델트리방식 및 뉴럴 네트워크 기반의 신경회로망 보다 향상된 성능을 보임을 알 수 있었다.

Environmental Survey Data Modeling Using K-means Clustering Techniques

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권3호
    • /
    • pp.557-566
    • /
    • 2005
  • Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. In this paper we used k-means clustering of several clustering techniques. The k-means Clustering Is classified as a partitional clustering method. We analyze 2002 Gyeongnam social indicator survey data using k-means clustering techniques for environmental information. We can use these outputs given by k-means clustering for environmental preservation and environmental improvement.

  • PDF

퍼지컬러 모델을 이용한 컬러 데이터 클러스터링 알고리즘1 (Color Data Clustering Algorithm using Fuzzy Color Model)

  • Kim, Dae-Won;Lee, Kwang H.
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2002년도 춘계학술대회 및 임시총회
    • /
    • pp.119-122
    • /
    • 2002
  • The research Interest of this paper is focused on the efficient clustering task for an arbitrary color data. In order to tackle this problem, we have tiled to model the inherent uncertainty and vagueness of color data using fuzzy color model. By laking a fuzzy approach to color modeling, we could make a soft decision for the vague regions between neighboring colors. The proposed fuzzy color model defined a three dimensional fuzzy color ball and color membership computation method with the two inter-color distance measures. With the fuzzy color model, we developed a new fuzzy clustering algorithm for an efficient partition of color data. Each fuzzy cluster set has a cluster prototype which is represented by fuzzy color centroid.

  • PDF

분산 서버 클러스터 시스템의 부하 분산 및 성능 분석 시뮬레이션 (Workload Distribution and Performance Analysis Simulation for a Distributed Server Cluster System)

  • 최은미;이원규
    • 한국시뮬레이션학회논문지
    • /
    • 제12권4호
    • /
    • pp.103-111
    • /
    • 2003
  • A distributed sewer cluster system is a cost-effective system to provide a service application for clients with reliable, scalable, available, and fault-tolerant features. In order to provide high quality services, it is necessary to evaluate service performances, tune the server system, and analyze performances. In this paper, we propose a simulator to generate workloads based on statistic configuration according to estimated application traffics, apply workload scheduling algorithms, and evaluate the simulation results. We introduce the simulator design modelling and architecture. By using flexible parameters, the simulator is able to generate various patterns of workloads with different statistics, and configure system environments such as the number of server nodes, system resources considered, and their capacities. With this simulator, we introduce two scenarios: one is to find appropriate thresholds for the best performance of cluster system, and the other is to find the suitable scheduling algorithm for workload characteristics of service applications.

  • PDF

Weak-Lensing Study of Galaxy Cluster PLCKG287.0+32.9

  • Finner, Kyle;Jee, Myungkook James
    • 천문학회보
    • /
    • 제41권1호
    • /
    • pp.71.2-71.2
    • /
    • 2016
  • Merging galaxy clusters, such as PLCKG287.0+32.9, provide a window into the formation process of the large scale structure of the universe. PLCKG287.0+32.9 is an enormous merging galaxy cluster with mass estimated to be ~10^15 Msun. It hosts a pair of mega-parsec sized radio relics with projected offsets from the X-ray center of approximately 350kpc and 2.7Mpc, suggesting a NW-SE merging scenario with relics originating from two separate passes (Bonafede et al. 2014). A detected radio halo coincides with the center of x-ray emission. We present the motivation for our weak lensing study of the merging galaxy cluster PLCKG287.0+32.9 using recent Subaru optical imaging. We discuss the basics of weak-lensing and the criteria for source selection. In addition, we describe our method of PSF modeling and mass reconstruction.

  • PDF

Development of Simulator with Cluster System for Towing Fisheries

  • Park Myeong-Chul;Ha Seok-Wun
    • Journal of information and communication convergence engineering
    • /
    • 제3권2호
    • /
    • pp.84-89
    • /
    • 2005
  • Goal of this study is to implement 3-dimensional underwater appearance graphical display, fishery measured information display, sonar data representation and display, and 3-dimensional underwater appearance animation based on coefficient data of chaos behavior and fishing modeling of fishing gears from PC cluster system. In order to accomplish the goals of this study, it is essential to compose user interfacing and realistic description of image scenes in the towing-net fishery simulator, and techniques to describe sand cloud effects under water using particle systems are necessary. In this study, we implemented graphical representations and animations of the simulator by using OpenGL together with C routines.

분산 서버 클러스터 시스템의 부하 분산 및 성능 분석 시뮬레이션 (Workload Distribution and Performance Analysis Simulation for a Distributed Server Cluster System)

  • 최은미;이원규
    • 한국시뮬레이션학회:학술대회논문집
    • /
    • 한국시뮬레이션학회 2003년도 추계학술대회 및 정기총회
    • /
    • pp.27-34
    • /
    • 2003
  • A distributed server cluster system is a cost-effective system to provide a service application for clients with reliable, scalable, available, and fault-tolerant features. In order to provide high quality services, it is necessary to evaluate service performances, tune the server system, and analyze performances. In this paper, we propose a simulator to generate workloads based on statistic configuration according to estimated application traffics, apply workload scheduling algorithms, and evaluate the simulation results. We introduce the simulator design modelling and architecture. By using flexible parameters, the simulator is able to generate various patterns of workloads with different statistics, and configure system environments such as the number of server nodes, system resources considered, and their capacities. With this simulator, we introduce two scenarios: one is to find appropriate thresholds for the best performance of cluster system, and the other is to find the suitable scheduling algorithm for workload characteristics of service applications.

  • PDF

Obstacles modeling method in cluttered environments using satellite images and its application to path planning for USV

  • Shi, Binghua;Su, Yixin;Zhang, Huajun;Liu, Jiawen;Wan, Lili
    • International Journal of Naval Architecture and Ocean Engineering
    • /
    • 제11권1호
    • /
    • pp.202-210
    • /
    • 2019
  • The obstacles modeling is a fundamental and significant issue for path planning and automatic navigation of Unmanned Surface Vehicle (USV). In this study, we propose a novel obstacles modeling method based on high resolution satellite images. It involves two main steps: extraction of obstacle features and construction of convex hulls. To extract the obstacle features, a series of operations such as sea-land segmentation, obstacles details enhancement, and morphological transformations are applied. Furthermore, an efficient algorithm is proposed to mask the obstacles into convex hulls, which mainly includes the cluster analysis of obstacles area and the determination rules of edge points. Experimental results demonstrate that the models achieved by the proposed method and the manual have high similarity. As an application, the model is used to find the optimal path for USV. The study shows that the obstacles modeling method is feasible, and it can be applied to USV path planning.