• Title/Summary/Keyword: cubing

Search Result 8, Processing Time 0.033 seconds

H*-tree/H*-cubing-cubing: Improved Data Cube Structure and Cubing Method for OLAP on Data Stream (H*-tree/H*-cubing: 데이터 스트림의 OLAP를 위한 향상된 데이터 큐브 구조 및 큐빙 기법)

  • Chen, Xiangrui;Li, Yan;Lee, Dong-Wook;Kim, Gyoung-Bae;Bae, Hae-Young
    • The KIPS Transactions:PartD
    • /
    • v.16D no.4
    • /
    • pp.475-486
    • /
    • 2009
  • Data cube plays an important role in multi-dimensional, multi-level data analysis. Meeting on-line analysis requirements of data stream, several cube structures have been proposed for OLAP on data stream, such as stream cube, flowcube, S-cube. Since it is costly to construct data cube and execute ad-hoc OLAP queries, more research works should be done considering efficient data structure, query method and algorithms. Stream cube uses H-cubing to compute selected cuboids and store the computed cells in an H-tree, which form the cuboids along popular-path. However, the H-tree layoutis disorderly and H-cubing method relies too much on popular path.In this paper, first, we propose $H^*$-tree, an improved data structure, which makes the retrieval operation in tree structure more efficient. Second, we propose an improved cubing method, $H^*$-cubing, with respect to computing the cuboids that cannot be retrieved along popular-path when an ad-hoc OLAP query is executed. $H^*$-tree construction and $H^*$-cubing algorithms are given. Performance study turns out that during the construction step, $H^*$-tree outperforms H-tree with a more desirable trade-off between time and memory usage, and $H^*$-cubing is better adapted to ad-hoc OLAP querieswith respect to the factors such as time and memory space.

An Efficient Algorithm for Multi-dimensional Sequential Pattern Mining (다차원 순차패턴 마이닝을 위한 효율적 알고리즘)

  • 이순신;김은주;김명원
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.10a
    • /
    • pp.214-216
    • /
    • 2004
  • 순차패턴 마이닝은 데이터들 속에서 어떤 순차 관계가 들어 있는 패턴을 찾는 것이다. 순차 패턴은 다양한 분야에서 중요하게 쓰인다. 예를 들어, 소비자가 구입한 물품들 간의 순차적인 관계성은 다음에 구입할 물건을 예측하는데 쓰일 수 있다. 또한 방문 웹 페이지의 순차 패턴은 사용자가 방문하고자 하는 다음 페이지를 예측하는데 중요할 수 있다. 본 논문에서는 다차원 순차패턴을 마이닝하는 새로운 효율적인 알고리즘의 구현에 대해 설명한다 다차원 순차 패턴 마이닝은 속성-값(attribute-value) 기술을 포함하는 순차 패턴의 연관 규칙을 찾는 것이다. 다음의 두 가지의 현존하는 효율적 알고리즘을 융합하였다. 순차패턴 마이닝을 위한 PrefixSpan 알고리즘과 비 순차패턴 마이닝을 위한 StarCubing 알고리즘. 새로운 알고리즘은 다차원 데이터를 마이닝 하는 StarCubing알고리즘의 효율성을 이용하므로 다차원 순차 데이터를 마이닝 하는데 효율적일 것이다. 실험결과는 제안한 알고리즘이 특히 작은 최소지지도와 작은 cardinality에서 Seq-Dim과 Dim-Seq 같은 현존하는 알고리즘보다 나은 성능임을 보여준다.

  • PDF

Spatio-temporal Query Clustering: A Data Cubing Approach (시공간 질의 클러스터링: 데이터 큐빙 기법)

  • Chen, Xiangrui;Baek, Sung-Ha;Bae, Hae-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.11a
    • /
    • pp.287-288
    • /
    • 2009
  • Multi-query optimization (MQO) is a critical research issue in the real-time data stream management system (DSMS). We propose to address this problem in the ubiquitous GIS (u-GIS) environment, focusing on grouping 'similar' spatio-temporal queries incrementally into N clusters so that they can be processed virtually as N queries. By minimizing N, the overlaps in the data requirements of the raw queries can be avoided, which implies the reducing of the total disk I/O cost. In this paper, we define the spatio-temporal query clustering problem and give a data cubing approach (Q-cube), which is expected to be implemented in the cloud computing paradigm.

Data Cube Generation Method Using Hash Table in Spatial Data Warehouse (공간 데이터 웨어하우스에서 해쉬 테이블을 이용한 데이터큐브의 생성 기법)

  • Li, Yan;Kim, Hyung-Sun;You, Byeong-Seob;Lee, Jae-Dong;Bae, Hae-Young
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.11
    • /
    • pp.1381-1394
    • /
    • 2006
  • Generation methods of data cube have been studied for many years in data warehouse which supports decision making using stored data. There are two previous studies, one is multi-way array algorithm and the other is H-cubing algorithm which is based on the hyper-tree. The multi-way array algorithm stores all aggregation data in arrays, so if the base data is increased, the size of memory is also grow. The H-cubing algorithm which is based on the hyper-tree stores all tuples in one tree so the construction cost is increased. In this paper, we present an efficient data cube generation method based on hash table using weight mapping table and record hash table. Because the proposed method uses a hash table, the generation cost of data cube is decreased and the memory usage is also decreased. In the performance study, we shows that the proposed method provides faster search operation time and make data cube generation operate more efficiently.

  • PDF

Effect of Moisture Content on Physical and Chemical Characteristics of Italian Ryegrass Cube (수분 함량이 이탈리안 라이그라스 큐브의 물리적 및 화학적 성상에 미치는 영향)

  • Moon, Byeong Heoun;Park, Hyung Soo;Shin, Jong Seo;Park, Byeong Ki;Kim, Jong Geun
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.36 no.1
    • /
    • pp.34-40
    • /
    • 2016
  • The objective of this study was to determine the effect of moisture content on the physical and chemical characteristics of Italian ryegrass cube. Cube quality according to moisture contents (15, 20, 25, and 30%) was determined. Cubes made with 15 to 20% moisture showed a little cracks. But, the amount of powder generate from these cubes were lower by 10 to 16% compared to other cubes made with 25 to 30% moisture contents. The highest hardness at 159 kg/f was obtained when the cube was made with 15% moisture content and the lowest was 70 kg/f when the cube was made with 30% moisture content. The electrical loading and surface temperature were increased when moisture content was decreased. The chemical compositions of cube were differ from those of raw materials. Crude protein (CP) and ether extract (EE) contents were increased after cubing works. However, crude fiber (CF), acid detergent fiber (ADF), and neutral detergent fiber (NDF) contents were decreased after cubing. The crude ash content was not significantly (p > 0.05) different between raw material and cube. Higher moisture content resulted in higher crude protein content. However, crude fiber and crude ash content were not significantly (p > 0.05) different between each other. The contents of ADF and NDF were the lowest in cubes made with 30% moisture content. Our results suggest that the proper moisture content of Italian ryegrass cubing is recommended to be 15 to 20% and that cubing works should help increase forage quality.

Efficient Creation of Data Cube Using Hash Table in Data Warehouse (데이터 웨어하우스에서 해쉬 테이블을 이용한 효율적인 데이터 큐브 생성 기법)

  • Kim Hyungsun;You Byeongseob;Lee JaeDong;Bae Haeyoung
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11b
    • /
    • pp.211-213
    • /
    • 2005
  • 데이터 웨어하우스는 축적된 대량의 데이터를 분석하여 의사결정을 지원하는 시스템이다. 의사결정을 위한 대량의 데이터 분석은 많은 비용을 요구하므로, 질의 처리 성능을 높이고 의사 결정자에게 빠른 응답을 제공하는 효율적인 데이터 큐브 생성 기법이 연구되었다. 기존 기법으로는 Multiway Array 기법과 H-Cubing 기법이 있다. Multiway Array 기법은 다차원 집계 연산에 필요한 모든 데이터를 배열로 저장하는 것으로 데이터의 양이 많아질수록 메모리 사용이 증가한다. H-Cubing 기법은 Hyper-Tree를 기반으로 튜플을 트리로 구축하므로 모든 튜플을 트리로 구축해야 하는 비용이 증가한다. 본 논문에서는 데이터 웨어하우스에서 해쉬 테이블을 이용한 효율적인 데이터 큐브 생성 기법을 제안한다. 제안 기법은 데이터 큐브 생성 시 필드 해쉬 테이블과 레코드 해쉬 테이블을 사용한다. 필드 해쉬 테이블은 저장될 레코드 순서 계산을 위하여 각 필드에 대해 레벨 값을 해쉬 테이블로 관리한다. 레코드 해쉬 테이블은 데이터 큐브 테이블에 저장될 레코드의 순서와 데이터 큐브 테이블에 저장하기 위한 임시 레코드의 위치를 관리한다. 필드 해쉬 테이블을 이용하여 다차원 데이터의 저장될 레코드 순서를 빠르게 찾아 저장함으로서 데이터 큐브의 생성속도가 향상된다. 또한 해쉬 테이블 만을 유지하면 되므로 메모리 사용량이 감소한다. 따라서 해쉬 테이블의 사용으로 데이터의 빠른 검색과 데이터 큐브 생성 요청에 빠른 응답이 가능하다.

  • PDF

A Stereo Image Recognition-Based Method for measuring the volume of 3D Object (스테레오 영상 인식에 기반한 3D 물체의 부피계측방법)

  • Jeong, Yun-Su;Lee, Hae-Won;Kim, Jin-Seok;Won, Jong-Un
    • The KIPS Transactions:PartB
    • /
    • v.9B no.2
    • /
    • pp.237-244
    • /
    • 2002
  • In this paper, we propose a stereo image recognition-based method for measuring the volume of the rectangular parallelepiped. The method measures the volume from two images captured with two CCD (charge coupled device) cameras by sequential processes such as ROI (region of interest) extraction, feature extraction, and stereo matching-based vortex recognition. The proposed method makes it possible to measure the volume of the 3D object at high speed because only a few features are used in the process of stereo matching. From experimental results, it is demonstrated that this method is very effective for measuring the volume of the rectangular parallelepiped at high speed.

A Single Camera based Method for Cubing Rectangular Parallelepiped Objects (한대의 카메라에 기반한 직육면체의 부피 계측 방법)

  • Won, Jong-Won;Chung, Yun-Su;Kim, Woo-Seob;You, Kwang-Hun;Lee, Yong-Joon;Park, Kil-Houm
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.8 no.5
    • /
    • pp.562-573
    • /
    • 2002
  • In this paper, we propose a method for measuring the volume of packages for the efficient handling of the packages. Using the geometrical characteristics of the rectangular parallelepiped type objects, the method measures the volume of packages with one camera only in real time. In preprocessing of volume measurement, the method extracts outer lines of the object and then crossing points of the lines as feature points or vertexes. From these cross points(-feature points-), the volume of the package is calculated. Compared to the direct feature extraction, the proposed method shows especially the blurring robust result by using the line for feature extraction. Additionally, the method can get the stable result by considering object's direction. From experimental results, it is demonstrated that this method is very effective for the real time volume measurement of the rectangular parallelepiped.