Search | Korea Science

Efficient Processing method of OLAP Range-Sum Queries in a dynamic warehouse environment (다이나믹 데이터 웨어하우스 환경에서 OLAP 영역-합 질의의 효율적인 처리 방법)

Chun, Seok-Ju;Lee, Ju-Hong
- The KIPS Transactions:PartD
- /
- v.10D no.3
- /
- pp.427-438
- /
- 2003
In a data warehouse, users typically search for trends, patterns, or unusual data behaviors by issuing queries interactively. The OLAP range-sum query is widely used in finding trends and in discovering relationships among attributes in the data warehouse. In a recent environment of enterprises, data elements in a data cube are frequently changed. The problem is that the cost of updating a prefix sum cube is very high. In this paper, we propose a novel algorithm which reduces the update cost significantly by an index structure called the Δ-tree. Also, we propose a hybrid method to provide either approximate or precise results to reduce the overall cost of queries. It is highly beneficial for various applications that need quick approximate answers rather than time consuming accurate ones, such as decision support systems. An extensive experiment shows that our method performs very efficiently on diverse dimensionalities, compared to other methods.
https://doi.org/10.3745/KIPSTD.2003.10D.3.427 인용 PDF KSCI

Efficient Storage Techniques for Materialized Views Using Multi-Zoned Disks in OLAP Environment (OLAP 환경에서 다중 존 디스크를 활용한 실체뷰의 효율적 저장 기법)

Chang, Jae-Young
- The Journal of Society for e-Business Studies
- /
- v.14 no.1
- /
- pp.143-160
- /
- 2009
In determining the performance of OLAP database applications, the structure and the effective access methods to the underlying disk system is a significant factor. In recent years, hard disks are designed with multiple physical zones where seek times and data transfer rates vary across the zones. However, there is little consideration of multi-zone disks in previous works. Instead, they assumed a traditional disk model that comes with many simplifying assumptions such as an average seek-time and a single data transfer rate. In this paper, we propose a technique storing a set of materialized views into the multi-zoned disks in OLAP environment dealing with large sets of data. We first present the disk zoning algorithm of materialized views according to the access probabilities of each views. Also, we address the problem of storing views in the dynamic environment where data are updated continuously. Finally, through experiments, we prove the performance improvement of the proposed algorithm against the conventional methods.
PDF

PC-SAN: Pretraining-Based Contextual Self-Attention Model for Topic Essay Generation

Lin, Fuqiang;Ma, Xingkong;Chen, Yaofeng;Zhou, Jiajun;Liu, Bo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.8
- /
- pp.3168-3186
- /
- 2020
Automatic topic essay generation (TEG) is a controllable text generation task that aims to generate informative, diverse, and topic-consistent essays based on multiple topics. To make the generated essays of high quality, a reasonable method should consider both diversity and topic-consistency. Another essential issue is the intrinsic link of the topics, which contributes to making the essays closely surround the semantics of provided topics. However, it remains challenging for TEG to fill the semantic gap between source topic words and target output, and a more powerful model is needed to capture the semantics of given topics. To this end, we propose a pretraining-based contextual self-attention (PC-SAN) model that is built upon the seq2seq framework. For the encoder of our model, we employ a dynamic weight sum of layers from BERT to fully utilize the semantics of topics, which is of great help to fill the gap and improve the quality of the generated essays. In the decoding phase, we also transform the target-side contextual history information into the query layers to alleviate the lack of context in typical self-attention networks (SANs). Experimental results on large-scale paragraph-level Chinese corpora verify that our model is capable of generating diverse, topic-consistent text and essentially makes improvements as compare to strong baselines. Furthermore, extensive analysis validates the effectiveness of contextual embeddings from BERT and contextual history information in SANs.
https://doi.org/10.3837/tiis.2020.08.001 인용 PDF KSCI HTML

A Cyclic Sliced Partitioning Method for Packing High-dimensional Data (고차원 데이타 패킹을 위한 주기적 편중 분할 방법)

김태완;이기준
- Journal of KIISE:Databases
- /
- v.31 no.2
- /
- pp.122-131
- /
- 2004
Traditional works on indexing have been suggested for low dimensional data under dynamic environments. But recent database applications require efficient processing of huge sire of high dimensional data under static environments. Thus many indexing strategies suggested especially in partitioning ones do not adapt to these new environments. In our study, we point out these facts and propose a new partitioning strategy, which complies with new applications' requirements and is derived from analysis. As a preliminary step to propose our method, we apply a packing technique on the one hand and exploit observations on the Minkowski-sum cost model on the other, under uniform data distribution. Observations predict that unbalanced partitioning strategy may be more query-efficient than balanced partitioning strategy for high dimensional data. Thus we propose our method, called CSP (Cyclic Spliced Partitioning method). Analysis on this method explicitly suggests metrics on how to partition high dimensional data. By the cost model, simulations, and experiments, we show excellent performance of our method over balanced strategy. By experimental studies on other indices and packing methods, we also show the superiority of our method.
PDF KSCI

Term Clustering and Duplicate Distribution for Efficient Parallel Information Retrieval (효율적인 병렬정보검색을 위한 색인어 군집화 및 분산저장 기법)

강재호;양재완;정성원;류광렬;권혁철;정상화
- Journal of KIISE:Software and Applications
- /
- v.30 no.1_2
- /
- pp.129-139
- /
- 2003
The PC cluster architecture is considered as a cost-effective alternative to the existing supercomputers for realizing a high-performance information retrieval (IR) system. To implement an efficient IR system on a PC cluster, it is essential to achieve maximum parallelism by having the data appropriately distributed to the local hard disks of the PCs in such a way that the disk I/O and the subsequent computation are distributed as evenly as possible to all the PCs. If the terms in the inverted index file can be classified to closely related clusters, the parallelism can be maximized by distributing them to the PCs in an interleaved manner. One of the goals of this research is the development of methods for automatically clustering the terms based on the likelihood of the terms' co-occurrence in the same query. Also, in this paper, we propose a method for duplicate distribution of inverted index records among the PCs to achieve fault-tolerance as well as dynamic load balancing. Experiments with a large corpus revealed the efficiency and effectiveness of our method.
PDF KSCI

Dynamic Recommendation System for a Web Library by Using Cluster Analysis and Bayesian Learning (군집분석과 베이지안 학습을 이용한 웹 도서 동적 추천 시스템)

Choi, Jun-Hyeog;Kim, Dae-Su;Rim, Kee-Wook
- Journal of the Korean Institute of Intelligent Systems
- /
- v.12 no.5
- /
- pp.385-392
- /
- 2002
Collaborative filtering method for personalization can suggest new items and information which a user hasn t expected. But there are some problems. Not only the steps for calculating similarity value between each user is complex but also it doesn t reflect user s interest dynamically when a user input a query. In this paper, classifying users by their interest makes calculating similarity simple. We propose the a1gorithm for readjusting user s interest dynamically using the profile and Bayesian learning. When a user input a keyword searching for a item, his new interest is readjusted. And the user s profile that consists of used key words and the presence frequency of key words is designed and used to reflect the recent interest of users. Our methods of adjusting user s interest using the profile and Bayesian learning can improve the real satisfaction of users through the experiment with data set, collected in University s library. It recommends a user items which he would be interested in.
https://doi.org/10.5391/JKIIS.2002.12.5.385 인용 PDF KSCI

An Efficient Dynamic Path Query Processing Method for Digital Road Map Databases (디지털 로드맵 데이터베이스에서 효율적인 동적 경로 질의어 처리 방안)

Jung, Sung-Won
- Journal of KIISE:Databases
- /
- v.28 no.3
- /
- pp.430-448
- /
- 2001
In navigation system, a primary task is to compute the minimum cost route from the current location to the destination. One of major problems for navigation systems is that a significant amount of computation time is required when the digital road map is large. Since navigation systems are real time systems, it is critical that the path be computed while satisfying a time constraint. In this paper, we have developed a HiTi(Hierarchical MulTi) graph model for hierarchically structuring large digital road maps to speedup the minimum cost path computation. We propose a new shortest path algorithm named SPAH, which utilizes HiTi graph model of a digital road map for its computation. We prove that the shortest path computed by SPAH is the optimal. Our performance analysis of SPAH also showed that it significantly reduces the computation time over exiting methods. We present an in-depth experimental analysis of HiTi graph method by comparing it with other similar works.
PDF

Efficient RFID Search Protocols Providing Enhanced User Privacy (강화된 사용자 프라이버시를 보장하는 효율적인 RFID 검색 프로토콜)

Lim, Ji-Hwan;Oh, Hee-Kuck;Nyang, Dae-Hun;Lee, Mun-Kyu;Kim, Sang-Jin
- The KIPS Transactions:PartC
- /
- v.16C no.3
- /
- pp.347-356
- /
- 2009
In an RFID search protocol, a reader uses designated query to determine whether a specific tag is in the vicinity of the reader. This fundamental difference makes search protocol more vulnerable to replay attacks than authentication protocols. Due to this, techniques used in existing RFID authentication protocols may not be suitable for RFID search protocols. In this paper, we propose two RFID search protocols, one based on static ID and the other based on dynamic ID, which use counter to prevent replay attacks. Moreover, we propose a security model for RFID search protocols that includes forward/backward traceability, de-synchronization and forgery attack. Based on this model, we analyze security of our protocols and related works.
https://doi.org/10.3745/KIPSTC.2009.16-C.3.347 인용 PDF KSCI

Development of Moving Object Management System for Vehicle Monitoring/Control Management in e-Logistics Environment (e-Logistics 환경에서 차량관제를 위한 이동체 관리 시스템 개발)

Kim, Dong-Ho;Lee, Hye-Jin;Lee, Hyun-Ah;Kim, Jin-Suk
- The KIPS Transactions:PartD
- /
- v.11D no.6
- /
- pp.1231-1238
- /
- 2004
By virtue of the advanced Internet technology, there are lots of research works for e-Logistics which means virtual business activities or service architecture based on the Internet among the logistics companies. Because e-Logistics environment requires more dynamic and global service area, conventional vehicle monitoring and control technologies innate many problems in terms of Integrating, storing and sharing the location data. It needs the development of the moving object technology in order to resolve efficiently the limitations. In this paper, we propose the whole components of the moving object management system which supports the advanced sharing the location information as well as the integration of location data. We are sure the suggested system can be adopted to construct the next generation-logistics vehicle monitoring and control system by reducing the overall cost and time.
https://doi.org/10.3745/KIPSTD.2004.11D.6.1231 인용 PDF KSCI

A Study on XMDR-DSM System Design for Cooperative (협업을 위한 XMDR-DSM 시스템 설계에 관한 연구)

Moon, Seok-Jae;Jung, Kye-Dong;Choi, Young-Keun
- The KIPS Transactions:PartD
- /
- v.16D no.5
- /
- pp.701-714
- /
- 2009
In the enterprises the data integration based on service requires integrated data management as the change in the environment of enterprises accelerates. Cooperation among enterprises is accomplished through accessing distributed database using business process. As this approach is performed based on the global query, problems such as data heterogeneity, schema heterogeneity, and verification of validity have to be solved in advance for the interoperability among the heterogeneous system. Thus, cooperation requires dynamic and reliable construction. In this paper, we propose XMDR-DSM (eXtended MetaData Registry-Data Service Mediator) system for cooperation. XMDR-DSM, which is comprised of XMDR-DS, XMDR-DQP, and XMDR-DAI, supports the mapping between global schema and local schema and provides data access and integration service. Therefore, XMDR-DSM enables the mutual support of business operations among heterogeneous database. In addition, it can secure information as reusable asset and the standardization of interchange. Also it can manage unified information since it provides business process based on workflow; therefore, it will be able to increase the life span of information and reduce the cost.
https://doi.org/10.3745/KIPSTD.2009.16D.5.701 인용 PDF KSCI

Search Result 177, Processing Time 0.018 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)