• Title/Summary/Keyword: indexing technique

Search Result 204, Processing Time 0.024 seconds

A Theoretical Study of Designing Thesaurus Browser by Clustering Algorithm (클러스터링을 이용한 시소러스 브라우저의 설계에 대한 이론적 연구)

  • Seo, Hwi
    • Journal of Korean Library and Information Science Society
    • /
    • v.30 no.3
    • /
    • pp.427-456
    • /
    • 1999
  • This paper deals with the problems of information retrieval through full-test database which arise from both the deficiency of searching strategies or methods by information searcher and the difficulties of query representation, generation, extension, etc. In oder to solve these problems, we should use automatic retrieval instead of manual retrieval in the past. One of the ways to make the gap narrow between the terms by the writers and query by the searchers is that the query should be searched with the terms which the writers use. Thus, the preconditions which should be taken one accorded way to solve the problems are that all areas of information retrieval such as should taken one accorded way to solve the problems are that all areas of information retrieval such as contents analysis, information structure, query formation, query evaluation, etc. should be solved as a coherence way. We need to deal all the ares of automatic information retrieval for the efficiency of retrieval thought this paper is trying to solve the design of thesaurus browser. Thus, this paper shows the theoretical analyses about the form of information retrieval, automatic indexing, clustering technique, establishing and expressing thesaurus, and information retrieval technique. As the result of analyzing them, this paper shows us theoretical model, that is to say, the thesaurus browser by clustering algorithm. The result in the paper will be a theoretical basis on new retrieval algorithm.

  • PDF

An XPath Accelerator on Relational Databases: An Implementation and Its Tuning (관계형 데이터베이스를 이용한 XPath Accelerator: 구현과 튜닝)

  • Shin Jin-Ho;Na Gap-Joo;Lee Sang-Won
    • The KIPS Transactions:PartD
    • /
    • v.12D no.2 s.98
    • /
    • pp.189-198
    • /
    • 2005
  • XML is rapidly becoming the standard for data representation and exchange, and XML documents are being adopted in various applications. Since the late 1990s, some native XML database management systems(DBMSs) have been developed. More recently, commercial relational DBMS vendors try to incorporate full functionalities of XML into their products, such as Oracle, MS SQL and IBM DB2. In this paper, we implement a well-known RDBMS-based XML data storage and indexing technique, called XPath Accelerator, and tune it in an industry-leading RDBMS. Our contributions are two-folds: 1) an in-depth implementation of the XPath Accelerator technique and 2) its tuning to exploit the advanced query processing techniques of an RDBMS.

Estimation of Flood Risk Index for the Nakdong River Watershed (낙동강 유역의 홍수위험지수 산정)

  • Song, Jae Ha;Kim, Sangdan;Park, Moo Jong;Choi, Hyun Il
    • Journal of Korea Water Resources Association
    • /
    • v.46 no.1
    • /
    • pp.35-45
    • /
    • 2013
  • The aim of study is to present how to estimate and use the FRI (Flood Risk Index) for classifying area zones based on regional flooding risk in terms of the integrated flood risk management. To estimate the FRI at a spatial resolution of city/county/town units for the Nakdong River Watershed, the 17 representative flood indexing factors are carefully selected for the three flood indexes, such as PI (Pressure Index), SI (State Index), and RI (Response Index) under the P-S-R (Pressure-State-Response) classification system. Because flood indexing factors are measured at different scales and units, they are transformed into a common domain by the T-Score normalization technique. The entropy weight coefficient method is also applied to calculate the weight of flood indexing factors in order to reduce subjective judgement on the effect of weight coefficients. The three flood indexes of PI, SI, and RI are integrated for an overall value of the FRI to evaluate the flood risk of districts. To examine the practical application of the proposed FRI, the FRI results with/without the weight coefficients are compared with flooding zones of natural disaster risk areas officially announced in 2010. It is expected that the FRI ensured by full verification can make regional protection plans against flooding disasters with respect to causes and characteristics of past floods.

A Web GPS based Logistics Vehicle Control Management System using MVC Design Patterns (MVC 디자인 패턴을 활용한 Web GPS 기반의 물류차량 출하 관제 시스템)

  • Sim, Choon Bo;Kim, Kyoung Jong
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.6 no.1
    • /
    • pp.131-142
    • /
    • 2010
  • In this paper, we propose a web GPS based logistics vehicle control management system using MVC design patterns. The proposed system is designed by applying design patterns of object oriented modeling called mini-architecture to enhance reliability of software as well as promote stability of overall system design. In addition, we can get a position information by means of the GPS embedded in PDA and communicate between client and monitoring server using CDMA network so that the position of client can be identified directly by the map service. The system provides an moving object indexing technique which extends the existing TB-tree to manage and retrieve a transporting trajectory of logistics efficiently. Finally, with development of the logistics vehicle control service called WG-LOGICS system, we can verify the usefulness of our system which is able for monitoring a vehicle preparation, allocating registration, loading a burden, transfer path, and destination arrival in real world.

Content-Based Retrieval using MPEG-7 Visual Descriptor and Hippocampal Neural Network (MPEG-7 시각 기술자와 해마 신경망을 이용한 내용기반 검색)

  • Kim Young Ho;Kang Dae-Seong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.12
    • /
    • pp.1083-1087
    • /
    • 2005
  • As development of digital technology, many kinds of multimedia data are used variously and requirements for effective use by user are increasing. In order to transfer information fast and precisely what user wants, effective retrieval method is required. As existing multimedia data are impossible to apply the MPEG-1, MPEG-2 and MPEG-4 technologies which are aimed at compression, store and transmission. So MPEG-7 is introduced as a new technology for effective management and retrieval of multimedia data. In this paper, we extract content-based features using color descriptor among the MPEG-7 standardization visual descriptor, and reduce feature data applying PCA(Principal Components Analysis) technique. We model the cerebral cortex and hippocampal neural network in engineering domain, and team content-based feature vectors fast and apply the hippocampal neural network algorithm to compose of optimized feature. And then we present fast and precise retrieval effect when indexing and retrieving.

An Efficient Indexing Technique for Wireless Data Broadcast in Multiple Channels (멀티채널 환경에서 데이터 브로드 캐스트를 위한 효율적인 인덱싱 방법)

  • 이병규;정성원
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04a
    • /
    • pp.586-588
    • /
    • 2002
  • 본 논문에서는 높은 대역폭(Bandwidth)을 가지는 싱글채널(Single Channel)로 결합될 수 없는 낮은 대역폭의 멀티채널(Multi Channel) 환경에서의 브로드캐스트(Broadcast)를 위한 효과적인 인덱스 방법을 제안한다. 최근에 들어 멀티채널에서 데이터 브로드캐스트의 인덱스 방법에 대한 많은 연구가 행하여 졌는데, 트리구조의 사용 여부에 따라 두 가지 방법으로 구분해 볼 수 있다. 트리 구조를 이용한 방법은 서버의 부담을 증가시키지만 트리 구조를 이용하지 않은 방법보다 모바일 유저에게 낮은 대기시간을 제공할 수 있다. 이 논문에서는 기존의 방법들이 가지는 채널의 수나 데이터의 크기에 제한을 두는 단점을 해결하고 접근 빈도가 높은 데이터에 대한 인덱스들의 반복 정도를 높여서 모든 인덱스에 대한 대기시간(Latency)의 평균값을 현저하게 줄일 수 있는 효과적인 인덱스 스케줄 생성 방법을 제시한다.

  • PDF

An Efficient Indexing Technique for Time-evolving Moving Spatial Data (시공간 이동객체들을 위한 효율적인 색인 기법)

  • Kang, So-Young;Yang, Sung-Bong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10c
    • /
    • pp.205-209
    • /
    • 2006
  • 무선 네트워크의 기술 발달과 이동기기들의 많은 보급에 따라 GIS, LBS 등의 위치기반 서비스 등이 널리 사용되고 있다. 이러한 서비스를 수행하기위하여 방대한 양의 이동 객체들의 효율적인 관리, 검색에 대한 연구가 많이 이루어지고 있으며 이동 객체들의 특성과 사용목적에 따라 다양한 연구결과가 제시되고 있다. 본 논문에서는 시간에 따라 다양한 속도와 방향으로 이동하는 다차원 이동 공간 데이터를 위한 효율적인 색인 기법을 제시한다. 제안된 방법은 KDB-Tree와 QSF-Tree에 기반을 두고 있으며 색인 구조의 검색 성능을 향상시키기 위한 분할, 갱신 방법을 제시하여 시간에 따라 현재 속도와 방향이 다양하게 변화하는 데이터를 신속히 데이터를 색인하고 영역 질의 뿐만 아니라 미래 예측 질의에 대하여 현재 보유한 데이터 정보를 기반으로 미래 예측 결과를 신속히 산출할 수 있도록 한다.

  • PDF

An Indexing Method to Prevent Attacks based on Frequency in Database as a Service (서비스로의 데이터베이스에서 빈도수 기반의 추론공격 방지를 위한 인덱싱 기법)

  • Jung, Kang-Soo;Park, Seog
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.8
    • /
    • pp.878-882
    • /
    • 2010
  • DaaS model that surrogates their data has a problem of privacy leakage by service provider. In this paper, we analyze inference attack that can occur on encrypted data that consist of multiple column through index, and we suggest b-anonymity to protect data against inference attack. We use R+-tree technique to minimize false-positive that can happen when we use an index for efficiency of data processing.

Distributed Indexing Methods for Moving Objects based on Spark Stream

  • Lee, Yunsou;Song, Seokil
    • International Journal of Contents
    • /
    • v.11 no.1
    • /
    • pp.69-72
    • /
    • 2015
  • Generally, existing parallel main-memory spatial index structures to avoid the trade-off between query freshness and CPU cost uses light-weight locking techniques. However, still, the lock based methods have some limits such as thrashing which is a well-known problem in lock based methods. In this paper, we propose a distributed index structure for moving objects exploiting the parallelism in multiple machines. The proposed index is a lock free multi-version concurrency technique based on the D-Stream model of Spark Stream. The proposed method exploits the multiversion nature of D-Stream of Spark Streaming.

A Study on Automatic Indexing System Using natural language Processing, Statistical Technique, Relevance Verification (자연어 처리, 통계적 기법, 적합성 검증을 이용한 자동색인 시스템에 관한 연구)

  • Yu, Chun-Sik;U, Seon-Mi;Yu, Cheol-Jung;Lee, Jong-Deuk;Gwon, O-Bong;Kim, Yong-Seong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.6
    • /
    • pp.1552-1562
    • /
    • 1998
  • 형태소 분석(Morphological Analysis)과 같은 언어학적 처리에 의존하는 기존의 한국어 문헌에 대한 자동색인 기법들은 품사의 애매모호함이나 복합명사의 처리 등으로 부담(overhead)이 크다. 또한 불용어 처리에 사용되는 불용어 리스트가 대상 문헌의 주제 분야별로 따로 구축되어야 하며 그 크기가 방대하다는 문제점이 있다. 이러한 문제점들을 해결하기 위해, 본 논문에서는 각 문헌의 텍스트에 대해 복합명사 처리나 애매모호함에 대한 엄격한 분석을 수행하지 않는 간단한 형태의 형태소 분석을 수행하여 단순명사들을 추출한다. 그런 후 이들 단순명사들을 이용하여 유한 오토마타(Finite Automata)를 구성하고, 구성된 유한 오토마타와 각 명사의 단어빈도(Term Frequency)에 의해 각 색인어 후보들의 중요도를 계산하는 자동색인 기법을 제안한다. 그 결과 품사의 애매모호함에 대한 처리나 복합명사의 처리에따른 부담을 줄일 수 있었으며, 선정된 색인어들과 수작업으로 선정한 색인어들의 비교 실험에 의해 제안한 자동색인 기법의 성능을 검증하였다.

  • PDF