• Title/Summary/Keyword: Databases

Search Result 5,119, Processing Time 0.035 seconds

The Distribution of Citations in Online Databases (온라인 데이터베이스정보의 분포특성 분석)

  • 이효숙
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 1995.08a
    • /
    • pp.41-44
    • /
    • 1995
  • Twenty six databases relevant to ‘trade’ have been searched to test for a Bradford's law of scatter. Citations in the databases adhere to the linearity of Bradford's distribution, however, they show that the concentration of citations in core databases is not as great as being expected.

  • PDF

Problems with Chinese Ideographs Search in Unicode and Solutions to Them (유니코드 한자 검색의 문제점 및 개선방안)

  • Lee, Jeong-hyeon
    • Informatization Policy
    • /
    • v.19 no.3
    • /
    • pp.50-63
    • /
    • 2012
  • This thesis is designed to analyze how the search for Chinese ideographs is done in Koreanology-related domestic databases, domestic library databases, domestic academic databases, and overseas library databases, with a view to identifying problems and suggesting solutions to them. The major reasons that impede Chinese ideographs search in Unicode are classified as 'multicode characters', 'simplified characters', and 'variant characters', and three characters are chosen as samples to describe the current practice. Thirteen Koreanology-related databases, five domestic library databases, five domestic academic databases and two overseas library databases are analyzed in terms of Chinese ideographs search. To support search for multicode characters, the open source of the Unicode consortium must be applied. To improve search for simplified and variant characters, a matching table must be standardized and proposed to the Unicode consortium.

  • PDF

Histogram-based Selectivity Estimation Method in Spatio-Temporal Databases (시공간 데이터베이스를 위한 히스토그램 기반 선택도 추정 기법)

  • Lee Jong-Yun;Shin Byoung-Cheol
    • The KIPS Transactions:PartD
    • /
    • v.12D no.1 s.97
    • /
    • pp.43-50
    • /
    • 2005
  • The Processing domains of spatio-temporal databases are divided into time-series databases for moving objects and sequence databases for discrete historical objects. Recently the selectivity estimation techniques for query optimization in spatio-temporal databases have been studied, but focused on query optimization in time-series databases. There wat no previous work on the selectivity estimation techniques for sequence databates as well. Therefore, we construct T-Minskew histogram for query optimization In sequence databases and propose a selectivity estimation method using the T-Minskew histogram. Furthermore we propose an effective histogram maintenance technique for food performance of the histogram.

Application Plan of Document Databases in the Big Data Environment (빅데이터환경에서의 문서데이터베이스 활용방안)

  • Park, Sungbum;Lee, Sangwon;Ahn, Hyunsup;Jung, In-Hwan
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.230-232
    • /
    • 2013
  • For Many enterprises are creating and handling huge amount of data in their business administration. However, it would be impossible for general databases such as Relational Databases, hierarchical databases, and network databases to manage and analyze this large amount of document data efficiently and effectively. So, in this paper, we define document databases and check out their characteristics such as consistency and transaction. And we propose appropriate or inappropriate subjects for application of document databases.

  • PDF

Genomic and Proteomic Databases: Foundations, Current Status and Future Applications

  • Navathe, Shamkant B.;Patil, Upen;Guan, Wei
    • Journal of Computing Science and Engineering
    • /
    • v.1 no.1
    • /
    • pp.1-30
    • /
    • 2007
  • In this paper we have provided an extensive survey of the databases and other resources related to the current research in bioinformatics and the issues that confront the database researcher in helping the biologists. Initially we give an overview of the concepts and principles that are fundamental in understanding the basis of the data that has been captured in these databases. We briefly trace the evolution of biological advances and point out the importance of capturing data about genes, the fundamental building blocks that encode the characteristics of life and proteins that are the essential ingredients for sustaining life. The study of genes and proteins is becoming extremely important and is being known as genomics and proteomics, respectively. Whereas there are numerous databases related to various subfields of biology, we have maintained a focus on genomic and proteomic databases which are the crucial stepping stones for other fields and are expected to play an important role in the future applications of biology and medicine. A detailed listing of these databases with information about their sizes, formats and current status is presented. Related databases like molecular pathways and interconnection network databases are mentioned, but their full coverage would be beyond the scope of a single paper. We comment on the peculiar nature of the data in biology that presents special problems in organizing and accessing these databases. We also discuss the capabilities needed for database development and information management in the bioinformatics arena with particular attention to ontology development. Two research case studies based on our own research are summarized dealing with the development of a new genome database called Mitomap and the creation of a framework for discovery of relationships among genes from the biomedical literature. The paper concludes with an overview of the applications that will be driven from these databases in medicine and healthcare. A glossary of important terms is provided at the end of the paper.

A Evaluation Study on Foreign Science & Technology Online Databases (과학기술분야 해외 온라인 데이터베이스 평가 연구)

  • Kim, Suk-Young
    • Journal of Information Management
    • /
    • v.33 no.1
    • /
    • pp.61-80
    • /
    • 2002
  • The purpose of this study was to investigate the current state of foreign online databases in the field of Science and Technology. Evaluation of databases is becoming increasingly important for information users and for information providers as well. Previous research studies on evaluation of Science & Technology databases were summarized. This paper examined major online databases of Science & Technology with a short annotation. And the results of analysis by subject fields, by publishers, by frequency, by online availability of 618 titles were also described.

Reversible Watermarking For Relational Databases using DE (Difference Expansion) Algorithm (DE 알고리즘을 사용한 관계형 데이터베이스를 위한 가역 워터마킹)

  • Kim, Cheonshik
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.15 no.3
    • /
    • pp.7-13
    • /
    • 2015
  • Generally, watermarking can be used copyright for contents such as audios, videos, images, and texts. With the development of Internet, many malicious attackers illegally copy relational databases synchronized applications Therefore, it is needed for the protection of databases copyright, because databases involve various sensitive information such as personal information, information industry, and secret national intelligence. Thus, the protection of relational databases is a major research field in the databases research topics. In this paper, we will review previous researches related the protection of relational databases and propose new method for relational data. Especially, we propose watermarking scheme for databases using reversible method in this paper. As an experimental result, the proposed scheme is very strong to malicious attacks. In addition, we proved our proposed scheme is to apply real application.

Intelligent Data Governance for the Federated Integration of Air Quality Databases in the Railway Industry (철도 산업의 공기 질 데이터베이스 연합형 통합을 위한 지능형 데이터 거버넌스)

  • Minjeong, Kim;Jong-Un, Won;Sangchan, Park;Gayoung, Park
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.4
    • /
    • pp.811-830
    • /
    • 2022
  • Purpose: In this paper, we will discuss 1) prioritizing databases to be integrated; 2) which data elements should be emphasized in federated database integration; and 3) the degree of efficiency in the integration. This paper aims to lay the groundwork for building data governance by presenting guidelines for database integration using metrics to identify and evaluate the capabilities of the UK's air quality databases. Methods: This paper intends to perform relative efficiency analysis using Data Envelope Analysis among the multi-criteria decision-making methods. In federated database integration, it is important to identify databases with high integration efficiency when prioritizing databases to be integrated. Results: The outcome of this paper aims not to present performance indicators for the implementation and evaluation of data governance, but rather to discuss what criteria should be used when performing 'federated integration'. Using Data Envelope Analysis in the process of implementing intelligent data governance, authors will establish and present practical strategies to discover databases with high integration efficiency. Conclusion: Through this study, it was possible to establish internal guidelines from an integrated point of view of data governance. The flexiblity of the federated database integration under the practice of the data governance, makes it possible to integrate databases quickly, easily, and effectively. By utilizing the guidelines presented in this study, authors anticipate that the process of integrating multiple databases, including the air quality databases, will evolve into the intelligent data governance based on the federated database integration when establishing the data governance practice in the railway industry.

On the Characteristics and Information Retrieval Performance of Full-Text Databases (전문데이터베이스의 특성과 정보검색성능)

  • Cho Myung-Hi
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.17
    • /
    • pp.339-366
    • /
    • 1989
  • Appearance of full-text online is the most encouraging phenomenon ·during the development of databases. The full-text databases of today is derived from by-product of electronic publication of printed materials. Now, there are also some movements toward electronic production of documents in Korea although not powerful. The present study is designed to examine the characteristics and effective retrieval method of full-text databases now commercially available through various vendors. The outline of this paper IS as follows: First, background and present situation of existing full-text database services through national and worldwide are examined. Second, free-text searching system of full-text databases is compared with controlled vocabulary system. The factors influencing on free-text retrieval performance, searching thesaurus, and hybrid or compromising system, which is using limited controlled vocabulary in conjunction with natural language for the enrichment needed for practical operation of the . system, are examined. Third, user demands through the analysis of preceding studies on 'various types of full-text databases are recognised. Fouth, application of CD-ROM full-text database to the libraries and information centers is examined as prospective resources for them. Finally, some problems and prospect of full-text databases are presented.

  • PDF

Semantic Correspondence of Database Schema from Heterogeneous Databases using Self-Organizing Map

  • Dumlao, Menchita F.;Oh, Byung-Joo
    • Journal of IKEEE
    • /
    • v.12 no.4
    • /
    • pp.217-224
    • /
    • 2008
  • This paper provides a framework for semantic correspondence of heterogeneous databases using self- organizing map. It solves the problem of overlapping between different databases due to their different schemas. Clustering technique using self-organizing maps (SOM) is tested and evaluated to assess its performance when using different kinds of data. Preprocessing of database is performed prior to clustering using edit distance algorithm, principal component analysis (PCA), and normalization function to identify the features necessary for clustering.

  • PDF