• Title/Summary/Keyword: XML Keyword Search

Search Result 28, Processing Time 0.028 seconds

A Study for the Effective Classification and Retrieval of Software Component (효과적인 소프트웨어 컴포넌트 분류 및 검색에 관한 연구)

  • Cho, Byung-Ho
    • Journal of Internet Computing and Services
    • /
    • v.7 no.6
    • /
    • pp.1-10
    • /
    • 2006
  • A software development using components reuse is an useful method to reduce the software development cost. But a retrieval method by the keyword and category classifications is difficult to search an exact matching component due to components complexity in component reuse. Therefore, after different existing methods are examined and analyzed, an effective classification and retrieval method using XML specifications and the system architecture of components integrated management based on it are presented. Many discording elements of DTD which is component meta-expression exist in components retrieval. To compensate it, this retrieval method using estimations of precision and concision is effective one to catch considerable matching preference components. This method makes possible to retrieve suitable components having better priority due to searching similar matching components that are difficult in an existing keyword matching method.

  • PDF

The design of Intelligent and Integrated Registries System for e-Business (e-비즈니스를 위한 지능형 통합 레지스트리 시스템 설계)

  • 유정연;김계용;이규철
    • The Journal of Society for e-Business Studies
    • /
    • v.8 no.2
    • /
    • pp.63-76
    • /
    • 2003
  • The fundamental technology to the b2b e-commerce framework is Registry. Although Registries have developed, it is yet difficult to apply in actual e-business . That is, the e-business information was stored in physically and/or logically distributed and heterogeneous Registries. And Registry uses the keyword-based search to discovery the information stored. But, the keyword-based search technology can't provide the discovery the business information necessary for parties and trading partners. As spreading the understand of this problem, it requires the technologies for the integration of distributed and various Registries and the systematic definition and intelligent discovery of the e-business information. In this paper we propose the architecture of intelligent and integrated e-business registry system for solving these problems . This system composed of the Registry Integration Query Manager for integrating various registries and the Intelligent Registry Agent providing the systematic organization and discovery of e-business information.

  • PDF

A New Keyword Search Algorithm for RDF/S and OWL Documents (RDF/S 및 OWL 문서에 대한 키워드 검색 알고리즘)

  • Kim, Hak Soo;Son, Jin Hyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.04a
    • /
    • pp.321-324
    • /
    • 2009
  • XML 또는 RDBMS 에서의 키워드 검색은 기존의 정보 검색처럼 데이터의 구조 또는 질의 언어에 대한 사전 지식 없이 질의 처리를 수행하는 연구 분야 중의 하나이다. 오늘날 키워드 검색을 효율적으로 처리하기 위해 제안된 연구들은 그래프 기반의 질의 처리에 기반한 기법들에 초점을 두고 있다. 이러한 접근들은 XML 또는 RDBMS 안에 존재하는 데이터를 그래프 구조에 기반한 데이터로 변환한 다음에 그래프 탐색을 통해서 모든 질의 키워드를 포함하는 결과들을 찾는다. 그러나 기존의 기법들을 RDF/S 또는 OWL 문서와 같은 복잡한 그래프 구조에 적용하기에는 질의 성능 측면에서 많은 문제점을 가지고 있다. 또한, 온톨로지 언어의 의미적 단위로서의 RDF 트리플을 고려하지 않기 때문에 질의 결과에 대한 신뢰성을 보장할 수 없다. 이러한 관점에서 본 논문은 RDF/S 또는 OWL 저장소에서 효율적이고 의미적인 키워드 검색을 위한 인덱싱 기법 및 알고리즘을 설계한다.

A Study on Design and Analysis of Metadata and Ontology based on Humanities and Social Sciences (기초학문자료 메타데이터 설계 분석 및 온톨로지 적용 방안 연구)

  • Lee, Jung-Yeoun;Kim, Jung-Min;Choi, Suk-Doo;Kim, Lee-Kyum
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.41 no.2
    • /
    • pp.291-316
    • /
    • 2007
  • The purpose of this study is to design metadata model for describing different kinds of concepts, properties, and semantic relationships of result materials of researches. We examine our metadata model to evaluate correctness and efficiency of the model through contents analysis of a constructed database. From the results of examination, we suggest more effective structure of metadata schema. Domain ontology could constructed by the enlarged thesaurus in order to overcome the limitation of the keyword search, therefore we design a philosophy and religion ontology based on subject classification to improve information retrieval and implement it using XML/Topic Maps to improve retrieval functionality of our database.

Content based data search using semantic annotation (시맨틱 주석을 이용한 내용 기반 데이터 검색)

  • Kim, Byung-Gon;Oh, Sung-Kyun
    • Journal of Digital Contents Society
    • /
    • v.12 no.4
    • /
    • pp.429-436
    • /
    • 2011
  • Various documents, images, videos and other materials on the web has been increasing rapidly. Efficient search of those things has become an important topic. From keyword-based search, internet search has been transformed to semantic search which finds the implications and the relations between data elements. Many annotation processing systems manipulating the metadata for semantic search have been proposed. However, annotation data generated by different methods and forms are difficult to process integrated search between those systems. In this study, in order to resolve this problem, we categorized levels of many annotation documents, and we proposed the method to measure the similarity between the annotation documents. Similarity measure between annotation documents can be used for searching similar or related documents, images, and videos regardless of the forms of the source data.

Study on Model Case of Ideal Digitization of Korean Ancient Books (국학고전자료의 디지털화를 위한 모범적인 방안 연구)

  • Lee, Hee-Jae
    • Journal of the Korean Society for information Management
    • /
    • v.22 no.1 s.55
    • /
    • pp.105-123
    • /
    • 2005
  • The most of all, this study is planned to search an ideal methods to develop the digital library system for our korean ancient books for their safe preservation and, at the same time, for their perusal of transcendental time and space : first. to offer the various access points like traditional oriental Four parts Classics classification, current subject classification and index keyword, etc. : second, to program a digital library system using MARC or XML, but with all bibliographic descriptive elements as possible; third, to prepare the more easy annotated bibliography and index for users' better comprehension, and last, to build original text database for practical reading to avoid the damage of original text. This type of korean ancient books digital library will be developed to the real international bibliographic control by networking enter the same kinds of internal and external organizations.

Document Analysis based Main Requisite Extraction System (문서 분석 기반 주요 요소 추출 시스템)

  • Lee, Jongwon;Yeo, Ilyeon;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.4
    • /
    • pp.401-406
    • /
    • 2019
  • In this paper, we propose a system for analyzing documents in XML format and in reports. The system extracts the paper or reports of keywords, shows them to the user, and then extracts the paragraphs containing the keywords by inputting the keywords that the user wants to search within the document. The system checks the frequency of keywords entered by the user, calculates weights, and removes paragraphs containing only keywords with the lowest weight. Also, we divide the refined paragraphs into 10 regions, calculate the importance of the paragraphs per region, compare the importance of each region, and inform the user of the main region having the highest importance. With these features, the proposed system can provide the main paragraphs with higher compression ratio than analyzing the papers or reports using the existing document analysis system. This will reduce the time required to understand the document.

Construct ion of Keyword Index and Improved Search Methods for e-Catalogs Eased on Semantic Relationship (의미적 연결 관계에 기반한 전자 카탈로그에서의 확장된 어휘 인덱스 구축 및 이를 이용한 검색 성능 향상 기법)

  • Lee Dongjoo;Lee Taehee;Lee Sang-goo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.67-69
    • /
    • 2005
  • 본 논문에서는 기 구축된 전자 카탈로그를 의미적 연결 관계에 기초한 확장된 전자 카탈로그로 변환하는 방법을 제안한다. 이를 통해 구축된 확장된 전자 카탈로그에서 의미적 태깅에 의한 확장된 어휘 인덱스 구축 방안과, 이를 이용한 검색 성능 향상 기법을 제안한다. 기존의 전자 카탈로그는 상품 정보가 분류별로 생성된 테이블에 저장되고 저장된 테이블로부터 생성된 키워드 인덱스로부터 검색이 이루어 졌다. 이러한 검색은 상품이 가지는 정보를 데이터베이스에 구축된 테이블에만 한정하게 되어 전자 카탈로그에 포함된 상품이나 분류간의 의미적 연결 관계들을 충분히 이용하지 못하였다 전자 카탈로그에 내재된 의미적 요소를 충분히 활용하기 위해서는 전자 카탈로그를 의미적 연결 관계에 기초한 모델로 구성할 필요가 있다. 본 논문에서는 의미적 모델 기반 전자 카탈로그 시스템으로의 전환 과정을 XML형태의 명세를 이용해 반자동적으로 전환할 수 있는 툴을 구현하며, 단순 키워드 어휘 인덱스 구축이 아닌, 어휘 인덱스의 의미적 확장을 제안하고, 이를 위한 태그 요소로써 어휘에 대한 형태소 분석 결과, 수치 환산 및 확장 요소, 속성간의 도메인 정보 등을 제시하였다. 이를 기반으로 최적의 검색 결과를 얻어 내도록 하는 인접도 평가 함수에 적용하는 방법을 제시한다.

  • PDF