• Title/Summary/Keyword: XML Matching

Search Result 67, Processing Time 0.03 seconds

Semantic Matching Engine for Searching Web Services (웹 서비스 검색을 위한 시맨틱 매칭 엔진)

  • Yang, Seung-Hoon;Lee, Dae-Wook;Kwon, Joon-Ho;Lee, Suk-Ho
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10c
    • /
    • pp.267-272
    • /
    • 2006
  • 인터넷망의 지속적인 발달과 함께 웹 애플리케이션 개발 방법으로 XML 기반의 웹 서비스가 부각되면서 많은 웹 서비스들이 개발되었고, 점차 더 많은 웹 서비스들이 개발될 것으로 예상된다. 이처럼 급격하게 늘어나는 웹 서비스들 중에서 사용자가 원하는 웹 서비스 찾는 것이 중요한 이슈로 부각되고 있다. 그러나 현재의 웹 서비스 검색 표준인 UDDI 레지스트리는 키워드 기반이기 때문에 검색 성능의 한계점을 갖고 있다. 최근에 이러한 한계를 극복하고자 하는 많은 연구가 진행되고 있지만 아직은 많이 부족한 상황이다. 따라서 본 논문에서는 비록 키워드가 일치하지 않더라도 사용자가 원하는 웹 서비스를 찾을 수 있도록 웹 서비스 표준인 UDDI 레지스트리에 시맨틱 매칭 엔진(semantic matching engine)이라는 추가적인 시맨틱 레이어를 추가하여 재현율(recall)과 정확률(precision)을 모두 향상 시킬 수 있는 시스템을 제안한다.

  • PDF

A Covariance-matching-based Model for Musical Symbol Recognition

  • Do, Luu-Ngoc;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang;Dinh, Cong Minh
    • Smart Media Journal
    • /
    • v.7 no.2
    • /
    • pp.23-33
    • /
    • 2018
  • A musical sheet is read by optical music recognition (OMR) systems that automatically recognize and reconstruct the read data to convert them into a machine-readable format such as XML so that the music can be played. This process, however, is very challenging due to the large variety of musical styles, symbol notation, and other distortions. In this paper, we present a model for the recognition of musical symbols through the use of a mobile application, whereby a camera is used to capture the input image; therefore, additional difficulties arise due to variations of the illumination and distortions. For our proposed model, we first generate a line adjacency graph (LAG) to remove the staff lines and to perform primitive detection. After symbol segmentation using the primitive information, we use a covariance-matching method to estimate the similarity between every symbol and pre-defined templates. This method generates the three hypotheses with the highest scores for likelihood measurement. We also add a global consistency (time measurements) to verify the three hypotheses in accordance with the structure of the musical sheets; one of the three hypotheses is chosen through a final decision. The results of the experiment show that our proposed method leads to promising results.

Comparison and Analysis of Science and Technology Journal Metadata (해외 과학기술 학술논문 메타데이터의 비교 분석)

  • Lee, Min-Ho;Lee, Won-Goo;Yoon, Hwa-Mook;Shin, Sung-Ho;Ryou, Jae-Cheol
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.9
    • /
    • pp.515-523
    • /
    • 2011
  • It is important to manage large amount of information from various information providers for supporting recent information services such as providing global research trends, detecting emerging signal and listing leading researchers. For integrated management, definition of integrated metadata schema, data transformation and schema matching are needed. It is first necessary to analyze existing various metadata for defining integrated metadata schema. In this paper, we have analyzed several metadata of scientific journal papers by classifying semantics, content rules and syntax, and looked around considerations to make integrated schema or transform metadata. We have known that XML is used as a syntax for supporting convenience and various usage condition, and hierarchy element names and common elements in semantics are needed. We also have looked at elements having various content rules and related standards. We hope that this study will be used as basic research material of metadata integrated management, data transform and schema matching for interoperability.

MicroPost: The Design of an Efficient Event Notification Architecture for Distributed Social Applications (MicroPost: 분산형 소셜 애플리케이션을 위한 효율적인 이벤트 통지 아키텍처의 설계)

  • Bae, Joon-Hyun;Kim, Sang-Wook
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.232-239
    • /
    • 2009
  • Emerging social networking services provide a new paradigm for human-to-human communication. However, these services are centralized and managed by single service provider. In this paper, we propose MicroPost, a decentralized event notification service architecture for social applications based on publish/subscribe model. In our design space, event brokers are structured as an overlay network which provides the substrate of distributed peer-to-peer lookup service for storing and retrieving subscriptions with hashed keys. Event clients interact with event brokers to publish or subscribe social messages over the wide-area network. Using XML standards, we present an efficient algorithm to forward events for rendezvous-based matching in this paper. In our design space, the cost of routing is O(${\omega}log_kN$), where N is the number of event brokers, ${\omega}$ is the number of meta-data obtained from event messages, and k is a constant, which is selected by our design, to divide the identifier space and to conquer the lookup of given key. Consequently, what we achieved is an asynchronous social messaging service architecture which is decentralized, efficient, scalable, and flexible.

  • PDF

A Study of Integrating ASP Databases with Customer Databases (ASP 용의 데이터베이스와 고객 데이터베이스 연동에 관한 연구)

  • Kim, Ho-Yoon;Lee, Jae-Won
    • The KIPS Transactions:PartD
    • /
    • v.11D no.5
    • /
    • pp.1063-1072
    • /
    • 2004
  • In the ASP(Application Service Provider) business, applications using database sometimes require some data from clients' databases. These days such data are extracted from client database using manual database operations as an EXCEL file and the ASP, once receiving this file, transfers it into the application's database using manual database operations. This paper describes how to deal with data transmitting between the client database and ASP database on the web without using database manual operations for data extraction and insertion. We propose a framework which enables to transmit client data in a systematical way, to match different attribute names of each database for sharing same attribute values, and to avoid exposing information about the network path of client database to the ASP. This approach consists of two steps of data processing. The first is extracting data from client database as XML format by using a downloaded client program from ASP site, the second is uploading and storing the XML file into the ASP database. The implemented prototype system shows the suggested data integration paradigm is valid and ASP business needing integration of client database can be activated using it.

Chunking Korean and an Application (한국어 낱말 묶기와 그 응용)

  • Un Koaunghi;Hong Jungha;You Seok-Hoon;Lee Kiyong;Choe Jae-Woong
    • Language and Information
    • /
    • v.9 no.2
    • /
    • pp.49-68
    • /
    • 2005
  • Application of chunking to English and some other European languages has shown that it is a viable parsing mechanism for natural languages. Although a small number of attempts have been made to apply chunking to the analysis of the Korean language, it still is not clear enough what criteria there are to identify appropriate units of chunking, and how efficient and valid the chunking algorithms would be when applied to some authentic Korean texts. The purpose of this research is to provide an alternative set of algorithms for chunking Korean, and to implement them, and to test them against some English-Korean parallel corpora, which is English and Korean bibles matched sentence by sentence. It is shown in the paper that aligning related texts and identifying matched phrases between the two languages can be achieved through appropriate chunking and matching algorithms defined on the morphologically-tagged parallel corpus. Chunking and matching processes are based on the content words rather than the function words, and the matching itself is done in terms of the transfer dictionary. The implementation is done in C and XML, and can be accessed through the Internet.

  • PDF

MATERIAL MATCHING PROCESS FOR ENERGY PERFORMANCE ANALYSIS

  • Jung-Ho Yu;Ka-Ram Kim;Me-Yeon Jeon
    • International conference on construction engineering and project management
    • /
    • 2011.02a
    • /
    • pp.213-220
    • /
    • 2011
  • In the current construction industry where various stakeholders take part, BIM Data exchange using standard format can provide a more efficient working environment for related staffs during the life-cycle of the building. Currently, the formats used to exchange the data from 3D-CAD application to structure energy analysis at the design stages are IFC, the international standard format provided by IAI, and gbXML, developed by Autodesk. However, because of insufficient data compatibility, the BIM data produced in the 3D-CAD application cannot be directly used in the energy analysis, thus there needs to be additional data entry. The reasons for this are as follows: First, an IFC file cannot contain all the data required for energy simulation. Second, architects sometimes write material names on the drawings that are not matching to those in the standard material library used in energy analysis tools. DOE-2.2 and Energy Plus are the most popular energy analysis engines. And both engines have their own material libraries. However, our investigation revealed that the two libraries are not compatible. First, the types and unit of properties were different. Second, material names used in the library and the codes of the materials were different. Furthermore, there is no material library in Korean language. Thus, by comparing the basic library of DOE-2, the most commonly used energy analysis engine worldwide, and EnergyPlus regarding construction materials; this study will analyze the material data required for energy analysis and propose a way to effectively enter these using semantic web's ontology. This study is meaningful as it enhances the objective credibility of the analysis result when analyzing the energy, and as a conceptual study on the usage of ontology in the construction industry.

  • PDF

An Improved Conversational Help Agent Using Approximate Pattern Matching (근사 패턴매칭을 이용한 개선된 대화형 도우미 에이전트)

  • 김수영;조성배
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04b
    • /
    • pp.415-417
    • /
    • 2001
  • 인터넷의 성장에 따라, 많은 웹사이트가 생기고, 더 많은 정보들이 웹사이트에 등록되었다. 웹사이트에서 등록되는 정보가 많을수록, 사용자가 원하는 정보를 얻기가 쉽지 않다. 따라서, 사용자가 찾는 정보를 쉽게 찾을 수 있도록, 웹사이트 내에 전문(full-text) 검색엔진을 도입하기도 한다. 본 논문에서는 사용자가 자연어를 이용한 대화를 통해 웹사이트 내의 정보를 습득할 수 있도록 하는 대화형 도우미 에이전트를 위하여 한글 전처리 방법과 근사패턴매칭기법을 제안한다. 사용자가 문장을 입력하면, 동의어처리와 형태소 분석을 통해 사용자의 문장을 분석하고, 이미 작성되어 있는 지식과의 매칭을 통해 사용자에게 알맞은 대답을 제시한다. 지식은 XML 형식으로 저장되며, 사용자가 입력한 문장과 아주 똑같지 않더라도, 어느 정도의 유사도를 가지고 대답을 이끌어 낼수 있다.

  • PDF

Building Open API Ontologies based (ll Semantics for Smart Mashup (스마트 매쉬업을 위한 시맨틱 기반 Open API 온톨로지 구축 기법)

  • Lee, Yong Ju
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.7 no.3
    • /
    • pp.11-23
    • /
    • 2011
  • Recently, Open APIs are getting attention with the advent of Web 2.0. Open APIs are used to combine services and generate new services by Mashup. However, the growing number of available Open APIs raises a challenging issue how to locate the desired APIs. We automatically build ontologies from WSDL, WADL, HTML, and their underlying semantics. The key ingredient of our method is a technique that clusters input/output parameters in the collection of API methods into semantically meaningful concepts, and captures the hierarchical relationships between the terms contained in a parameter. These semantic ontologies allow search engines to support a similarity search for Open APIs based on various protocols such as SOAP, REST, JavaScript, and XML-RPC, and significantly improve the quality of APIs matching by the clustering and hierarchical relationships mechanism.

Effects of Adopting the Open Document Format in Public Records Management (공문서 컴포넌트 오픈포맷 채택이 기록관리에 미치는 영향 분석)

  • Jung, Mi Ri;Oh, Seh-La;Yim, Jin Hee
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.16 no.2
    • /
    • pp.29-55
    • /
    • 2016
  • Korean public organizations create electronic documents through electronic document management systems under the e-Government scheme. A majority of these public documents are saved in vendor-dependent file formats, mainly HWP. Vendor-dependent formats can be opened only with specific software, which requires purchase. As the license does not guarantee compatibility between past and future versions, interoperability problems occur in long-term preservation, which need to be solved. Any error from the elimination of styles or no matching elements in document definition during conversion from vendor-dependent formats to the XML-based standard exchange format leads to file open failure or the modification of original documents. This study introduces the Open Document Format (ODF) and investigates the effects of adopting ODF in the creation, exchange, management, and preservation of public records.