• Title/Summary/Keyword: XML Information Set

Search Result 94, Processing Time 0.022 seconds

Design and Implementation of personalized recommendation system using Case-based Reasoning Technique (사례기반추론 기법을 이용한 개인화된 추천시스템 설계 및 구현)

  • Kim, Young-Ji;Mun, Hyeon-Jeong;Ok, Soo-Ho;Woo, Yong-Tae
    • The KIPS Transactions:PartD
    • /
    • v.9D no.6
    • /
    • pp.1009-1016
    • /
    • 2002
  • We design and implement a new case-based recommender system using implicit rating information for a digital content site. Our system consists of the User Profile Generation module, the Similarity Evaluation and Recommendation module, and the Personalized Mailing module. In the User Profile Generation Module, we define intra-attribute and inter-attribute weight deriver from own's past interests of a user stored in the access logs to extract individual preferences for a content. A new similarity function is presented in the Similarity Evaluation and Recommendation Module to estimate similarities between new items set and the user profile. The Personalized Mailing Module sends individual recommended mails that are transformed into platform-independent XML document format to users. To verify the efficiency of our system, we have performed experimental comparisons between the proposed model and the collaborative filtering technique by mean absolute error (MAE) and receiver operating characteristic (ROC) values. The results show that the proposed model is more efficient than the traditional collaborative filtering technique.

A Message Conversion System based on MDR for Resolving Metadata Heterogeneity (메타데이타 이질성 해결을 위한 MDR 기반의 메시지 변환 시스템)

  • 김진관;김중일;정동원;백두권
    • Journal of KIISE:Databases
    • /
    • v.31 no.3
    • /
    • pp.232-242
    • /
    • 2004
  • Metadata is a general notion of data about data to improve data sharing and exchanging by definitely describing meaning and representation of data. However, metadata has been created in various ways and It caused another kind of heterogeneity problem named metadata heterogeneity problem. Recently, the research on metadata gateway approach that allows metadata heterogeneity is being more actively progressed. However, the existing commercialized systems that have been implemented with the metadata gateway approach are dependent on a metadata schema. In this paper, we propose a message conversion system which separates the mapping information from the mapping rules between heterogeneous metadata schemas. The proposed system dynamically manages standardized data elements by applying ISO/IEC l1179. Therefore, the proposed system provides the set of standard data elements to create consistently metadata of new databases and provides a fundamental resolution to the metadata heterogeneity problem.

A Multiagent System for Workflow-Based Bioinformatics Tool Integration

  • Sohn, Bong-Ki;Lee, Keon-Myung;Kim, Hak-Joon
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.3 no.2
    • /
    • pp.133-137
    • /
    • 2003
  • Various bioinformatics tools for biological data processing have been developed and most of them are available in public. Most bioinformatics works are carried out by a composite application of those tools. Several integration approaches have been proposed for easy use of the tools. This paper proposes a new multi agent system to integrate bioinformatics tools in the perspective of workflow since the composite applications of tools can be regarded as workflows. For the easy integration, the proposed system employs wrapper agents for existing tools, uses XML-based messages in the inter-agent communication, and agents are supposed to extract necessary information from the received messages. This allows new tools to be easily added on the integration framework. The proposed method allows various control structures in workflow definition and provides the progress monitoring capability of the on-going workflows. In particular, agents in this system have the rule-based architecture which allows the defined rule set to be a special role agent. This feature provides fast and flexible agent development to aid in managing the complexity of bioinformatics application. This system has been partially implemented and has been proven to be a viable implementation for workflow-based bioinformatics tool integration.

Automatic Generation of Information Extraction Rules Through User-interface Agents (사용자 인터페이스 에이전트를 통한 정보추출 규칙의 자동 생성)

  • 김용기;양재영;최중민
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.4
    • /
    • pp.447-456
    • /
    • 2004
  • Information extraction is a process of recognizing and fetching particular information fragments from a document. In order to extract information uniformly from many heterogeneous information sources, it is necessary to produce information extraction rules called a wrapper for each source. Previous methods of information extraction can be categorized into manual wrapper generation and automatic wrapper generation. In the manual method, since the wrapper is manually generated by a human expert who analyzes documents and writes rules, the precision of the wrapper is very high whereas it reveals problems in scalability and efficiency In the automatic method, the agent program analyzes a set of example documents and produces a wrapper through learning. Although it is very scalable, this method has difficulty in generating correct rules per se, and also the generated rules are sometimes unreliable. This paper tries to combine both manual and automatic methods by proposing a new method of learning information extraction rules. We adopt the scheme of supervised learning in which a user-interface agent is designed to get information from the user regarding what to extract from a document, and eventually XML-based information extraction rules are generated through learning according to these inputs. The interface agent is used not only to generate new extraction rules but also to modify and extend existing ones to enhance the precision and the recall measures of the extraction system. We have done a series of experiments to test the system, and the results are very promising. We hope that our system can be applied to practical systems such as information-mediator agents.

Multimedia Presentation Authoring and Virtual Collaboration in Medicine

  • Hong, Chul-Eui
    • Journal of information and communication convergence engineering
    • /
    • v.8 no.6
    • /
    • pp.690-696
    • /
    • 2010
  • Web-based virtual collaboration is increasingly gaining popularity in almost every area in our society due to the fact that it can bridge the gap imposed by time and geographical constraints. However, in medical field, such collaboration has been less popular than other fields. Some of the reasons were timeliness, security, and preciseness of the information they are dealing with. In this paper, we are proposing a web-based distributed medical collaboration system called Virtual Collaboration System for Medicine (VCSM) for medical doctors that meet the needs. The proposed system consists of two parts - multimedia presentation and recordable virtual collaboration. The former supports synchronized multimedia presentation using Synchronous Multimedia Integration Language (SMIL.) It allows synchronization of the contents of a PowerPoint presentation file and a video file. The presentation may be provided to the participants before the discussion begins. Next, in the virtual collaboration stage, participants can use text along with associated symbols during the discussion over the presented medical images. The symbols such as arrows or polygons can be set or removed dynamically to represent areas of interest in digital images using so called layered architecture that separates image layer from annotation layer. XML files are used to record participants' opinions along with the symbols over some particular images

A Co-training Method based on Classification Using Unlabeled Data (비분류표시 데이타를 이용하는 분류 기반 Co-training 방법)

  • 윤혜성;이상호;박승수;용환승;김주한
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.8
    • /
    • pp.991-998
    • /
    • 2004
  • In many practical teaming problems including bioinformatics area, there is a small amount of labeled data along with a large pool of unlabeled data. Labeled examples are fairly expensive to obtain because they require human efforts. In contrast, unlabeled examples can be inexpensively gathered without an expert. A common method with unlabeled data for data classification and analysis is co-training. This method uses a small set of labeled examples to learn a classifier in two views. Then each classifier is applied to all unlabeled examples, and co-training detects the examples on which each classifier makes the most confident predictions. After some iterations, new classifiers are learned in training data and the number of labeled examples is increased. In this paper, we propose a new co-training strategy using unlabeled data. And we evaluate our method with two classifiers and two experimental data: WebKB and BIND XML data. Our experimentation shows that the proposed co-training technique effectively improves the classification accuracy when the number of labeled examples are very small.

Construction of Internet Public Library Asia (아시아 인터넷 공공 도서관(Internet Public Library Asia) 구축에 관한 연구)

  • 이원숙;일본명;일본명;일본명;일본명
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.13 no.2
    • /
    • pp.59-73
    • /
    • 2002
  • Libraries, not only research libraries but also public libraries, have been fundamentally affected by the immense spread of the Internet and the World Wide Web. Many public libraries have their Web pages, through which they are providing their new and conventional services. There are also web sites which provide library-like services. This paper shows an experimental project named Internet Public Library Asia, which provides information in multiple languages of information resources published in Chinese, Japanese and Korean languages. This paper firstly overviews how traditional public libraries have been affected by the Internet. Then, it describes a few aspects from the viewpoint of crucial library function on the Internet and also from the viewpoint of Asian resources and users. This paper secondly proposes a model to serve information about valuable resources published in multiple Asian languages, and then shows the metadata schema and a few software tools developed for IPL-Asia The name of IPL is borrowed from Internet Public Library based at University of Michigan, since it is, in part, a collaborative activity with the IPL in Michigan. The metadata schema is defined based both on Dublin Core and IEEE LOM and adapted for parallel description in the four languages, i. e. , Chinese, Japanese, Korean and English. The software tools provide functions to support collaboration among people engaged in development of metadata database and metadata editing. These tools have been developed based on the XML technologies.

  • PDF

Hypertext Retrieval System Using XLinks (XLinks를 이용한 하이퍼텍스트 검색 시스템)

  • Kim, Eun-Jeong;Bae, Jong-Min
    • The KIPS Transactions:PartD
    • /
    • v.8D no.5
    • /
    • pp.483-494
    • /
    • 2001
  • Most of hypertext retrieval models consider documents as independent entities. They ignore relationships between documents of link semantics. in an information retrieval system for hypertext documents, retrieval effectiveness can be improved when ling information is used. Previous link-based hypertext retrieval models ignore link information while indexing. They utilize link information to re-rank the retrieval results. Therefore they are limited that only the documents is result-set utilize link information. This paper utilizes link information when indexing. We present how to use term weighting and inLinks weighting for ranking the relevant documents. Experimental results show that recall and precision evaluation according to the link semantics and the comparison with previously link_based hypertext retrieval model.

  • PDF

Chunking Korean and an Application (한국어 낱말 묶기와 그 응용)

  • Un Koaunghi;Hong Jungha;You Seok-Hoon;Lee Kiyong;Choe Jae-Woong
    • Language and Information
    • /
    • v.9 no.2
    • /
    • pp.49-68
    • /
    • 2005
  • Application of chunking to English and some other European languages has shown that it is a viable parsing mechanism for natural languages. Although a small number of attempts have been made to apply chunking to the analysis of the Korean language, it still is not clear enough what criteria there are to identify appropriate units of chunking, and how efficient and valid the chunking algorithms would be when applied to some authentic Korean texts. The purpose of this research is to provide an alternative set of algorithms for chunking Korean, and to implement them, and to test them against some English-Korean parallel corpora, which is English and Korean bibles matched sentence by sentence. It is shown in the paper that aligning related texts and identifying matched phrases between the two languages can be achieved through appropriate chunking and matching algorithms defined on the morphologically-tagged parallel corpus. Chunking and matching processes are based on the content words rather than the function words, and the matching itself is done in terms of the transfer dictionary. The implementation is done in C and XML, and can be accessed through the Internet.

  • PDF

A Web Service Development Process with MDA Applied (MDA를 적용한 웹서비스 개발 프로세스)

  • Yun Hong-ran;Park Jae-nyun
    • The KIPS Transactions:PartD
    • /
    • v.12D no.4 s.100
    • /
    • pp.583-588
    • /
    • 2005
  • Being able to resolve huge problems deriving from integration of information systems in-house or business to business, the web service that uses the XML standard technology has recently taken a quick dominance the next generation e-business bases. It's one constant concern how to integrate, change, and maintain such systems as based on certain technologies according to the changes to information technology, which is on the ongoing process of evolution. To help solve those problems, OMG suggested a new software architecture called MDA(Model Driven Architecture). MDA runs a process that establishes a platform independent model(PIM), which is an analysis model used as part of the existing development procedures, and automatically converts it into a platform specific model(PSM), a design model, based on the established PIM. Such automatic conversion has lots of benefits including easy support for diverse platforms, reducing the coding time that usually consume a great deal of the developer's effort, and facilitating quality control in the aspect of development processes. By applying the MDA development process to a new web service development, you can choose web service as the target platform at the PIM of MDA and express PSM with a web service model, WSDL. This study set out to classify the web service development or integration processes by the provider md requester to identify the types of web service development processes, and to apply the MDA development process to web service development, thus suggesting a new kind of web service development process that can be referred to by both the web service provider and requester.