• Title/Summary/Keyword: HTML documents

Search Result 149, Processing Time 0.024 seconds

Weighting of XML Tag using User's Query (사용자 질의를 이용한 XML 태그의 가중치 결정)

  • Woo Seon-Mi;Yoo Chun-Sik;Kim Yong-Sung
    • The KIPS Transactions:PartD
    • /
    • v.12D no.3 s.99
    • /
    • pp.439-446
    • /
    • 2005
  • XML is the standard that can manage systematically WWW documents and increase retrieval efficiency. Because XML documents have the information of contents and that of structure in single document, users can get more suitable retrieval result by retrieving the information of content as well as that of logical structure. In this paper, we will propose a method to calculate the weights of XML tags so that the information of XML tag is used to index decision. A proposed method creates term vector and weight vector for XML tags, and calculates weight of tag by reflecting user's retrieval behavior (user's query). And it decides the weights of index terms of XML document by reflecting the weights of tags. And we will perform an evaluation of proposed method by comparison with existing researches using weights of paragraphs.

A Study on Ambiguity Resolving for Pen-based Proofreading of Web Documents (펜 기반 웹 문서 교정을 위한 모호성 문제 해결에 관한 연구)

  • Sohn, Won-Sung
    • Journal of The Korean Association of Information Education
    • /
    • v.11 no.1
    • /
    • pp.107-116
    • /
    • 2007
  • To produce accurate editing results, the ambiguity of editing scopes related to marked correction signs should be solved. Proofreading the web document modifies the document structures, and the modified structures should be robustly valid for the defined DTD. This paper presents a pen-based proof-reading interface in the XML document. In the proposed interface, correction signs are free-drawn, and the editing scopes are recognized and revised based on the contexts of the document to minimize the ambiguity of the editing scopes. The proposed interface provides both implicit and explicit modification methods for document structures. As a result, the editing scopes processed in the proposed interface are more accurate, and the document structures are maintained valid for DTD after the editing.

  • PDF

PDF Publication Solution based on Web (웹을 기반으로 한 PDF 출판 솔류션에 관한 연구)

  • Lee Jae-Deuk
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.28 no.2
    • /
    • pp.109-116
    • /
    • 2005
  • In the previous C/S publishing system, the editor or contributor can arbitrarily modify the document created by the author, in which case it is difficult to identify the changes made in the document. Another shortcoming is in that when the document is in need of tracking or editing, the client must have the respective editing system. To solve this problem, the gist of the document must be preserved along with the document itself, and the process of handling the document must be standardized. Publishing on the web ensures a more stable and accurate result in processing documents. The significance of web publishing is made clear, when we consider the importance of information per se and the growing demand for immediate publication in the present day. The need for a simple and straightforward apache-based PDF publishing system, in which HTML and CSS are supported, and a converting engine provides PDF standard security application support, is prominent. This provides a library in which one can directly create a PDF via Windows, Linux, or Unix without having to rely on a client, allowing high-speed PDF creation. The development of a web-accessed PDF converting engine forms the basis for e-transactions, online brochures, electronic B/L, and many other industrial sectors.

Implementation of Online Editing System based on Structural Documents (구조문서 기반 온라인 교정 시스템의 설계 및 구현)

  • Jung, Han-Sang;Kim, Jae-Kyung;Sohn, Won-Sung;Lim, Soon-Bum;Choy, Yoon-Chul
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11c
    • /
    • pp.2289-2292
    • /
    • 2002
  • 최근 웹을 기반으로 한 문서의 전자화가 이루어지면서 기존의 전통적인 펜기반 교정 시스템 또한 온라인상의 전자 문서 환경에 맞게 변화하고 있다. 이러한 펜기반 입력 기법을 사용하는 교정 시스템에서는 일반문서와 달리 웹 문서의 구조정보를 고려한 편집이 지원되어야 하며 또한 교정부호와 텍스트간의 정확한 영역 인식이 이루어져야 한다. 본 연구에서는 온라인 교정 시스템 모델링을 통하여 온라인 환경에 적합한 교정 부호를 정의하고, 교정 대상 텍스트 영역을 편집 가능한 단위로 구분하여 효율적인 편집 연산이 이루어 질 수 있도록 하였다. 또한 웹 기반의 구조문서(HTML/XML) 편집 환경을 고려하여 편집으로 인한 문서의 구조 정보 변경을 지원하기 위하여 텍스트를 비구조 및 구조정보 텍스트로 분류하여 정의하였다. 본 연구에서는 이러한 모델에 기반하여 교정 부호의 특성에 따른 가변적인 편집 텍스트 영역 인식 규칙 모델을 정의하여 교정 부호와 편집 텍스트 영역간의 모호성을 최소화하고, 편집으로 인한 문서의 구조 정보 변경을 지원하는 시스템을 구현하였다. 결과적으로 온라인 웹 문서 환경에서 펜기반의 모호한 교정 부호의 입력을 인지적인 관점에서 해석하여 보다 정확한 교정 작업 수행을 지원하도록 하였다.

  • PDF

A Web Manual Generator for Courseware Development using CAS and Web Connectivity Technology (컴퓨터 대수 시스템과 웹 연동 기술을 활용한 코스웨어 개발용 웹 매뉴얼 생성기)

  • Park, Hong-Joon;Jun, Young-Cook;Jang, Moon-Suk
    • The Journal of Korean Association of Computer Education
    • /
    • v.8 no.5
    • /
    • pp.97-108
    • /
    • 2005
  • In this paper, we present our prototype of a web manual creator that is based on MSP technology embedded in webMathematica. This tool gives courseware authors more simple ways to make their own mathematical web contents. We first classified authoring models for creating mathematical content development and proposed an advanced model. The final application called phpMath can generate MSP-driven documents automatically using Mathematica commands typed by users. In other words, phpMath users can make interactive dynamic mathematical web contents even though they do not know anything about web server, HTML, and webMathematica. We illustrated the details of the prototype from the user's perspectives followed by comments on usefulness.

  • PDF

Adopting DITA Standard for Developing Technical Documentation Efficiently (효율적인 기술문서 개발을 위한 DITA 표준 적용)

  • Koo, Heung-Seo
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.65 no.1
    • /
    • pp.171-178
    • /
    • 2016
  • In engineering industry, technical documentation refers to any type of documentation that describes handling, functionality and architecture of a technical product. The intended audiences for product technical documentation is the end user, the administrator, service or maintenance technician. Competition and rapid technological evolutions have created pressure to release new and improved products on an increasingly frequent basis. As the technical capabilities of products advance, the technical documentation for those products is becoming longer and complex. The Darwin Information Typing Architecture (DITA) is and XML-based, end-to-end architecture for authoring, producing, and delivering technical documents. DITA is an OASIS standard that allows documentation groups to single-source document for multiple products and users, to automatically publish that documentation in several media formats including PDF and HTML, and to efficiently maintain and update that documentation. This paper study on the various implementation projects for technical documentation using OASIS DITA standard and examines the potential for using DITA as a solution for reconstructing existing technical documentation. Also We offer an incremental adoption approach to effectively implement DITA standard of technical documentation project for domestic firms, while reducing failures.

A Study on Design and Implementation of Automatic Product Information Indexing and Retrieval System for Online Comparison Shopping on the Web (웹 상의 온라인 비교 쇼핑을 위한 상품 정보 자동 색인 및 검색 시스템의 설계 및 구현에 대한 연구)

  • 강대기;이제선;함호상
    • The Journal of Society for e-Business Studies
    • /
    • v.3 no.2
    • /
    • pp.57-71
    • /
    • 1998
  • In this paper, we describe the approaches of shopping agents and directory services for online comparison shopping on the web, and propose an information indexing and retrieval system, named InfoEye, with a new method for automatic extraction of product information. The developed method is based on the knowledge about presentation of the product information on the Web. The method from the knowledge about presentation of the product information is derived from both the point that online stores display their products to customers in easy-to-browse ways and heuristics made of analyses of product information look-and-feel of domestic online stores. In indexing process, the method is applied to product information extraction from Hypertext Markup Language (HTML) documents collected by a mirroring robot from online stores. We have made InfoEye to a readily usable stage and transferred the technology to Webnara commercial shopping engine. The proposed system is a cutting-edge solution to help customers as a shopping expert by providing information about the reasonable price of a product from dozens of online stores, saving customers shopping time, giving information about new products, and comparing quality factors of products in a same category.

  • PDF

Description-Based Multimedia Clipart Retrieval in WWW

  • Kim, Hion-Gun;Sin, Bong-Kee;Song, Ju-Won
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1998.06b
    • /
    • pp.111-115
    • /
    • 1998
  • The Internet today is teemed with not only text data but also other media such as sound, still and moving images in a variety of formats. Unlike text, however, that can be retrieved easily with the help of numerous search engines, there has been few way to access data of other media unless the exact location or the URL is known. Multimedia data in the WWW are contained in or linked via anchors in the hyper-documents. They can most reliably be retrieved by analyzing the binary data content, which is far from being practical yet by the current state of the art. Instead we present another technique of searching based on textual descriptions which are found at or around the multimedia objects. The textual description used in this research includes file name (URL), anchor text and its context, alternative descriptions found in ALT HTML tage. These are actually the clues assumedly relevant to the contents. Although not without a possibility of missing or misinterpreting images and sounds, the description-based search is highly practical in terms of computation. The prototype search engine will soon be deployed to the public service through the prestige search engine, InfoDetective, in Korea.

  • PDF

A Study of Document Format for Effective Transmission on the Internet Environments (인터넷환경하에서 효율적 전송을 위한 문서형식에 관한 연구)

  • Cho, Hyun-Yang;Choi, Hung-Sik
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.34 no.1
    • /
    • pp.229-242
    • /
    • 2000
  • Today, we are confronted with huge amount of data which contain complex documents, images and multimedia contents. Therefore a new method is needed to analyze and manage the mathematical expressions and extract new Information from them. It is more and more important to manage the document files including mathematical expressions which are generated by general-purpose word processors. Three major word processors are shared over 90% of domestic market. These are HWP, TeX and MS word. Due to the progress of Internet and digital library, it is necessary to develop a system to manage the document file containing mathematical expressions over the Web.

  • PDF

Design and Implementation of Learning System for Generating Multimedia Contents at On-Line$\cdot$Mobile Environment (온라인$\cdot$모바일 환경에서 멀티미디어 컨텐츠 생성을 위한 학습 시스템의 설계 및 구현에 관한 연구)

  • Lee Hyun Chang;Choi Kwang Don
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.1 s.33
    • /
    • pp.217-222
    • /
    • 2005
  • The on-line and mobile communication technologies provide an environment to make users share information on the rove. However learning on a file received from on-line or mobile internet environment is able to read only, According to this, users cannot use various learning methods to make multimedia contents for learning like coloring and underlining considerable parts. Also, in case of storing, it cannot be stored in a standard file format HTML. Therefore, in this paper, we suggest a new learning platform to be able to change text contents in a web documents and implement a prototype system to process learning system in on-line environment

  • PDF