• Title/Summary/Keyword: 사용자 문서

Search Result 1,729, Processing Time 0.026 seconds

An Equation Retrieval System Based on Weighted Sum of Heterogenous Indexing Terms (이질적 색인어의 가중치 합에 기반한 수식 검색 시스템)

  • Shin, Jun-Soo;Kim, Hark-Soo
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.10
    • /
    • pp.745-750
    • /
    • 2010
  • To effectively retrieve mathematical documents including various equations, mathaware search engines are needed. In this paper, we propose a equation retrieval system which helps users effectively search structurally similar equations. The proposed system disassembles MathML equations into three types of heterogeneous indexing terms; operators, variables, and partial structures of equations. Then, it independently indexes the disassembled terms. When a user inputs a MathML equation, the proposed system searches and ranks equations using weighted sums of three language models for the heterogeneous indexing terms. In the experiments with 244,744 MathML equations, three proposed system showed reliable performances (a P@1 of 53% in the closed test and a P@1 of 63% in the open test).

Retrieval Model Based on Word Translation Probabilities and the Degree of Association of Query Concept (어휘 번역확률과 질의개념연관도를 반영한 검색 모델)

  • Kim, Jun-Gil;Lee, Kyung-Soon
    • The KIPS Transactions:PartB
    • /
    • v.19B no.3
    • /
    • pp.183-188
    • /
    • 2012
  • One of the major challenge for retrieval performance is the word mismatch between user's queries and documents in information retrieval. To solve the word mismatch problem, we propose a retrieval model based on the degree of association of query concept and word translation probabilities in translation-based model. The word translation probabilities are calculated based on the set of a sentence and its succeeding sentence pair. To validate the proposed method, we experimented on TREC AP test collection. The experimental results show that the proposed model achieved significant improvement over the language model and outperformed translation-based language model.

Design the Scene Coposition-Information using Multi-Source based on MMT-CI (MMT-CI 기반 멀티 소스를 활용한 효율적 장면구성정보 설계)

  • Hong, Ji-Ye;Kim, Kyuheon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2014.11a
    • /
    • pp.107-110
    • /
    • 2014
  • 방통융합시대가 도래함에 따라 방송망뿐만 아니라 IP망을 통해 콘텐츠를 소비할 수 있는 스마트TV의 보급이 급속히 확산되고 있다. 또한 영상과 음악 및 광고와 같은 멀티미디어 콘텐츠의 소비 환경이 한 화면에 한 가지의 미디어 서비스만 제공받는 환경에서 다양한 정보를 동시에 소비할 수 있는 환경으로 확대되면서 다양한 정보를 하나의 화면에서 혼용적으로 전달하기 위한 장면구성정보가 필요하게 되었다. 장면구성정보는 미디어가 소비되는 특정 시간과 해당 공간의 정보를 기술하므로써 다수의 미디어를 복합적으로 소비하는 방법으로, IP 기반에서도 여러 개의 화면을 통해 여러 소스의 콘텐츠를 소비하여 다양한 사용자 경험(User Experience)를 제공하고자 ISO/IEC JTC1/SC29/WG11(별칭 MPEG)에서는 방송서비스에서 방송망 및 IP망을 활용하여 멀티미디어 데이터를 전송할 때 사용하는 MPEG Media Transport(MMT)를 기반으로 장면구성정보를 제공하기 위한 MMT-Composition Information(MMT-CI)를 기술표준화하였다. 본 논문에서는 MMT-CI의 표준문서에 따라 W3C의 웹언어인 HTML5와 확장성 언어인 XML을 이용하여 멀티소스를 활용한 장면구성정보를 기술하는 것을 제안하고 장면구성서비스에 적용을 위한 플레이어를 구현 및 검증한다.

  • PDF

A Fast and Powerful Question-answering System using 2-pass Indexing and Rule-based Query Processing Method (2-패스 색인 기법과 규칙 기반 질의 처리기법을 이용한 고속, 고성능 질의 응답 시스템)

  • 김학수;서정연
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.11
    • /
    • pp.795-802
    • /
    • 2002
  • We propose a fast and powerful Question-answering (QA) system in Korean, which uses a predictive answer indexer based on 2-pass scoring method. The indexing process is as follows. The predictive answer indexer first extracts all answer candidates in a document. Then, using 2-pass scoring method, it gives scores to the adjacent content words that are closely related with each answer candidate. Next, it stores the weighted content words with each candidate into a database. Using this technique, along with a complementary analysis of questions which is based on lexico-syntactic pattern matching method, the proposed QA system saves response time and enhances the precision.

Automatic translation system for hangul's romanization Based on the World Wide Web (웹 기반하의 국어의 로마자 전사 표기 자동 변환 시스템)

  • 김홍섭
    • Journal of the Korea Society of Computer and Information
    • /
    • v.7 no.4
    • /
    • pp.108-114
    • /
    • 2002
  • After automatic translation system for hangul's romanization based on the World Wide Web converting korean-word, sentence, document to Transliteration letters by applying algorithm based phonological principles. even though a user do not know the basic principles of the usage of Korean-to-Romanization notations, It refers to corresponding character table that has been currently adopted the authority's standard proposition for Korean-to-Romanization notation rule concurrently, add to make possible to convert a machinized code as well. It provides font for toggling Korean-English mode, insert-edit mode by assigning ASCII codes and Unicode are hardly used to them. This program could be made in C++ progamming language and Unified Modeling Language to implement various font. font-expanding and condensing. alternative printing.

  • PDF

The Design and Implementation of Template Markup Language Script Processor for Electronic Shopping Mall based on XML (XML기반 전자 쇼핑몰을 위한 템플릿 마크업 언어 스크립트 처리기의 설계 및 구현)

  • 김규태;이수연
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.2
    • /
    • pp.169-174
    • /
    • 2002
  • According to the expansion of the E-Commerce based on Internet, the Interoperability between shopping malls and the expansibility of B2B has been needed. Also, Intelligent User Interface has been needed. The XML based Script Processor, the solution of the problem, is good for Interoperability and if a shopping mall is build by it, it is possible to do customer oriented display that an XML document is displayed with different looks by other style sheet according to customer's preference. In the proposed system, the TMP(Template Markup Language), an auto XML generation script, is defined by XML and the script processor is implemented to work on the shopping mall on the Web.

Design and Implementation of a Character Agent based Multimodal Presentation Authoring Tool (캐릭터 에이전트 기반 멀티모달 프리젠테이션 저작도구 설계 및 구현)

  • 정성태;정석태
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.5
    • /
    • pp.941-948
    • /
    • 2003
  • Character agent based Multimodal Presentation Markup Language(MPML) has been developed to increase the efficiency of the presentation using a computer. However, authoring of a presentation by using MPML is not simple because MPML describes only the behavior of the character agent. It describes the presentation background by importing HTML documents. This paper proposes EMPML(Extended MPML) which describes not only the behavior of the character agent but also the presentation background. And an authoring tool for the EMPML has been designed and implemented. By integrating the editing of the presentation background and character agent behavior, the proposed authoring tool supports WYSIWIG(What You See Is What You Get) mode designing. By using the proposed authoring tool, users can make a multimodal presentation without knowing the details of EMPML.

A Filtering Method of Spam E-mails by Term Weighting (단어 가중치를 이용한 스팸메일 필터링)

  • Kim, Ho-Seong;Jung, Kyung-Ho;Hwang, Do-Sam
    • Annual Conference on Human and Language Technology
    • /
    • 2001.10d
    • /
    • pp.171-174
    • /
    • 2001
  • 현재, 전자메일을 정보전달의 수단이나 광고등의 목적으로 많이 이용하게 되면서, 메일 수신자는 원치 않는 상업적 광고, 불필요한 정보등의 스팸메일을 여과 없이 수신하게 되는 경우가 많아졌다. 이로 인하여 업무효율성 감퇴, 시간 낭비, 자원 낭비 등의 많은 문제점을 야기시키고 있다. 이러한 문제점을 해결하기 위한 기존의 메일 필터링 시스템들은 송신자의 주소나 도메인, 제목 등의 메일 헤더정보만을 이용하거나, 사용자가 정의한 문장이 본문 내용에 나타날 때 필터링하는 방식들이 주류를 이루고 있다. 그러나 이러한 방식들은 메일의 내용에 대한 근본적인 필터링이 불가능하다. 본 논문에서는 메일의 내용을 파악하기 위해 메일의 내용을 대표할 수 있는 체언정보를 추출하여, 카이제곱 통계량 공식을 통해 단어 가중치를 부여하고, 이를 문서분류를 위한 로그단어 빈도 가중치 공식에 적용하여 스팸메일을 필터링하는 방식을 제시한다. 본 논문에서 제안한 방법으로 실험한 결과, 스팸메일을 필터링하는데 84.61%의 재현율과 83.01%의 정확율을 얻을 수 있었다.

  • PDF

A study of Postscript Converter using XSL-FO (XSL-FO를 이용한 PostScript Converter에 관한 연구)

  • 유동석;최호찬;이진영;김차종
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.109-112
    • /
    • 2003
  • Web documents specified with HTML and CSS is displayed with high quality on the web browser. However, basically, any printed pages don't have the same quality of softcopy. The reason is HTML and CSS is not suit for printing. The XSL-FO(XSL-Formatting Object) is Formatting Language for imaging of web document and The PostScript is one of the most famous PDL(Page Description Language). To get high-quality pages, we propose the design of converter which translate XML-FO into PostScript format. Using the designed converter, we can get hardcopies with high quality.

  • PDF

A study of feature catalogue standard of marine GIS (해양지리정보 피쳐 카탈로그 표준에 관한 연구)

  • Park, Jong-Min;Cho, Young-Po;Suh, Sang-Hyun
    • Journal of Navigation and Port Research
    • /
    • v.28 no.1
    • /
    • pp.91-96
    • /
    • 2004
  • Although features, core element of marine GIS in many application, are same things users have difficulty in using them on account of varying according to method of classification. Accordingly feature cataloguing in accordance with the standard is the trend of the modem world. In this article we have became familiar with ISO 19110 - Methodology for Feature Cataloguing, we was able to discuss element and definition of features for the purpose of Marine GIS's activation. Through the result of study, we presented the methodology of Marine GIS feature cataloguing.