• 제목/요약/키워드: textual information

검색결과 241건 처리시간 0.023초

휴대용 단말기 환경을 위한 Annotation 모델링 및 시스템 구현 (Annotation Modeling and System Implementation for Hand-held Environment)

  • 손원성
    • 정보교육학회논문지
    • /
    • 제10권2호
    • /
    • pp.219-226
    • /
    • 2006
  • 어노테이션(annotation)은 문서에서 개인의 의견, 정리, 요약 등을 표현하기 위한 주석을 의미한다. 따라서 전자문서에서도 어노테이션은 중요하게 사용되며 특히 전자 잉크(digital inking)릉 이용한 이동 단말기 환경에서 효과적으로 사용된다. 그러나 기존 연구에서는 휴대용 단말기 환경의 단점인 적은 디스플레이 공간을 전혀 고려하지 않기 때문에 어노테이션 작성 및 활용이 매우 불편하다. 따라서 본 논문에서는 전자펜과 이동식 단말기 환경을 고려한 어노테이션 모델 및 시스템을 제안한다. 제안 어노테이션 모델은 다양한 컨텍스트(context)를 고려하고 이에 기반한 어노테이션 마크업 언어를 정의한다. 본 모델은 다양한 어노테이션 타입 및 의미(semantic) 모델, 펜 기반 어노테이션의 자동 인식 및 영역 보정 기능 등을 고려하며, 이것을 기반으로 CAML(Context-based Annotation Markup Language)를 정의한다. 또한 본 모델을 이용하여 XML 기반의 전자책문서 및 단말기 환경을 고려한 어노테이션 시스템을 구현하고 그 활용 가능성에 대하여 살펴본다. 본 연구의 결과는 eLearning, Cyber-Class, IETM(Interactive Electronic Technical Manuals) 에서 적절히 응용 가능하다.

  • PDF

비구조적 공간정보를 지원하는 개념적 지오시맨틱 웹 서비스 프레임워크의 설계 (Design of a Conceptual Geosemantic Web Service Framework supporting Textual Geospatial Information)

  • 하수욱;남광우
    • Spatial Information Research
    • /
    • 제19권4호
    • /
    • pp.91-97
    • /
    • 2011
  • 본 연구는 공간정보 분야에서 시맨틱 웹 기술을 접목한 서비스 아키텍처를 제안한다. 최근 웹 서비스 기술의 발달과 함께 무선 인터넷 및 스마트폰의 보급을 통해 지리정보와 관련된 다양한 서비스들이 등장하고 있다. 또한 시맨틱 웹 기술이 주요 검색 포털들에 적용됨으로써 일반인들은 온라인상에 공개된 다양한 정보들에 보다 쉽게 접근할 수 있게 되었다. 그러나 공간정보 분야에서는 이질적인 데이터 저장 구조를 갖는 공간정보 데이터에 대한 연계 이외의 활용 가능한 서비스 유형에 대한 연구가 부족한 것이 현실이다. 따라서 본 연구에서는 공간정보 웹 서비스 및 시맨틱 웹과 관련된 표준들을 바탕으로 시맨틱 공간정보 서비스를 구성하는 기능 요소들을 도출하였다. 또한 적용 가능한 유즈케이스들과 시맨틱 공간정보 서비스 아키텍처를 제시하였으며, 이를 통해 향후 시맨틱 공간정보 서비스 구현을 위한 기반을 마련하였다.

멀티미디어 정보요구와 검색행태에 관한 탐색적 연구 (An Exploratory Investigation on Multimedia Information Needs and Searching Behavior among College Students)

  • 정은경
    • 한국문헌정보학회지
    • /
    • 제46권3호
    • /
    • pp.251-270
    • /
    • 2012
  • 일상생활에서 멀티미디어 검색은 중요한 검색활동이며, 멀티미디어 정보요구와 검색행태는 그 특성상 텍스트 기반의 정보요구와 검색행태와는 달리 고유한 특성을 지닌다. 본 연구는 서울시내 남녀 대학생 20명의 멀티미디어 정보요구와 검색행태에 관하여 탐색적 고찰을 제시하였다. 대학생의 멀티미디어 정보요구와 이에 따른 검색과정이 녹화 및 녹취되었으며, 사후심층면담도 함께 이루어졌다. 이러한 데이터의 분석결과는 멀티미디어 검색원, 정보요구, 적합성 판단 근거, 검색저해요인으로 구분하여 제시되었다. 검색원의 특징으로는 국내포털사이트와 구글이 주로 사용되었으나, 이미지, 오디오, 비디오의 개별 멀티미디어의 특성을 반영한 검색원이 규명되었다. 멀티미디어 정보요구는 특정한 정보요구가 보편적인 정보요구에 비해 많이 추구되었으나 개별 멀티미디어의 특성에 따라 보편적 정보요구의 비중은 다르게 나타났다. 멀티미디어 적합성 판단 기준과 검색저해요인은 개별 멀티미디어에 따라 상당히 다양하게 발견되었다. 이러한 결과는 기존의 통합적 멀티미디어의 검색 환경에서 개별 멀티미디어 정보요구와 특성이 반영된 검색환경 구현이 필요하다는 것을 보여준다.

A Knowledge-based Model for Semantic Oriented Contextual Advertising

  • Maree, Mohammed;Hodrob, Rami;Belkhatir, Mohammed;Alhashmi, Saadat M.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권5호
    • /
    • pp.2122-2140
    • /
    • 2020
  • Proper and precise embedding of commercial ads within Webpages requires Ad-hoc analysis and understanding of their content. By the successful implementation of this step, both publishers and advertisers gain mutual benefits through increasing their revenues on the one hand, and improving user experience on the other. In this research work, we propose a novel multi-level context-based ads serving approach through which ads will be served at generic publisher websites based on their contextual relevance. In the proposed approach, knowledge encoded in domain-specific and generic semantic repositories is exploited in order to analyze and segment Webpages into sets of contextually-relevant segments. Semantically-enhanced indexes are also constructed to index ads based on their textual descriptions provided by advertisers. A modified cosine similarity matching algorithm is employed to embed each ad from the Ads repository into one or more contextually-relevant segments. In order to validate our proposal, we have implemented a prototype of an ad serving system with two datasets that consist of (11429 ads and 93 documents) and (11000 documents and 15 ads), respectively. To demonstrate the effectiveness of the proposed techniques, we experimentally tested the proposed method and compared the produced results against five baseline metrics that can be used in the context of ad serving systems. In addition, we compared the results produced by our system with other state-of-the-art models. Findings demonstrate that the accuracy of conventional ad matching techniques has improved by exploiting the proposed semantically-enhanced context-based ad serving model.

Digital Marketing Tools for Managing the Development of Park and Recreation Complexes

  • Chaikovska, Maryna;Mashika, Hanna;Mankovska, Ruslana;Liulchak, Zoreslava;Haida, Pavlo;Diakova, Yana
    • International Journal of Computer Science & Network Security
    • /
    • 제22권5호
    • /
    • pp.154-162
    • /
    • 2022
  • Digital marketing tools are actively used in managing the development of park and recreation complexes to familiarize the population with the objects of natural heritage. This article aims to empirically evaluate digital marketing tools for popularizing the park and recreational complexes. The methodology was based on the concept of ecosystem value of park and recreation complexes as a natural heritage site. These methods included: identifying and selecting websites with information about park and recreation complexes in Slovakia and Ukraine. structural analysis of the main channels of online details about natural parks. Assessing the current state of online identity of the studied sites from the perspective of Internet users. The results indicate that to manage the development of park and recreational complexes developed their driven official websites in the Internet space, on which sections structure the information with the allocation of data on tourism and recreational potential. The article identifies additional digital marketing tools for managing the development of park and recreation complexes, particularly social networks and tourist websites. There is a sufficient amount of information about tourist recreation sites within these natural parks and tourist routes. Among the main problems of the websites: the information on the websites is entirely textual, there is a lack of sufficient data on social networks, despite the created official pages, there is no video content, which was more attracted tourists and visitors, allowing a visual assessment of the tourist potential; there is a problem of many communication channels to present the natural heritage of the countries. The research proves that the website is the primary and most common digital marketing tool for natural heritage, structuring information about tourism potential and recreation.

Human Action Recognition Using Pyramid Histograms of Oriented Gradients and Collaborative Multi-task Learning

  • Gao, Zan;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제8권2호
    • /
    • pp.483-503
    • /
    • 2014
  • In this paper, human action recognition using pyramid histograms of oriented gradients and collaborative multi-task learning is proposed. First, we accumulate global activities and construct motion history image (MHI) for both RGB and depth channels respectively to encode the dynamics of one action in different modalities, and then different action descriptors are extracted from depth and RGB MHI to represent global textual and structural characteristics of these actions. Specially, average value in hierarchical block, GIST and pyramid histograms of oriented gradients descriptors are employed to represent human motion. To demonstrate the superiority of the proposed method, we evaluate them by KNN, SVM with linear and RBF kernels, SRC and CRC models on DHA dataset, the well-known dataset for human action recognition. Large scale experimental results show our descriptors are robust, stable and efficient, and outperform the state-of-the-art methods. In addition, we investigate the performance of our descriptors further by combining these descriptors on DHA dataset, and observe that the performances of combined descriptors are much better than just using only sole descriptor. With multimodal features, we also propose a collaborative multi-task learning method for model learning and inference based on transfer learning theory. The main contributions lie in four aspects: 1) the proposed encoding the scheme can filter the stationary part of human body and reduce noise interference; 2) different kind of features and models are assessed, and the neighbor gradients information and pyramid layers are very helpful for representing these actions; 3) The proposed model can fuse the features from different modalities regardless of the sensor types, the ranges of the value, and the dimensions of different features; 4) The latent common knowledge among different modalities can be discovered by transfer learning to boost the performance.

CASPER: Congestion Aware Selection of Path with Efficient Routing in Multimedia Networks

  • Obaidat, Mohammad S.;Dhurandher, Sanjay K.;Diwakar, Khushboo
    • Journal of Information Processing Systems
    • /
    • 제7권2호
    • /
    • pp.241-260
    • /
    • 2011
  • In earlier days, most of the data carried on communication networks was textual data requiring limited bandwidth. With the rise of multimedia and network technologies, the bandwidth requirements of data have increased considerably. If a network link at any time is not able to meet the minimum bandwidth requirement of data, data transmission at that path becomes difficult, which leads to network congestion. This causes delay in data transmission and might also lead to packet drops in the network. The retransmission of these lost packets would aggravate the situation and jam the network. In this paper, we aim at providing a solution to the problem of network congestion in mobile ad hoc networks [1, 2] by designing a protocol that performs routing intelligently and minimizes the delay in data transmission. Our Objective is to move the traffic away from the shortest path obtained by a suitable shortest path calculation algorithm to a less congested path so as to minimize the number of packet drops during data transmission and to avoid unnecessary delay. For this we have proposed a protocol named as Congestion Aware Selection Of Path With Efficient Routing (CASPER). Here, a router runs the shortest path algorithm after pruning those links that violate a given set of constraints. The proposed protocol has been compared with two link state protocols namely, OSPF [3, 4] and OLSR [5, 6, 7, 8].The results achieved show that our protocol performs better in terms of network throughput and transmission delay in case of bulky data transmission.

술어-논항 구조의 패턴 유사도를 결합한 혼합 커널 기반관계 추출 (Relation Extraction based on Composite Kernel combining Pattern Similarity of Predicate-Argument Structure)

  • 정창후;최성필;최윤수;송사광;전홍우
    • 인터넷정보학회논문지
    • /
    • 제12권5호
    • /
    • pp.73-85
    • /
    • 2011
  • 문헌에 존재하는 핵심개체 간의 관계를 자동으로 추출할 때 다양한 형태의 문서 분석 결과를 활용할 수 있다. 본 논문에서 는 기존에 개발되어 비교적 높은 성능을 보여준 합성곱 구문 트리 커널의 구절 구조 유사성 정보와 두 개체 사이의 유의미한 연관관계를 표현해주는 술어-논항 구조 패턴의 유사성 정보를 동시에 활용하는 혼합 커널을 제안한다. 구문적 구조를 이용하는 기존의 합성곱 구문 트리 커널에 술어와 논항 간의 의미적 구조를 활용하는 술어-논항 구조 패턴 유사도 커널을 결합하여 상호보완적인 혼합 커널을 구성하였고, 다양한 테스트컬렉션 기반의 실험을 통하여 개발된 커널의 성능을 측정하였다. 실험결과 구절 구조 정보를 이용하는 합성곱 구문 트리 커널만을 단독으로 사용했을 때보다 술어-논항 구조의 패턴 정보를 결합한 혼합 커널을 사용했을 때에 더 좋은 성능을 보이는 것을 확인할 수 있었다. 또한 기존의 시스템보다 우수한 성능을 보이는 것도 함께 확인할 수 있었다.

유해화학물질의 시각적 안전관리를 위한 MSDS 지도 개발 (Development of MSDS Map for Visual Safety Management of Hazardous and Chemical Materials)

  • 신명우;서용윤
    • 한국안전학회지
    • /
    • 제34권2호
    • /
    • pp.48-55
    • /
    • 2019
  • For preventing the accidents generated from the chemical materials, thus far, MSDS (Material Safety Data Sheet) data have been made to notify how to use and manage the hazardous and chemical materials in safety. However, it is difficult for users who handle these materials to understand the MSDS data because they are only listed based on the alphabetical order, not based on the specific factors such as similarity of characteristics. It is limited in representing the types of chemical materials with respect to their characteristics. Thus, in this study, a lots of MSDS data are visualized based on relationships of the characteristics among the chemical materials for supporting safety managers. For this, we used the textmining algorithm which extracts text keywords contained in documents and the Self-Organizing Map (SOM) algorithm which visually addresses textual data information. In the case of Occupational Safety and Health Administration (OSHA) in the United States, the guide texts contained in MSDS documents, which include use information such as reactivity and potential risks of materials, are gathered as the target data. First, using the textmining algorithm, the information of chemicals is extracted from these guide texts. Next, the MSDS map is developed using SOM in terms of similarity of text information of chemical materials. The MSDS map is helpful for effectively classifying chemical materials by mapping prohibited and hazardous substances on the developed the SOM map. As a result, using the MSDS map, it is easy for safety managers to detect prohibited and hazardous substances with respect to the Industrial Safety and Health Act standards.

English Predicate Inversion: Towards Data-driven Learning

  • Kim, Jong-Bok;Kim, Jin-Young
    • 영어영문학
    • /
    • 제56권6호
    • /
    • pp.1047-1065
    • /
    • 2010
  • English inversion constructions are not only hard for non-native speakers to learn but also difficult to teach mainly because of their intriguing grammatical and discourse properties. This paper addresses grammatical issues in learning or teaching the so-called 'predicate inversion (PI)' construction (e.g., Equally important in terms of forest depletion is the continuous logging of the forests). In particular, we chart the grammatical (distributional, syntactic, semantic, pragmatic) properties of the PI construction, and argue for adata-driven teaching for English grammar. To depart from the arm-chaired style of grammar teaching (relying on author-made simple sentences), our teaching method introduces a datadriven teaching. With total 25 university students in a grammar-related class, students together have analyzed the British Component of the International Corpus of English (ICE-GB), containing about one million words distributed across a variety of textual categories. We have identified total 290 PI sentences (206 from spoken and 87 from written texts). The preposed syntactic categories of the PI involve five main types: AdvP, PP, VP(ed/ing), NP, AP, and so, all of which function as the complement of the copula. In terms of discourse, we have observed, supporting Birner and Ward's (1998) observation that these preposed phrases represent more familiar information than the postposed subject. The corpus examples gave us the three possible types: The preposed element is discourse-old whereas the postposed one is discourse-new as in Putting wire mesh over a few bricks is a good idea. Both preposed and postposed elements can also be discourse new as in But a fly in the ointment is inflation. These two elements can also be discourse old as in Racing with him on the near-side is Rinus. The dominant occurrence of the PI in the spoken texts also supports the view that the balance (or scene-setting) in information structure is the main trigger for the use of the PI construction. After being exposed to the real data and in-depth syntactic as well as informationstructure analysis of the PI construction, it is proved that the class students have had a farmore clear understanding of the construction in question and have realized that grammar does not mean to live on by itself but tightly interacts with other important grammatical components such as information structure. The study directs us toward both a datadriven and interactive grammar teaching.