Search | Korea Science

Automated annotation of web page contents for rapid creation of Semantic web contents

Phuong Tu Minh;Duy Pham Hoang;Kien Trinh Huu
- Proceedings of the IEEK Conference
- /
- summer
- /
- pp.376-381
- /
- 2004
The Semantic Web is an extension of the current Web in which information is given formal and explicit meaning. The Semantic Web enables computer programs to understand information contents and thus facilitates more efficient discovery, automation, integration and sharing of data. To create Semantic Web contents one needs appropriate tools. In this paper, we describe such a toolkit we have constructed. The most important feature of the toolkit is that it makes use of information extraction techniques for automatically annotating web page contents. Experiments with a real life application show promising results and demonstrate the usefulness of the toolkit.
PDF

An Image Retrieving Scheme Using Salient Features and Annotation Watermarking

Wang, Jenq-Haur;Liu, Chuan-Ming;Syu, Jhih-Siang;Chen, Yen-Lin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.1
- /
- pp.213-231
- /
- 2014
Existing image search systems allow users to search images by keywords, or by example images through content-based image retrieval (CBIR). On the other hand, users might learn more relevant textual information about an image from its text captions or surrounding contexts within documents or Web pages. Without such contexts, it's difficult to extract semantic description directly from the image content. In this paper, we propose an annotation watermarking system for users to embed text descriptions, and retrieve more relevant textual information from similar images. First, tags associated with an image are converted by two-dimensional code and embedded into the image by discrete wavelet transform (DWT). Next, for images without annotations, similar images can be obtained by CBIR techniques and embedded annotations can be extracted. Specifically, we use global features such as color ratios and dominant sub-image colors for preliminary filtering. Then, local features such as Scale-Invariant Feature Transform (SIFT) descriptors are extracted for similarity matching. This design can achieve good effectiveness with reasonable processing time in practical systems. Our experimental results showed good accuracy in retrieving similar images and extracting relevant tags from similar images.
https://doi.org/10.3837/tiis.2014.01.013 인용 PDF KSCI KPUBS

Design and Implementation of Domain Ontology to Overcome Conceptual Heterogeneity in Annotation-based Image Retrieval (주석기반 이미지 검색에서 개념적 이질성 극복을 위한 도메인 온톨로지 설계 및 구현)

Kim Won-Pil;Kim Pan-Koo
- Journal of Internet Computing and Services
- /
- v.4 no.4
- /
- pp.1-8
- /
- 2003
As the multimedia information retrieval system is advanced, the study of multimedia information retrieval is changing the method of low-level content based image retrieval to the semantical concept based retrieval. in this paper, we apply the theory of ontology to overcome the conceptual heterogeneity in the annotation based image retrieval. And we solve the some problems that happen when the ontology apply. As a result of our study, we try to apply the domain ontology to settle the conceptual heterogenity. In the experimental result, we knew that the semantic distance among the words is pretty dose when we apply the domain ontology than the wordnet. And in this paper, we show the possibility of the semantic image retrieval as we apply the domain ontology in the annotation based image retrieval.
PDF

Automatic Generation of RDF Metadata for Semantic Search in Semantic Web (시맨틱 웹에서 의미 검색을 위한 RDF 메타데이타 자동 생성)

강상구;양재영;양승섭;최원종;최중민
- Proceedings of the Korea Inteligent Information System Society Conference
- /
- 2002.11a
- /
- pp.311-320
- /
- 2002
시맨틱 웹은 인간이 이해하는 것처럼 웹 문서의 의미를 컴퓨터가 처리할 수 있도록 하는데 있다. 그러나 인터넷 등 정보통신 기술의 발전으로 인해 정보량이 급증함으로써 이들 정보 자원을 효과적으로 검색하기에는 많은 어려움이 있다. 이러한 문제점을 해결하기 위해 본 논문에서는 주석 에디터를 사용하여 논문에 대한 RDF 메타데이타의 자동 생성 방법을 제안한다. 사용자가 논문을 주석 처리할 때, 문서에 대한 특징을 추출하고 온토로지 인터페이스를 사용하여 문서를 분류한다. 구현된 시스템을 통해 사용자는 추출된 메타데이타를 메타데이타 뷰를 통해 볼 수 있으며, HTML 뷰를 통해 메타데이타를 수동으로 수정이 가능하다. 이 메타데이타는 RDF Repository로 저장할 수 있으며, 주석 뷰를 통하여 RDF 메타데이타 생성을 확인할 수 있다. 이렇게 생성된 RDF 메타데이타는 웹 로봇이 내용의 의미 파악 및 카테고리 정보를 쉽게 알 수 있도록 해준다. 본 논문은 검색 엔진을 통하여 논문 검색시 전체 내용보다 RDF 메타데이타 정보만으로 효율적인 검색을 할 수 있는 방법에 초점을 둔다.
PDF

Lifting a Metadata Model to the Semantic Multimedia World

Martens, Gaetan;Verborgh, Ruben;Poppe, Chris;Van De Walle, Rik
- Journal of Information Processing Systems
- /
- v.7 no.1
- /
- pp.199-208
- /
- 2011
This paper describes best-practices in lifting an image metadata standard to the Semantic Web. We provide guidelines on how an XML-based metadata format can be converted into an OWL ontology. Additionally, we discuss how this ontology can be mapped to the W3C's Media Ontology. This ontology is a standardization effort of the W3C to provide a core vocabulary for multimedia annotations. The approach presented here can be applied to other XML-based metadata standards.
https://doi.org/10.3745/JIPS.2011.7.1.199 인용 PDF KSCI

Conversation Context Annotation using Speaker Detection (화자인식을 이용한 대화 상황정보 어노테이션)

Park, Seung-Bo;Kim, Yoo-Won;Jo, Geun-Sik
- Journal of Korea Multimedia Society
- /
- v.12 no.9
- /
- pp.1252-1261
- /
- 2009
One notable challenge in video searching and summarizing is extracting semantic from video contents and annotating context for video contents. Video semantic or context could be obtained by two methods to extract objects and contexts between objects from video. However, the method that use just to extracts objects do not express enough semantic for shot or scene as it does not describe relation and interaction between objects. To be more effective, after extracting some objects, context like relation and interaction between objects needs to be extracted from conversation situation. This paper is a study for how to detect speaker and how to compose context for talking to annotate conversation context. For this, based on this study, we proposed the methods that characters are recognized through face recognition technology, speaker is detected through mouth motion, conversation context is extracted using the rule that is composed of speaker existing, the number of characters and subtitles existing and, finally, scene context is changed to xml file and saved.
PDF

A WWW Images Automatic Annotation Based On Multi-cues Integration (멀티-큐 통합을 기반으로 WWW 영상의 자동 주석)

Shin, Seong-Yoon;Moon, Hyung-Yoon;Rhee, Yang-Won
- Journal of the Korea Society of Computer and Information
- /
- v.13 no.4
- /
- pp.79-86
- /
- 2008
As the rapid development of the Internet, the embedded images in HTML web pages nowadays become predominant. For its amazing function in describing the content and attracting attention, images become substantially important in web pages. All these images consist a considerable database. What's more, the semantic meanings of images are well presented by the surrounding text and links. But only a small minority of these images have precise assigned keyphrases. and manually assigning keyphrases to existing images is very laborious. Therefore it is highly desirable to automate the keyphrases extraction process. In this paper, we first introduce WWW image annotation methods, based on low level features, page tags, overall word frequency and local word frequency. Then we put forward our method of multi-cues integration image annotation. Also, show multi-cue image annotation method is more superior than other method through an experiment.
PDF

Korean Nominal Bank, Using Language Resources of Sejong Project (세종계획 언어자원 기반 한국어 명사은행)

Kim, Dong-Sung
- Language and Information
- /
- v.17 no.2
- /
- pp.67-91
- /
- 2013
This paper describes Korean Nominal Bank, a project that provides argument structure for instances of the predicative nouns in the Sejong parsed Corpus. We use the language resources of the Sejong project, so that the same set of data is annotated with more and more levels of annotation, since a new type of a language resource building project could bring new information of separate and isolated processing. We have based on the annotation scheme based on the Sejong electronic dictionary, semantically tagged corpus, and syntactically analyzed corpus. Our work also involves the deep linguistic knowledge of syntaxsemantic interface in general. We consider the semantic theories including the Frame Semantics of Fillmore (1976), argument structure of Grimshaw (1990) and argument alternation of Levin (1993), and Levin and Rappaport Hovav (2005). Various syntactic theories should be needed in explaining various sentence types, including empty categories, raising, left (or right dislocation). We also need an explanation on the idiosyncratic lexical feature, such as collocation and etc.
PDF

Improving a CNN-based Image Annotation System Using Multi-Labeled Images (다중 레이블 이미지를 활용한 CNN기반 이미지 어노테이션 시스템의 개선)

Kim, Taeksoo;Kim, Sangbum
- Annual Conference on Human and Language Technology
- /
- 2015.10a
- /
- pp.99-103
- /
- 2015
최근 딥러닝 기술의 발전에 힘입어 이미지로부터 자동으로 관련된 단어 혹은 문장을 생성하는 연구들이 진행되고 있는데, 많은 연구들은 이미지와 단어가 1:1로 대응된 잘 정련된 학습 집합을 필요로 한다. 한편 스마트폰 보급의 확산으로 인스타그램, 폴라 등의 이미지 기반 SNS가 급속하게 성장함에 따라 인터넷에는 한 이미지의 복수개의 단어(태그)가 부착되어있는 데이터들이 폭증하고 있는 것이 현실이다. 본 논문에서는 소규모의 잘 정련된 학습 집합뿐 아니라 이러한 대규모의 다중 레이블 데이터를 같이 활용하여 이미지로부터 태그를 생성하는 개선된 CNN구조 및 학습알고리즘을 제안한다. 기존의 분류 기반 모델에 은닉층을 추가하고 새로운 학습 방법을 도입한 결과, 어노테이션 성능이 기존 모델보다 11% 이상 향상되었다.
PDF

Design and Implementation of Ontology-Annotation System for Semantic Web (시맨틱웹을 위한 온톨로지 주석 시스템의 설계와 구현)

Ryu Yeong-Hyeon;Yong Wang;Han Sung-Kook
- Proceedings of the Korean Information Science Society Conference
- /
- 2006.06b
- /
- pp.226-228
- /
- 2006
현재 웹의 발전과 더불어 시맨틱웹의 응용에 대한 연구가 활발히 진행되고 있다. 그러나 아직까지 그것에 대한 결과물이 나오지 않는 것은 정보자원을 의미적으로 정확히 분석하고, 관리할 수 없는 시스템의 부재라고 할 수 있다. 본 논문에서는 기존의 Annotator들을 분석하고, 시맨틱웹의 응용에 필요한 온톨로지 annotation system을 구현하여, 사용자가 원하는 정확한 정보를 검색하고, 사용자가 편리하게 관리 저장 할 수 있는 방법을 제시하였다.
PDF

Search Result 105, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)