Search | Korea Science

A Study on Focused Crawling of Web Document for Building of Ontology Instances (온톨로지 인스턴스 구축을 위한 주제 중심 웹문서 수집에 관한 연구)

Chang, Moon-Soo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.18 no.1
- /
- pp.86-93
- /
- 2008
The construction of ontology defines as complicated semantic relations needs precise and expert skills. For the well defined ontology in real applications, plenty of information of instances for ontology classes is very critical. In this study, crawling algorithm which extracts the fittest topic from the Web overflowing over by a great number of documents has been focused and developed. Proposed crawling algorithm made a progress to gather documents at high speed by extracting topic-specific Link using URL patterns. And topic fitness of Link block text has been represented by fuzzy sets which will improve a precision of the focused crawler.
https://doi.org/10.5391/JKIIS.2008.18.1.086 인용 PDF KSCI

Developing Subject Headings for Subject Access of Children's Picture Books (어린이 그림책의 주제접근을 위한 주제명표 개발)

Park, Ziyoung
- Proceedings of the Korean Society for Information Management Conference
- /
- 2012.08a
- /
- pp.105-108
- /
- 2012
어린이 그림책의 효과적인 주제접근을 위해서는 주제명의 부여가 필수적이다. 그러나 어린이 그림책은 표제나 목차 등 자료 자체에서 주제명을 발췌하기가 어렵다. 또한 일반도서를 위한 주제명표를 그대로 사용하기도 적합하지 않다. 이에 본 연구에서는 어린이 그림책을 위한 별도의 주제명표를 개발하였다. 주제명표 개발을 위해서는 영미권의 그림책 주제명표를 참고하였으며, 이 과정에서 우리 문화와 언어에 맞도록 기존의 표목을 수정.추가하였다. 또한 그림책의 주요 독자층인 어린이에게 적합한 표목을 선정하기 위해 초등국어사전 등을 참고하였다. 그리고 시범적으로 구축된 주제명표를 어린이도서연구회의 추천그림책을 대상으로 적용하였다. 앞으로는 지속적인 주제명 부여 작업을 통해 주제명표를 정련하고, 그림책의 주제분석을 위한 지침을 추가적으로 제공할 필요가 있을 것이다.
PDF

Understanding Topical Relevance of Multimedia based on EEG Techniques (뇌파측정기술(EEG)에 기초한 멀티미디어 자료의 주제 적합성에 관한 연구)

Kim, Hyun-Hee;Kim, Yong-Ho
- Journal of the Korean Society for Library and Information Science
- /
- v.50 no.3
- /
- pp.361-381
- /
- 2016
This study proposed two topical relevance models, simple and complex models, using EEG/ERP techniques. In the simple model regarding simple search tasks, N300 and P3b components are used. The N300 is specific to the semantic processing of pictures and the P3b reflects mechanisms involved in the decision about whether an external stimulus matches or does not match an internal representation of a specific category. In the complex model regarding complex search tasks, on the other hand, N400 and P600 components are used. The N400 reflects activation of an amodel system that integrates both image-based and conceptual representations into a context, whereas the P600 is related to complex cognitive processes. Our research results can be used as a source to design an EEG-based interactive multimedia system.
https://doi.org/10.4275/KSLIS.2016.50.3.361 인용 PDF KSCI

A Topic Classification System in cQA Services Based on Semi-Automatic Learning Using Wikipedia (위키피디아를 이용한 반자동 학습 기반의 cQA 서비스 주제 분류 시스템)

Kim, Taehyun
- Annual Conference on Human and Language Technology
- /
- 2015.10a
- /
- pp.139-141
- /
- 2015
본 논문은 커뮤니티 기반의 질의-응답 서비스에서 사용자 질의의 주제를 분류하는 시스템을 소개한다. 커뮤니티 기반의 질의-응답 서비스는 분야에 따라 다양한 주제를 가질 수 있으며 오늘 날 사용자 질의의 주제 분류에는 통계 기반의 분류 방법이 많이 이용되고 있다. 통계 기반의 분류 방법으로 사용자 질의를 분류하기 위해서는 주제에 적합한 대량의 학습 말뭉치가 필요하다. 주제에 적합한 대량의 학습 말뭉치를 사람이 직접 구축하는 것은 많은 시간과 비용이 든다. 따라서 본 논문에서는 이러한 문제를 해결하기 위해 위키피디아 문서를 Supervised K-means Clustering 기법으로 주제별로 분류함으로써 학습 말뭉치를 반자동으로 구축하는 방법을 제안한다. 그 다음, 생성된 학습 말뭉치로 지지 벡터 기계를 학습하여 사용자 질의의 주제를 분류하게 된다. 위키피디아 문서와 사용자 질의는 다른 도메인의 문서임에도 불구하고 본 논문의 시스템으로 사용자 질의의 주제를 분류한 결과 77.33%의 정확도를 보였다.
PDF

Korean Generative Chatbot using Topic Embedding (주제 임베딩을 활용한 한국어 생성 기반 챗봇)

Oh, Shinhyeok;Kim, Harksoo
- Annual Conference on Human and Language Technology
- /
- 2020.10a
- /
- pp.524-528
- /
- 2020
챗봇은 발화에 대해 컴퓨터가 자동으로 응답하는 시스템이다. 현재 챗봇은 전체 주제에 대한 잡담(chit-chat)보다는 특정 주제에 관한 대화를 목적으로 많이 개발되고 있다. 하지만 개개인이 필요로 하는 챗봇 용도에 적합한 학습 데이터는 부족하다. 이러한 상황에서 챗봇 학습을 위해 필요한 주제의 말뭉치를 대량으로 구축하는 것은 시간과 비용이 많이 소모되어 현실적으로 어렵다. 따라서 학습에 필요한 소량의 말뭉치만 사용하더라도 주제에 적합한 응답을 할 수 있는 챗봇이 필요하다. 이에 본 논문은 챗봇의 목적과 관련 없는 대량의 말뭉치와 소량의 주제 기반 말뭉치를 이용하여 높은 성능을 끌어낼 수 있는 주제 임베딩 방법을 제안한다.
PDF

User-centered relevance judgement model for information retrieval (정보검색에서의 사용자 중심 적합성 판단 모형)

Park, Jung-Ah;Sohn, Young-Woo
- Science of Emotion and Sensibility
- /
- v.12 no.4
- /
- pp.489-500
- /
- 2009
This research takes a user-centered approach to define relevance, the core concept in information retrieval. The literature on relevance has identified numerous factors affecting such a judgment. We examined the model of user relevance judgment that describes the relationship between user relevance criteria and different types of relevance with information search task. We consider 7 criteria of user relevance-topicality, novelty, reliability, understandability, specificity, richness, and interest-and 3 type of user relevance-cognitive relevance, situational relevance, and affective relevance. Data were collected from a semi-controlled survey and analyzed by a structural equation modeling. As a result, topicality and reliability were found to be the essential relevance criteria in all information retrieval tasks. In the fact search task, topicality, reliability, novelty, richness, and interest were found to be significant. In the problem solving search task, topicality, reliability, understandability, and specificity were found to be significant. In the decision making search task, topicality, reliability, novelty, understandability, richness, specificity, and interest were found to be significant. In addition, the relationships between types of user relevance were determined. This research made theoretical and practical contributions to the field of information retrieval by identifying a definite model of user relevance judgment.
PDF

An Experimental Study on the Effect of Domain Expertise on the Consistency of Relevance Judgements (주제전문지식이 적합성판정의 일관성에 미치는 영향에 관한 실험적 연구)

Scholten, Stacey;Moon, Sung-Been
- Journal of the Korean Society for information Management
- /
- v.38 no.3
- /
- pp.1-22
- /
- 2021
An online experiment was conducted to test the subject-knowledge view of relevance theory in order to find evidence of a conceptual basis for relevance. Six experts in Library and Information Science (LIS), nine Master's students of LIS, and twelve non-experts judged the relevance of 14 abstracts within and outside of the LIS domain. Consistency among the judges was calculated by joint-probability agreement (PA) and interclass correlation coefficients (ICC). When using PA to analyze the judgements, non-experts had a higher consensus regardless of the task or division of groups. However, ICC calculations found Master's candidates had a higher level of consensus than non-experts within LIS, although the experts did not; and the agreement rates on the non-LIS task for all groups were only poor to moderate. It was only when the groups were analyzed as two groups (experts including Master's candidates and non-experts) that the expected trend of higher consistency among experts in the LIS task was seen.
https://doi.org/10.3743/KOSIM.2021.38.3.001 인용 PDF KSCI

Developing Subject Headings for Children's Picture Books based on A to Zoo (어린이 그림책을 위한 주제명표 개발 연구: 『A to Zoo』를 바탕으로)

Park, Ziyoung
- Journal of the Korean Society for information Management
- /
- v.29 no.4
- /
- pp.251-271
- /
- 2012
Subject headings support the effective access of children's picture books. However, it is difficult to select subject terms from titles or table of contents in children's picture books because of their relatively little textual information. Therefore, it is necessary to assign subject terms to each picture book. However, it is not adequate to use general subject headings because the types and levels of general subject headings are different from special subject headings for the children's materials. For this reason, this study aims to develop subject headings for children's picture books. The subject terms in A to Zoo were selected, and the selected terms were translated into Korean and modified for the Korean culture and language. Other reference books, such as Elementary Korean Dictionary, were also used to determine adequate terms for children. The resulting subject headings were assigned to the recommended picture books for children and used to search by subject, browse, and recommend books.
https://doi.org/10.3743/KOSIM.2012.29.4.251 인용 PDF KSCI

Deciding The Relevance of Web Documents Using WordNet and BPN (WordNet과 BPN을 이용한 웹 문서 적합성 판단)

김원우;변영태
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10b
- /
- pp.91-93
- /
- 2001
본 논문은 웹 문서가 특정 주제와 관련된 정보를 담고 있는지를 특정 주제의 단어와 다른 주제의 단어들 사이의 관계를 이용해 평가할 수 있는 방법을 제시하고자 한다. 특정 주제와 관련된 웹 문서에 단어$_{A}$와 단어$_{B}$가 그렇지 않은 웹 문서보다 나온 수가 더 많다면, 단어$_{A}$와 단어$_{B}$의 연결 관계는 특정 주제에 대해 Positive하다고 볼 수 있다. 반대의 경우에는 Negative하다고 볼 수 있다. 이러한 단어와 단어의 연결 관계를 수치화하여 특정 주제와 관련된 웹 문서의 평가에 사용할 수 있도록 WordNet과 BFN을 이용해 보고자 한다.
PDF

Assessment of English Essay Topic Suitability using Keyword of Instruction (문제 핵심 어휘를 이용한 영어 논술 주제 적합성 평가)

Goh, Dae-Ohk;Kim, Minjeong;Rim, Hae-Chang
- Annual Conference on Human and Language Technology
- /
- 2012.10a
- /
- pp.148-153
- /
- 2012
본 논문에서는 그동안 영어 자동 평가에서 다루지 않은 문제와 답안의 적합성에 대한 평가를 시도한다. 답안이 주어진 문제에 적합한지를 평가하기 위해 문제에서 내용어를 중심으로 핵심어를 추출하며, 이렇게 추출한 핵심어와 각 답안의 적합성을 코사인 상관계수를 이용하여 구해본다. 한 문제에서 추출 가능한 핵심어가 매우 한정되어 있으므로 추가적으로 워드넷의 관련어나 예시 답안을 활용하여 확장한 핵심어 목록으로 실험을 하였으며, 실험 결과를 통해 핵심어를 이용한 답안과 문제의 적합성 평가가 가능함을 보였다.
PDF

Search Result 470, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)