• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.028 seconds

A Study on Automatic Database Selection Technique Using the Maximal Concept Strength Recognition Method (최대 개념강도 인지기법을 이용한 데이터베이스 자동선택 방법에 관한 연구)

  • Jeong, Do-Heon
    • Journal of the Korean Society for information Management
    • /
    • v.27 no.3
    • /
    • pp.265-281
    • /
    • 2010
  • The proposed method in this study is the Maximal Concept-Strength Recognition Method(MCR). In case that we don't know which database is the most suitable for automatic-classification when new database is imported, MCR method can support to select the most similar database among many databases in the legacy system. For experiments, we constructed four heterogeneous scholarly databases and measured the best performance with MCR method. In result, we retrieved the exact database expected and the precision value of MCR based automatic-classification was close to the best performance.

Procedural Entity Extraction for Procedural Knowledge on Medline Abstracts (의료 문헌에서의 절차적 지식 추출을 위한 단위 절차 추출 연구)

  • Song, Sa-Kwang;Oh, Heung-Seon;Choi, Yoon-Jung;Jang, He-Ju;Myaeng, Sung-Hyon;Choi, Sung-Pil;Choi, Yun-Soo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.154-157
    • /
    • 2011
  • 본 연구는 2인의 전문의와 함께 의료 문헌의 초록을 분석하여 의료문서에서의 절차적 지식을 모델링하고 텍스트 마이닝 기법을 적용하여 절차적 지식을 추출하는 방법론에 대해 기술한다. 절차적 지식은 목적과 해법의 묶음으로, 해법은 다시 단위 절차 지식의 네트워크로 정의 하였고, 목적과 해법 정보 추출과 단위 절차 지식의 구성요소인 대상/행위/방법 개체를 인식하기 위해, 품사태깅, 구문분석, 술어-논항구조(Predicate-Argument Structure), 온톨로지 용어 매핑 정보 등에 기반한 기계학습 방법을 사용하였다. 실험을 위해 전문의와 함께 위함과 척추질환에 대한 1309 문서에 절차적 지식 태깅을 수행하였고, 이 문서 집합을 기반으로 목적/해법 추출 작업과 단위 절차 지식(대상질병/행위/적용방법) 추출 실험을 수행하여, 각각 82% 와 63%의 F-measure 값을 얻을 수 있었다.

Mobile Context Visualizer Design (모바일 컨텍스트 Visualizer 설계)

  • Kim, Wan-Ki;Kim, Moon-Kwon;Cheun, Du-Wan;Bae, Hyun-Joo;Keum, Chang-Sup;Kim, Soo-Dong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.108-111
    • /
    • 2011
  • 최근 스마트폰을 이용하여 위치, 움직임, 주위 환경 정보 등과 같은 사용자의 상황을 인지하고, 인지한 컨텍스트를 기반으로 어플리케이션이 능동적으로 기능을 제공하기 위한 연구가 활발하다. 그러나 현재 켠텍스트 인지 모바일 어플케이션에서는 단일 사용자의 현재 컨텍스트에 국한되어 컨텍스트를 활용하고 있다. 본 논문에서는 제안하는 컨텍스트 Visualizer에서는 포괄적인 컨텍스트 정보의 활용을 위해 컨텍스트 정보를 개인, 그룹별로 분류하여 보여준다. 또한, 과거, 현재, 미래 컨텍스트 정보를 고려하여 화면에 보여줌으로써 컨텍스트의 활용 범위를 향상시킨다. 이를 위하여 온 논문에서는 모바일 컨텍스트 종류를 설명하고, 이를 보여주기 위한 컨텍스트 Visualizer의 설계를 제시한다. 또한 설계 모델을 기반으로 구현한 프로토타입을 보여주고, Visualizer의 활용을 제시함으로써 연구의 실효성을 보여준다.

A Study on the Reclassification of Author Keywords for Automatic Assignment of Descriptors (디스크립터 자동 할당을 위한 저자키워드의 재분류에 관한 실험적 연구)

  • Kim, Pan-Jun;Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.2
    • /
    • pp.225-246
    • /
    • 2012
  • This study purported to investigate the possibility of automatic descriptor assignment using the reclassification of author keywords in domestic scholarly databases. In the first stage, we selected optimal classifiers and parameters for the reclassification by comparing the characteristics of machine learning classifiers. In the next stage, learning the author keywords that were assigned to the selected articles on readings, the author keywords were automatically added to another set of relevant articles. We examined whether the author keyword reclassifications had the effect of vocabulary control just as descriptors collocate the documents on the same topic. The results showed the author keyword reclassification had the capability of the automatic descriptor assignment.

Clustering of Web Document Exploiting with the Co-link in Hypertext (동시링크를 이용한 웹 문서 클러스터링 실험)

  • 김영기;이원희;권혁철
    • Journal of Korean Library and Information Science Society
    • /
    • v.34 no.2
    • /
    • pp.233-253
    • /
    • 2003
  • Knowledge organization is the way we humans understand the world. There are two types of information organization mechanisms studied in information retrieval: namely classification md clustering. Classification organizes entities by pigeonholing them into predefined categories, whereas clustering organizes information by grouping similar or related entities together. The system of the Internet information resources extracts a keyword from the words which appear in the web document and draws up a reverse file. Term clustering based on grouping related terms, however, did not prove overly successful and was mostly abandoned in cases of documents used different languages each other or door-way-pages composed of only an anchor text. This study examines infometric analysis and clustering possibility of web documents based on co-link topology of web pages.

  • PDF

Measuring the Confidence of Human Disaster Risk Case based on Text Mining (텍스트마이닝 기반의 인적재난사고사례 신뢰도 측정연구)

  • Lee, Young-Jai;Lee, Sung-Soo
    • The Journal of Information Systems
    • /
    • v.20 no.3
    • /
    • pp.63-79
    • /
    • 2011
  • Deducting the risk level of infrastructure and buildings based on past human disaster risk cases and implementing prevention measures are important activities for disaster prevention. The object of this study is to measure the confidence to proceed quantitative analysis of various disaster risk cases through text mining methodology. Indeed, by examining confidence calculation process and method, this study suggests also a basic quantitative framework. The framework to measure the confidence is composed into four stages. First step describes correlation by categorizing basic elements based on human disaster ontology. Secondly, terms and cases of Term-Document Matrix will be created and the frequency of certain cases and terms will be quantified, the correlation value will be added to the missing values. In the third stage, association rules will be created according to the basic elements of human disaster risk cases. Lastly, the confidence value of disaster risk cases will be measured through association rules. This kind of confidence value will become a key element when deciding a risk level of a new disaster risk, followed up by preventive measures. Through collection of human disaster risk cases related to road infrastructure, this study will demonstrate a case where the four steps of the quantitative framework and process had been actually used for verification.

Mobile Web Magazine Design based on Kansei Engineering and Universal Design (간사이 공학을 적용한 유니버설 모바일웹 매거진 디자인)

  • Lee, Hyun-Ki;Yang, Janghoon
    • Journal of Digital Contents Society
    • /
    • v.18 no.7
    • /
    • pp.1227-1237
    • /
    • 2017
  • In this research, a design of mobile web magazine based on Kansei engineering is studied. Following the procedure of Kansei engineering, we executed user survey to discover latent emotion elements in mobile web magazine, which were 'attractive', 'open', and 'unique'. Corresponding design elements were found to be slide design, text layout, and proportion of image. The methodology of universal design was adopted to improves on those design elements so that the proposed design can be applicable to wide range of users. We could verify the potential of the proposed design method through expert interviews on the developed prototype mobile web magazine.

Visualization Techniques for Massive Source Code (대용량 소스코드 시각화기법 연구)

  • Seo, Dong-Su
    • The Journal of Korean Association of Computer Education
    • /
    • v.18 no.4
    • /
    • pp.63-70
    • /
    • 2015
  • Program source code is a set of complex syntactic information which are expressed in text forms, and contains complex logical structures. Structural and logical complexity inside source code become barriers in applying visualization techniques shown in traditional big-data approaches when the volume of source code become over ten-thousand lines of code. This paper suggests a procedure for making visualization of structural characteristics in source code. For this purpose, this paper defines internal data structures as well as inter-procedural relationships among functions. The paper also suggests a means of outlining the structural characteristics of source code by visualizing the source codes with network forms The result of the research work can be used as a means of controling and understanding the massive volume of source code.

A Study on the Design of Cyber lecture Component (가상강의 Component 설계에 관한 연구)

  • 강정배;김선경
    • Proceedings of the Korea Society of Information Technology Applications Conference
    • /
    • 2002.11a
    • /
    • pp.171-177
    • /
    • 2002
  • E-Loaming is a modem main teaching method starting from the concept of remote education. This research is aimed for proposing cyber education library system, and designing a cyber education component that becomes a basis for e-Learning system. Cyber education library is a storage system of cyber lectures that can supply high quality data to the needed developers. Cyber education component consists of 5 categories and those are text, voice, image, animation, and flash. By using this system, the developers can save the necessary time and effort in education development. This system also helps students. The students can access various lecture data on a given subject and select the best fit for them.

  • PDF

An Efficient BitmapInvert Index based on Relative Position Coordinate for Retrieval of XML documents (효율적인 XML검색을 위한 상대 위치 좌표 기반의 BitmapInvert Index 기법)

  • Kim, Tack-Gon;Kim, Woo-Saeng
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.1 s.307
    • /
    • pp.35-44
    • /
    • 2006
  • Recently, a lot of index techniques for storing and querying XML document have been studied so far and many researches of them used coordinate-based methods. But update operation and query processing to express structural relations among elements, attributes and texts make a large burden. In this paper, we propose an efficient BitmapInvert index technique based on Relative Position Coordinate (RPC). RPC has good preformance even if there are frequent update operations because it represents relationship among parent node and left, right sibling nodes. BitmapInvert index supports tort query with bitwise operations and does not casue serious performance degradations on update operations using PostUpdate algerian. Overall, the performance could be improved by reduction of the number of times for traversing nodes.