• Title/Summary/Keyword: 분야별 분류

Search Result 818, Processing Time 0.034 seconds

A Passage Retrieval Method by Using Field-Associated Information (연상정보를 이용한 단락분할 방법)

  • Hong, Sung-Og;Lee, Samuel Sang-Kon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.05a
    • /
    • pp.497-500
    • /
    • 2003
  • 문서에 여러 가지 화제가 혼합되어 있는 문서에서 화제의 실마리 부분을 특정화하여 각 화제별 단락을 추출하는 기술은 정보검색 분야에서 중요한 역할을 담당하는 기술이다. 잘 정의된 분야체계에 따라 구축된 분야연상어를 이용하여 단락분할을 시도한다. 분야연상어는 특정한 분야를 정확하게 연상할 수 있는 단어로서 잘 분류된 문서 컬렉션에서 구축할 수 있다. 이 분야연상어를 이용하여 문서를 관련된 분야변로 추출하여 의미기반 단락추출 방법을 제안한다. 화제의 계속성에 주목하여 분야연상어의 수준(범위)이나 연속출현성에 의해 계산된 계속도에 의해 화제의 실마리를 추적하고, 화제의 전환성을 고려한 방법을 제안한다. 문서 내 각 화제의 단락구분을 명확히 하여, 단락을 화제분야별로 추출하는 방법을 제안한다. 50문서를 실험한 결과 82%의 정확율과 63%의 재현율을 얻어 실용성을 기대할 수 있다.

  • PDF

Text Mining-Based Emerging Trend Analysis for e-Learning Contents Targeting for CEO (텍스트마이닝을 통한 최고경영자 대상 이러닝 콘텐츠 트렌드 분석)

  • Kyung-Hoon Kim;Myungsin Chae;Byungtae Lee
    • Information Systems Review
    • /
    • v.19 no.2
    • /
    • pp.1-19
    • /
    • 2017
  • Original scripts of e-learning lectures for the CEOs of corporation S were analyzed using topic analysis, which is a text mining method. Twenty-two topics were extracted based on the keywords chosen from five-year records that ranged from 2011 to 2015. Research analysis was then conducted on various issues. Promising topics were selected through evaluation and element analysis of the members of each topic. In management and economics, members demonstrated high satisfaction and interest toward topics in marketing strategy, human resource management, and communication. Philosophy, history of war, and history demonstrated high interest and satisfaction in the field of humanities, whereas mind health showed high interest and satisfaction in the field of in lifestyle. Studies were also conducted to identify topics on the proportion of content, but these studies failed to increase member satisfaction. In the field of IT, educational content responds sensitively to change of the times, but it may not increase the interest and satisfaction of members. The present study found that content production for CEOs should draw out deep implications for value innovation through technology application instead of simply ending the technical aspect of information delivery. Previous studies classified contents superficially based on the name of content program when analyzing the status of content operation. However, text mining can derive deep content and subject classification based on the contents of unstructured data script. This approach can examine current shortages and necessary fields if the service contents of the themes are displayed by year. This study was based on data obtained from influential e-learning companies in Korea. Obtaining practical results was difficult because data were not acquired from portal sites or social networking service. The content of e-learning trends of CEOs were analyzed. Data analysis was also conducted on the intellectual interests of CEOs in each field.

Classification and Allocation method of e-mail using possibility distribution and prediction (확률 분포와 추론에 의한 이메일 분류 및 정리 방법)

  • Go, Nam-Hyeon;Kim, Ji-Yun;Choi, Man-Kyu
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2016.07a
    • /
    • pp.95-96
    • /
    • 2016
  • 본 논문에서는 디리클레 분포와 베이즈 추론 모델을 활용하여 전자우편을 분류하고 정리하는 방법을 제안한다. 과거 원치 않는 광고성 이메일인 스팸 탐지에서 시작한 전자우편 분류는 지속적인 송수신 량의 증가와 내용의 다양화로 인해 광고성과 정보성의 판단 기준이 모호해진 상태이다. 스팸 탐지와 같은 이분법적 분류 방식이 아닌 내용의 주제 별로 자동 분류할 수 있는 방법이 필요하다. 본 논문에서 다루는 제안 기법은 전자우편의 내용에서 다뤄질 수 있는 주제의 종류를 예측하기 위한 방법을 제공한다. 발신하거나 수신된 전자우편이 속한 주제를 자동으로 정할 수 있다. 본 제안 기법의 활용을 통해 전자우편의 분류만이 아닌 업무 및 시장 동향 분석과 정보보안 분야에서는 악성코드 분류에 사용될 수 있을 것으로 기대된다.

  • PDF

Feature Selection and Extraction for Document Classifier for If documents based on SVM (SVM기반 정보기술 문서분류를 위한 특성 선택 및 추출 기법)

  • 강윤희
    • Proceedings of the KAIS Fall Conference
    • /
    • 2001.11a
    • /
    • pp.75-78
    • /
    • 2001
  • 본 논문에서는 웹 문서의 자동 분류를 위한 특성 선택 및 추출기법을 기술한다. 최근 인터넷의 급속한 성장과 보급으로 전자우편과 웹을 통해 제공되어지는 정보의 양이 기하급수적으로 증가함에 따라 효율적인 문서 분류의 필요성이 증가하고 있다. 본 논문에서는 웹 디렉토리 내의 문서로부터 추출된 용어 집합을 기반으로 SVM을 사용하여 학습한 후 문서 분류를 수행한다. 본 실험의 문서는 정보통신 분야 디렉토리 서비스 시스템인 itfind로부터 수집된 문서를 대상으로 하였으며 3가지 시나리오에 따라 실험을 수행하여 각 시나리오 별로 재현율/정확율 및 오분류율을 성능 요소로 계산하였다. 본 실험은 학습 벡터 구성과정에서 잡음에 의해 다른 클래스의 문서 분류에 미치는 영향을 평가하여 SVM을 기반으로 한 문서 분류 기법이 강건함을 보였다.

A Study on the Developing Standard Classsification of the National Knowledge and Information Resources (국가지식정보 자원 분류 체계 표준화 연구)

  • Ko Young-Man;Seo Tae-Sul;Cho Sun-Yeong
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.40 no.3
    • /
    • pp.151-173
    • /
    • 2006
  • The purpose of this study is to make out a draft for the standard classification of the National Knowledge and Information Resources. As the result of the Study the standard classification system of the national knowledge and information resources, named "Knowledge Classification 'KC' is suggested. KC consists of 3 classification systems classification by subject, type of resources and type of media. The classification by subject has 12 main classes, and each main class has divisions. Main classes consist each of major discipline or group of related disciplines. The type of resources is classified by 10 types of content, likewise numbered 0-9, and the media of knowledge are classified by 8 types. likewise 0-7. In the Practice the notation always consists of 2 characters and 2 digits. The first character designate main class and the second character designate division. The first number designate the type of resources and the second number designate the type of media.

The Construction of a Domain-Specific Sentiment Dictionary Using Graph-based Semi-supervised Learning Method (그래프 기반 준지도 학습 방법을 이용한 특정분야 감성사전 구축)

  • Kim, Jung-Ho;Oh, Yean-Ju;Chae, Soo-Hoan
    • Science of Emotion and Sensibility
    • /
    • v.18 no.1
    • /
    • pp.103-110
    • /
    • 2015
  • Sentiment lexicon is an essential element for expressing sentiment on a text or recognizing sentiment from a text. We propose a graph-based semi-supervised learning method to construct a sentiment dictionary as sentiment lexicon set. In particular, we focus on the construction of domain-specific sentiment dictionary. The proposed method makes up a graph according to lexicons and proximity among lexicons, and sentiments of some lexicons which already know their sentiment values are propagated throughout all of the lexicons on the graph. There are two typical types of the sentiment lexicon, sentiment words and sentiment phrase, and we construct a sentiment dictionary by creating each graph of them and infer sentiment of all sentiment lexicons. In order to verify our proposed method, we constructed a sentiment dictionary specific to the movie domain, and conducted sentiment classification experiments with it. As a result, it have been shown that the classification performance using the sentiment dictionary is better than the other using typical general-purpose sentiment dictionary.

A THEMATIC SURVEY ON THE REPORTS PUBLISHED IN THE JOURNAL OF THE KOREAN ACADEMY OF PEDIATRIC DENTISTRY (역대 대한소아치과학회지 게재논문의 분야별 분포에 대한 조사)

  • Kim, Jae-Moon;Jeong, Tae-Sung;Kim, Shin
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.29 no.2
    • /
    • pp.270-277
    • /
    • 2002
  • Since founded in 1959, it's well known that the KAPD has pioneered in the researches and clinical aspects of pediatric dentistry in Korea. It's official journal, the Journal of the KAPD, was first published in 1974 and has pressed total 956 articles up to now(March, 2001). In this study, all the articles pressed in this journal have been surveyed, focussing in their main theme, their chronological and thematic distribution. The thematic classification was made with the reference of the previous studies and renowned textbooks in pediatric dentistry. And we obtained the results as follows: 1. The researches on dental materials and dental equipments have shown continuous increase throughout the period. 2. The researches on dental caries, caries prevention and systemic disorders have occupied relatively high proportion consistently. 3. The researches on malocclusions and cysts/minor surgery have shown increasing tendency in the second period, but are decreasing in the third period. 4. The researches on craniofacial growth/development, tooth development/eruption, developmental disorders of teeth, management of eruption space have shown decreasing tendency. 5. The researches on behavioral research, oral habits, occlusion of primary-mixed dentition have shown very low proportion, reaching no more than 1% throughout the period.

  • PDF

A Study on Improvement for Classification of Fiction to Enhance to Accessibility for Middle School Students (중학생의 소설 접근성을 증진시키기 위한 소설 분야 분류 개선 방안에 관한 연구)

  • Cho, Hye Chon;Chung, Yeon-Kyoung
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.1
    • /
    • pp.61-82
    • /
    • 2018
  • Fiction is a collection that most students read and borrow in school libraries. KDC has several limitations when students look for fiction books they need. In line with this, we surveyed various cases of fiction classifications used in libraries, bookstores, and publishers and use behaviors of fiction of middle school students. Based upon the result of the surveys, we proposed a better way of classifying fiction books according to user needs. In addition to the KDC number, color bands were attached according to genres so that users could easily find the desired books. These suggestions and other information will enhance the accessibility and discoverability to fiction books for middle school students and may be used as reference materials for fiction classification in libraries, bookstores, and publishers in the future.

A Structure on Classification Service System of Internet Documents (인터넷 문서의 자동분류 서비스 시스템에 관한 구현)

  • Hwang Sung-Ha;Choi Kwang-Nam;Lee Dae-Kyu;Lee Sang-Ho
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2005.11a
    • /
    • pp.66-71
    • /
    • 2005
  • Using for the internet information is easy or difficult. The effort to obtain the useful information is developed the various technique such as search as well as the information repository, classification, processing and the utilization. Specially, such developments are remarkable to the Agent of various uses and the classification, conversion in processing techniques. The study introduces the classification service system of internet documents which is processing from the repository of internet information to the automatic classification and search service.

  • PDF

A Study on the Classification Guidelines of Modern Culture Heritages in Building and Facilities (근대 건축 및 시설물 문화유산 분류방안 연구)

  • Lee, Jeong-Soo;Yang, Seung-Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.9
    • /
    • pp.6333-6344
    • /
    • 2015
  • This study focused on the classification systems of modern architecture and facilities reviewing the characteristics of domestic and foreign cultural heritage classification systems. The results are as follows : (1) It is necessary new classification system for recent emerging architectures and facilities which contains new functions, and reflecting new scope of cultural heritage, in example cultural landscape. (2) Reviewing the related spheres which can produce future cultural heritages such as KDC, Industrial Classification and foreign trends on the cultural heritages, we classified 6 main categories ; Politics & Diplomatics, Industry & Economy, Society & Life, Culture & Art, Technology & Science, Military & Public Safety. (3) Under the main category, we divided sub- and subject-category according usages of objects for reflecting the registered appreciations.