• 제목/요약/키워드: Topic Classification

검색결과 254건 처리시간 0.028초

공공기관 기록물 분류체계 재정비를 위한 지능화 방안: L 기관 사례를 중심으로 (An Intelligent Approach for Reorganization Record Classification Schemes in Public Institutions: Case Study on L Institution)

  • 임진솔;한희정;오효정
    • 정보관리학회지
    • /
    • 제40권2호
    • /
    • pp.137-156
    • /
    • 2023
  • 사회·정치적 패러다임의 변화에 따라 공공기관의 기관업무 및 직제는 시시각각 신설되거나 통합 또는 폐지된다. 효과적인 기록관리 관점에서는 이러한 변화를 반영하여 이전에 구축된 기록물 분류체계와 현행 업무 맥락이 적정한지 검토할 필요가 있다. 그러나 대부분 기관에서는 분류체계 재정비 과정이 실무담당자나 기관 기록물 담당자의 실무 경험적 판단에 의존한 수작업으로 진행되고 있어, 기업의 변화가 적시에 반영되거나 전체 큰 맥락을 통합적으로 파악하기가 어렵다. 이에 본 연구는 이러한 문제를 보완하고 나아가 기록의 효율적인 관리를 위해 자동화 및 지능화 기술을 활용한 기록물 분류체계 재정비 방안을 제안한다. 또한 제안된 방법론을 실제 공공기관에 적용하고, 도출된 결과물을 기관의 기능분류 담당 실무자와 면담을 수행하여 그 실효성과 한계점을 검증하였다. 이를 통해 재정비한 기록물 분류체계의 정확도와 신뢰도를 높여 기록물 관리의 표준화 실현을 도모하고자 한다.

A Comparative Study on the Intransitive Verb Alternation of English and Korean in the Aspectual Event Syntax

  • Khym, Han-Gyoo
    • International journal of advanced smart convergence
    • /
    • 제6권4호
    • /
    • pp.41-49
    • /
    • 2017
  • In this paper I applies Borer (1993)'s way of classifying English intransitive action verbs such as 'run', walk, among many others, to the corresponding Korean intransitive action verbs such as 'tali-ta' and 'keət-ta', and show how they are different from - or similar with - each other in terms of syntactic structures and verb classification. Unlike the English verb 'run' which can be classified into an unaccusative verb as well as an unergative verb in Borer's theory, the corresponding Korean verbs 'tali-ta' or 't'wi-ta' can behave not only as an unergative and unauucsative verb, but also it can behave as a transitive verb. Though Borer's perspective on classification of verb types may be thought of as somewhat radical mostly due to its heavy dependency on aspectual representation of a whole sentence which a verb is just part of, it is clearly suggesting a new and great insight into the controversial topic of classification of verb types. So it is worth adopting this insightful perspective for the analysis of corresponding Korean verbs and seeing if it also works for the Korean ones.

Detecting Malicious Social Robots with Generative Adversarial Networks

  • Wu, Bin;Liu, Le;Dai, Zhengge;Wang, Xiujuan;Zheng, Kangfeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권11호
    • /
    • pp.5594-5615
    • /
    • 2019
  • Malicious social robots, which are disseminators of malicious information on social networks, seriously affect information security and network environments. The detection of malicious social robots is a hot topic and a significant concern for researchers. A method based on classification has been widely used for social robot detection. However, this method of classification is limited by an unbalanced data set in which legitimate, negative samples outnumber malicious robots (positive samples), which leads to unsatisfactory detection results. This paper proposes the use of generative adversarial networks (GANs) to extend the unbalanced data sets before training classifiers to improve the detection of social robots. Five popular oversampling algorithms were compared in the experiments, and the effects of imbalance degree and the expansion ratio of the original data on oversampling were studied. The experimental results showed that the proposed method achieved better detection performance compared with other algorithms in terms of the F1 measure. The GAN method also performed well when the imbalance degree was smaller than 15%.

Biomedical Ontologies and Text Mining for Biomedicine and Healthcare: A Survey

  • Yoo, Ill-Hoi;Song, Min
    • Journal of Computing Science and Engineering
    • /
    • 제2권2호
    • /
    • pp.109-136
    • /
    • 2008
  • In this survey paper, we discuss biomedical ontologies and major text mining techniques applied to biomedicine and healthcare. Biomedical ontologies such as UMLS are currently being adopted in text mining approaches because they provide domain knowledge for text mining approaches. In addition, biomedical ontologies enable us to resolve many linguistic problems when text mining approaches handle biomedical literature. As the first example of text mining, document clustering is surveyed. Because a document set is normally multiple topic, text mining approaches use document clustering as a preprocessing step to group similar documents. Additionally, document clustering is able to inform the biomedical literature searches required for the practice of evidence-based medicine. We introduce Swanson's UnDiscovered Public Knowledge (UDPK) model to generate biomedical hypotheses from biomedical literature such as MEDLINE by discovering novel connections among logically-related biomedical concepts. Another important area of text mining is document classification. Document classification is a valuable tool for biomedical tasks that involve large amounts of text. We survey well-known classification techniques in biomedicine. As the last example of text mining in biomedicine and healthcare, we survey information extraction. Information extraction is the process of scanning text for information relevant to some interest, including extracting entities, relations, and events. We also address techniques and issues of evaluating text mining applications in biomedicine and healthcare.

Assisted Magnetic Resonance Imaging Diagnosis for Alzheimer's Disease Based on Kernel Principal Component Analysis and Supervised Classification Schemes

  • Wang, Yu;Zhou, Wen;Yu, Chongchong;Su, Weijun
    • Journal of Information Processing Systems
    • /
    • 제17권1호
    • /
    • pp.178-190
    • /
    • 2021
  • Alzheimer's disease (AD) is an insidious and degenerative neurological disease. It is a new topic for AD patients to use magnetic resonance imaging (MRI) and computer technology and is gradually explored at present. Preprocessing and correlation analysis on MRI data are firstly made in this paper. Then kernel principal component analysis (KPCA) is used to extract features of brain gray matter images. Finally supervised classification schemes such as AdaBoost algorithm and support vector machine algorithm are used to classify the above features. Experimental results by means of AD program Alzheimer's Disease Neuroimaging Initiative (ADNI) database which contains brain structural MRI (sMRI) of 116 AD patients, 116 patients with mild cognitive impairment, and 117 normal controls show that the proposed method can effectively assist the diagnosis and analysis of AD. Compared with principal component analysis (PCA) method, all classification results on KPCA are improved by 2%-6% among which the best result can reach 84%. It indicates that KPCA algorithm for feature extraction is more abundant and complete than PCA.

A Review of Domestic Research Trends Related to the International Classification of Functioning, Disability and Health (ICF): 2015-2020

  • Song, Ju-Min
    • 대한물리의학회지
    • /
    • 제16권3호
    • /
    • pp.65-80
    • /
    • 2021
  • PURPOSE: This study was conducted as a literature review to analyze the research trends related to the International Classification of Functioning, Disability and Health (ICF) in Korea from 2015 to 2020. METHODS: Precedent studies were searched with the search term "ICF" or "international classification of functioning, disability and health" from the databases of RISS, KISS, DBpia, and Pubmed. The inclusion criteria are that the studies have been carried out in Korea from 2015 to 2020 using ICF by researchers consisting of one or more Koreans and have been peer-reviewed. RESULTS: Of the total 269 studies, 107 that met the inclusion criteria were analyzed. It was found that these studies were published at a similar frequency each year. The most common area of expertise was identified as the clinical area (n = 67), followed by special education (n = 21) and social welfare (n = 13). The study subject groups were mostly patients (n = 39), disabled people (n = 25), and related experts (n = 13). The most common research topic was functioning evaluation (n = 49) and followed by a literature review (n = 29), and the most frequently used components in all the areas of expertise were activity and participation (n = 98), body function and structure (n = 73), and environmental factors (n = 61). CONCLUSION: For the past six years, domestic ICF-related research has been conducted in a wider range of expertise areas on more subdivised subject groups. Continuous research, development of standardized curricula and contents, and development of coding tools are considered to be important in vitalizing the use of the ICF.

Novel Category Discovery in Plant Species and Disease Identification through Knowledge Distillation

  • Jiuqing Dong;Alvaro Fuentes;Mun Haeng Lee;Taehyun Kim;Sook Yoon;Dong Sun Park
    • 스마트미디어저널
    • /
    • 제13권7호
    • /
    • pp.36-44
    • /
    • 2024
  • Identifying plant species and diseases is crucial for maintaining biodiversity and achieving optimal crop yields, making it a topic of significant practical importance. Recent studies have extended plant disease recognition from traditional closed-set scenarios to open-set environments, where the goal is to reject samples that do not belong to known categories. However, in open-world tasks, it is essential not only to define unknown samples as "unknown" but also to classify them further. This task assumes that images and labels of known categories are available and that samples of unknown categories can be accessed. The model classifies unknown samples by learning the prior knowledge of known categories. To the best of our knowledge, there is no existing research on this topic in plant-related recognition tasks. To address this gap, this paper utilizes knowledge distillation to model the category space relationships between known and unknown categories. Specifically, we identify similarities between different species or diseases. By leveraging a fine-tuned model on known categories, we generate pseudo-labels for unknown categories. Additionally, we enhance the baseline method's performance by using a larger pre-trained model, dino-v2. We evaluate the effectiveness of our method on the large plant specimen dataset Herbarium 19 and the disease dataset Plant Village. Notably, our method outperforms the baseline by 1% to 20% in terms of accuracy for novel category classification. We believe this study will contribute to the community.

지질용어 시소러스 시스템의 설계 및 구축 (Design and Implementation of Thesaurus System for Geological Terms)

  • 황재홍;지광훈;한종규;연영광;류근호
    • 한국지리정보학회지
    • /
    • 제10권2호
    • /
    • pp.23-35
    • /
    • 2007
  • 최근 정보 검색 분야에서 시맨틱 웹 기술에 따른 인터넷 용어사전과 더불어 시소러스의 필요성이 더욱 중요시되고 있다. 시소러스는 분류와 사전의 결합으로 상위 및 하위개념 사이의 전후관계를 명확히 하기 위해서 공식적으로 조직, 통제된 색인어의 어휘로 인간의 학습, 탐구활동 등 제반 지식활동의 대상이 되는 개념(용어)간의 관계를 표현한 지식구조의 토픽 맵이다. 하지만 시소러스가 용어의 통제 및 표준화와 더불어 정보를 능률적으로 처리하고 검색하는데 필수적인 수단으로 평가되고 있음에도 불구하고 아직까지 지질분야에서 우리말 시소러스가 없는 실정이다. 시소러스를 구축하기 위해서는 표준화되고 잘 정의된 지침이 필요하다. 이러한 표준화된 지침은 보다 효율적인 정보 관리를 가능하게 할 것이며, 정보 이용자 또한 보다 정확한 정보를 쉽고 편리하게 이용할 수 있게 될 것이다. 본 연구는 지질정보 중 가장 기본이 되는 용어 시소러스 시스템 구축 연구이다. 이를 위해서 첫째, 국내외 지질용어 표준화 동향을 살펴보았다. 둘째, 15개 분야에 대한 지질학적 주제를 정하고 각 주제에 대한 분류체계(안)를 마련하였다. 셋째, 지질용어 시소러스 분류체계를 바탕으로 지질용어 시소러스 명세서를 작성하였다. 마지막으로 이 명세서를 이용하여 인터넷기반 지질용어 시소러스 시스템을 설계하고 구축하였다.

  • PDF

근육의 경근 배속에 대한 국내 연구 고찰 (Classification of Muscles into Meridian Sinew: A Literature Review)

  • 문수정;김성하;이상훈
    • 한방재활의학과학회지
    • /
    • 제24권4호
    • /
    • pp.83-96
    • /
    • 2014
  • Objectives Although many studies explored the topic of meridian sinew in various perspectives and the term "meridian sinew" is widely used, the theory of meridian sinew is not applied for precise diagnosis and in-depth treatment in clinical practice. The aim of the study is to provide basic data classifying muscles into meridian sinew for future studies that investigate meridian sinew based on an anatomical basis. Methods Studies were identified with searches of six major Korean databases: OASIS, KoreaMed, KMBASE, KISS, NDSL and KoreanTK. Published primary studies classifying muscles into meridian sinew were included. Results A total of 20 studies met the inclusion criteria and were included in the analysis. Twelve studies conducted the classification of muscles into meridian sinew based on meridian/ acupoints distribution and six based on meridian sinew distribution, and two based on both. Muscles with fidelity level of 50 or more were 54 (85.7%) and muscles with 100 fidelity level were 7 (11.3%): occipitalis, adductor digiti minimi, frontalis, biceps femoris, rectus femoris, vatus lateralis and extensor digitorum longus. Conclusions Classification results of muscles into meridian sinew varied according to the classification criteria and interpretation of meridian sinew and acupoints distribution. To develop muscle sinew as a more useful theory in diagnosis and treatment, efforts should be made to reduce the gap between study results and build consensus on the anatomical entity of meridian sinew.

Crowd Activity Classification Using Category Constrained Correlated Topic Model

  • Huang, Xianping;Wang, Wanliang;Shen, Guojiang;Feng, Xiaoqing;Kong, Xiangjie
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권11호
    • /
    • pp.5530-5546
    • /
    • 2016
  • Automatic analysis and understanding of human activities is a challenging task in computer vision, especially for the surveillance scenarios which typically contains crowds, complex motions and occlusions. To address these issues, a Bag-of-words representation of videos is developed by leveraging information including crowd positions, motion directions and velocities. We infer the crowd activity in a motion field using Category Constrained Correlated Topic Model (CC-CTM) with latent topics. We represent each video by a mixture of learned motion patterns, and predict the associated activity by training a SVM classifier. The experiment dataset we constructed are from Crowd_PETS09 bench dataset and UCF_Crowds dataset, including 2000 documents. Experimental results demonstrate that accuracy reaches 90%, and the proposed approach outperforms the state-of-the-arts by a large margin.