• Title/Summary/Keyword: 용어중요도

Search Result 29, Processing Time 0.032 seconds

Detection of Porno Sites on the Web using Fuzzy Inference (퍼지추론을 적용한 웹 음란문서 검출)

  • 김병만;최상필;노순억;김종완
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.5
    • /
    • pp.419-425
    • /
    • 2001
  • A method to detect lots of porno documents on the internet is presented in this parer. The proposed method applies fuzzy inference mechanism to the conventional information retrieval techniques. First, several example sites on porno arc provided by users and then candidate words representing for porno documents are extracted from theme documents. In this process, lexical analysis and stemming are performed. Then, several values such as tole term frequency(TF), the document frequency(DF), and the Heuristic Information(HI) Is computed for each candidate word. Finally, fuzzy inference is performed with the above three values to weight candidate words. The weights of candidate words arc used to determine whether a liven site is sexual or not. From experiments on small test collection, the proposed method was shown useful to detect the sexual sites automatically.

  • PDF

Automatic Korean to English Cross Language Keyword Assignment Using MeSH Thesaurus (MeSH 시소러스를 이용한 한영 교차언어 키워드 자동 부여)

  • Lee Jae-Sung;Kim Mi-Suk;Oh Yong-Soon;Lee Young-Sung
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.155-162
    • /
    • 2006
  • The medical thesaurus, MeSH (Medical Subject Heading), has been used as a controlled vocabulary thesaurus for English medical paper indexing for a long time. In this paper, we propose an automatic cross language keyword assignment method, which assigns English MeSH index terms to the abstract of a Korean medical paper. We compare the performance with the indexing performance of human indexers and the authors. The procedure of index term assignment is that first extracting Korean MeSH terms from text, changing these terms into the corresponding English MeSH terms, and calculating the importance of the terms to find the highest rank terms as the keywords. For the process, an effective method to solve spacing variants problem is proposed. Experiment showed that the method solved the spacing variant problem and reduced the thesaurus space by about 42%. And the experiment also showed that the performance of automatic keyword assignment is much less than that of human indexers but is as good as that of authors.

A Study on Automatic Indexing System Using natural language Processing, Statistical Technique, Relevance Verification (자연어 처리, 통계적 기법, 적합성 검증을 이용한 자동색인 시스템에 관한 연구)

  • Yu, Chun-Sik;U, Seon-Mi;Yu, Cheol-Jung;Lee, Jong-Deuk;Gwon, O-Bong;Kim, Yong-Seong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.6
    • /
    • pp.1552-1562
    • /
    • 1998
  • 형태소 분석(Morphological Analysis)과 같은 언어학적 처리에 의존하는 기존의 한국어 문헌에 대한 자동색인 기법들은 품사의 애매모호함이나 복합명사의 처리 등으로 부담(overhead)이 크다. 또한 불용어 처리에 사용되는 불용어 리스트가 대상 문헌의 주제 분야별로 따로 구축되어야 하며 그 크기가 방대하다는 문제점이 있다. 이러한 문제점들을 해결하기 위해, 본 논문에서는 각 문헌의 텍스트에 대해 복합명사 처리나 애매모호함에 대한 엄격한 분석을 수행하지 않는 간단한 형태의 형태소 분석을 수행하여 단순명사들을 추출한다. 그런 후 이들 단순명사들을 이용하여 유한 오토마타(Finite Automata)를 구성하고, 구성된 유한 오토마타와 각 명사의 단어빈도(Term Frequency)에 의해 각 색인어 후보들의 중요도를 계산하는 자동색인 기법을 제안한다. 그 결과 품사의 애매모호함에 대한 처리나 복합명사의 처리에따른 부담을 줄일 수 있었으며, 선정된 색인어들과 수작업으로 선정한 색인어들의 비교 실험에 의해 제안한 자동색인 기법의 성능을 검증하였다.

  • PDF

Importance-Performance Analysis to Evaluate Historic Culture Festival -The Case of The Yeoncheon Jeongok Paleolithic Festival- (역사체험축제의 중요도-실행도 분석에 관한 연구 -연천전곡리 구석기축제를 중심으로-)

  • Park, Sang-Hyeon
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.10
    • /
    • pp.321-329
    • /
    • 2007
  • The form of travel has been changing to dynamic, experience and family. Festivals are popular, specially the historic culture festivals, whose themes are a history event, a time and the people, are preferred to families because they give visitors opportunities of education and experience. Evaluation of festivals is important that it diagnoses the problems and enhancement One of the evaluation methods, Importance-Performance Analysis is useful which it is easy to find priority with visual matrix without complex statistical technique and a technical terminology. This research used IPA to evaluate The Yeoncheon Jeongok Paleolithic Festival which is one of the popular historic culture festivals. From the result 'unique food', 'rest facility', 'other convenience facility', 'hygiene', 'crowding' were included in selected to 'concentrate' territory of the IPA matrix. Therefor the festival manager should put his efforts to develop unique food, build more rest and other convenience facilities, enhance hygiene, and lowered crowding.

A Proofreader Matching Method Based on Topic Modeling Using the Importance of Documents (문서 중요도를 고려한 토픽 기반의 논문 교정자 매칭 방법론)

  • Son, Yeonbin;An, Hyeontae;Choi, Yerim
    • Journal of Internet Computing and Services
    • /
    • v.19 no.4
    • /
    • pp.27-33
    • /
    • 2018
  • In the process of submitting a manuscript to a journal in order to present the results of the research at the research institution, researchers often proofread the manuscript because it can manuscripts to communicate the results more effectively. Currently, most of the manuscript proofreading companies use the manual proofreader assignment method according to the subjective judgment of the matching manager. Therefore, in this paper, we propose a topic-based proofreader matching method for effective proofreading results. The proposed method is categorized into two steps. First, a topic modeling is performed by using Latent Dirichlet Allocation. In this process, the frequency of each document constituting the representative document of a user is determined according to the importance of the document. Second, the user similarity is calculated based on the cosine similarity method. In addition, we confirmed through experiments by using real-world dataset. The performance of the proposed method is superior to the comparative method, and the validity of the matching results was verified using qualitative evaluation.

Term Weighting Using Date Information and Its Appliance in Automatic Text Classification (날짜 정보를 이용한 가중치 계산 방법을 적용한 자동 문서분류)

  • Shim, Bojun;Park, Jinwoo;Seo, Jungyun
    • Annual Conference on Human and Language Technology
    • /
    • 2007.10a
    • /
    • pp.169-173
    • /
    • 2007
  • 문장을 구성하는 단어들은 문장의 의미를 표출하는 데에 있어서 모두 같은 크기의 중요도를 갖지는 않는다. 따라서, 정보검색 분야에서는 오랫동안 단어에 부여할 서로 다른 가중치를 구하는 다양한 전략을 연구해 왔다. 매우 일반적인 기능어들은 불용어로 분류하여 고려 대상에서 제외하기도 하고, 개체명 추출기를 이용하여 고유명사에 높은 가중치를 부여하거나, TF-IDF와 같이 단어가 문서 집합에 출현하는 양상과 빈도를 고려하여 가중치를 구하는 전략을 사용하기도 한다. 이와 같은 연구들에서는 같은 단어라면 어떤 상황에서도 변하지 않는 가중치를 가지게 된다. 본 논문에서는 같은 단어라 할지라도 날짜에 따라서, 어떤 날짜에는 중요한 단어이므로 높은 가중치를 받지만, 다른 날짜에는 낮은 가중치를 부여하는 전략을 제안하고 있다. 이 방법은 모든 정보검색 작업에서 사용할 수 있는 범용적인 전략이다. 본 연구에서는 특히, 문서분류 작업에 제안 방법을 적용했을 때, 제안 방법을 적용하지 않은 기본 시스템보다 분류 정확성이 더 향상되는 것을 실험을 통해서 확인하였다.

  • PDF

The Study on Decision-making for Articles for the Tramper Ship (부정기선의 선용품 보급지 결정에 관한 연구)

  • Yun, Seok-Hwan;Park, Jin-Hee
    • Journal of Navigation and Port Research
    • /
    • v.44 no.4
    • /
    • pp.354-361
    • /
    • 2020
  • The term "articles for ship" is a general term for all relevant mechanical accessories (SPARE) and consumable materials (STORE) commonly used in ships. Ships commonly are at sea, so it is difficult to respond rapidly to the demand for them in an emergency situation. In particular, it is more difficult to determine the boarding location of tramper ships as it is more difficult to predict the next sailing route in advance. The purpose of this study was to identify the important factors to be considered in determining the boarding location of tramper ships through a survey of each ship owner and ship management company. This valuable information on the proposed supply procedures for each country and port, would be an efficient way to supply articles for ships.

A Study on the Application to Network analysis on Importance of Author keyword based on Sequence of keyword (네트워크 분석을 통한 저자키워드 출현순서에 대한 의미 분석)

  • Kwon, Sun-young
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.9
    • /
    • pp.9-14
    • /
    • 2018
  • This study aims to investigate an importance of Author keyword with analysis the position of author keyword. An analysis was carried out on the position of author keyword. we examined an importance of Author keyword by using degree centrality, closeness centrality, betweenness centrality, eigenvector centrality. In the next stage, we performed analysis on correlation between network centrality measures and the position of keyword. As a result, degree centrality, closeness centrality, betweenness centrality, eigenvector centrality both has a high value in 4th author keyword order. eigenvector centrality was the comparatively effective method to separate of author keyword order method than other 3 centrality. Correlation analysis result shows that the network analysis value are increasing in order. This study has significance in that it was able to examine the author keyword behavior. Future research is needed to identify and supplement future situational factors, behavior, and psychology.

Comparison on Nursing Importance and Performance of Nursing Interventions linked to Nursing Diagnoses-focused on 5 NANDA Nursing Diagnoses (간호진단과 연계된 간호중재의 중요도와 수행도 분석 - 5개 간호진단을 중심으로 -)

  • 이은주;최인희
    • Journal of Korean Academy of Nursing
    • /
    • v.33 no.2
    • /
    • pp.210-219
    • /
    • 2003
  • Purpose: The purpose of this study was to identify nursing importance and the performance of nursing interventions linked to five nursing diagnoses and find out core nursing interventions to each of the five nursing diagnosis. The five nursing diagnoses were Pain, Diarrhea, Constipation, Hyperthermia, and Infection: Risk for. Method: Data was collected from nurses working in four different hospitals. Data were analyzed using mean, SD, and paired t-test to compare difference between importance and performance of each intervention. Result: In general interventions related to medication, such as Medication Administration: IV, Medication Administration: IM, Medication Administration: Oral, Medication Management were all considered highly important and performed very often regardless of nursing diagnoses. And the level of importance was higher than the performance in most of all the interventions linked to five nursing diagnoses. Only two interventions, Medication Administration and Intravenous (IV) insertion had higher level of performance than importance in the diagnoses of Pain and Diarrhea respectively. Conclusion: Using the above findings, we now know which intervention should be performed more frequently to solve nursing problems and which interventions are more critically important to nursing diagnosis. This information can be very helpful for developing nursing information system.

Measurement of CSF's Maturity for Korean e-Biz Market (한국 e-Biz 시장의 핵심성공요인 성숙도 측정)

  • Hong, Hyun-Gi
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.7
    • /
    • pp.161-170
    • /
    • 2007
  • E-Business has, nowadays, become a common commerce transaction. In the beginning, e-Biz has known as Electronic Commerce and has expanded its territory to department store's shopping mall, travel, finance, stock, and even luxury goods as car sales market. Considering these trends, this paper researched the environment of korean e-Biz market and suggested the picture of the matured and sound e-Biz market in Korea. We surveyed matured level of Critical Success Factors of e-Biz in terms of management. We also surveyed time based Critical Success Factors to analyze level of the Korean e-Biz market. These study's may provide us the knowledge about the prediction and preparation for changes in e-Biz market in the future.