• Title/Summary/Keyword: 검색 키워드 추출

Search Result 293, Processing Time 0.029 seconds

Design and Implementation of web Document Visualization System using FastMap (FastMap을 이용한 웹 문서 시각화 시스템의 설계 및 구현)

  • 문진석;손기락;김차성
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10a
    • /
    • pp.33-35
    • /
    • 1999
  • 인터넷의 발달과 더불어 매일같이 제공되는 수많은 정보로부터 자신에게 필요한 정보만을 추출하는데는 많은 시간과 노력이 소모된다. 이러한 정보수집의 어려움에서 정보를 쉽고 효율적으로 찾기 위해서 웹 문서 시각화 시스템을 구현하였다. 웹 문서 시각화 시스템은 사용자가 검색하는 정보는 과거에 검색했던 웹 문서를 다시 방문하는 경험에서 착안하였다. 이를 위해 인터넷 익스플로러를 통해서 방문 중인 웹 문서의 URL, 키워드, 문서간의 유사성을 추출하여 시각화 한다. 시각화 알고리즘으로 FastMap을 사용하였다. 본 논문에서 FastMap은 웹문서간의 유사성, 즉 상대적인 거리 객체 형태를 2-차원 공간으로 표현하는 알고리즘이다. 2차원 공간으로 매핑된 주변에 있는 웹 문서 객체들을 확대하면 방문중인 웹 문서와 유사성이 있는 문서를 쉽게 찾을 수 있다.

  • PDF

A Study on Radiological Image Retrieval System (방사선 의료영상 검색 시스템에 관한 연구)

  • Park, Byung-Rae;Shin, Yong-Won
    • Journal of radiological science and technology
    • /
    • v.28 no.1
    • /
    • pp.19-24
    • /
    • 2005
  • The purpose of this study was to design and implement a useful annotation-based Radiological image retrieval system to accurately determine on education and image information for Radiological technologists. For better retrieval performance based on large image databases, we presented an indexing technique that integrated $B^+-tree$ proposed by Bayer for indexing simple attributes and inverted file structure for text medical keywords acquired from additional description information about Radiological images. In our results, we implemented proposed retrieval system with Delphi under Windows XP environment. End users, Radiological technologists, are able to store simple attributes information such as doctor name, operator name, body parts, disease and so on, additional text-based description information, and Radiological image itself as well as to retrieve wanted results by using simple attributes and text keywords from large image databases by graphic user interface. Consequently proposed system can be used for effective clinical decision on Radiological image, reduction of education time by organizing the knowledge, and well organized education in the clinical fields. In addition, It can be expected to develop as decision support system by constructing web-based integrated imaging system included general image and special contrast image for the future.

  • PDF

Hierarchical Automatic Classification of News Articles based on Association Rules (연관규칙을 이용한 뉴스기사의 계층적 자동분류기법)

  • Joo, Kil-Hong;Shin, Eun-Young;Lee, Joo-Il;Lee, Won-Suk
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.6
    • /
    • pp.730-741
    • /
    • 2011
  • With the development of the internet and computer technology, the amount of information through the internet is increasing rapidly and it is managed in document form. For this reason, the research into the method to manage for a large amount of document in an effective way is necessary. The conventional document categorization method used only the keywords of related documents for document classification. However, this paper proposed keyword extraction method of based on association rule. This method extracts a set of related keywords which are involved in document's category and classifies representative keyword by using the classification rule proposed in this paper. In addition, this paper proposed the preprocessing method for efficient keywords creation and predicted the new document's category. We can design the classifier and measure the performance throughout the experiment to increase the profile's classification performance. When predicting the category, substituting all the classification rules one by one is the major reason to decrease the process performance in a profile. Finally, this paper suggested automatically categorizing plan which can be applied to hierarchical category architecture, extended from simple category architecture.

A Term Weight Mensuration based on Popularity for Search Query Expansion (검색 질의 확장을 위한 인기도 기반 단어 가중치 측정)

  • Lee, Jung-Hun;Cheon, Suh-Hyun
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.8
    • /
    • pp.620-628
    • /
    • 2010
  • With the use of the Internet pervasive in everyday life, people are now able to retrieve a lot of information through the web. However, exponential growth in the quantity of information on the web has brought limits to online search engines in their search performance by showing piles and piles of unwanted information. With so much unwanted information, web users nowadays need more time and efforts than in the past to search for needed information. This paper suggests a method of using query expansion in order to quickly bring wanted information to web users. Popularity based Term Weight Mensuration better performance than the TF-IDF and Simple Popularity Term Weight Mensuration to experiments without changes of search subject. When a subject changed during search, Popularity based Term Weight Mensuration's performance change is smaller than others.

The Development of Automatic Ontology Generation System Using Extended Search Keywords (검색 키워드 확장을 이용한 온톨로지 자동 생성 시스템 개발)

  • Shim, Joon;Lee, Hong-Chul
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.6
    • /
    • pp.1220-1228
    • /
    • 2009
  • Ontologies, which are the core of the Semantic Web, are usually limited by specific domains or created by defining meanings and relationships that depend on the heuristic. However, the creation of an ontology is not only very difficult but also very time-consuming. In contrast with ontologies that are used in specific fields, an ontology for the Web entails an unlimited scope of knowledge and expression of information. Hence, it is hard to express information in the same way that is used to create ontologies in specific fields. Therefore, the automatic generation of an ontology takes very important role in the Semantic Web. In this paper, to make ontologies automatically, we suggest the methods to create and renew ontologies by expanding keywords related to the index-terms which are extracted from the search keywords which users input in the search engines by analyzing the morphemes.

Extracting Alternative Word Candidates for Patent Information Search (특허 정보 검색을 위한 대체어 후보 추출 방법)

  • Baik, Jong-Bum;Kim, Seong-Min;Lee, Soo-Won
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.4
    • /
    • pp.299-303
    • /
    • 2009
  • Patent information search is used for checking existence of earlier works. In patent information search, there are many reasons that fails to get appropriate information. This research proposes a method extracting alternative word candidates in order to minimize search failure due to keyword mismatch. Assuming that two words have similar meaning if they have similar co-occurrence words, the proposed method uses the concept of concentration, association word set, cosine similarity between association word sets and a ranking modification technique. Performance of the proposed method is evaluated using a manually extracted alternative word candidate list. Evaluation results show that the proposed method outperforms the document vector space model in recall.

A Study on Optimized Information Search Algorithm Using lava (Java를 이용한 정보 검색 최적화 알고리즘에 관한 연구)

  • 김용호;정종근;이윤배
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.6
    • /
    • pp.797-804
    • /
    • 2002
  • As internet use is being generalized central of WWW(World Wide Web) service of multimedia based recently, we could acquire many informations that exist to all over the world's computer network .Therefore, picking up of information became important problem before that internet is generalized, but it is risen to important problem to acquire correct information rapidly on modem society that use of internet is generalized. This paper designed internet search engine and understand structure of internet search engine drawing URL that is optimized, and secure embodiment technology using Java that is language of object base. Search engine that proposed in this paper maintained user's the convenience by offer keyword search, and simplify user interface And although quantity of searched information site is few, search engine show that the bad link rate of searched result is improved compare with existent domestic manufacture search engines.

A study on Similarity analysis of National R&D Programs using R&D Project's technical classification (R&D과제의 기술분류를 이용한 사업간 유사도 분석 기법에 관한 연구)

  • Kim, Ju-Ho;Kim, Young-Ja;Kim, Jong-Bae
    • Journal of Digital Contents Society
    • /
    • v.13 no.3
    • /
    • pp.317-324
    • /
    • 2012
  • Recently, coordination task of similarity between national R&D programs is emphasized on view from the R&D investment efficiency. But the previous similarity search method like text-based similarity search which using keyword of R&D projects has reached the limit due to deviation of document's quality. For the solve the limitations of text-based similarity search using the keyword extraction, in this study, utilization of R&D project's technical classification will be discussed as a new similarity search method when analyzed of similarity between national R&D programs. To this end, extracts the Science and Technology Standard Classification of R & D projects which are collected when national R&D Survey & analysis, and creates peculiar vector model of each R&D programs. Verify a reliability of this study by calculate the cosine-based and Euclidean distance-based similarity and compare with calculated the text-based similarity.

Development of geo-coding module prototype on water hazard information (수재해 정보 지오코딩 모듈 프로토타입 개발)

  • BAECK, Seung Hyub;PARK, Gwang-Ha;HWANG, Eui-Ho;CHAE, Hyo-Sok
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.476-476
    • /
    • 2017
  • 최근 갑작스런 폭우로 인한 제방 붕괴, 침수 및 지진 등과 같은 재해 발생 시 추가 피해를 방지하고 주민들의 긴급대피를 도운 건 SNS를 통한 현장 정보와 경보 메시지의 지속적인 전파이다. 최근의 SNS는 재난정보에서도 활용할 수 있을 정도로 진화하였다. 국가재난정보 중 수재해 관련 정보를 추출하여 다양한 주제도위에 중첩으로 공간정보를 제공할 수 있는 재난정보 제공을 위한 웹서비스를 개발하고자 하였다. 수재해 정보를 필터링하기 위하여 우선 관련된 키워드 선정이 필요하며, 기본적인 키워드는 하천일람표를 참고하여 6개 권역 및 하천이름을 선정하였다. 또한, 한강 홍수 통제소의 수자원 용어사전과 (사)한국물학술단체연합회에서 발간한 물용어집을 참고하여 수재해 관련 용어들 약 300여개를 추가하였다. 선정된 용어들은 1차적으로 적재된 데이터베이스에서 수재해 정보 관련 필터링을 하는데 사용되며, 비정형 데이터들을 필터링하고 주소 정보 검색 및 추출을 통하여 정형화 하게 된다. 추출된 주소정보에 대하여 개발한 지오코딩 모듈을 적용하여 수재해 항목에 대해 좌표정보를 업데이트 하게 된다. 가뭄, 집중호우, 홍수 등의 수재해 정보별, 또한 일자별 그룹화 및 구조화를 진행하고 해당되는 정보를 공간정보 오픈플랫폼 API를 활용하여 지도상에 가시화할 수 있다. 개발한 지오코딩 모듈을 이용하여 실제 테이블 정보를 구성하여 데이터베이스에 수재해 정보 지오코딩 테이블을 구성하여 테스트 모의하였다. 재난정보 중 홍수, 가뭄에 대한 선택정보와 시간정보를 매개변수로 받는 XML 웹서비스 테스트로 검증을 하였다. 본 연구를 통하여 재난정보 가시화에 있어서 사용자가 조회하고자 하는 유형별, 날짜별 선택이 가능한 공간적 정보를 검색 및 확인할 수 있게 되었다. 개발한 수재해 정보 지오코딩 모듈 프로토 타입은 수재해 정보 플랫폼 융합기술 연구단에서 개발하는 핵심 목표시스템 내 재난정보 제공시스템에 적용 가능하며, 수재해 정보에 대하여 대국민 서비스가 가능할 것으로 사료된다.

  • PDF

Design and Implementation of Tag Coupling-based Boolean Query Matching System for Ranked Search Result (태그결합을 이용한 불리언 검색에서 순위화된 검색결과를 제공하기 위한 시스템 설계 및 구현)

  • Kim, Yong;Joo, Won-Kyun
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.4
    • /
    • pp.101-121
    • /
    • 2012
  • Since IR systems which adopt only Boolean IR model can not provide ranked search result, users have to conduct time-consuming checking process for huge result sets one by one. This study proposes a method to provide search results ranked by using coupling information between tags instead of index weight information in Boolean IR model. Because document queries are used instead of general user queries in the proposed method, key tags used as queries in a relevant document are extracted. A variety of groups of Boolean queries based on tag couplings are created in the process of extracting queries. Ranked search result can be extracted through the process of matching conducted with differential information among the query groups and tag significance information. To prove the usability of the proposed method, the experiment was conducted to find research trend analysis information on selected research information. Aslo, the service based on the proposed methods was provided to get user feedback for a year. The result showed high user satisfaction.