• Title/Summary/Keyword: Information retrieval techniques

Search Result 276, Processing Time 0.03 seconds

A Survey of Information Searches on Internet (인터넷에서 정보 탐색에 대한 연구 조사)

  • 강병주;백혜승;최기선
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 1997.08a
    • /
    • pp.37-53
    • /
    • 1997
  • The huge size of Internet does not allow ordinary information seekers to search information with ease. Now, it is almost impossible to navigate the ocean of information without effective search tools. Web search engine has been the most effective technology for information retrieval on WWW. But recently, the need for new search tools on WWW or Internet has increased drastically. Currently, there are many on-going researches on the related topics. In this survey, we categorize the new search tools into four types: monitoring systems, filtering systems, browsing assistant systems, recommending systems. These example systems are examined. We are especially interested in WWW information filtering. It is studied how to apply the information filtering techniques to WWW, The application is not so straightforward like Email, Newswire filtering systems. As a result of this study, a simple WWW information filtering system is proposed.

  • PDF

Towards Improving Causality Mining using BERT with Multi-level Feature Networks

  • Ali, Wajid;Zuo, Wanli;Ali, Rahman;Rahman, Gohar;Zuo, Xianglin;Ullah, Inam
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.10
    • /
    • pp.3230-3255
    • /
    • 2022
  • Causality mining in NLP is a significant area of interest, which benefits in many daily life applications, including decision making, business risk management, question answering, future event prediction, scenario generation, and information retrieval. Mining those causalities was a challenging and open problem for the prior non-statistical and statistical techniques using web sources that required hand-crafted linguistics patterns for feature engineering, which were subject to domain knowledge and required much human effort. Those studies overlooked implicit, ambiguous, and heterogeneous causality and focused on explicit causality mining. In contrast to statistical and non-statistical approaches, we present Bidirectional Encoder Representations from Transformers (BERT) integrated with Multi-level Feature Networks (MFN) for causality recognition, called BERT+MFN for causality recognition in noisy and informal web datasets without human-designed features. In our model, MFN consists of a three-column knowledge-oriented network (TC-KN), bi-LSTM, and Relation Network (RN) that mine causality information at the segment level. BERT captures semantic features at the word level. We perform experiments on Alternative Lexicalization (AltLexes) datasets. The experimental outcomes show that our model outperforms baseline causality and text mining techniques.

The Retrieval of Abnormal TL Glow Curves Using Modified Glow Curve Analysis Method

  • Lee, Sang-Yoon;Lee, Kun-Jai;Kim, Jang-Lyul;Chang, Si-Young
    • Nuclear Engineering and Technology
    • /
    • v.29 no.5
    • /
    • pp.385-392
    • /
    • 1997
  • The shape of TL glow curve is a useful indicator for assurance of correct reading of the personal dosimeter. Since the reading procedure of TLD is irreversible, however, an analytic remedy should be considered to procure reliable dosimetric information for the readings with irregular glow con shape. In this study, kinetic trapping parameters of CaSO$_4$ : Dy Teflon personal dosimeter(Teledyne PB-6A) were analyzed by Halperin and Braner's model for general-order kinetics. From these kinetic tapping parameters, we also developed a simple procedure to retrieve the dosimetric information from abnormally distorted glow curves. The computerized glow curve deconvolution(CGCD) fitting of the reference glow curve with kinetic parameters from this study yields relative errors of about 5% from the expected integral. It was also found that the glow curve remedial procedure developed could retrieve the distorted TL glow curves within ewer ranges of 1575. With the glow curve retrieval techniques, doses incurred by gamma radiation can now be successfully re-constructed for the CaSO$_4$ : Dy Teflon dosimeter resulting abnormal glow curves.

  • PDF

A Dynamic Segmentation Method for Representative Key-frame Extraction from Video data (동적 분할 기법을 이용한 비디오 데이터의 대표키 프레임 추출)

  • Lee, Soon-Hee;Kim, Young-Hee;Ryu, Keun-Ho
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.38 no.1
    • /
    • pp.46-57
    • /
    • 2001
  • To access the multimedia data, such as video data with temporal properties, the content-based image retrieval technique is required. Moreover, one of the basic techniques for content-based image retrieval is an extraction of representative key-frames. Not only did we implement this method, but also by analyzing the video data, we have proven the proposed method to be both effective and accurate. In addition, this method is expected to solve the real world problem of building video databases, as it is very useful in building an index.

  • PDF

Semantic Ontology Speech Information Extraction using Non-parametric Correlation Coefficient (비모수적 상관계수를 이용한 시맨틱 온톨로지 음성 정보 추출)

  • Lee, Byungwook
    • Journal of Digital Convergence
    • /
    • v.11 no.9
    • /
    • pp.147-151
    • /
    • 2013
  • On retrieving high frequency keywords in information retrieval system, mismatchings to user's request are problems because of the various meanings of keywords in the existing ontology configuration. In this paper, it is to construct personnel selection ontology and rules in personnel management which are composed of various concepts and knowledges based on semantic web technology and suggest selection procedures to support these rules and knowledge retrieval system to verify suitability of selection results. This system utilizes a method of extraction of speech features by using non-parametric correlation coefficient. This proposed method has been validated by showing that the result average SNR of the experiment evaluation of the proposed techniques was shown to be decreased by .752dB.

Retrieval of video images based on Co-occurrence matrix (Co-occurrence matrix 기반 비데오 영상 검색)

  • 김규헌;정세윤;전병태;이재연;배영래
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10c
    • /
    • pp.482-484
    • /
    • 1998
  • Abstract : Multimedia data now one of the widely used information in all the fields as the fast developments of computer techniques have been made. Traditional database systems based on textual information have limitations when applied to multimedia information. This is because simple textual descriptions are ambiguous and inadequate for searching multimedia information for multimedia databases and digital libraries. Thus, especially for image data, which is one of the important multimedia information types, which can retrieve and browse image data on the basis of pictorial queries. Therefore, this paper presents an efficient method for describing texture information in image data.

  • PDF

Efficient Linear Path Query Processing using Information Retrieval Techniques for Large-Scale Heterogeneous XML Documents (정보 검색 기술을 이용한 대규모 이질적인 XML 문서에 대한 효율적인 선형 경로 질의 처리)

  • 박영호;한욱신;황규영
    • Journal of KIISE:Databases
    • /
    • v.31 no.5
    • /
    • pp.540-552
    • /
    • 2004
  • We propose XIR-Linear, a novel method for processing partial match queries on large-scale heterogeneous XML documents using information retrieval (IR) techniques. XPath queries are written in path expressions on a tree structure representing an XML document. An XPath query in its major form is a partial match query. The objective of XIR-Linear is to efficiently support this type of queries for large-scale documents of heterogeneous schemas. XIR-Linear has its basis on the schema-level methods using relational tables and drastically improves their efficiency and scalability using an inverted index technique. The method indexes the labels in label paths as key words in texts, and allows for finding the label paths that match the queries far more efficiently than string match used in conventional methods. We demonstrate the efficiency and scalability of XIR-Linear by comparing it with XRel and XParent using XML documents crawled from the Internet. The results show that XIR-Linear is more efficient than both XRel and XParent by several orders of magnitude for linear path expressions as the number of XML documents increases.

Knowledge Representation in Knowledge-based Systems of Library and Information Science Field (문헌정보학 영역 지식기반시스템에서의 지식표현)

  • Jeong, Yeong-Mi
    • Journal of the Korean Society for information Management
    • /
    • v.7 no.2
    • /
    • pp.35-57
    • /
    • 1990
  • Knowledge-based system is interpreted from the viewpoint of library and information science, and the concept of knowledge is defined in relation to information and data. Knowledge representation techniques are illustrated with examples from intelligent information retrieval systems and expert systems developed in library and information science field.

  • PDF

A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques (역사객체 기반의 기계학습 기법을 활용한 웹 문서의 시간정보 추출 방안 제안)

  • Lee, Jun;KWON, YongJin
    • Journal of Internet Computing and Services
    • /
    • v.16 no.4
    • /
    • pp.39-50
    • /
    • 2015
  • In information retrieval process through search engine, some users want to retrieve several documents that are corresponding with specific time period situation. For example, if user wants to search a document that contains the situation before 'Japanese invasions of Korea era', he may use the keyword 'Japanese invasions of Korea' by using searching query. Then, search engine gives all of documents about 'Japanese invasions of Korea' disregarding time period in order. It makes user to do an additional work. In addition, a large percentage of cases which is related to historical documents have different time period between generation date of a document and record time of contents. If time period in document contents can be extracted, it may facilitate effective information for retrieval and various applications. Consequently, we pursue a research extracting time period of Joseon era's historical documents by using historic literature for Joseon era in order to deduct the time period corresponding with document content in this paper. We define historical objects based on historic literature that was collected from web and confirm a possibility of extracting time period of web document by machine learning techniques. In addition to the machine learning techniques, we propose and apply the similarity filtering based on the comparison between the historical objects. Finally, we'll evaluate the result of temporal indexing accuracy and improvement.

Novel Speech Web Architecture Based on Information Selection Agent

  • Kwon, Hyeong-Joon;Kinoshita, Tetsuo
    • International Journal of Advanced Culture Technology
    • /
    • v.1 no.1
    • /
    • pp.11-14
    • /
    • 2013
  • In this paper, we propose a prototype of the SpeechWeb application using the information selection agent. We describe its design and implementation method and illustrated the processing results with the aid of some screenshots. Proposed SpeechWeb application presents the associated contents to the user by the aid of dynamic voice-anchors. These contents are presented using the apriori algorithm, which is one of data mining techniques. The application is better than the existing user-initiative structure from the viewpoint of making the user's interesting induction. Moreover, we believe that our proposed application is effective in information retrieval through wired and wireless telephone networks.

  • PDF