• Title/Summary/Keyword: Indexing Model

Search Result 169, Processing Time 0.028 seconds

A Study on Automatic Text Categorization of Web-Based Query Using Synonymy List (유사어 사전을 이용한 웹기반 질의문의 자동 범주화에 관한 연구)

  • Nam, Young-Joon;Kim, Gyu-Hwan
    • Journal of Information Management
    • /
    • v.35 no.4
    • /
    • pp.81-105
    • /
    • 2004
  • In this study, the way of the automatic text categorization on web-based query was implemented. X2 methods based on the Supported Vector Machine were used to test the efficiency of text categorization on queries. This test is carried out by the model using the Synonymy List. 713 synonyms were extracted manually from the tested documents. As the result of this test, the precision ratio and the recall ratio were decreased by -0.01% and by 8.53%, respectively whether the synonyms were assigned or not. It also shows that the Value of F1 Measure was increased by 4.58%. The standard deviation between the recall and precision ratio was improve by 18.39%.

3-D Building Reconstruction from Standard IKONOS Stereo Products in Dense Urban Areas (IKONOS 컬러 입체영상을 이용한 대규모 도심지역의 3차원 건물복원)

  • Lee, Suk Kun;Park, Chung Hwan
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.3D
    • /
    • pp.535-540
    • /
    • 2006
  • This paper presented an effective strategy to extract the buildings and to reconstruct 3-D buildings using high-resolution multispectral stereo satellite images. Proposed scheme contained three major steps: building enhancement and segmentation using both BDT (Background Discriminant Transformation) and ISODATA algorithm, conjugate building identification using the object matching with Hausdorff distance and color indexing, and 3-D building reconstruction using photogrammetric techniques. IKONOS multispectral stereo images were used to evaluate the scheme. As a result, the BDT technique was verified as an effective tool for enhancing building areas since BDT suppressed the dominance of background to enhance the building as a non-background. In building recognition, color information itself was not enough to identify the conjugate building pairs since most buildings are composed of similar materials such as concrete. When both Hausdorff distance for edge information and color indexing for color information were combined, most segmented buildings in the stereo images were correctly identified. Finally, 3-D building models were successfully generated using the space intersection by the forward RFM (Rational Function Model).

A Study on the Classification Scheme of the Internet Search Engine (인터넷 탐색엔진에 관한 연구)

  • 김영보
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.8 no.1
    • /
    • pp.197-227
    • /
    • 1997
  • The main purpose of this study is ① to settle and to analyze the classification of the Internet Search Engine comparitively, and ② to build the compatible model of Internet Search Engine classification in order to seek information on the Internet resources. specially in the branch of the Computers and Internet areas. For this study, four Internet Search Engine (Excite, 1-Detect, Simmany, Yahoo Korea!), Inspec Classification and two distionaries were used. The major findings and result of analysis are summarized as follows : 1. The basis of the classification is the scope of topics, the system logic, the clearness, the efficiency. 2. The scope of topics is analyzed comparitively by the number of items from each Search Engine. In the result, Excite is the most superior of the four 3. The system logic is analyzed comparitively by the casuality balance and consistency of the items from each Search Engine. In the result, Excite is the most superior of the four 4. The clearness is analyzed comparitively by the clearness and accuracy of items, the recognition of the searchers. In the result, Excite is the most superior of the four. 5 The efficiency is analyzed comparitively by the exactness of indexing and decreasing the effort of the searchers. In the result, Yahoo Korea! is the most superior of the four. 6 The compatible model of Internet Search Engine classification is estavlished to uplift the scope of topics, the system logic, the clearness, and the efficiency. The model divides the area mainly based upon the topics and resources using‘bookmark’and‘shadow’concept.

  • PDF

XML Structured Model of Tree-type for Efficient Retrieval (효율적인 검색을 위한 Tree 형태의 XML 문서 구조 모델)

  • Kim Young-Ran
    • Journal of the Korea Society of Computer and Information
    • /
    • v.9 no.4 s.32
    • /
    • pp.27-32
    • /
    • 2004
  • A XML Document has a structure which may be irregular The irregular document structure is difficult for users to know exactly. In this paper, we propose the XML document model and the structure retrieval method for efficient management and structure retrieval of XML documents. So we use fixed-sized LETID having the information of element, describe the structured information retrieval algorithm for parent and child element to represent the structured information of XML documents. Using this method, we represent the structured information of XML document efficiently. We can directly access to specific clement by simple operation, and process various queries. We expect the method to support various structured retrieval of specific element such as parent, child. and sibling elements.

  • PDF

A Syudy on the Biomedical Information Processing for Biomedicine and Healthcare (의료보건을 위한 의료정보처리에 관한 연구)

  • Jeong, Hyun-Cheol;Park, Byung-Jun;Bae, Sang-Hyun
    • Journal of Integrative Natural Science
    • /
    • v.2 no.4
    • /
    • pp.243-251
    • /
    • 2009
  • This paper surveys some researches to accomplish on bioinformatics. These researches wish to propose a database architecture combining a general view of bioinformatics data as a graph of data objects and data relationships, with the efficiency and robustness of data management and query provided by indexing and generic programming techniques. Here, these invert the role of the index, and make it a first-class citizen in the query language. It is possible to do this in a structured way, allowing users to mention indexes explicitly without yielding to a procedural query model, by converting functional relations into explicit functions. In the limit, the database becomes a graph, in which the edges are these indexes. Function composition can be specified either explicitly or implicitly as path queries. The net effect of the inversion is to convert the database into a hyperdatabase: a database of databases, connected by indexes or functions. The inversion approach was motivated by their work in biological databases, for which hyperdatabases are a good model. The need for a good model has slowed progress in bioinformatics.

  • PDF

Automatic extraction of similar poetry for study of literary texts: An experiment on Hindi poetry

  • Prakash, Amit;Singh, Niraj Kumar;Saha, Sujan Kumar
    • ETRI Journal
    • /
    • v.44 no.3
    • /
    • pp.413-425
    • /
    • 2022
  • The study of literary texts is one of the earliest disciplines practiced around the globe. Poetry is artistic writing in which words are carefully chosen and arranged for their meaning, sound, and rhythm. Poetry usually has a broad and profound sense that makes it difficult to be interpreted even by humans. The essence of poetry is Rasa, which signifies mood or emotion. In this paper, we propose a poetry classification-based approach to automatically extract similar poems from a repository. Specifically, we perform a novel Rasa-based classification of Hindi poetry. For the task, we primarily used lexical features in a bag-of-words model trained using the support vector machine classifier. In the model, we employed Hindi WordNet, Latent Semantic Indexing, and Word2Vec-based neural word embedding. To extract the rich feature vectors, we prepared a repository containing 37 717 poems collected from various sources. We evaluated the performance of the system on a manually constructed dataset containing 945 Hindi poems. Experimental results demonstrated that the proposed model attained satisfactory performance.

A Model to Forecast Rice Blast Disease Based on Weather Indexing (기상지수에 의한 벼도열병 예찰의 한 모델)

  • Kim Choong-Hoe;MacKenzie D. R.;Rush M. C.
    • Korean Journal Plant Pathology
    • /
    • v.3 no.3
    • /
    • pp.210-216
    • /
    • 1987
  • A computer program written to predict blast occurrence based on micro climatic events was developed and tested as an on-site microcomputer in field plots in 1984 and 1985. A microcomputer unit operating on alkaline batteries; continuously monitored air temperature, leaf wetness, and relative humidity; interpreted the microclimate information in relation to rice blast development and displayed daily values (0-8) of blast units of severity (BUS). Cumulative daily BUS values (CBUS) were highly correlated with blast development on the two susceptible cultivars, M-201 and Brazos grown in field plots. When CBUS values were used to predict the logit of disease proportions, the average coefficients of determination $(R^2)$ between these two factors were 71 to $91\%$, depending on cultivar and year. This was a significant improvement when compared to 61 to $79\%$ when days were used as a predictor of logit disease severity. The ability of CBUS to predict logit disease severity was slightly less with Brazos than M-201. This is significant inasmuch as Brazos showed field resistance at mid-sea­son. The results in this study indicate that the model has the potential for future use and that the model could be improved by incorporating other variables associated with host plants and pathogen races in addition to the key environmental variables.

  • PDF

The Study for the Realtime Noise Simulation Integration Model Applied to Traffic Simulation and Spatial Modeling (교통 시뮬레이션과 공간 모델링 기법을 적용한 실시간 소음 시뮬레이션 통합 모델에 대한 연구)

  • Kang, Tae-Wook;Cho, Yoon-Ho;Kim, In-Tai
    • International Journal of Highway Engineering
    • /
    • v.13 no.3
    • /
    • pp.111-119
    • /
    • 2011
  • The noise prediction model, KRON-2006, in South Korea has been developed for obtaining the average noise level. The model is based on an outdoor sound propagation method based on ISO9613 and ASJ Model-1998 and supports the analysis of the linear noise source, such as highway, for obtaining Leq. Because of that, the model can't obtain Lmax, Lmin from the time series noise profile based on traffic at every moment. In order to address this problem, the real time noise prediction model based on traffic simulation using GIS model and algorithm is proposed. It can predict the vehicle point noise level based on vehicle type, speed generated from traffic simulation by using headway and obtain Lmax, Lmin as integrating the noise profile generated from it at every moment. An evalution of the noise prediciton model using field measurements finds good agreement between predicted and measured noise levels at 1m, 8m, 15m from curb of the near side lane.

Designing and Evaluating Digital Video Storyboard Surrogates (디지털 영상 초록의 설계와 평가에 관한 연구)

  • Kim, Hyun-Hee;Kim, Yong-Ho;Ko, Su-Hyun
    • Journal of Korean Library and Information Science Society
    • /
    • v.38 no.4
    • /
    • pp.463-480
    • /
    • 2007
  • This study examines the design and utilization of video storyboard surrogates in the digital video libraries. To do this, first we constructed the arrangement model of key-frames for storyboard based on the FRBR model, image communication and PRECIS Indexing theories and evaluated the model using 6 sample videos and 26 participants. The study results show that the video storyboard surrogates based on the arrangement model has a higher accuracy value in terms of summary extraction than that of the sequential video storyboard. Moreover, watching both types of video storyboard one after another, especially browsing the sequential video storyboard first and then the arrangement model-based one, produces a remarkable increase in accuracy value of summary extraction. The study proposes two methods of utilizing the video storyboard surrogates in the digital video libraries: Designing a video browsing interface where users can use the sequential storyboard as a default and then the arrangement model-based one for re-watching; and utilizing the arrangement model-based storyboard as structured match sources of image-based queries.

  • PDF

Deep-Learning Approach for Text Detection Using Fully Convolutional Networks

  • Tung, Trieu Son;Lee, Gueesang
    • International Journal of Contents
    • /
    • v.14 no.1
    • /
    • pp.1-6
    • /
    • 2018
  • Text, as one of the most influential inventions of humanity, has played an important role in human life since ancient times. The rich and precise information embodied in text is very useful in a wide range of vision-based applications such as the text data extracted from images that can provide information for automatic annotation, indexing, language translation, and the assistance systems for impaired persons. Therefore, natural-scene text detection with active research topics regarding computer vision and document analysis is very important. Previous methods have poor performances due to numerous false-positive and true-negative regions. In this paper, a fully-convolutional-network (FCN)-based method that uses supervised architecture is used to localize textual regions. The model was trained directly using images wherein pixel values were used as inputs and binary ground truth was used as label. The method was evaluated using ICDAR-2013 dataset and proved to be comparable to other feature-based methods. It could expedite research on text detection using deep-learning based approach in the future.