• Title/Summary/Keyword: search similarity

Search Result 535, Processing Time 0.033 seconds

WordNet-Based Category Utility Approach for Author Name Disambiguation (저자명 모호성 해결을 위한 개념망 기반 카테고리 유틸리티)

  • Kim, Je-Min;Park, Young-Tack
    • The KIPS Transactions:PartB
    • /
    • v.16B no.3
    • /
    • pp.225-232
    • /
    • 2009
  • Author name disambiguation is essential for improving performance of document indexing, retrieval, and web search. Author name disambiguation resolves the conflict when multiple authors share the same name label. This paper introduces a novel approach which exploits ontologies and WordNet-based category utility for author name disambiguation. Our method utilizes author knowledge in the form of populated ontology that uses various types of properties: titles, abstracts and co-authors of papers and authors' affiliation. Author ontology has been constructed in the artificial intelligence and semantic web areas semi-automatically using OWL API and heuristics. Author name disambiguation determines the correct author from various candidate authors in the populated author ontology. Candidate authors are evaluated using proposed WordNet-based category utility to resolve disambiguation. Category utility is a tradeoff between intra-class similarity and inter-class dissimilarity of author instances, where author instances are described in terms of attribute-value pairs. WordNet-based category utility has been proposed to exploit concept information in WordNet for semantic analysis for disambiguation. Experiments using the WordNet-based category utility increase the number of disambiguation by about 10% compared with that of category utility, and increase the overall amount of accuracy by around 98%.

A study on reduction of sensibility dimension for selection of wallpaper (벽지 선택을 위한 감성 차원 축소에 관한 연구)

  • Chun Young-Min;Kim Soon-Young;Kim Sung-Hwan;Chung Sung-Suk
    • Science of Emotion and Sensibility
    • /
    • v.8 no.4
    • /
    • pp.333-344
    • /
    • 2005
  • The sensitivity adjectives on wall paper are collected. With the collected sensitivity adjective, we are going to develop the model which can recommend the wallpaper to customer. A large number of adjectives describing affective responses were collected from such diverse sources as questionnaire survey results, field survey results and internet survey result. To search the representative adjective of collected adjective, we used the diverse statistical analysis method. We attempted to decide the axis name of dimension through the MDS(Multi-Dimensional Scale) analysis method using the similarity matrix an4 to find a three or four reduced factors through the factor analysis method using the varimax rotation method. The result of the analysis showed that the reduced factors could account about $82\%$ when the number of factor is three(popular, elegance, and passable) ant about $93\%$ when the number of factor is four (elegance, passable, beautiful, and affectionate) On the basis of this result, we expect it can be used to develop the model recommending the wallpaper.

  • PDF

Construction of a Full-length cDNA Library from Korean Stewartia (Stewartia koreana Nakai) and Characterization of EST Dataset (노각나무(Stewartia koreana Nakai)의 cDNA library 제작 및 EST 분석)

  • Im, Su-Bin;Kim, Joon-Ki;Choi, Young-In;Choi, Sun-Hee;Kwon, Hye-Jin;Song, Ho-Kyung;Lim, Yong-Pyo
    • Horticultural Science & Technology
    • /
    • v.29 no.2
    • /
    • pp.116-122
    • /
    • 2011
  • In this study, we report the generation and analysis of 1,392 expressed sequence tags (ESTs) from Korean Stewartia (Stewartia koreana Nakai). A cDNA library was generated from the young leaf tissue and a total of 1,392 cDNA were partially sequenced. EST and unigene sequence quality were determined by computational filtering, manual review, and BLAST analyses. Finally, 1,301 ESTs were acquired after the removal of the vector sequence and filtering over a minimum length 100 nucleotides. A total of 893 unigene, consisting of 150 contigs and 743 singletons, was identified after assembling. Also, we identified 95 new microsatellite-containing sequences from the unigenes and classified the structure according to their repeat unit. According to homology search with BLASTX against the NCBI database, 65% of ESTs were homologous with known function and 11.6% of ESTs were matched with putative or unknown function. The remaining 23.2% of ESTs showed no significant similarity to any protein sequences found in the public database. Annotation based searches against multiple databases including wine grape and populus sequences helped to identify putative functions of ESTs and unigenes. Gene ontology (GO) classification showed that the most abundant GO terms were transport, nucleotide binding, plastid, in terms biological process, molecular function and cellular component, respectively. The sequence data will be used to characterize potential roles of new genes in Stewartia and provided for the useful tools as a genetic resource.

Bacterial Community Structure Shift Driven by Salinity: Analysis of DGGE Band Patterns from Freshwater to Seawater of Hyeongsan River, Korea (염도의 변화에 따른 미생물 군집의 변화: 경북 형산강 하류 미생물 군집 변화의 DGGE pattern 분석)

  • Beck, Bo Ram;Holzapfel, Wilhelm;Hwang, Cher Won;Do, Hyung Ki
    • Journal of Life Science
    • /
    • v.23 no.3
    • /
    • pp.406-414
    • /
    • 2013
  • The influence of a gradual increase in salinity on the diversity of aquatic bacterial in rivers was demonstrated. The denaturing gradient gel electrophoresis (DGGE) was used to analyze the bacterial community shift downstream in the Hyeongsan River until it joins the open ocean. Four water samples were taken from the river showing the salinity gradients of 0.02%, 1.48%, 2.63%, and 3.62%. The samples were collected from four arbitrary stations in 2.91 km intervals on average, and a DGGE analysis was performed. Based on the results of this analysis, phylogenetic similarity identification, tree analysis, and a comparison of each station were performed. The results strongly suggested that the response of the bacterial community response was concomitant to gradual changes in salinity, which implies that salt concentration is a major factor in shifting the microbiota in aquatic habitats. The results also imply a huge diversity in a relatively small area upstream from the river mouth, compared to that in open oceans or coastal regions. Therefore, areas downstream towards a river mouth or delta are could be good starting points in the search for new bacterial species and strains ("biotypes").

MOLECULAR CLONING AND SEQUENCE ANALYSIS OF THE GENE FOR THE HEMIN-BINDING PROTEIN FROM Prevotella intermedia (Prevotella intermedia에서의 Hemin 결합 단백질 유전자의 분리 및 염기서열 분석)

  • Kim, Shin;Kim, Sung-Jo
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.33 no.2
    • /
    • pp.304-310
    • /
    • 2006
  • Prevotella intermedia is one of the most frequently implicated pathogens in human periodontal disease and has a requirement for hemin for growth. This study has identified a hemin-binding P. intermedia protein by expression of a P. intermedia genomic library in Escherichia coli, a bacterium which does not require or transport exogenous hemin. The genomic library of P. intermedia was constructed into plasmid pUC18, transformed into Escherichia coli strain $DH5{\alpha}$, and screened for recombinant clones using heminbinding activity by plating onto hemin-containing agar. Approximately 5,000 recombinant E. coli colonies were screened onto LB-amp-hemin agar, single clone(pHem1) was exhibited a clearly pigmented phonotype. The 2.5kb insert DNA of pHem1 was determined by restriction enzyme mapping. Southern blot analysis of BamHI, BglII, EcoRI, HindIII and PstI-digested P. intermedia DNA indicated that single copy of the gene was present in the genome. Northern blot analysis revealed that the size of transcript was approximately 1.8 kb. The cloned gene contained a single ORF, consisting of approximately 850-residue amino acids. A BLAST search of the Institute for Genomic Research genes with similar nucleotide sequence revealed no significant similarity It needs further investigation to clarify the mechanisms of heme uptake in P. intermedia.

  • PDF

Haplotype Diversity and Gene Flow of the Diamondback Moth, Plutella xylostella(L.) (Lepidoptera: Yponomeutidae), in Korea (배추좀나방(나비목: 집나방과)의 haplotype 다양성과 유전자 이동률)

  • 김익수;배진식;최광호;진병래;이경로;손흥대
    • Korean journal of applied entomology
    • /
    • v.39 no.1
    • /
    • pp.43-52
    • /
    • 2000
  • A portion of mitochondria1 COI gene (438 bp) was sequenced from the sampls of Plutella xylostella from four localities in Korea to investigate the population genetic structure and characteristics by measuring the magnitude of genetic diversity and the degree of gene flow among populations. Thirteen haplotypes ranging in nucleotide divergence 0.3% to 1.4%, were obtained from 21 individuals. The nucleotide divergence was similar to the other related studies, but haplotype diversity was substantially higher (mean h = 0.81). The genetic distance among geographically remote Cheju Island population and the two Kimhae populations, distant 1 lkm to each other, was not statistically significant (p<0.05). Instead, a substantial or high female gene flow was detected (Nm = 2-30). One Hawaiian haplotype of the diamondback moth obtained through GenBank search also was genetically similar to the ones obtained from this study. Collectively, the genetic population structure of the diamondback moth in Korea can be characterized into two aspects. First, the diamondback moths in Korea possesses overall moderate genetic divergence based on a high number of haplotypes. Second, a high haplotype diversity within each population due to the long distance dispersal with a substantial dispersal power and the resultant genetic similarity among geographic populations is characteristic.

  • PDF

An Ontology-Driven Mapping Algorithm between Heterogeneous Product Classification Taxonomies (이질적인 쇼핑몰 환경을 위한 온톨로지 기반 상품 매핑 방법론)

  • Kim Woo-Ju;Choi Nam-Hyuk;Choi Dae-Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.12 no.2
    • /
    • pp.33-48
    • /
    • 2006
  • The Semantic Web and its related technologies have been opening the era of information sharing via the Web. There are, however, several huddles still to overcome in the new era, and one of the major huddles is the issue of information integration, unless a single unified and huge ontology could be built and used which could address everything in the world. Particularly in the e-business area, the problem of information integration is of a great concern for product search and comparison at various Internet shopping sites and e-marketplaces. To overcome this problem, we proposed an ontology-driven mapping algorithm between heterogeneous product classification and description frameworks. We also peformed a comparative evaluation of the proposed mapping algorithm against a well-Down ontology mapping tool, PROMPT.

  • PDF

Development of Identity-Provider Discovery System leveraging Geolocation Information (위치정보 기반 식별정보제공자 탐색시스템의 개발)

  • Jo, Jinyong;Jang, Heejin;Kong, JongUk
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.9
    • /
    • pp.1777-1787
    • /
    • 2017
  • Federated authentication (FA) is a multi-domain authentication and authorization infrastructure that enables users to access nationwide R&D resources with their home-organizational accounts. An FA-enabled user is redirected to his/her home organization, after selecting the home from an identity-provider (IdP) discovery service, to log in. The discovery service allows a user to search his/her home from all FA-enabled organizations. Users get troubles to find their home as federation size increases. Therefore, a discovery service has to provide an intuitive way to make a fast IdP selection. In this paper, we propose a discovery system which leverages geographical information. The proposed system calculates geographical proximity and text similarity between a user and organizations, which determines the order of organizations shown on the system. We also introduce a server redundancy and a status monitoring method for non-stop service provision and improved federation management. Finally, we deployed the proposed system in a real service environment and verified the feasibility of the system.

An Analysis of Science Magazine in the View of Infographic (인포그래픽 관점을 이용한 과학 잡지 분석)

  • Jeon, Seongsoo;Jung, Jinkyu;Park, Jong-Ho
    • Journal of The Korean Association For Science Education
    • /
    • v.34 no.6
    • /
    • pp.601-611
    • /
    • 2014
  • The purpose of this study is to analyze the Korean science magazine, Science Donga providing scientific facts, phenomenons, and issues with infographic for the readers by time series analysis and to search for the application of infographic on the science education. The criteria for the infographic analysis of Science Donga consisted of three categories such as storytelling type, visual perception, and framework level because infographic presents complex information quickly and clearly by integrating various images, words, and graphics. We found that the articles emphasized by including image about science issue have been published from 1986 to 2014. Particularly, after 2008, the articles including infographic sharply rose. So we set up 2008 as $T_c$(Critical time point). The articles including infographic after 2008 have been more variously distributed and frequently used in storytelling types category such as location, time, number, connection, function, and process based infographic, in visual perception of Gestalt Theory such as proximity, similarity, continuation, and closure than before 2008. Lastly, in framework level category, location, time, number, and process based infographic mainly had total range level but function and connection based infographic changed in the framework level. The three features about storytelling type, visual perception, framework level are important changes to influence $T_c$ in the infographic analysis about Science Donga. Through the results of this study, we analyzed the feature of change on infographic from 1986 to 2014. Thus, we hope that the results suggest a basic criteria for making materials including infographic in science education.

Intelligent Diagnosis Assistant System of Capsule Endoscopy Video Through Analysis of Video Frames (영상 프레임 분석을 통한 대용량 캡슐내시경 영상의 지능형 판독보조 시스템)

  • Lee, H.G.;Choi, H.K.;Lee, D.H.;Lee, S.C.
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.2
    • /
    • pp.33-48
    • /
    • 2009
  • Capsule endoscopy is one of the most remarkable inventions in last ten years. Causing less pain for patients, diagnosis for entire digestive system has been considered as a most convenience method over a normal endoscope. However, it is known that the diagnosis process typically requires very long inspection time for clinical experts because of considerably many duplicate images of same areas in human digestive system due to uncontrollable movement of a capsule endoscope. In this paper, we propose a method for clinical diagnosticians to get highly valuable information from capsule-endoscopy video. Our software system consists of three global maps, such as movement map, characteristic map, and brightness map, in temporal domain for entire sequence of the input video. The movement map can be used for effectively removing duplicated adjacent images. The characteristic and brightness maps provide frame content analyses that can be quickly used for segmenting regions or locating some features(such as blood) in the stream. Our experiments show the results of four patients having different health conditions. The result maps clearly capture the movements and characteristics from the image frames. Our method may help the diagnosticians quickly search the locations of lesion, bleeding, or some other interesting areas.

  • PDF