• Title/Summary/Keyword: Similarity comparison

Search Result 750, Processing Time 0.035 seconds

Characterization and Expression Profile Analysis of a New cDNA Encoding Taxadiene Synthase from Taxus media

  • Kai, Guoyin;Zhao, Lingxia;Zhang, Lei;Li, Zhugang;Guo, Binhui;Zhao, Dongli;Sun, Xiaofen;Miao, Zhiqi;Tang, Kexuan
    • BMB Reports
    • /
    • v.38 no.6
    • /
    • pp.668-675
    • /
    • 2005
  • A full-length cDNA encoding taxadiene synthase (designated as TmTXS), which catalyzes the first committed step in the Taxol biosynthetic pathway, was isolated from young leaves of Taxus media by rapid amplification of cDNA ends (RACE). The full-length cDNA of TmTXS had a 2586 bp open reading frame (ORF) encoding a protein of 862 amino acid residues. The deduced protein had isoelectric point (pI) of 5.32 and a calculated molecular weight of about 98 kDa, similar to previously cloned diterpene cyclases from other Taxus species such as T. brevifolia and T. chinenisis. Sequence comparison analysis showed that TmTXS had high similarity with other members of terpene synthase family of plant origin. Tissue expression pattern analysis revealed that TmTXS expressed strongly in leaves, weak in stems and no expression could be detected in fruits. This is the first report on the mRNA expression profile of genes encoding key enzymes involved in Taxol biosynthetic pathway in different tissues of Taxus plants. Phylogenetic tree analysis showed that TmTXS had closest relationship with taxadiene synthase from T. baccata followed by those from T. chinenisis and T. brevifolia. Expression profiles revealed by RT-PCR under different chemical elicitor treatments such as methyl jasmonate (MJ), silver nitrate (SN) and ammonium ceric sulphate (ACS) were also compared for the first time, and the results revealed that expression of TmTXS was all induced by the tested three treatments and the induction effect by MJ was the strongest, implying that TmTXS was high elicitor responsive.

Intelligent Web Crawler for Supporting Big Data Analysis Services (빅데이터 분석 서비스 지원을 위한 지능형 웹 크롤러)

  • Seo, Dongmin;Jung, Hanmin
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.12
    • /
    • pp.575-584
    • /
    • 2013
  • Data types used for big-data analysis are very widely, such as news, blog, SNS, papers, patents, sensed data, and etc. Particularly, the utilization of web documents offering reliable data in real time is increasing gradually. And web crawlers that collect web documents automatically have grown in importance because big-data is being used in many different fields and web data are growing exponentially every year. However, existing web crawlers can't collect whole web documents in a web site because existing web crawlers collect web documents with only URLs included in web documents collected in some web sites. Also, existing web crawlers can collect web documents collected by other web crawlers already because information about web documents collected in each web crawler isn't efficiently managed between web crawlers. Therefore, this paper proposed a distributed web crawler. To resolve the problems of existing web crawler, the proposed web crawler collects web documents by RSS of each web site and Google search API. And the web crawler provides fast crawling performance by a client-server model based on RMI and NIO that minimize network traffic. Furthermore, the web crawler extracts core content from a web document by a keyword similarity comparison on tags included in a web documents. Finally, to verify the superiority of our web crawler, we compare our web crawler with existing web crawlers in various experiments.

A Combined Hough Transform based Edge Detection and Region Growing Method for Region Extraction (영역 추출을 위한 Hough 변환 기반 에지 검출과 영역 확장을 통합한 방법)

  • N.T.B., Nguyen;Kim, Yong-Kwon;Chung, Chin-Wan;Lee, Seok-Lyong;Kim, Deok-Hwan
    • Journal of KIISE:Databases
    • /
    • v.36 no.4
    • /
    • pp.263-279
    • /
    • 2009
  • Shape features in a content-based image retrieval (CBIR) system are divided into two classes: contour-based and region-based. Contour-based shape features are simple but they are not as efficient as region-based shape features. Most systems using the region-based shape feature have to extract the region firs t. The prior works on region-based systems still have shortcomings. They are complex to implement, particularly with respect to region extraction, and do not sufficiently use the spatial relationship between regions in the distance model In this paper, a region extraction method that is the combination of an edge-based method and a region growing method is proposed to accurately extract regions inside an object. Edges inside an object are accurately detected based on the Canny edge detector and the Hough transform. And the modified Integrated Region Matching (IRM) scheme which includes the adjacency relationship of regions is also proposed. It is used to compute the distance between images for the similarity search using shape features. The experimental results show the effectiveness of our region extraction method as well as the modified IRM. In comparison with other works, it is shown that the new region extraction method outperforms others.

Characterization of a Multimodular Endo-β-1,4-Glucanase (Cel9K) from Paenibacillus sp. X4 with a Potential Additive for Saccharification

  • Lee, Jae Pil;Kim, Yoon A;Kim, Sung Kyum;Kim, Hoon
    • Journal of Microbiology and Biotechnology
    • /
    • v.28 no.4
    • /
    • pp.588-596
    • /
    • 2018
  • An endo-${\beta}$-1,4-glucanase gene, cel9K, was cloned using the shot-gun method from Paenibacillus sp. X4, which was isolated from alpine soil. The gene was 2,994 bp in length, encoding a protein of 997 amino acid residues with a predicted signal peptide composed of 32 amino acid residues. Cel9K was a multimodular enzyme, and the molecular mass and theoretical pI of the mature Cel9K were 103.5 kDa and 4.81, respectively. Cel9K contains the GGxxDAGD, PHHR, GAxxGG, YxDDI, and EVxxDYN motifs found in most glycoside hydrolase family 9 (GH9) members. The protein sequence showed the highest similarity (88%) with the cellulase of Bacillus sp. BP23 in comparison with the enzymes with reported properties. The enzyme was purified by chromatography using HiTrap Q, CHT-II, and HiTrap Butyl HP. Using SDS-PAGE/activity staining, the molecular mass of Cel9K was estimated to be 93 kDa, which is a truncated form produced by the proteolytic cleavage of its C-terminus. Cel9K was optimally active at pH 5.5 and $50^{\circ}C$ and showed a half-life of 59.2 min at $50^{\circ}C$. The CMCase activity was increased to more than 150% in the presence of 2 mM $Na^+$, $K^+$, and $Ba^{2+}$, but decreased significantly to less than 50% by $Mn^{2+}$ and $Co^{2+}$. The addition of Cel9K to a commercial enzyme set (Celluclast 1.5L + Novozym 188) increased the saccharification of the pretreated reed and rice straw powders by 30.4% and 15.9%, respectively. The results suggest that Cel9K can be used to enhance the enzymatic conversion of lignocellulosic biomass to reducing sugars as an additive.

Efficient Multi-Step k-NN Search Methods Using Multidimensional Indexes in Large Databases (대용량 데이터베이스에서 다차원 인덱스를 사용한 효율적인 다단계 k-NN 검색)

  • Lee, Sanghun;Kim, Bum-Soo;Choi, Mi-Jung;Moon, Yang-Sae
    • Journal of KIISE
    • /
    • v.42 no.2
    • /
    • pp.242-254
    • /
    • 2015
  • In this paper, we address the problem of improving the performance of multi-step k-NN search using multi-dimensional indexes. Due to information loss by lower-dimensional transformations, existing multi-step k-NN search solutions produce a large tolerance (i.e., a large search range), and thus, incur a large number of candidates, which are retrieved by a range query. Those many candidates lead to overwhelming I/O and CPU overheads in the postprocessing step. To overcome this problem, we propose two efficient solutions that improve the search performance by reducing the tolerance of a range query, and accordingly, reducing the number of candidates. First, we propose a tolerance reduction-based (approximate) solution that forcibly decreases the tolerance, which is determined by a k-NN query on the index, by the average ratio of high- and low-dimensional distances. Second, we propose a coefficient control-based (exact) solution that uses c k instead of k in a k-NN query to obtain a tigher tolerance and performs a range query using this tigher tolerance. Experimental results show that the proposed solutions significantly reduce the number of candidates, and accordingly, improve the search performance in comparison with the existing multi-step k-NN solution.

Comparison of Community Structure of Fish Larvae in the Northern East China Sea in Normal and El Niño/La Niña Periods (엘리뇨/라니냐와 정상 기간 동중국해 북부해역의 자치어의 군집구조 비교)

  • Yoo, Joon-Taek;Choi, Jung-Hwa;Kim, Jin-Yeong;Kim, Jong-Bin;Choi, Kwang-Ho
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.46 no.6
    • /
    • pp.907-916
    • /
    • 2013
  • The aim of this study was to compare community structure of larval fish species in the northern East China Sea during normal meteorological conditions in autumn 2009, during the El Ni$\tilde{n}$o period in 2009-2010, and during the La Nina period in 2010. Fifty taxa were recorded during the study period; the most dominant species were Benthosema pterotum and Gobiidae spp. In October 2008 during the normal period, warm water from the Tsushima Warm Current (TWC) intruded more into the surface and middle layers, and cold water affected by the Yellow Sea Cold Water (YSCW) intruded into the bottom layer. In October 2009 during the El Ni$\tilde{n}$o period, intrusion of the China Coastal Water (CCW), which has low salinity (<32.2 psu), was more apparent than intrusion of the TWC or YSCW. In October 2010 during the La Nina period, intrusion of the TWC and CCW was relatively weak, resulting in the lowest temperature and highest salinity observed during the study period in the eastern part of the study area. Hierarchical cluster, one-way ANOSIM (analysis of similarities), and SIMPER (similarity-percentages procedure) analyses provided two main results. First, the abundance of the most dominant larval fish species in autumn of the normal period was greater than that in autumn of the El Ni$\tilde{n}$o/La Nina periods, resulting in a significant difference in ichthyoplankton community structure between the periods. The abundance of Benthosema pterotum increased in the normal period, possibly influenced by the intrusion of cold water from the YSCW; the abundance of species residing in Korean waters (e.g., Gobiidae spp.) probably decreased during the El Ni$\tilde{n}$o/La Nina periods. The second finding was that the abundance of subtropical larval fish in autumn of the normal period was generally larger than that during autumn of the El Ni$\tilde{n}$o/La Nina periods. This could have been induced by the stronger intrusion of warm water from the TWC during the normal period. Although differences in oceanographic conditions between El Ni$\tilde{n}$o and La Nina periods were observed, the differences in ichthyoplankton community structure between the two periods were not significant.

Identification of Lactobacillus spp. associated with nematodes in peach farm soil (복숭아 농장 토양에서 Nematodes와 연관된 Lactobacillus spp.의 분리 및 동정)

  • Lee, Woo-Hyun;Choi, Jae Im;Lee, Jin Il;Lee, Won-Pyo;Yoon, Sung-Sik
    • Korean Journal of Microbiology
    • /
    • v.53 no.3
    • /
    • pp.163-169
    • /
    • 2017
  • Strains D4 and D5 were isolated from peach-rotten soil during the peach harvest season. The isolates were identified based on morphological and biochemical characterization, and identification was determined by 16S rRNA gene sequencing. Results showed that D4 has high similarity to Lactobacillus plantarum ATCC $14917^T$ and Lactobacillus pentosus ATCC $8041^T$ at 99.05% and 98.98%, respectively. D5 was also similar to Lactobacillus pentosus ATCC $8041^T$ and Lactobacillus plantarum ATCC $14917^T$ at 98.71% and 98.64%, respectively. In contrast, isolates showed differences in carbohydrate utilization in comparison to Lactobacillus plantarum ATCC $14917^T$ and Lactobacillus pentosus ATCC $8041^T$. In view of this we performed VITEK MS matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) analysis, multiplex PCR fingerprinting, and random amplified polymorphic DNA (RAPD)-PCR to further confirm the identification of D4 and D5. The results of these analyses showed that both strains were most similar to Lactobacillus plantarum.

Geographical comparison on different methods for identification of Streptococcus parauberis isolated from cultured olive flounder, Paralichthys olivaceus (양식 넙치에서 분리한 Streptococcus parauberis의 동정방법에 따른 지역적 비교)

  • Cho, Mi-Young;Oh, Yun-Kyeong;Lee, Deok-Chan;Kim, Jae-Hoon;Park, Myoung-Ae
    • Journal of fish pathology
    • /
    • v.20 no.1
    • /
    • pp.49-60
    • /
    • 2007
  • Non-hemolytic Streptococcus parauberis isolated from diseased olive flounder, Paralichthys olivaceus in the South coast of Korea were identified by physiological, biochemical and genetic analysis in order to define the different characteristics geographically. First, twelve strains of S. parauberis were isolated from catalase-negative gram-positive cocci by multiplex PCR assay. Phenotypic identifications were performed with commercially available kit (API 20 Strep and API ZYM system). Analysis of API profiles of the isolates showed that strains were identified as either of Lactococcus lactis, S. constellatus or S. uberis. Moreover, S. parauberis isolated from olive flounder differed from that of turbot (X89967) to the test of not Voges-Proskauer, arginine, hippurate, alkiline phosphatase and pyrroidonyl arylamidase but β-glucuronidase. All S. parauberis isolates were sensitive to florfenicol, ampicillin, ofloxacin and vancomycin but were resistant to oxolinic acid, flumequine, nalidixic acid and sulfisoxazol. However, the 16S rDNA sequences of the isolates showed 99% similarity to S. parauberis KCTC 3651 (AY584477) and a great homogenecity among the flounder isolates.

Manufacturing and Characteristics of Korean Traditional Liquor, Hahyangju Prepared by Saccharomyces cerevisiae HA3 Isolated from Traditional Nuruk (전통 누룩으로부터 분리된 Saccharomyces cerevisiae HA3을 이용한 하향주의 제조 및 특성)

  • Jung, Hee-Kyoung;Park, Chi-Duck;Park, Hwan-Hee;Lee, Gee-Dong;Lee, In-Seon;Hong, Joo-Heon
    • Korean Journal of Food Science and Technology
    • /
    • v.38 no.5
    • /
    • pp.659-667
    • /
    • 2006
  • In order to standardize the manufacturing processes of Hahyangju, a traditional Korean liquor, 29 yeast strains were isolated from traditional Nuruk. Strain N8 exhibited a particularly strong resistance to sugar. Strains HA2, HA3 and HA4 grew successfully in medium containing 10% ethanol. In comparison with the growth exhibited by these strains when grown in a yeast malt extract medium, the ethanol production rates for the three strains were 10.8%, 10.45%, and 10%, respectively in a yeast malt extract medium containing 25% glucose. Based on these results, HA3 was the strain selected for use in the manufacturing processes of Hahyangju and it was identified as a Saccharomyces cerevisiae strain with 97% ITS sequence similarity. The use of Saccharomyces cerevisiae HA3 causcd a decrease in the lactic acid content, acidity and growth of lactic acid bacteria in the fermentation mash. Following thc addition of Saccharomyces cerevisiae HA3 to the manufacturing process of Hahyangju, the second fermentation mash showed a 22% increase in the alcohol production rate associated with traditional fermentation; however, the amino acidity, pH and reducing sugar content showed little change. Sensory evaluation of Hahyangju fermented with S. cerevisiae HA3 also showed better scores than Hahyangju mashed by the traditional method.

On-line Handwriting Chinese Character Recognition for PDA Using a Unit Reconstruction Method (유닛 재구성 방법을 이용한 PDA용 온라인 필기체 한자 인식)

  • Chin, Won;Kim, Ki-Doo
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.1
    • /
    • pp.97-107
    • /
    • 2002
  • In this paper, we propose the realization of on-line handwritten Chinese character recognition for mobile personal digital assistants (PDA). We focus on the development of an algorithm having a high recognition performance under the restriction that PDA requires small memory storage and less computational complexity in comparison with PC. Therefore, we use index matching method having computational advantage for fast recognition and we suggest a unit reconstruction method to minimize the memory size to store the character models and to accomodate the various changes in stroke order and stroke number of each person in handwriting Chinese characters. We set up standard model consisting of 1800 characters using a set of pre-defined units. Input data are measured by similarity among candidate characters selected on the basis of stroke numbers and region features after preprocessing and feature extracting. We consider 1800 Chinese characters adopted in the middle and high school in Korea. We take character sets of five person, written in printed style, irrespective of stroke ordering and stroke numbers. As experimental results, we obtained an average recognition time of 0.16 second per character and the successful recognition rate of 94.3% with MIPS R4000 CPU in PDA.