• Title/Summary/Keyword: Genome similarity

Search Result 204, Processing Time 0.019 seconds

Heat Shock RNA 1, Known as a Eukaryotic Temperature-Sensing Noncoding RNA, Is of Bacterial Origin

  • Choi, Dongjin;Oh, Hye Ji;Goh, Chul Jun;Lee, Kangseok;Hahn, Yoonsoo
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.8
    • /
    • pp.1234-1240
    • /
    • 2015
  • Heat shock RNA 1 (HSR1) is described as a "eukaryotic heat-sensing noncoding RNA" that regulates heat shock response in human and other eukaryotic cells. Highly conserved HSR1 sequences have been identified from humans, hamsters, Drosophila, Caenorhabditis elegans, and Arabidopsis. In a previous study, however, it was suggested that HSR1 had originated from a bacterial genome. HSR1 showed no detectible nucleotide sequence similarity to any eukaryotic sequences but harbored a protein coding region that showed amino-acid sequence similarity to bacterial voltage-gated chloride channel proteins. The bacterial origin of HSR1 was not convincible because the nucleotide sequence similarity was marginal. In this study, we have found that a genomic contig sequence of Comamonas testosteroni strain JL14 contained a sequence virtually identical to that of HSR1, decisively confirming the bacterial origin of HSR1. Thus, HSR1 is an exogenous RNA, which can ectopically trigger heat shock response in eukaryotes. Therefore, it is no longer appropriate to cite HSR1 as a "eukaryotic functional noncoding RNA."

Characteristics in Molecular Vibrational Frequency Patterns between Agonists and Antagonists of Histamine Receptors

  • Oh, S. June
    • Genomics & Informatics
    • /
    • v.10 no.2
    • /
    • pp.128-132
    • /
    • 2012
  • To learn the differences between the structure-activity relationship and molecular vibration-activity relationship in the ligand-receptor interaction of the histamine receptor, 47 ligands of the histamine receptor were analyzed by structural similarity and molecular vibrational frequency patterns. The radial tree that was produced by clustering analysis of molecular vibrational frequency patterns shows its potential for the functional classification of histamine receptor ligands.

Global Sequence Homology Detection Using Word Conservation Probability

  • Yang, Jae-Seong;Kim, Dae-Kyum;Kim, Jin-Ho;Kim, Sang-Uk
    • Interdisciplinary Bio Central
    • /
    • v.3 no.4
    • /
    • pp.14.1-14.9
    • /
    • 2011
  • Protein homology detection is an important issue in comparative genomics. Because of the exponential growth of sequence databases, fast and efficient homology detection tools are urgently needed. Currently, for homology detection, sequence comparison methods using local alignment such as BLAST are generally used as they give a reasonable measure for sequence similarity. However, these methods have drawbacks in offering overall sequence similarity, especially in dealing with eukaryotic genomes that often contain many insertions and duplications on sequences. Also these methods do not provide the explicit models for speciation, thus it is difficult to interpret their similarity measure into homology detection. Here, we present a novel method based on Word Conservation Score (WCS) to address the current limitations of homology detection. Instead of counting each amino acid, we adopted the concept of 'Word' to compare sequences. WCS measures overall sequence similarity by comparing word contents, which is much faster than BLAST comparisons. Furthermore, evolutionary distance between homologous sequences could be measured by WCS. Therefore, we expect that sequence comparison with WCS is useful for the multiple-species-comparisons of large genomes. In the performance comparisons on protein structural classifications, our method showed a considerable improvement over BLAST. Our method found bigger micro-syntenic blocks which consist of orthologs with conserved gene order. By testing on various datasets, we showed that WCS gives faster and better overall similarity measure compared to BLAST.

New Approach to Predict microRNA Gene by using data Compression technique

  • Kim, Dae-Won;Yang, Joshua SungWoo;Kim, Pan-Jun;Chu, In-Sun;Jeong, Ha-Woong;Park, Hong-Seog
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.361-365
    • /
    • 2005
  • Over the past few years, the complex and subtle roles of microRNA (miRNA) in gene regulation have been increasingly appreciated. Computational approaches have played one of important roles in identifying miRNAs from plant and animals, as well as in predicting their putative gene target. We present a new approach of comprehensive analysis of the evolutionarily conserved element scores and applied data compression technique to detect putative miRNA genes. We used the evolutionarily conserved elements [19] (see more detail on method and material) to calculate for base-by-base along the candidate pre-miRNA gene region by detecting common conserved pattern from target sequence. We applied the data compression technique [20] to detect unknown miRNA genes. This zipping method devises, without loss of generality with respect to the nature of the character strings, a method to measure the similarity between the strings under consideration [20]. Our experience to using our new computational method for detecting miRNA gene identification (or miRNA gene prediction) has been stratified and we were able to find 28 putative miRNA genes.

  • PDF

Comparative Analysis of Large Genome in Human-Chimpanzee (인간-침팬지간 대량의 지놈서열 비교분석)

  • Kim, Tae-Hyung;Kim, Dae-Soo;Jeon, Yeo-Jin;Cho, Hwan-Gue;Kim, Heui-Soo
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.183-192
    • /
    • 2003
  • With the availability of complete whole-genomes such as the human, mouse, fugu and chimpanzee chromosome 22, comparative analysis of large genomes from cross-species at varying evolutionary distances is considered one of a powerful approach for identifying coding and functional non-coding sequences. Here we describe a fast and efficient global alignment method especially for large genomic regions over mega bases pair. We used an approach for identifying all similarity regions by HSP (Highest Segment Pair) regions using local alignments and then large syntenic genome based on the both extension of anchors at HSP regions in two species and global conservation map. Using this alignment approach, we examined rearrangement loci in human chromosome 21 and chimpanzee chromosome 22. Finally, we extracted syntenic genome 30 Mb of human chromosome 21 with chimpanzee chromosome 22, and then identified genomic rearrangements (deletions and insertions ranging h size from 0.3 to 200 kb). Our experiment shows that all jnsertion/deletion (indel) events in excess of 300 bp within chimpanzee chromosome 22 and human chromosome 21 alignments in order to identify new insertions that had occurred over the last 7 million years of evolution. Finally we also discussed evolutionary features throughout comparative analyses of Ka/ks (non-synonymous / synonymous substitutions) rate in orthologous 119 genes of chromosome 21 and 53 genes of MHC-I class in human and chimpanzee genome.

  • PDF

Genomic Analysis of the Extremely Halophilic Archaeon Halobacterium noricense CBA1132 Isolated from Solar Salt That Is an Essential Material for Fermented Foods

  • Lim, Seul Ki;Kim, Joon Yong;Song, Hye Seon;Kwon, Min-Sung;Lee, Jieun;Oh, Young Jun;Nam, Young-Do;Seo, Myung-Ji;Lee, Dong-Gi;Choi, Jong-Soon;Yoon, Changmann;Sohn, Eunju;Rahman, MD. Arif-Ur;Roh, Seong Woon;Choi, Hak-Jong
    • Journal of Microbiology and Biotechnology
    • /
    • v.26 no.8
    • /
    • pp.1375-1382
    • /
    • 2016
  • The extremely halophilic archaeon Halobacterium noricense is a member of the genus Halobacterium. Strain CBA1132 (= KCCM 43183, JCM 31150) was isolated from solar salt. The genome of strain CBA1132 assembled with 4 contigs, including three rRNA genes, 44 tRNA genes, and 3,208 open reading frames. Strain CBA1132 had nine putative CRISPRs and the genome contained genes encoding metal resistance determinants: copper-translocating P-type ATPase (CtpA), arsenical pump-driving ATPase (ArsA), arsenate reductase (ArsC), and arsenical resistance operon repressor (ArsR). Strain CBA1132 was related to Halobacterium noricense, with 99.2% 16S rRNA gene sequence similarity. Based on the comparative genomic analysis, strain CBA1132 has distinctly evolved; moreover, essential genes related to nitrogen metabolism were only detected in the genome of strain CBA1132 among the reported genomes in the genus Halobacterium. This genome sequence of Halobacterium noricense CBA1132 may be of use in future molecular biological studies.

Complete genome sequence of Gordonia sp. MMS17-SY073, a soil actinobacterium (토양 방선균인 Gordonia sp. MMS17-SY073 균주의 유전체 분석)

  • Kim, Yeong Seok;Kim, Seung Bum
    • Korean Journal of Microbiology
    • /
    • v.55 no.3
    • /
    • pp.303-305
    • /
    • 2019
  • An actinobacterial strain designated Gordonia sp. MMS17-SY073 (=KCTC 49257) was isolated from a coastal soil of an island, and its complete genome was analyzed. A single contig consisting of 5,962,176 bp with the G + C content of 67.4% was obtained, and the annotation resulted in 5,201 protein-coding genes, 6 rRNA genes and 45 tRNA genes. Strain MMS17-SY073 was closest to the type strain of Gordonia soli based on the 16S rRNA gene sequence comparison, sharing 98.5% sequence similarity. A number of biosynthetic gene clusters for secondary metabolites, non-ribosomal peptide synthetase types in particular, could be identified from the genome.

Molecular characterization and functional annotation of a hypothetical protein (SCO0618) of Streptomyces coelicolor A3(2)

  • Ferdous, Nadim;Reza, Mahjerin Nasrin;Emon, Md. Tabassum Hossain;Islam, Md. Shariful;Mohiuddin, A.K.M.;Hossain, Mohammad Uzzal
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.28.1-28.9
    • /
    • 2020
  • Streptomyces coelicolor is a gram-positive soil bacterium which is well known for the production of several antibiotics used in various biotechnological applications. But numerous proteins from its genome are considered hypothetical proteins. Therefore, the present study aimed to reveal the functions of a hypothetical protein from the genome of S. coelicolor. Several bioinformatics tools were employed to predict the structure and function of this protein. Sequence similarity was searched through the available bioinformatics databases to find out the homologous protein. The secondary and tertiary structure were predicted and further validated with quality assessment tools. Furthermore, the active site and the interacting proteins were also explored with the utilization of CASTp and STRING server. The hypothetical protein showed the important biological activity having with two functional domain including POD-like_MBL-fold and rhodanese homology domain. The functional annotation exposed that the selected hypothetical protein could show the hydrolase activity. Furthermore, protein-protein interactions of selected hypothetical protein revealed several functional partners those have the significant role for the bacterial survival. At last, the current study depicts that the annotated hypothetical protein is linked with hydrolase activity which might be of great interest to the further research in bacterial genetics.

The Complete Mitochondrial Genome of Nysius plebeius Distant, 1883 (Heteroptera: Lygaeidae) from Korea (한국에 서식하는 애긴노린재(노린재목: 긴노린재과)의 미토콘드리아 전장 유전체)

  • Jiyeong Shin;Rameswor Maharjan;Hwijong Yi;Minkyu Jeong;Juil Kim
    • Korean journal of applied entomology
    • /
    • v.62 no.2
    • /
    • pp.83-87
    • /
    • 2023
  • Nysius plebeius is a major lygaeid pest of various cereal crops and ornamental plants in East Asian countries, including Korea. The complete mitochondrial genome of N. plebeius was characterized and found to comprise a total of 17,367 bp, which included 13 protein-coding genes, NADH dehydrogenase components (complex I, ND), cytochrome oxidase subunits (complex VI, COX), cytochrome oxidase b (CYPB), two ATP synthases, two ribosomal RNA genes, and 22 transfer RNAs. The GC content of 23%. It showed high sequence similarity to other Lygaeidae species, such as N. cymoides (94.5%), N. fuscovittatus (91.7%), and an unknown Nysius species (94.1%). This new N. plebeius mitochondrial genome can be widely used for evolutionary studies of Lygaeidae and to improve pest management practices.

Heterogeneity Analysis of the 16S rRNA Gene Sequences of the Genus Vibrio (Vibrio 속 16S rRNA 유전자 염기서열의 이질성 분석)

  • Ki, Jang-Seu
    • Korean Journal of Microbiology
    • /
    • v.45 no.4
    • /
    • pp.430-434
    • /
    • 2009
  • Bacterial 16S rRNA gene sequences have been widely used for the studies on molecular phylogeny, evolutional history, and molecular detections. Bacterial genomes have multiple rRNA operons, of which gene sequences sometimes are variable. In the present study, heterogeneity of the Vibrio 16S rRNA gene sequences were investigated. Vibrio 16S rRNA sequences were obtained from GenBank databases, considering the completion of gene annotation of Vibrio genome sequences. These included V. cholerae, V. harveyi, V. parahaemolyticus, V. splendidus, and V. vulnificus. Chromosome 1 of the studied Vibrio had 7~10 copies of the 16S rRNA gene, and their intragenomic variations were less than 0.9% dissimilarity (more than 99.1% DNA similarity). Chromosome 2 had none or single 16S rRNA gene. Intragenomic 16S rRNA genotypes were detected at least 5 types (V. vulnificus #CMCP6) to 8 types (V. parahaemolyticus #RIMD 2210633, V. harveyi #ATCC BAA-1116). These suggest that Vibrio has high heterogeneity of the 16S rRNA gene sequences.