• Title/Summary/Keyword: COG (cluster of orthologous proteins)

Search Result 7, Processing Time 0.025 seconds

Conservative Genes among 1,309 Species of Prokaryotes (원핵생물 1,309종의 보존적 유전자)

  • Lee, Dong-Geun
    • Journal of Life Science
    • /
    • v.32 no.6
    • /
    • pp.463-467
    • /
    • 2022
  • As a result of applying the COG (Cluster of Orthologous Groups of Protein) algorithm to 1,309 species to confirm the conserved genes of prokaryotes, ribosomal protein S11 (COG0100) was identified. The numbers of conservative genes were 2, 5, 5, and 6 in 1,308, 1,307, 1,306, and 1,305 species, respectively. Twenty-nine genes were conserved in over 1,302 species, and they encoded 23 ribosomal proteins, 3 tRNA synthetases, 2 translation factors, and 1 RNA polymerase subunit. Most of them were related to protein production, suggesting the importance of protein expression in prokaryotes. The highest conservative COG was COG0048 (ribosomal protein S12) among the 29 COGs. The 29 conserved genes usually have one protein for each prokaryote. COG0090 (ribosomal protein L2) had not only the lowest conservation value but also the largest standard deviation of phylogenetic distance value. As COG0090 is not only a member of the ribosome, but also a regulator of replication and transcription, it could be inferred that prokaryotes have large variations in COG0090 to survive in various environments. This study could provide data necessary for basic science, tumor control, and development of antibacterial agents.

Metabolic Pathways of 1309 Prokaryotic Species in Relation to COGs (COG pathways에서 원핵생물 1,309종의 대사경로)

  • Lee, Dong-Geun;Kim, Ju-Hui;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.32 no.3
    • /
    • pp.249-255
    • /
    • 2022
  • Metabolism is essential for survival and reproduction, and there is a metabolic pathways entry in the clusters of orthologous groups of proteins (COGs) database, updated in 2020. In this study, the metabolic pathways of 1309 prokaryotes were analyzed using COGs. There were 822 COGs associated with 63 metabolic pathways, and the mean for each taxon was between 200.50 (mollicutes) and 527.07 (cyanobacteria) COGs. The metabolic pathway composition ratio (MPCR) was defined as the number of COGs present in one genome in relation to the total number of COGs constituting each metabolic pathway, and the number of pathways with 100% MPCR ranged from 0 to 26 in each prokaryote. Among 1309 species, the 100% MPCR pathways included murein biosynthesis associated with cell wall synthesis (922 species); glycine cleavage (918); and ribosomal 30S subunit synthesis (903). The metabolic pathways with 0% MPCR were those involving photosystem I (1263 species); archaea/vacuolar-type ATP synthase (1028); and Na+-translocation NADH dehydrogenase (976). Depending on the prokaryote, three to 49 metabolic pathways could not be performed at all. The sequence of most highly conserved metabolic pathways was ribosome 30S subunit synthesis (96.1% of 1309 species); murein biosynthesis (86.8%); arginine biosynthesis (80.4%); serine biosynthesis (80.3%); and aminoacyl-tRNA synthesis (82.2%). Protein and cell wall synthesis have been shown to be important metabolic pathways in prokaryotes, and the results of this study of COGs related to such pathways can be utilized in, for example, the development of antibiotics and artificial cells.

Conservative Genes of Less Orthologous Prokaryotes (Orthologs 수가 적은 원핵생물들의 보존적 유전자)

  • Lee, Dong-Geun
    • Journal of Life Science
    • /
    • v.27 no.6
    • /
    • pp.694-701
    • /
    • 2017
  • Mycoplasma genitalium represents the smallest genome among mono-cultivable prokaryotes. To discover and compare the orthologs (conservative genes) among M. genitalium and 14 prokaryotes that are uncultivable and have less orthologs than M. genitalium, COG (clusters of orthologous groups of protein) analyses were applied. The analyzed prokaryotes were M. genitalium, one hyperthermophilic exosymbiotic archaeon Nanoarchaeum equitans, four intracellular plant pathogenic eubacteria of Candidatus Phytoplasma genus, and nine endosymbiotic eubacteria of phloem- and xylem-feeding insects. Among 367 orthologs of M. genitalium, 284 orthologs were conservative between M. genitalium and at least one other prokaryote. All 15 prokaryotes commonly have 29 orthologs, representing the significance of proteins in life. They belong to 25 translation-related, including 22 ribosomal proteins, 3 subunits of RNA polymerase, and 1 protein-folding-related. Among the 15 prokaryotes, 40 orthologs were only found in all four Candidatus Phytoplasma. The other nine Candidatus, all endosymbionts with insects, showed only a single common COG0539 (ribosomal protein S1), representing the diversity of orthologs among them. These results might provide clues to understand conservative genes in uncultivable prokaryotes, and may be helpful in industrial areas, such as handling prokaryotes producing amino acids and antibiotics, and as precursors of organic synthesis.

Conserved COG Pathways and Genes of 122 Species of Archaea (고세균 122종의 보존적 COG pathways와 유전자)

  • Dong-Geun Lee ;Sang-Hyeon Lee
    • Journal of Life Science
    • /
    • v.33 no.11
    • /
    • pp.944-949
    • /
    • 2023
  • The purpose of this study was to identify conserved metabolic pathways and conserved genes in 122 archaeal species. Using the Clusters of Orthologous Groups of Proteins (COG) database of conserved genes, we analyzed whether 122 species had 63 COG metabolic pathways, the 822 COGs that compose them, and a total of 4,877 COGs. Archaeal ribosomal proteins were the most conserved in metabolic pathways. 46 COGs in seven COG pathways among 63 COG pathways and 20 COGs in others were conserved in 122 species. Some genes involved in cell wall and extracellular matrix synthesis, replication, transcription, translation, and protein metabolism were common to all 122 species. When the distance value of the phylogenetic tree was analyzed at the phylum level or class level, the average was the lowest at the class Halobacteria of the phylum Euryarchaeota. Standard deviation was high for the class Nitosospharia of the phylum Thaumarchaeota, the unclassified members of phylum Thaumarchaeota, the class Halobacteria of the phylum Euryarchaeota, the class Thermoprotei of the phylum Crenarchaeota, and other archaea. Furthermore, the phylogenetic tree analysis revealed six commonalities. The results of this study, along with data on conserved genes, could be used for drug development and gene selection for strain improvement.

Investigation of Conservative Genes in 711 Prokaryotes (원핵생물 711종의 보존적 유전자 탐색)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.25 no.9
    • /
    • pp.1007-1013
    • /
    • 2015
  • A COG (Cluster of Orthologous Groups of proteins) algorithm was applied to detect conserved genes in 711 prokaryotes. Only COG0080 (ribosomal protein L11) was common among all the 711 prokaryotes analyzed and 58 COGs were common in more than 700 prokaryotes. Nine COGs among 58, including COG0197 (endonuclease III) and COG0088 (ribosomal protein L4), were conserved in a form of one gene per one organism. COG0008 represented 1356 genes in 709 of the prokaryotes and this was the highest number of genes among 58 COGs. Twenty-two COGs were conserved in more than 708 prokaryotes. Of these, two were transcription related, four were tRNA synthetases, eight were large ribosomal subunits, seven were small ribosomal subunits, and one was translation elongation factor. Among 58 conserved COGs in more than 700 prokaryotes, 50 (86.2%) were translation related, and four (6.9%) were transcription related, pointing to the importance of protein-synthesis in prokaryotes. Among these 58 COGs, the most conserved COG was COG0060 (isoleucyl tRNA synthetase), and the least conserved was COG0143 (methionyl tRNA synthetase). Archaea and eubacteria were discriminated in the genomic analysis by the average distance and variation in distance of common COGs. The identification of these conserved genes could be useful in basic and applied research, such as antibiotic development and cancer therapeutics.

Investigation of Conservative Genes in 168 Archaebacterial Strains (168개 고세균 균주들의 보존적 유전자에 관한 연구)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.30 no.9
    • /
    • pp.813-818
    • /
    • 2020
  • The archaeal clusters of orthologous genes (arCOG) algorithm, which identifies common genes among archaebacterial genomes, was used to identify conservative genes among 168 archaebacterial strains. The numbers of conserved orthologs were 14, 10, 9, and 8 arCOGs in 168, 167, 166, and 165 strains, respectively. Among 41 conserved arCOGs, 13 were related to function J (translation, ribosomal structure, and biogenesis), and 10 were related to function L (replication, recombination, and repair). Among the 14 conserved arCOGs in all 168 strains, 6 arCOGs of tRNA synthetase comprised the highest proportion. Of the remaining 8 arCOGs, 2 are involved in reactions with ribosomes, 2 for tRNA synthesis, 2 for DNA replication, and 2 for transcription. These results showed the importance of protein expression in archaea. For the classes or orders having 3 or more members, genomic analysis was performed by averaging the distance values of the conservative arCOGs. Classes Archaeoglobi and Thermoplasmata of the phylum Euryarchaeota showed the lowest and the highest average of distance value, respectively. This study can provides data necessary for basic scientific research and the development of antibacterial agents and tumor control.

Draft Genome Assembly and Annotation for Cutaneotrichosporon dermatis NICC30027, an Oleaginous Yeast Capable of Simultaneous Glucose and Xylose Assimilation

  • Wang, Laiyou;Guo, Shuxian;Zeng, Bo;Wang, Shanshan;Chen, Yan;Cheng, Shuang;Liu, Bingbing;Wang, Chunyan;Wang, Yu;Meng, Qingshan
    • Mycobiology
    • /
    • v.50 no.1
    • /
    • pp.66-78
    • /
    • 2022
  • The identification of oleaginous yeast species capable of simultaneously utilizing xylose and glucose as substrates to generate value-added biological products is an area of key economic interest. We have previously demonstrated that the Cutaneotrichosporon dermatis NICC30027 yeast strain is capable of simultaneously assimilating both xylose and glucose, resulting in considerable lipid accumulation. However, as no high-quality genome sequencing data or associated annotations for this strain are available at present, it remains challenging to study the metabolic mechanisms underlying this phenotype. Herein, we report a 39,305,439 bp draft genome assembly for C. dermatis NICC30027 comprised of 37 scaffolds, with 60.15% GC content. Within this genome, we identified 524 tRNAs, 142 sRNAs, 53 miRNAs, 28 snRNAs, and eight rRNA clusters. Moreover, repeat sequences totaling 1,032,129 bp in length were identified (2.63% of the genome), as were 14,238 unigenes that were 1,789.35 bp in length on average (64.82% of the genome). The NCBI non-redundant protein sequences (NR) database was employed to successfully annotate 11,795 of these unigenes, while 3,621 and 11,902 were annotated with the Swiss-Prot and TrEMBL databases, respectively. Unigenes were additionally subjected to pathway enrichment analyses using the Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Cluster of Orthologous Groups of proteins (COG), Clusters of orthologous groups for eukaryotic complete genomes (KOG), and Non-supervised Orthologous Groups (eggNOG) databases. Together, these results provide a foundation for future studies aimed at clarifying the mechanistic basis for the ability of C. dermatis NICC30027 to simultaneously utilize glucose and xylose to synthesize lipids.