• Title/Summary/Keyword: COGs

Search Result 31, Processing Time 0.023 seconds

Investigation of COGs (Clusters of Orthologous Groups of proteins) in 1,309 Species of Prokaryotes (원핵생물 1,309종에 분포된 COGs (Clusters of Orthologous Groups of proteins) 연구)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.31 no.9
    • /
    • pp.834-839
    • /
    • 2021
  • Authors previously reported the results of analyses of COGs (Clusters of Orthologous Groups of proteins) in 711 prokaryotes. The data of COGs were significantly updated for 2020 using 1,309 prokaryotic genomes. Here, we report the results of analyses of 3,455,853 proteins comprising 4,877 updated COGs in terms of COGs and prokaryotes. The numbers of COGs in each prokaryote ranged from 97 to 2,281, with an average of 1,430.0 and a standard deviation of 414.2. Mean numbers of COGs at the phylum level were minimal 497.86 for Mollicutes and maximal 1,642.90 for Cyanobacteria. The top 10 species with the highest COG retention numbers were all Proteobacteria, and 9 out of the bottom 10 were those that could not be cultured in vitro. The numbers of proteins belonging to each COG ranged from 2 to 22,048, with over 12,000 proteins up to the top 11. Five of the top 11 were COGs that bind to DNA and were involved in the gene expression, indicating the importance of regulating gene expression in prokaryotes in a changing environment. COG data are expected to be widely utilized as they can be used for the identification of genes included in the genome and the selection of genes for the strain improvement.

Korean and English affricates in bilingual children

  • Yu, Hye Jeong
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.1-6
    • /
    • 2017
  • This study examined how early bilingual children produce sounds in their two languages articulated with the same manner of articulation but at different places of articulation. English affricates are palato-alveolar and Korean affricates are alveolar. This study analyzed the frequencies of center of gravity (COG), spectral peak (SP), and the second formant (F2) of word-initial affricates in English and Korean produced by twenty-four early Korean-English bilingual children (aged 4 to 7), and compared them with those of monolingual counterparts in the two languages. If early Korean-English bilingual children produce palato-alveolar affricates in English and alveolar affricates in Korean, they may produce Korean affricates with higher COGs, SPs, and F2s than English affricates. The early Korean-English bilingual children at the age of 4 produced English and Korean affricates with similar COGs, SPs, and F2s, and the COGs, SPs, and F2s of their Korean affricates were similar to those of the Korean monolingual counterparts. However, the early bilingual children at the age of 5 to 7 had lower COGs and SPs for English affricates with higher F2s compared to Korean affricates, and the COGs, SPs, and F2s of their English affricates were similar to those of the English monolingual counterparts.

Metabolic Pathways of 1309 Prokaryotic Species in Relation to COGs (COG pathways에서 원핵생물 1,309종의 대사경로)

  • Lee, Dong-Geun;Kim, Ju-Hui;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.32 no.3
    • /
    • pp.249-255
    • /
    • 2022
  • Metabolism is essential for survival and reproduction, and there is a metabolic pathways entry in the clusters of orthologous groups of proteins (COGs) database, updated in 2020. In this study, the metabolic pathways of 1309 prokaryotes were analyzed using COGs. There were 822 COGs associated with 63 metabolic pathways, and the mean for each taxon was between 200.50 (mollicutes) and 527.07 (cyanobacteria) COGs. The metabolic pathway composition ratio (MPCR) was defined as the number of COGs present in one genome in relation to the total number of COGs constituting each metabolic pathway, and the number of pathways with 100% MPCR ranged from 0 to 26 in each prokaryote. Among 1309 species, the 100% MPCR pathways included murein biosynthesis associated with cell wall synthesis (922 species); glycine cleavage (918); and ribosomal 30S subunit synthesis (903). The metabolic pathways with 0% MPCR were those involving photosystem I (1263 species); archaea/vacuolar-type ATP synthase (1028); and Na+-translocation NADH dehydrogenase (976). Depending on the prokaryote, three to 49 metabolic pathways could not be performed at all. The sequence of most highly conserved metabolic pathways was ribosome 30S subunit synthesis (96.1% of 1309 species); murein biosynthesis (86.8%); arginine biosynthesis (80.4%); serine biosynthesis (80.3%); and aminoacyl-tRNA synthesis (82.2%). Protein and cell wall synthesis have been shown to be important metabolic pathways in prokaryotes, and the results of this study of COGs related to such pathways can be utilized in, for example, the development of antibiotics and artificial cells.

Investigation of Conservative Genes in 711 Prokaryotes (원핵생물 711종의 보존적 유전자 탐색)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.25 no.9
    • /
    • pp.1007-1013
    • /
    • 2015
  • A COG (Cluster of Orthologous Groups of proteins) algorithm was applied to detect conserved genes in 711 prokaryotes. Only COG0080 (ribosomal protein L11) was common among all the 711 prokaryotes analyzed and 58 COGs were common in more than 700 prokaryotes. Nine COGs among 58, including COG0197 (endonuclease III) and COG0088 (ribosomal protein L4), were conserved in a form of one gene per one organism. COG0008 represented 1356 genes in 709 of the prokaryotes and this was the highest number of genes among 58 COGs. Twenty-two COGs were conserved in more than 708 prokaryotes. Of these, two were transcription related, four were tRNA synthetases, eight were large ribosomal subunits, seven were small ribosomal subunits, and one was translation elongation factor. Among 58 conserved COGs in more than 700 prokaryotes, 50 (86.2%) were translation related, and four (6.9%) were transcription related, pointing to the importance of protein-synthesis in prokaryotes. Among these 58 COGs, the most conserved COG was COG0060 (isoleucyl tRNA synthetase), and the least conserved was COG0143 (methionyl tRNA synthetase). Archaea and eubacteria were discriminated in the genomic analysis by the average distance and variation in distance of common COGs. The identification of these conserved genes could be useful in basic and applied research, such as antibiotic development and cancer therapeutics.

COG 알고리즘으로 파악한 Proteobacteria의 보존적 유전자

  • Lee, Dong-Geun;Lee, Jin-Ok;Lee, Jae-Hwa
    • 한국생물공학회:학술대회논문집
    • /
    • 2003.04a
    • /
    • pp.715-718
    • /
    • 2003
  • A COG (clusters of orthologous groups of proteins) algorithm, protein similarities among genomes, was used to detect conserved genes and to figure out their relationships within 42 procaryote, 33 Bacteria and 16 Proteobacteria All analyzed procaryotes shared 75 COGs. COG0195, COG0358 and COG0528 were only represented by the 42 procaryotes. Sixty-four COGs were added as conserved genes in 33 eubacteria. Each Proteobacteria group has a unique repertoire of COGs. Metabolic COGs were more diverse in the beta-Proteobacteria group than in the other groups. The possibilities of detecting new biological molecules is high in phylogenetically related organisms, hence the identification of useful proteins by using this algorithm is possible.

  • PDF

Investigation of Conservative Genes in 168 Archaebacterial Strains (168개 고세균 균주들의 보존적 유전자에 관한 연구)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.30 no.9
    • /
    • pp.813-818
    • /
    • 2020
  • The archaeal clusters of orthologous genes (arCOG) algorithm, which identifies common genes among archaebacterial genomes, was used to identify conservative genes among 168 archaebacterial strains. The numbers of conserved orthologs were 14, 10, 9, and 8 arCOGs in 168, 167, 166, and 165 strains, respectively. Among 41 conserved arCOGs, 13 were related to function J (translation, ribosomal structure, and biogenesis), and 10 were related to function L (replication, recombination, and repair). Among the 14 conserved arCOGs in all 168 strains, 6 arCOGs of tRNA synthetase comprised the highest proportion. Of the remaining 8 arCOGs, 2 are involved in reactions with ribosomes, 2 for tRNA synthesis, 2 for DNA replication, and 2 for transcription. These results showed the importance of protein expression in archaea. For the classes or orders having 3 or more members, genomic analysis was performed by averaging the distance values of the conservative arCOGs. Classes Archaeoglobi and Thermoplasmata of the phylum Euryarchaeota showed the lowest and the highest average of distance value, respectively. This study can provides data necessary for basic scientific research and the development of antibacterial agents and tumor control.

Analysis of Conservative Genes in Thermophilic and Hyperthermophilic Bacteria (고온성과 초고온성 세균의 보존적 유전자 분석)

  • Lee Dong-Geun;Lee Jae-Hwa;Ha Bae Jin;Ha Jong-Myung;Lee Jung-Hyun;Kim Sang-Jin;Lee Sang Hyeon
    • KSBB Journal
    • /
    • v.20 no.5 s.94
    • /
    • pp.387-391
    • /
    • 2005
  • Totally 16,299 conservative genes, commonly found in 13 thermophilic and hyperthermophilic bacteria, were analyzed. All genes were belong to W 67 COGs (clusters of orthologous groups of proteins). COGs related to protein metabolism were 80 among 167 COGs. Conservative genes were not limited only thermophiles and hyperthermophiles, meaning thermal stability is independent of specific protein. However reverse gyrase was only found in all hyperthermophilic archaebacteria and eubacteria, meaning DNA stability is important in hyperthermophiles. Hyperthermophilic eubacteria and thermophilic archaebacteria had different position between phylogenetic tree of gene content and 165 rRNA gene. Thermophilic archaebacteria hyperthermophilic eubacteria and archaebacteria had similar values by the statistical analysis of distance values with 167 COGs in each organism.

Detection of Conserved Genes in Proteobacteria by using a COG Algorithm (COG 알고리즘을 통한 Proteobacteria의 보존적 유전자 파악)

  • 이동근;강호영;이재화;김철민
    • KSBB Journal
    • /
    • v.17 no.6
    • /
    • pp.560-565
    • /
    • 2002
  • A COG(clusters of orthologous groups of proteins) algorithm was used to detect conserved genes within Proteobacteria and to figure out their relationships. Restricting comparison to the sequences of 42 procaryotes, 33 eubacteria and 16 Proteobacteria, the number of conserved genes was increased. All analyzed procaryotes shared 75 COGs. COG0195, COG0358 and COG0528 were only represented by the 42 procaryotes. Sixtyfour COGs were added as conserved genes in 33 eubacteria. Each Proteobacteria group has a unique repertoire of COGs. Metabolic COGs were more diverse in the beta Proteobacteria group than in the other groups. These results could be used to determine the origins and the evolutionary relationships of Proteobacteria. The possibilities of detecting new biological molecules is high in phylogenetically related organisms, hence the identification of useful proteins by using this algorithm is possible.

Genetic Composition Analysis of Marine-Origin Euryarchaeota by using a COG Algorithm (COG 알고리즘을 통한 해양성 Euryarchaeota의 유전적 조성 분석)

  • 이재화;이동근;김철민;이은열
    • Journal of Life Science
    • /
    • v.13 no.3
    • /
    • pp.298-307
    • /
    • 2003
  • To figure out the conserved genes and newly added genes at each phylogenetic level of Archaea, COG (clusters of orthologous groups of proteins) algorithm was applied. The number of conserved genes within 9 species of Archaea was 340 and that of 8 species of Euryarchaeota was 388. Many of conserved 265 COGs, which are specific to Archaea and absent in Bacteria and S. cerevisiae, were concerned with 'information storage and processing' (94 COG, 35.5%) and 'metabolism' (82 COG, 30.9%). COGs related to these functions were assumed as highly conserved and permit peculiar life form to Archaea. It seemed that there was some difference in 'nucleotide transport and metabolism' and there was little difference in 'information storage and processing' between Euryarchaeota and Crenarchaeota. Marine-origin Euryarchaeota showed different conserved COGs with terrestrial Euryarchaeota. Conserved COGs, related to carbohydrate transport and metabolism and others, were different between marine- and terrestrial-origin Euryarchaeota. Hence it was assumed that their physiology might be different. This study may help to understand the origin and conserved genes at each phylogenetic level of marine-origin Euryarchaeota and may help in the mining of useful genes in marine Archaea as Manco et al. (Arch. Biochem. Biophy. 373, 182 (2000)).

Conserved COG Pathways and Genes of 122 Species of Archaea (고세균 122종의 보존적 COG pathways와 유전자)

  • Dong-Geun Lee ;Sang-Hyeon Lee
    • Journal of Life Science
    • /
    • v.33 no.11
    • /
    • pp.944-949
    • /
    • 2023
  • The purpose of this study was to identify conserved metabolic pathways and conserved genes in 122 archaeal species. Using the Clusters of Orthologous Groups of Proteins (COG) database of conserved genes, we analyzed whether 122 species had 63 COG metabolic pathways, the 822 COGs that compose them, and a total of 4,877 COGs. Archaeal ribosomal proteins were the most conserved in metabolic pathways. 46 COGs in seven COG pathways among 63 COG pathways and 20 COGs in others were conserved in 122 species. Some genes involved in cell wall and extracellular matrix synthesis, replication, transcription, translation, and protein metabolism were common to all 122 species. When the distance value of the phylogenetic tree was analyzed at the phylum level or class level, the average was the lowest at the class Halobacteria of the phylum Euryarchaeota. Standard deviation was high for the class Nitosospharia of the phylum Thaumarchaeota, the unclassified members of phylum Thaumarchaeota, the class Halobacteria of the phylum Euryarchaeota, the class Thermoprotei of the phylum Crenarchaeota, and other archaea. Furthermore, the phylogenetic tree analysis revealed six commonalities. The results of this study, along with data on conserved genes, could be used for drug development and gene selection for strain improvement.