• Title/Summary/Keyword: pan-genome

Search Result 43, Processing Time 0.022 seconds

Correlation-based and feature-driven mutation signature analyses to identify genetic features associated with DNA mutagenic processes in cancer genomes

  • Jeong, Hye Young;Yoo, Jinseon;Kim, Hyunwoo;Kim, Tae-Min
    • Genomics & Informatics
    • /
    • v.19 no.4
    • /
    • pp.40.1-40.11
    • /
    • 2021
  • Mutation signatures represent unique sequence footprints of somatic mutations resulting from specific DNA mutagenic and repair processes. However, their causal associations and the potential utility for genome research remain largely unknown. In this study, we performed PanCancer-scale correlative analyses to identify the genomic features associated with tumor mutation burdens (TMB) and individual mutation signatures. We observed that TMB was correlated with tumor purity, ploidy, and the level of aneuploidy, as well as with the expression of cell proliferation-related genes representing genomic covariates in evaluating TMB. Correlative analyses of mutation signature levels with genes belonging to specific DNA damage-repair processes revealed that deficiencies of NHEJ1 and ALKBH3 may contribute to mutations in the settings of APOBEC cytidine deaminase activation and DNA mismatch repair deficiency, respectively. We further employed a strategy to identify feature-driven, de novo mutation signatures and demonstrated that mutation signatures can be reconstructed using known causal features. Using the strategy, we further identified tumor hypoxia-related mutation signatures similar to the APOBEC-related mutation signatures, suggesting that APOBEC activity mediates hypoxia-related mutational consequences in cancer genomes. Our study advances the mechanistic insights into the TMB and signature-based DNA mutagenic and repair processes in cancer genomes. We also propose that feature-driven mutation signature analysis can further extend the categories of cancer-relevant mutation signatures and their causal relationships.

Comparative Genomic and Genetic Functional Analysis of Industrial L-Leucine- and L-Valine-Producing Corynebacterium glutamicum Strains

  • Ma, Yuechao;Chen, Qixin;Cui, Yi;Du, Lihong;Shi, Tuo;Xu, Qingyang;Ma, Qian;Xie, Xixian;Chen, Ning
    • Journal of Microbiology and Biotechnology
    • /
    • v.28 no.11
    • /
    • pp.1916-1927
    • /
    • 2018
  • Corynebacterium glutamicum is an excellent platform for the production of amino acids, and is widely used in the fermentation industry. Most industrial strains are traditionally obtained by repeated processes of random mutation and selection, but the genotype of these strains is often unclear owing to the absence of genomic information. As such, it is difficult to improve the growth and amino acid production of these strains via metabolic engineering. In this study, we generated a complete genome map of an industrial L-valine-producing strain, C. glutamicum XV. In order to establish the relationship between genotypes and physiological characteristics, a comparative genomic analysis was performed to explore the core genome, structural variations, and gene mutations referring to an industrial L-leucine-producing strain, C. glutamicum CP, and the widely used C. glutamicum ATCC 13032. The results indicate that a 36,349 bp repeat sequence in the CP genome contained an additional copy each of lrp and brnFE genes, which benefited the export of L-leucine. However, in XV, the kgd and panB genes were disrupted by nucleotide insertion, which increase the availability of precursors to synthesize L-valine. Moreover, the specific amino acid substitutions in key enzymes increased their activities. Additionally, a novel strategy is proposed to remodel central carbon metabolism and reduce pyruvate consumption without having a negative impact on cell growth by introducing the CP-derived mutant $H^+$/citrate symporter. These results further our understanding regarding the metabolic networks in these strains and help to elucidate the influence of different genotypes on these processes.

Methylation-sensitive high-resolution melting analysis of the USP44 promoter can detect early-stage hepatocellular carcinoma in blood samples

  • Si-Cho, Kim;Jiwon, Kim;Da-Won, Kim;Yanghee, Choi;Kyunghyun, Park;Eun Ju, Cho;Su Jong, Yu;Jeongsil, Kim-Ha;Young-Joon, Kim
    • BMB Reports
    • /
    • v.55 no.11
    • /
    • pp.553-558
    • /
    • 2022
  • Hepatocellular carcinoma (HCC) is dangerous cancer that often evades early detection because it is asymptomatic and an effective detection method is lacking. For people with chronic liver inflammation who are at high risk of developing HCC, a sensitive detection method for HCC is needed. In a meta-analysis of The Cancer Genome Atlas pan-cancer methylation database, we identified a CpG island in the USP44 promoter that is methylated specifically in HCC. We developed methylation-sensitive high-resolution melting (MS-HRM) analysis to measure the methylation levels of the USP promoter in cell-free DNA isolated from patients. Our MS-HRM assay correctly identified 40% of patients with early-stage HCC, whereas the α-fetoprotein test, which is currently used to detect HCC, correctly identified only 25% of early-stage HCC patients. These results demonstrate that USP44 MS-HRM analysis is suitable for HCC surveillance.

Selection signature reveals genes associated with susceptibility loci affecting respiratory disease due to pleiotropic and hitchhiking effect in Chinese indigenous pigs

  • Xu, Zhong;Sun, Hao;Zhang, Zhe;Zhang, Cheng-Yue;Zhao, Qing-bo;Xiao, Qian;Olasege, Babatunde Shittu;Ma, Pei-Pei;Zhang, Xiang-Zhe;Wang, Qi-Shan;Pan, Yu-Chun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.33 no.2
    • /
    • pp.187-196
    • /
    • 2020
  • Objective: Porcine respiratory disease is one of the most important health problems causing significant economic losses. To understand the genetic basis for susceptibility to swine enzootic pneumonia (EP) in pigs, we detected 102,809 single nucleotide polymorphisms in a total of 249 individuals based on genome-wide sequencing data. Methods: Genome comparison of susceptibility to swine EP in three pig breeds (Jinhua, Erhualian, and Meishan) with two western lines that are considered more resistant (Duroc and Landrace) using cross-population extended haplotype homozygosity and F-statistic (FST) statistical approaches identified 691 positively selected genes. Based on quantitative trait loci, gene ontology terms and literature search, we selected 14 candidate genes that have convincible biological functions associated with swine EP or human asthma. Results: Most of these genes were tested by several methods including transcription analysis and candidate genes association study. Among these genes: cytochrome P450 1A1 and catenin beta 1 (CTNNB1) are involved in fertility; transforming growth factor beta receptor 3 plays a role in meat quality traits; Wnt family member 2, CTNNB1 and transcription factor 7 take part in adipogenesis and fat deposition simultaneously; plasminogen activator, urokinase receptor (completely linked to AXL receptor tyrosine kinase, r2 = 1) plays an essential role in the successful ovulation of matured oocytes in pigs; colipase like 2 (strongly linked to SAM pointed domain containing ETS transcription factor, r2 = 0.848) is involved in male fertility. Conclusion: These adverse genes susceptible to swine EP may be selected while selecting for economic traits (especially reproduction traits) due to pleiotropic and hitchhiking effect of linked genes. Our study provided a completely new point of view to understand the genetic basis for susceptibility or resistance to swine EP in pigs thereby, provides insight for designing sustainable breed selection programs. Finally, the candidate genes are crucial due to their potential roles in respiratory diseases in a large number of species, including human.

Detection of genome-wide structural variations in the Shanghai Holstein cattle population using next-generation sequencing

  • Liu, Dengying;Chen, Zhenliang;Zhang, Zhe;Sun, Hao;Ma, Peipei;Zhu, Kai;Liu, Guanglei;Wang, Qishan;Pan, Yuchun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.3
    • /
    • pp.320-333
    • /
    • 2019
  • Objective: The Shanghai Holstein cattle breed is susceptible to severe mastitis and other diseases due to the hot weather and long-term humidity in Shanghai, which is the main distribution centre for providing Holstein semen to various farms throughout China. Our objective was to determine the genetic mechanisms influencing economically important traits, especially diseases that have huge impact on the yield and quality of milk as well as reproduction. Methods: In our study, we detected the structural variations of 1,092 Shanghai Holstein cows by using next-generation sequencing. We used the DELLY software to identify deletions and insertions, cn.MOPS to identify copy-number variants (CNVs). Furthermore, we annotated these structural variations using different bioinformatics tools, such as gene ontology, cattle quantitative trait locus (QTL) database and ingenuity pathway analysis (IPA). Results: The average number of high-quality reads was 3,046,279. After filtering, a total of 16,831 deletions, 12,735 insertions and 490 CNVs were identified. The annotation results showed that these mapped genes were significantly enriched for specific biological functions, such as disease and reproduction. In addition, the enrichment results based on the cattle QTL database showed that the number of variants related to milk and reproduction was higher than the number of variants related to other traits. IPA core analysis found that the structural variations were related to reproduction, lipid metabolism, and inflammation. According to the functional analysis, structural variations were important factors affecting the variation of different traits in Shanghai Holstein cattle. Our results provide meaningful information about structural variations, which may be useful in future assessments of the associations between variations and important phenotypes in Shanghai Holstein cattle. Conclusion: Structural variations identified in this study were extremely different from those of previous studies. Many structural variations were found to be associated with mastitis and reproductive system diseases; these results are in accordance with the characteristics of the environment that Shanghai Holstein cattle experience.

Cloning and Characterization of Squalene Synthase (SQS) Gene from Ganoderma lucidum

  • Zhao, Ming-Wen;Liang, Wan-Qi;Zhang, Da-Bing;Wang, Nan;Wang, Chen-Guang;Pan, Ying-Jie
    • Journal of Microbiology and Biotechnology
    • /
    • v.17 no.7
    • /
    • pp.1106-1112
    • /
    • 2007
  • This report provides the complete nucleotide sequences of the full-length cDNA encoding squalene synthase (SQS) and its genomic DNA sequence from a triterpene-producing fungus, Ganoderma lucidum. The cDNA of the squalene synthase (SQS) (GenBank Accession Number: DQ494674) was found to contain an open reading frame (ORF) of 1,404 bp encoding a 468-amino-acid polypeptide, whereas the SQS genomic DNA sequence (GenBank Accession Number: DQ494675) consisted of 1,984 bp and contained four exons and three introns. Only one gene copy was present in the G. lucidum genome. The deduced amino acid sequence of Ganoderma lucidum squalene synthase (GI-SQS) exhibited a high homology with other fungal squalene synthase genes and contained six conserved domains. A phylogenetic analysis revealed that G. lucidum SQS belonged to the fungi SQS group, and was more closely related to the SQS of U. maydis than to those of other fungi. A gene expression analysis showed that the expression level was relatively low in mycelia incubated for 12 days, increased after 14 to 20 days of incubation, and reached a relatively high level in the mushroom primordia. Functional complementation of GI-SQS in a SQS-deficient strain of Saccharomyces cerevisiae confirmed that the cloned cDNA encoded a squalene synthase.

Study on Microbial Community Succession and Protein Hydrolysis of Donkey Meat during Refrigerated Storage Based on Illumina NOVA Sequencing Technology

  • Wei, Zixiang;Chu, Ruidong;Li, Lanjie;Zhang, Jingjing;Zhang, Huachen;Pan, Xiaohong;Dong, Yifan;Liu, Guiqin
    • Food Science of Animal Resources
    • /
    • v.41 no.4
    • /
    • pp.701-714
    • /
    • 2021
  • In this study, the microbial community succession and the protein hydrolysis of donkey meat during refrigerated (4℃) storage were investigated. 16S rDNA sequencing method was used to analyze the bacteria community structure and succession in the level of genome. Meanwhile, the volatile base nitrogen (TVB-N) was measured to evaluate the degradation level of protein. After sorting out the sequencing results, 1,274,604 clean data were obtained, which were clustered into 2,064 into operational taxonomic units (OTUs), annotated to 32 phyla and 527 genus. With the prolonging of storage time, the composition of microorganism changed greatly. At the same time, the diversity and richness of microorganism decreased and then increased. During the whole storage period, Proteobacteria was the dominant phyla, and the Photobacterium, Pseudompnas, and Acinetobacter were the dominant genus. According to correlation analysis, it was found that the abundance of these dominant bacteria was significantly positively correlated with the variation of TVB-N. And Pseudomonas might play an important role in the production of TVB-N during refrigerated storage of donkey meat. The predicted metabolic pathways, based on PICRUSt analysis, indicated that amino metabolism in refrigerated donkey meat was the main metabolic pathways. This study provides insight into the process involved in refrigerated donkey meat spoilage, which provides a foundation for the development of antibacterial preservative for donkey meat.

Ginsenoside Rg3 increases gemcitabine sensitivity of pancreatic adenocarcinoma via reducing ZFP91 mediated TSPYL2 destabilization

  • Pan, Haixia;Yang, Linhan;Bai, Hansong;Luo, Jing;Deng, Ying
    • Journal of Ginseng Research
    • /
    • v.46 no.5
    • /
    • pp.636-645
    • /
    • 2022
  • Background: Ginsenoside Rg3 and gemcitabine have mutual enhancing antitumor effects. However, the underlying mechanisms are not clear. This study explored the influence of ginsenoside Rg3 on Zinc finger protein 91 homolog (ZFP91) expression in pancreatic adenocarcinoma (PAAD) and their regulatory mechanisms on gemcitabine sensitivity. Methods: RNA-seq and survival data from The Cancer Genome Atlas (TCGA)-PAAD and Genotype-Tissue Expression (GTEx) were used for in-silicon analysis. PANC-1, BxPC-3, and PANC-1 gemcitabine-resistant (PANC-1/GR) cells were used for in vitro analysis. PANC-1 derived tumor xenograft nude mice model was used to assess the influence of ginsenoside Rg3 and ZFP91 on tumor growth in vivo. Results: Ginsenoside Rg3 reduced ZFP91 expression in PAAD cells in a dose-dependent manner. ZFP91 upregulation was associated with significantly shorter survival of patients with PAAD. ZFP91 overexpression induced gemcitabine resistance, which was partly conquered by ginsenoside Rg3 treatment. ZFP91 depletion sensitized PANC-1/GR cells to gemcitabine treatment. ZFP91 interacted with Testis-Specific Y-Encoded-Like Protein 2 (TSPYL2), induced its poly-ubiquitination, and promoted proteasomal degradation. Ginsenoside Rg3 treatment weakened ZFP91-induced TSPYL2 poly-ubiquitination and degradation. Enforced TSPYL2 expression increased gemcitabine sensitivity of PAAD cells and partly reversed induced gemcitabine resistance in PANC-1/GR cells. Conclusion: Ginsenoside Rg3 can increase gemcitabine sensitivity of pancreatic adenocarcinoma at least via reducing ZFP91 mediated TSPYL2 destabilization.

Introduction of the Korea BioData Station (K-BDS) for sharing biological data

  • Byungwook Lee;Seungwoo Hwang;Pan-Gyu Kim;Gunwhan Ko;Kiwon Jang;Sangok Kim;Jong-Hwan Kim;Jongbum Jeon;Hyerin Kim;Jaeeun Jung;Byoung-Ha Yoon;Iksu Byeon;Insu Jang;Wangho Song;Jinhyuk Choi;Seon-Young Kim
    • Genomics & Informatics
    • /
    • v.21 no.1
    • /
    • pp.12.1-12.8
    • /
    • 2023
  • A wave of new technologies has created opportunities for the cost-effective generation of high-throughput profiles of biological systems, foreshadowing a "data-driven science" era. The large variety of data available from biological research is also a rich resource that can be used for innovative endeavors. However, we are facing considerable challenges in big data deposition, integration, and translation due to the complexity of biological data and its production at unprecedented exponential rates. To address these problems, in 2020, the Korean government officially announced a national strategy to collect and manage the biological data produced through national R&D fund allocations and provide the collected data to researchers. To this end, the Korea Bioinformation Center (KOBIC) developed a new biological data repository, the Korea BioData Station (K-BDS), for sharing data from individual researchers and research programs to create a data-driven biological study environment. The K-BDS is dedicated to providing free open access to a suite of featured data resources in support of worldwide activities in both academia and industry.

Definition of the peptide mimotope of cellular receptor for hepatitis C virus E2 protein using random peptide library (Random peptide library를 이용한 C형 간염바이러스 E2 단백질 세포막 수용체의 peptide mimotope 규명)

  • Lee, In-Hee;Paik, Jae-Eun;Seol, Sang-Yong;Seog, Dae-Hyun;Park, Sae-Gwang;Choi, In-Hak
    • IMMUNE NETWORK
    • /
    • v.1 no.1
    • /
    • pp.77-86
    • /
    • 2001
  • Background: Hepatitis C virus(HCV), a family of Flaviviridae, has a host cell-derived envelope containing a positive-stranded RNA genome, and has been known as the maj or etiological agent for chronic hepatitis, hepatic cirrhosis, and hepatocellular carcinoma. There remains a need to dissect a molecular mechanism of pathogenesis for the development of therapeutic and effective preventive measure for HCV. Identification of cellular receptor is of central importance not only to understand the viral pathogenesis, but also to exploit strategies for prevention of HCV. This study was aimed at identifying peptide mimotopes inhibiting the binding of E2 protein of HCV to MOLT-4 cell. Methods: In this study, phage peptide library displaying a random peptides consisting of 7 or 12 random peptides was employed in order to pan against E2 protein. Free HCV particles were separated from the immune complex forms by immunoprecipitation using anti-human IgG antibody, and used for HCV-capture ELISA. To identify the peptides inhibiting E2-binding to MOLT-4 cells, E2 protein was subj ect to bind to MOLT-4 cells under the competition with phage peptides. Results: Several phage peptides were selected for their specific binding to E2 protein, which showed the conserved sequence of SHFWRAP from 3 different peptide sequences. They were also able to recognize the HCV particles in the sera of HCV patients captured by monoclonal antibody against E2 protein. Two of them, showing peptide sequence of HLGPWMSHWFQR and WAPPLERSSLFY respectively, were revealed to inhibit the binding of E2 protein to MOLT-4 cell efficiently in dose dependent mode. However, few membrane-associated receptor candidates were seen using Fasta3 programe for homology search with these peptides. Conclusion: Phage peptides containing HLGPWMSHWFQR and WAPPLERSSLFY respectively, showed the inhibition of E2-binding to MOLT-4 cells. However, they did not reveal any homologues to cellular receptors from GenBank database. In further study, cellular receptor could be identified through the screening of cDNA library from MOLT-4 or hepatocytes using antibodies against these peptide mimotopes.

  • PDF