• Title/Summary/Keyword: Genomic prediction

Search Result 110, Processing Time 0.026 seconds

Genomic Analysis of 13 Putative Active Prophages Located in the Genomes of Walnut Blight Pathogen Xanthomonas arboricola pv. juglandis

  • Cao, Zheng;Cuiying, Du;Benzhong, Fu
    • Microbiology and Biotechnology Letters
    • /
    • v.50 no.4
    • /
    • pp.563-573
    • /
    • 2022
  • Xanthomonas arboricola pv. juglandis (Xaj) is a globally important bacterial pathogen of walnut trees that causes substantial economic losses in commercial walnut production. Although prophages are common in bacterial plant pathogens and play important roles in bacterial diversity and pathogenicity, there has been limited investigation into the distribution and function of prophages in Xaj. In this study, we identified and characterized 13 predicted prophages from the genomes of 12 Xaj isolates from around the globe. These prophages ranged in length from 11.8 kb to 51.9 kb, with between 11-75 genes and 57.82-64.15% GC content. The closest relatives of these prophages belong to the Myoviridae and Siphoviridae families of the Caudovirales order. The phylogenetic analysis allowed the classification of the prophages into five groups. The gene constitution of these predicted prophages was revealed via Roary analysis. Amongst 126 total protein groups, the most prevalent group was only present in nine prophages, and 22 protein groups were present in only one prophage (singletons). Also, bioinformatic analysis of the 13 identified prophages revealed the presence of 431 genes with an average length of 389.7 bp. Prokka annotation of these prophages identified 466 hypothetical proteins, 24 proteins with known function, and six tRNA genes. The proteins with known function mainly comprised prophage integrase IntA, replicative DNA helicase, tyrosine recombinase XerC, and IS3 family transposase. There was no detectable insertion site specificity for these prophages in the Xaj genomes. The identified Xaj prophage genes, particularly those of unknown function, merit future investigation.

A comparison of five sets of overlapping and non-overlapping sliding windows for semen production traits in the Thai multibreed dairy population

  • Mattaneeya Sarakul;Mauricio A. Elzo;Skorn Koonawootrittriron;Thanathip Suwanasopee;Danai Jattawa;Thawee Laodim
    • Animal Bioscience
    • /
    • v.37 no.3
    • /
    • pp.428-436
    • /
    • 2024
  • Objective: This study compared five distinct sets of biological pathways and associated genes related to semen volume (VOL), number of sperm (NS), and sperm motility (MOT) in the Thai multibreed dairy population. Methods: The phenotypic data included 13,533 VOL records, 12,773 NS records, and 12,660 MOT records from 131 bulls. The genotypic data consisted of 76,519 imputed and actual single nucleotide polymorphisms (SNPs) from 72 animals. The SNP additive genetic variances for VOL, NS, and MOT were estimated for SNP windows of one SNP (SW1), ten SNP (SW10), 30 SNP (SW30), 50 SNP (SW50), and 100 SNP (SW100) using a single-step genomic best linear unbiased prediction approach. The fixed effects in the model were contemporary group, ejaculate order, bull age, ambient temperature, and heterosis. The random effects accounted for animal additive genetic effects, permanent environment effects, and residual. The SNPs explaining at least 0.001% of the additive genetic variance in SW1, 0.01% in SW10, 0.03% in SW30, 0.05% in SW50, and 0.1% in SW100 were selected for gene identification through the NCBI database. The pathway analysis utilized genes associated with the identified SNP windows. Results: Comparison of overlapping and non-overlapping SNP windows revealed notable differences among the identified pathways and genes associated with the studied traits. Overlapping windows consistently yielded a larger number of shared biological pathways and genes than non-overlapping windows. In particular, overlapping SW30 and SW50 identified the largest number of shared pathways and genes in the Thai multibreed dairy population. Conclusion: This study yielded valuable insights into the genetic architecture of VOL, NS, and MOT. It also highlighted the importance of assessing overlapping and non-overlapping SNP windows of various sizes for their effectiveness to identify shared pathways and genes influencing multiple traits.

Correlation between MR Image-Based Radiomics Features and Risk Scores Associated with Gene Expression Profiles in Breast Cancer (유방암에서 자기공명영상 근거 영상표현형과 유전자 발현 프로파일 근거 위험도의 관계)

  • Ga Ram Kim;You Jin Ku;Jun Ho Kim;Eun-Kyung Kim
    • Journal of the Korean Society of Radiology
    • /
    • v.81 no.3
    • /
    • pp.632-643
    • /
    • 2020
  • Purpose To investigate the correlation between magnetic resonance (MR) image-based radiomics features and the genomic features of breast cancer by focusing on biomolecular intrinsic subtypes and gene expression profiles based on risk scores. Materials and Methods We used the publicly available datasets from the Cancer Genome Atlas and the Cancer Imaging Archive to extract the radiomics features of 122 breast cancers on MR images. Furthermore, PAM50 intrinsic subtypes were classified and their risk scores were determined from gene expression profiles. The relationship between radiomics features and biomolecular characteristics was analyzed. A penalized generalized regression analysis was performed to build prediction models. Results The PAM50 subtype demonstrated a statistically significant association with the maximum 2D diameter (p = 0.0189), degree of correlation (p = 0.0386), and inverse difference moment normalized (p = 0.0337). Among risk score systems, GGI and GENE70 shared 8 correlated radiomic features (p = 0.0008-0.0492) that were statistically significant. Although the maximum 2D diameter was most significantly correlated to both score systems (p = 0.0139, and p = 0.0008), the overall degree of correlation of the prediction models was weak with the highest correlation coefficient of GENE70 being 0.2171. Conclusion Maximum 2D diameter, degree of correlation, and inverse difference moment normalized demonstrated significant relationships with the PAM50 intrinsic subtypes along with gene expression profile-based risk scores such as GENE70, despite weak correlations.

Construction of Web-Based Database for Anisakis Research (고래회충 연구를 위한 웹기반 데이터베이스 구축)

  • Lee, Yong-Seok;Baek, Moon-Ki;Jo, Yong-Hun;Kang, Se-Won;Lee, Jae-Bong;Han, Yeon-Soo;Cha, Hee-Jae;Yu, Hak-Sun;Ock, Mee-Sun
    • Journal of Life Science
    • /
    • v.20 no.3
    • /
    • pp.411-415
    • /
    • 2010
  • Anisakis simplex is one of the parasitic nematodes, and has a complex life cycle in crustaceans, fish, squid or whale. When people eat under-processed or raw fish, it causes anisakidosis and also plays a critical role in inducing serious allergic reactions in humans. However, no web-based database on A. simplex at the level of DNA or protein has been so far reported. In this context, we constructed a web-based database for Anisakis research. To build up the web-based database for Anisakis research, we proceeded with the following measures: First, sequences of order Ascaridida were downloaded and translated into the multifasta format which was stored as database for stand-alone BLAST. Second, all of the nucleotide and EST sequences were clustered and assembled. And EST sequences were translated into amino acid sequences for Nuclear Localization Signal prediction. In addition, we added the vector, E. coli, and repeat sequences into the database to confirm a potential contamination. The web-based database gave us several advantages. Only data that agrees with the nucleotide sequences directly related with the order Ascaridida can be found and retrieved when searching BLAST. It is also very convenient to confirm contamination when making the cDNA or genomic library from Anisakis. Furthermore, BLAST results on the Anisakis sequence information can be quickly accessed. Taken together, the Web-based database on A. simplex will be valuable in developing species specific PCR markers and in studying SNP in A. simplex-related researches in the future.

Molecular Cloning and Characterization of the Yew Gene Encoding Squalene Synthase from Taxus cuspidata

  • Huang, Zhuoshi;Jiang, Keji;Pi, Yan;Hou, Rong;Liao, Zhihua;Cao, Ying;Han, Xu;Wang, Qian;Sun, Xiaofen;Tang, Kexuan
    • BMB Reports
    • /
    • v.40 no.5
    • /
    • pp.625-635
    • /
    • 2007
  • The enzyme squalene synthase (EC 2.5.1.21) catalyzes a reductive dimerization of two farnesyl diphosphate (FPP) molecules into squalene, a key precursor for the sterol and triterpene biosynthesis. A full-length cDNA encoding squalene synthase (designated as TcSqS) was isolated from Taxus cuspidata, a kind of important medicinal plants producing potent anti-cancer drug, taxol. The full-length cDNA of TcSqS was 1765 bp and contained a 1230 bp open reading frame (ORF) encoding a polypeptide of 409 amino acids. Bioinformatic analysis revealed that the deduced TcSqS protein had high similarity with other plant squalene synthases and a predicted crystal structure similar to other class I isoprenoid biosynthetic enzymes. Southern blot analysis revealed that there was one copy of TcSqS gene in the genome of T. cuspidata. Semi-quantitative RT-PCR analysis and northern blotting analysis showed that TcSqS expressed constitutively in all tested tissues, with the highest expression in roots. The promoter region of TcSqS was also isolated by genomic walking and analysis showed that several cis-acting elements were present in the promoter region. The results of treatment experiments by different signaling components including methyl-jasmonate, salicylic acid and gibberellin revealed that the TcSqS expression level of treated cells had a prominent diversity to that of control, which was consistent with the prediction results of TcSqS promoter region in the PlantCARE database.

Cloning of the posterior silk glands specific-expressed gene of silkworm (누에 후부실샘 특이 발현 유전자 클로닝)

  • Piao, Yulan;Kim, Seong-Ryul;Kim, Sung-Wan;Kang, Seok-Woo;Goo, Tae-Won;Choi, Kwang-Ho
    • Journal of Sericultural and Entomological Science
    • /
    • v.53 no.1
    • /
    • pp.44-49
    • /
    • 2015
  • We characterized tissue specific-expressed genes in the posterior silk gland of Bombyx mori using by the Annealing Control Primer based differential display-PCR manner. In this study, we isolated 34 differentially expressed PCR amplicons, which one of these was identified as a novel transcript named as ACP-16 (366 bp), its expression was observed only in the posterior silk glands by Northern blot analysis. To determine promoter region of the ACP-16, we isolated and analyzed a phage DNA having 1.7 kb-long genome DNA including the open reading flame and 5'- upstream untranslated region of the ACP-16 gene from a genomic DNA library. We have estimated a promoter region of the ACP-16 gene by a web promoter prediction engine, which locates -750 ~ -165 from translation initiation site (ATG, +1). ACP-16 gene is necessary to more studies about critical biological role in order to apply the silkworm's transgenic system.

Workflow for Building a Draft Genome Assembly using Public-domain Tools: Toxocara canis as a Case Study (개 회충 게놈 응용 사례에서 공개용 분석 툴을 사용한 드래프트 게놈 어셈블리 생성)

  • Won, JungIm;Kong, JinHwa;Huh, Sun;Yoon, JeeHee
    • KIISE Transactions on Computing Practices
    • /
    • v.20 no.9
    • /
    • pp.513-518
    • /
    • 2014
  • It has become possible for small scale laboratories to interpret large scale genomic DNA, thanks to the reduction of the sequencing cost by the development of next generation sequencing (NGS). De novo assembly is a method which creates a putative original sequence by reconstructing reads without using a reference sequence. There have been various study results on de novo assembly, however, it is still difficult to get the desired results even by using the same assembly procedures and the analysis tools which were suggested in the studies reported. This is mainly because there are no specific guidelines for the assembly procedures or know-hows for the use of such analysis tools. In this study, to resolve these problems, we introduce steps to finding whole genome of an unknown DNA via NGS technology and de novo assembly, while providing the pros and cons of the various analysis tools used in each step. We used 350Mbp of Toxocara canis DNA as an application case for the detailed explanations of each stated step. We also extend our works for prediction of protein-coding genes and their functions from the draft genome sequence by comparing its homology with reference sequences of other nematodes.

DNA Methylation changes in Human Cancers (인체 암의 DNA 메틸화 변화)

  • Kwon, Hyeong-Ju;Kang, Gyeong-Hoon
    • Journal of Genetic Medicine
    • /
    • v.6 no.1
    • /
    • pp.1-7
    • /
    • 2009
  • Epigenetic changes represented by promoter CpG island hypermethylation and histone modification are an important carcinogenetic mechanism, which is found in virtually all histologic types of human cancer. About 60-70% of human genes harbor CpG islands in their promoters and 5' exonal sequences, and some of them undergo aberrant promoter CpG island hypermethylation and subsequent downregulation of gene expression. The loss of expression in tumor suppressor or tumor-related genes results in acceleration of tumorigenic processes. In addition to regional CpG island hypermethylation, diffuse genomic hypomethylation represents an important aspect of DNA methylation changes occurring in human cancer cells and contributes to chromosomal instability. These apparently contrasting methylation changes occur not only in human cancer cells, but also in premalignant cells. CpG island hypermethylation has gained attention for not only the tumorigenic mechanistic process, but also its potential utilization as a tumor biomarker. DNA methylation markers are actively investigated for their potential uses as tumor biomarkers for diagnosis of tumors in body fluids, prognostication of cancer patients, or prediction of chemotherapeutic drug response. In this review, these aspects will be discussed in detail.

  • PDF

Application of Transposable Elements as Molecular-marker for Cancer Diagnosis (암 진단 분자 마커로서 이동성 유전인자의 응용)

  • Kim, Hyemin;Gim, Jeong-An;Woo, Hyojeong;Hong, Jeonghyeon;Kim, Jinyeop;Kim, Heui-Soo
    • Journal of Life Science
    • /
    • v.27 no.10
    • /
    • pp.1215-1224
    • /
    • 2017
  • Until now, various oncogenic pathways were idenfied. The accumulation of DNA mutation induces genomic instability in the cell, and it makes cancer. The development of bioinformatics and genomics, to find the precise and reliable biomarker is available. This biomarker could be applied the early-dignosis, prediction and convalescence of cancer. Recently, Transposable elements (TEs) have been attracted as the regulator of genes, because they occupy a half of human genome, and the cause of various diseases. TEs induce DNA mutation, as well as the regulation of gene expression, that makes to cancer development. So, we confirmed the relationship between TEs and colon cancer, and provided the clue for colon cancer biomarker. First, we confirmed long interspersed nuclear element-1 (LINE-1), Alu, and long terminal repeats (LTRs) and their relationship to colon cancer. Because these elements have large composition and enormous effect to the human genome. Interestingly, colon cancer specific patterns were detected, such as the hypomethylation of LINE-1, LINE-1 insertion in the APC gene, hypo- or hypermethylation of Alu, and isoform derived from LTR insertion. Moreover, hypomethylation of LINE-1 in proto-oncogene is used as the biomarker of colon cancer metastasis, and MLH1 mutation induced by Alu is detected in familial or hereditary colon cancer. The genes, effected by TEs, were analyzed their expression patterns by in silico analysis. Then, we provided tissue- and gender-specific expression patterns. This information can provide reliable cancer biomarker, and apply to prediction and diagnosis of colon cancer.

Evolutionary Explanation for Beauveria bassiana Being a Potent Biological Control Agent Against Agricultural Pests

  • Han, Jae-Gu
    • 한국균학회소식:학술대회논문집
    • /
    • 2014.05a
    • /
    • pp.27-28
    • /
    • 2014
  • Beauveria bassiana (Cordycipitaceae, Hypocreales, Ascomycota) is an anamorphic fungus having a potential to be used as a biological control agent because it parasitizes a wide range of arthropod hosts including termites, aphids, beetles and many other insects. A number of bioactive secondary metabolites (SMs) have been isolated from B. bassiana and functionally verified. Among them, beauvericin and bassianolide are cyclic depsipeptides with antibiotic and insecticidal effects belonging to the enniatin family. Non-ribosomal peptide synthetases (NRPSs) play a crucial role in the synthesis of these secondary metabolites. NRPSs are modularly organized multienzyme complexes in which each module is responsible for the elongation of proteinogenic and non-protein amino acids, as well as carboxyl and hydroxyacids. A minimum of three domains are necessary for one NRPS elongation module: an adenylation (A) domain for substrate recognition and activation; a tholation (T) domain that tethers the growing peptide chain and the incoming aminoacyl unit; and a condensation (C) domain to catalyze peptide bond formation. Some of the optional domains include epimerization (E), heterocyclization (Cy) and oxidation (Ox) domains, which may modify the enzyme-bound precursors or intermediates. In the present study, we analyzed genomes of B. bassiana and its allied species in Hypocreales to verify the distribution of NRPS-encoding genes involving biosynthesis of beauvericin and bassianolide, and to unveil the evolutionary processes of the gene clusters. Initially, we retrieved completely or partially assembled genomic sequences of fungal species belonging to Hypocreales from public databases. SM biosynthesizing genes were predicted from the selected genomes using antiSMASH program. Adenylation (A) domains were extracted from the predicted NRPS, NRPS-like and NRPS-PKS hybrid genes, and used them to construct a phylogenetic tree. Based on the preliminary results of SM biosynthetic gene prediction in B. bassiana, we analyzed the conserved gene orders of beauvericin and bassianolide biosynthetic gene clusters among the hypocrealean fungi. Reciprocal best blast hit (RBH) approach was performed to identify the regions orthologous to the biosynthetic gene cluster in the selected fungal genomes. A clear recombination pattern was recognized in the inferred A-domain tree in which A-domains in the 1st and 2nd modules of beauvericin and bassianolide synthetases were grouped in CYCLO and EAS clades, respectively, suggesting that two modules of each synthetase have evolved independently. In addition, inferred topologies were congruent with the species phylogeny of Cordycipitaceae, indicating that the gene fusion event have occurred before the species divergence. Beauvericin and bassianolide synthetases turned out to possess identical domain organization as C-A-T-C-A-NM-T-T-C. We also predicted precursors of beauvericin and bassianolide synthetases based on the extracted signature residues in A-domain core motifs. The result showed that the A-domains in the 1st module of both synthetases select D-2-hydroxyisovalerate (D-Hiv), while A-domains in the 2nd modules specifically activate L-phenylalanine (Phe) in beauvericin synthetase and leucine (Leu) in bassianolide synthetase. antiSMASH ver. 2.0 predicted 15 genes in the beauvericin biosynthetic gene cluster of the B. bassiana genome dispersed across a total length of approximately 50kb. The beauvericin biosynthetic gene cluster contains beauvericin synthetase as well as kivr gene encoding NADPH-dependent ketoisovalerate reductase which is necessary to convert 2-ketoisovalarate to D-Hiv and a gene encoding a putative Gal4-like transcriptional regulator. Our syntenic comparison showed that species in Cordycipitaceae have almost conserved beauvericin biosynthetic gene cluster although the gene order and direction were sometimes variable. It is intriguing that there is no region orthologous to beauvericin synthetase gene in Cordyceps militaris genome. It is likely that beauvericin synthetase was present in common ancestor of Cordycipitaceae but selective gene loss has occurred in several species including C. militaris. Putative bassianolide biosynthetic gene cluster consisted of 16 genes including bassianolide synthetase, cytochrome P450 monooxygenase, and putative Gal4-like transcriptional regulator genes. Our synteny analysis found that only B. bassiana possessed a bassianolide synthetase gene among the studied fungi. This result is consistent with the groupings in A-domain tree in which bassianolide synthetase gene found in B. bassiana was not grouped with NRPS genes predicted in other species. We hypothesized that bassianolide biosynthesizing cluster genes in B. bassiana are possibly acquired by horizontal gene transfer (HGT) from distantly related fungi. The present study showed that B. bassiana is the only species capable of producing both beauvericin and bassianolide. This property led to B. bassiana infect multiple hosts and to be a potential biological control agent against agricultural pests.

  • PDF