• Title/Summary/Keyword: Genome analysis

Search Result 2,360, Processing Time 0.035 seconds

Comparative Genomic Analysis of Food-Originated Coagulase-Negative Staphylococcus: Analysis of Conserved Core Genes and Diversity of the Pan-Genome

  • Heo, Sojeong;Lee, Jung-Sug;Lee, Jong-Hoon;Jeong, Do-Won
    • Journal of Microbiology and Biotechnology
    • /
    • v.30 no.3
    • /
    • pp.341-351
    • /
    • 2020
  • To shed light on the genetic differences among food-originated coagulase-negative Staphylococcus (CNS), we performed pan-genome analysis of five species: Staphylococcus carnosus (two strains), Staphylococcus equorum (two strains), Staphylococcus succinus (three strains), Staphylococcus xylosus (two strains), and Staphylococcus saprophyticus (one strain). The pan-genome size increases with each new strain and currently holds about 4,500 genes from 10 genomes. Specific genes were shown to be strain dependent but not species dependent. Most specific genes were of unknown function or encoded restriction-modification enzymes, transposases, or prophages. Our results indicate that unique genes have been acquired or lost by convergent evolution within individual strains.

The complete chloroplast genome sequence of Hibiscus sabdariffa (Malvaceae)

  • KWON, Soon-Ho;PARK, Yunmi;JANG, You Lim;KWON, Hae-Yun
    • Korean Journal of Plant Taxonomy
    • /
    • v.52 no.2
    • /
    • pp.123-126
    • /
    • 2022
  • Hibiscus sabdariffa L., (roselle) in the Malvaceae family is an erect subshrub known to be native to India and Malaysia. It is widely used as a food or tea material around the world, and its therapeutic effects have been widely studied. In this study, the sequencing of the complete chloroplast genome of H. sabdariffa was carried out. The result indicates a genome size of 162,428 bp, which is composed of a large single copy of 90,327 bp, two inverted repeats of 26,242 bp each, and a small single copy of 19,617 bp. Overall, a total of 131 genes were predicted, including 86 coding sequences, 37 tRNAs, and 8 rRNAs. According to a phylogenic analysis, it was clearly distinguished from outgroups such as other species of the genus Hibiscus used in the analysis.

Generation and analysis of whole-genome sequencing data in human mammary epithelial cells

  • Jong-Lyul Park;Jae-Yoon Kim;Seon-Young Kim;Yong Sun Lee
    • Genomics & Informatics
    • /
    • v.21 no.1
    • /
    • pp.11.1-11.5
    • /
    • 2023
  • Breast cancer is the most common cancer worldwide, and advanced breast cancer with metastases is incurable mainly with currently available therapies. Therefore, it is essential to understand molecular characteristics during the progression of breast carcinogenesis. Here, we report a dataset of whole genomes from the human mammary epithelial cell system derived from a reduction mammoplasty specimen. This system comprises pre-stasis 184D cells, considered normal, and seven cell lines along cancer progression series that are immortalized or additionally acquired anchorage-independent growth. Our analysis of the whole-genome sequencing (WGS) data indicates that those seven cancer progression series cells have somatic mutations whose number ranges from 8,393 to 39,564 (with an average of 30,591) compared to 184D cells. These WGS data and our mutation analysis will provide helpful information to identify driver mutations and elucidate molecular mechanisms for breast carcinogenesis.

Comparison of Normalization Methods for Defining Copy Number Variation Using Whole-genome SNP Genotyping Data

  • Kim, Ji-Hong;Yim, Seon-Hee;Jeong, Yong-Bok;Jung, Seong-Hyun;Xu, Hai-Dong;Shin, Seung-Hun;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.6 no.4
    • /
    • pp.231-234
    • /
    • 2008
  • Precise and reliable identification of CNV is still important to fully understand the effect of CNV on genetic diversity and background of complex diseases. SNP marker has been used frequently to detect CNVs, but the analysis of SNP chip data for identifying CNV has not been well established. We compared various normalization methods for CNV analysis and suggest optimal normalization procedure for reliable CNV call. Four normal Koreans and NA10851 HapMap male samples were genotyped using Affymetrix Genome-Wide Human SNP array 5.0. We evaluated the effect of median and quantile normalization to find the optimal normalization for CNV detection based on SNP array data. We also explored the effect of Robust Multichip Average (RMA) background correction for each normalization process. In total, the following 4 combinations of normalization were tried: 1) Median normalization without RMA background correction, 2) Quantile normalization without RMA background correction, 3) Median normalization with RMA background correction, and 4) Quantile normalization with RMA background correction. CNV was called using SW-ARRAY algorithm. We applied 4 different combinations of normalization and compared the effect using intensity ratio profile, box plot, and MA plot. When we applied median and quantile normalizations without RMA background correction, both methods showed similar normalization effect and the final CNV calls were also similar in terms of number and size. In both median and quantile normalizations, RMA backgroundcorrection resulted in widening the range of intensity ratio distribution, which may suggest that RMA background correction may help to detect more CNVs compared to no correction.

Genetic Polymorphisms of the Bovine NOV Gene Are Significantly Associated with Carcass Traits in Korean Cattle

  • Kim, B.S.;Kim, S.C.;Park, C.M.;Lee, S.H.;Cho, S.H.;Kim, N.K.;Jang, G.W.;Yoon, D.H.;Yang, B.S.;Hong, S.K.;Seong, H.H.;Choi, B.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.26 no.6
    • /
    • pp.780-787
    • /
    • 2013
  • The objective of this study was to investigate single nucleotide polymorphisms (SNPs) in the bovine nephroblastoma overexpressed (NOV) gene and to evaluate whether these polymorphisms affect carcass traits in the Korean cattle population. We resequenced to detect SNPs from 24 unrelated individuals and identified 19 SNPs within the full 8.4-kb gene, including the 1.5-kb promoter region. Of these 19 SNPs, four were selected for genotyping based on linkage disequilibrium (LD). We genotyped 429 steers to assess the associations of these four SNPs with carcass traits. Statistical analysis revealed that g.7801T>C and g.8379A>C polymorphisms in the NOV gene were associated with carcass weight (p = 0.012 and 0.008, respectively), and the g.2005A>G polymorphism was associated with the back fat thickness (BF) trait (p = 0.0001). One haplotype of the four SNPs (GGTA) was significantly associated with BF (p = 0.0005). Our findings suggest that polymorphisms in the NOV gene may be among the important genetic factors affecting carcass yield in beef cattle.