• Title/Summary/Keyword: Genome analysis

Search Result 2,394, Processing Time 0.024 seconds

Complete Genome Sequence of Bacillus subtilis NIB353 Isolated from Nuruk

  • Jeong-Ah Yoon;Se-Young Kwun;Eun-Hee Park;Myoung-Dong Kim
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.3
    • /
    • pp.289-292
    • /
    • 2023
  • Thermotolerant Bacillus subtilis NIB353 was isolated from Nuruk, a traditional Korean fermentation starter. The complete B. subtilis NIB353 genome sequence was obtained using MinION and Illumina (MiSeq) platforms. The B. subtilis NIB353 genome sequence was 4,247,447 bp with a GC content of 43%. The B. subtilis NIB353 strain exhibited orthologous average nucleotide identity values of 98.39% and 98.38% with B. subtilis 168 and B. subtilis ATCC6051a, respectively. The genome has been deposited in GenBank under the accession number NZ_CP089148.1.

Complete Genomic Characterization of Two Beet Soil-Borne Virus Isolates from Turkey: Implications of Comparative Analysis of Genome Sequences

  • Moradi, Zohreh;Maghdoori, Hossein;Nazifi, Ehsan;Mehrvar, Mohsen
    • The Plant Pathology Journal
    • /
    • v.37 no.2
    • /
    • pp.152-161
    • /
    • 2021
  • Sugar beet (Beta vulgaris L.) is known as a key product for agriculture in several countries across the world. Beet soil-borne virus (BSBV) triggers substantial economic damages to sugar beet by reducing the quantity of the yield and quality of the beet sugars. We conducted the present study to report the complete genome sequences of two BSBV isolates in Turkey for the first time. The genome organization was identical to those previously established BSBV isolates. The tripartite genome of BSBV-TR1 and -TR3 comprised a 5,835-nucleotide (nt) RNA1, a 3,454-nt RNA2, and a 3,005-nt RNA3 segment. According to sequence identity analyses, Turkish isolates were most closely related to the BSBV isolate reported from Iran (97.83-98.77% nt identity). The BSBV isolates worldwide (n = 9) were phylogenetically classified into five (RNA-coat protein read through gene [CPRT], TGB1, and TGB2 segments), four (RNA-rep), or three (TGB3) lineages. In genetic analysis, the TGB3 revealed more genetic variability (Pi = 0.034) compared with other regions. Population selection analysis revealed that most of the codons were generally under negative selection or neutral evolution in the BSBV isolates studied. However, positive selection was detected at codon 135 in the TGB1, which could be an adaptation in order to facilitate the movement and overcome the host plant resistance genes. We expect that the information on genome properties and genetic variability of BSBV, particularly in TGB3, TGB1, and CPRT genes, assist in developing effective control measures in order to prevent severe losses and make amendments in management strategies.

Generation of Whole-Genome Sequencing Data for Comparing Primary and Castration-Resistant Prostate Cancer

  • Park, Jong-Lyul;Kim, Seon-Kyu;Kim, Jeong-Hwan;Yun, Seok Joong;Kim, Wun-Jae;Kim, Won Tae;Jeong, Pildu;Kang, Ho Won;Kim, Seon-Young
    • Genomics & Informatics
    • /
    • v.16 no.3
    • /
    • pp.71-74
    • /
    • 2018
  • Because castration-resistant prostate cancer (CRPC) does not respond to androgen deprivation therapy and has a very poor prognosis, it is critical to identify a prognostic indicator for predicting high-risk patients who will develop CRPC. Here, we report a dataset of whole genomes from four pairs of primary prostate cancer (PC) and CRPC samples. The analysis of the paired PC and CRPC samples in the whole-genome data showed that the average number of somatic mutations per patients was 7,927 in CRPC tissues compared with primary PC tissues (range, 1,691 to 21,705). Our whole-genome sequencing data of primary PC and CRPC may be useful for understanding the genomic changes and molecular mechanisms that occur during the progression from PC to CRPC.

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.

Construction of a Computation Web Server for Genome Analysis (제놈 분석용 계산 Web 서버의 구성)

  • Park, Kie-Jung;Lee, Byung-Wook;Park, Yong-Ha
    • Microbiology and Biotechnology Letters
    • /
    • v.24 no.1
    • /
    • pp.132-136
    • /
    • 1996
  • A comutation server is needed to provide analysis programs to Korean biologists, especially genome researchers, on GINet. For each analysis program, we implmented an input form with HTML and a CGI program for interface between an input form and an analysis program with C language on GINet computatin Web server. We made two construction methods of CGI programs for analysis programs, and implemented all CGI programs based on the methods followed by modifying each CGI program for specific processing of each program. On the server ten programs are availabel now, which include most frequently used ones and those developed by our team, and most programs with will be ported or developed by our team will be available on the Web server.

  • PDF

Complete Sequence Analysis of a Korean Isolate of Chinese Yam Necrotic Mosaic Virus and Generation of the Virus Specific Primers for Molecular Detection

  • Kwon, Sun-Jung;Cho, In-Sook;Choi, Seung-Kook;Yoon, Ju-Yeon;Choi, Gug-Seoun
    • Research in Plant Disease
    • /
    • v.22 no.3
    • /
    • pp.194-197
    • /
    • 2016
  • Chinese yam necrotic mosaic virus (CYNMV) is one of the most widespread viruses in Chinese yam (Dioscorea opposita Thunb.) and causes serious yield losses. Currently, genetic information of CYNMV is very restricted and complete genome sequences of only two isolates (one from Japan and another from China) have been reported. In this study, we determined complete genome sequence of the CYNMV isolate AD collected from Andong, Korea. Genetic analysis of the polyprotein amino acid sequence revealed that the Korean isolate AD has high similarity with the Japanese isolate PES3 (97%) but relatively low similarity with the Chinese isolate FX1 (78%). Phylogenetic analysis using the CYNMV 3' proximal nucleotide sequences harboring the coat protein and 3' untranslated region further supported genetic relationship among the CYNMV isolates. Based on comparative analysis of the CYNMV genome sequences determined in this study and other previous studies, we generated molecular detection primers that are highly specific and efficient for CYNMV diagnosis.

Dietary Patterns and Prevalence Odds Ratio in Middle-aged Adults of Rural and Mid-size City in Korean Genome Epidemiology Study (40대 이상 농촌 및 중소도시 성인의 식품섭취 패턴 (Pattern)과 질환별 유병위험도 - 한국인유전체역학조사사업 일부 대상자에 대해 -)

  • Ahn, Youn-Jhin;Park, Yun-Ju;Park, Seon-Joo;Min, Hae-Sook;Kwak, Hye-Kyoung;Oh, Kyung-Soo;Park, Chan
    • Journal of Nutrition and Health
    • /
    • v.40 no.3
    • /
    • pp.259-269
    • /
    • 2007
  • Recently, dietary pattern analysis was emerged as an approach to examine the relationships between diet and risk of chronic diseases. This study was to identify groups with population who report similar dietary pattern in Korean genome epidemiology study (KoGES) and association with several chronic diseases. The cohort participants living in Ansung and Ansan (Gyeonggi province) were totally 10,038. Among those, 6,873 subjects with no missing values in food frequency questionnaire were included in this analysis. After combining 103 food items into 17 food groups, 4 dietary factors were obtained by factor analysis based on their weights. Factor 1 showed high factor loadings in vegetables, mushrooms, meats, fish, beverages, and oriental-cereals. Factor 2 had high factor loadings in vegetables, fruits, fish, and factor 3 had high factor loadings in cereal-oriental, cerial-western and snacks. Factor 4 showed positive high factor loadings in rice and Kimchi and negative factor loadings in mushrooms and milk and dairy products. Using factor scores of four factors, subjects were classified into 3 clusters by K-means clustering. We named those 'Rice and Kimchi eating' group, 'Contented eating' group, and 'Healthy and light eating' group depending on their eating characteristics. 'Rice and Kimchi eating' group showed high prevalence in men, farmers and 60s. 'Contented eating' group and 'Healthy and light eating' group had high prevalence in women, people living in urban area (Ansan Citizen), with high-school education and above, and a monthly income of one million won and more. 'Contented eating' group appeared lower distribution proportion in the sixties and 'Healthy and light eating' group does higher in the fifties. 'Contented eating' versus 'Rice and Kimchi eating', odds ratio for hypertension, diabetes, metabolic syndrome and obesity significantly decreased after adjusting age and sex (OR=0.64, 0.73, and 0.85 respectively, 95% CI). Although our results were from a cross-sectional study, these imply that the dietary patterns were related to diseases.

Analysis of Nuclear Mitochondrial DNA Segments of Nine Plant Species: Size, Distribution, and Insertion Loci

  • Ko, Young-Joon;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • v.14 no.3
    • /
    • pp.90-95
    • /
    • 2016
  • Nuclear mitochondrial DNA segment (Numt) insertion describes a well-known phenomenon of mitochondrial DNA transfer into a eukaryotic nuclear genome. However, it has not been well understood, especially in plants. Numt insertion patterns vary from species to species in different kingdoms. In this study, the patterns were surveyed in nine plant species, and we found some tip-offs. First, when the mitochondrial genome size is relatively large, the portion of the longer Numt is also larger than the short one. Second, the whole genome duplication event increases the ratio of the shorter Numt portion in the size distribution. Third, Numt insertions are enriched in exon regions. This analysis may be helpful for understanding plant evolution.

A Primer for Disease Gene Prioritization Using Next-Generation Sequencing Data

  • Wang, Shuoguo;Xing, Jinchuan
    • Genomics & Informatics
    • /
    • v.11 no.4
    • /
    • pp.191-199
    • /
    • 2013
  • High-throughput next-generation sequencing (NGS) technology produces a tremendous amount of raw sequence data. The challenges for researchers are to process the raw data, to map the sequences to genome, to discover variants that are different from the reference genome, and to prioritize/rank the variants for the question of interest. The recent development of many computational algorithms and programs has vastly improved the ability to translate sequence data into valuable information for disease gene identification. However, the NGS data analysis is complex and could be overwhelming for researchers who are not familiar with the process. Here, we outline the analysis pipeline and describe some of the most commonly used principles and tools for analyzing NGS data for disease gene identification.

The complete chloroplast genome of Limonium tetragonum (Plumbaginaceae) isolated in Korea

  • KIM, Yongsung;XI, Hong;PARK, Jongsun
    • Korean Journal of Plant Taxonomy
    • /
    • v.51 no.3
    • /
    • pp.337-344
    • /
    • 2021
  • The chloroplast genome of Limonium tetragonum (Thunb.) Bullock, a halophytic species, was sequenced to understand genetic differences based on its geographical distribution. The cp genome of L. tetragonum was 154,689 bp long (GC ratio is 37.0%) and has four subregions: 84,572 bp of large single-copy (35.3%) and 12,813 bp of small single-copy (31.5%) regions were separated by 28,562 bp of inverted repeat (40.9%) regions. It contained 128 genes (83 protein-coding genes, eight rRNAs, and 37 tRNAs). Thirty-five single-nucleotide polymorphisms and 33 INDEL regions (88 bp in length) were identified. Maximum-likelihood and Bayesian inference phylogenetic trees showed that L. tetragonum formed a sister group with L. aureum, which is incongruent with certain previous studies, including a phylogenetic analysis.