• Title/Summary/Keyword: Whole genome sequence

Search Result 223, Processing Time 0.031 seconds

Current status of whole-genome sequences of Korean angiosperms

  • Jongsun PARK;Yunho YUN;Hong XI;Woochan KWON;Janghyuk SON
    • Korean Journal of Plant Taxonomy
    • /
    • v.53 no.3
    • /
    • pp.181-200
    • /
    • 2023
  • Owing to the rapid development of sequencing technologies, more than 1,000 plant genomes have been sequenced and released. Among them, 69 Korean plant taxa (85 genome sequences) contain at least one whole-genome sequence despite the fact that some samples were not collected in Korea. The sequencing-by-synthesis method (next-generation sequencing) and the PacBio (third-generation sequencing) method were the most commonly used in studies appearing in 65 publications. Several scaffolding methods, such as the Hi-C and 10x types, have also been used for pseudo-chromosomal assembly. The most abundant families among the 69 taxa are Rosaceae (10 taxa), Brassicaceae (7 taxa), Fabaceae (7 taxa), and Poaceae (7 taxa). Due to the rapid release of plant genomes, it is necessary to assemble the current understanding of Korean plant species not only to understand their whole genomes as our own plant resources but also to establish new tools for utilizing plant resources efficiently with various analysis pipelines, including AI-based engines.

Basic Concept of Gene Microarray (Gene Microarray의 기본개념)

  • Hwang, Seung Yong
    • Korean Journal of Biological Psychiatry
    • /
    • v.8 no.2
    • /
    • pp.203-207
    • /
    • 2001
  • The genome sequencing project has generated and will continue to generate enormous amounts of sequence data including 5 eukaryotic and about 60 prokaryotic genomes. Given this ever-increasing amounts of sequence information, new strategies are necessary to efficiently pursue the next phase of the genome project-the elucidation of gene expression patterns and gene product function on a whole genome scale. In order to assign functional information to the genome sequence, DNA chip(or gene microarray) technology was developed to efficiently identify the differential expression pattern of independent biological samples. DNA chip provides a new tool for genome expression analysis that may revolutionize many aspects of biotechnology including new drug discovery and disease diagnostics.

  • PDF

Complete genome sequence of Clostridium perfringens B20, a bacteriocin-producing pathogen

  • Elnar, Arxel G.;Kim, Geun-Bae
    • Journal of Animal Science and Technology
    • /
    • v.63 no.6
    • /
    • pp.1468-1472
    • /
    • 2021
  • Clostridium perfringens B20 was isolated from chicken feces collected from a local farm associated with Chung-Ang University (Anseong, Korea). The whole genome of C. perfringens B20 was sequenced using the PacBio RS II platform and assembled de novo. The genome is 2,982,563 bp long and assembled in two contigs. Annotation analyses revealed 2,668 protein-coding sequences, 30 rRNA genes, and 94 tRNA genes, with 28.2% G + C (guanine + cytosine) content. In silico genomic analysis revealed the presence of genes encoding a class IId bacteriocin, lactococcin A, and associated ABC transporter and immunity proteins, as well as a putative bacteriocin gene.

Whole genome sequence analysis of Ligilactobacillus agilis C7 isolated from pig feces revealed three bacteriocin gene clusters

  • Jeong Min, Yoo;Remilyn M., Mendoza;In-Chan, Hwang;Dae-Kyung, Kang
    • Journal of Animal Science and Technology
    • /
    • v.64 no.5
    • /
    • pp.1008-1011
    • /
    • 2022
  • We here report the whole genome sequence of Ligilactobacillus agilis C7 with anti-listerial activity, which was isolated from pig feces. The genome size of L. agilis C7 (~ 3.0 Mb) is relatively larger compared with other L. agilis strains. L. agilis C7 carries three bacteriocin gene clusters encoding garvicin Q, salivaricin A, and Blp family class II bacteriocin. Garvicin Q and salivaricin A are reported to be active against Listeria monocytogenes and Micrococcus luteus, respectively, as well as against other Gram-positive bacteria. Meanwhile, the bacteriocin encoded in the blp cassette was shown to be active against pneumococci, mediating intraspecies competition. This report highlights the potential of L. agilis C7 for the production of bacteriocins inhibiting pathogenic bacteria.

A Survey of the Brassica rapa Genome by BAC-End Sequence Analysis and Comparison with Arabidopsis thaliana

  • Hong, Chang Pyo;Plaha, Prikshit;Koo, Dal-Hoe;Yang, Tae-Jin;Choi, Su Ryun;Lee, Young Ki;Uhm, Taesik;Bang, Jae-Wook;Edwards, David;Bancroft, Ian;Park, Beom-Seok;Lee, Jungho;Lim, Yong Pyo
    • Molecules and Cells
    • /
    • v.22 no.3
    • /
    • pp.300-307
    • /
    • 2006
  • Brassica rapa ssp. pekinensis (Chinese cabbage) is an economically important crop and a model plant for studies on polyploidization and phenotypic evolution. To gain an insight into the structure of the B. rapa genome we analyzed 12,017 BAC-end sequences for the presence of transposable elements (TEs), SSRs, centromeric satellite repeats and genes, and similarity to the closely related genome of Arabidopsis thaliana. TEs were estimated to occupy 14% of the genome, with 12.3% of the genome represented by retrotransposons. It was estimated that the B. rapa genome contains 43,000 genes, 1.6 times greater than the genome of A. thaliana. A number of centromeric satellite sequences, representing variations of a 176-bp consensus sequence, were identified. This sequence has undergone rapid evolution within the B. rapa genome and has diverged among the related species of Brassicaceae. A study of SSRs demonstrated a non-random distribution with a greater abundance within predicted intergenic regions. Our results provide an initial characterization of the genome of B. rapa and provide the basis for detailed analysis through whole-genome sequencing.

Draft Genome Sequence of Weissella koreensis Strain HJ, a Probiotic Bacterium Isolated from Kimchi

  • Seung-Min Yang;Eiseul Kim;So-Yun Lee;Soyeong Mun;Hae Choon Chang;Hae-Yeong Kim
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.1
    • /
    • pp.128-131
    • /
    • 2023
  • Here we report the draft genome sequence of Weissella koreensis strain HJ and genomic analysis of its key features. The genome consists of 1,427,571 bp with a GC content of 35.5%, and comprises 1,376 coding genes. In silico analysis revealed the absence of pathogenic factors within the genome. The genome harbors several genes that play an important role in the survival of the gastrointestinal tract. In addition, a type III polyketide synthase cluster was identified. Pangenome analysis identified 68 unique genes in W. koreensis strain HJ. The genome information of this strain provides the basis for understanding its probiotic properties.

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.

A Simple Java Sequence Alignment Editing Tool for Resolving Complex Repeat Regions

  • Ham, Seong-Il;Lee, Kyung-Eun;Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • v.7 no.1
    • /
    • pp.46-48
    • /
    • 2009
  • Finishing is the most time-consuming step in sequencing, and many genome projects are left unfinished due to complex repeat regions. Here, we have developed BACContigEditor, a prototype shotgun sequence finishing tool. It is essentially an editor that visualizes assemblies of shotgun sequence fragment reads as gapped multiple alignments. The program offers some flexibility that is needed to rapidly resolve complex regions within a working session. The sole purpose of the release is to promote collaborative creation of extensible software for fragment assembly editors, foster collaborative development, and reduce barriers to initial tool development effort. We describe our software architecture and identify current challenges. The program is available under an Open Source license.

Whole genome sequencing of foot-and-mouth disease virus using benchtop next generation sequencing (NGS) system

  • Moon, Sung-Hyun;Oh, Yeonsu;Tark, Dongseob;Cho, Ho-Seong
    • Korean Journal of Veterinary Service
    • /
    • v.42 no.4
    • /
    • pp.297-300
    • /
    • 2019
  • In countries with FMD vaccination, as in Korea, typical clinical signs do not appear, and even in FMD positive cases, it is difficult to isolate the FMDV or obtain whole genome sequence. To overcome this problem, more rapid and simple NGS system is required to control FMD in Korea. FMDV (O/Boeun/ SKR/2017) RNA was extracted and sequenced using Ion Torrent's bench-top sequencer with amplicon panel with optimized bioinformatics pipelines. The whole genome sequencing of raw data generated data of 1,839,864 (mean read length 283 bp) reads comprising a total of 521,641,058 (≥Q20 475,327,721). Compared with FMDV (GenBank accession No. MG983730), the FMDV sequences in this study showed 99.83% nucleotide identity. Further study is needed to identify these differences. In this study, fast and robust methods for benchtop next generation sequencing (NGS) system was developed for analysis of Foot-and-mouth disease virus (FMDV) whole genome sequences.

The Complete Genome Sequence of Southern rice black-streaked dwarf virus Isolated from Vietnam

  • Dinh, Thi-Sau;Zhou, Cuiji;Cao, Xiuling;Han, Chenggui;Yu, Jialin;Li, Dawei;Zhang, Yongliang
    • The Plant Pathology Journal
    • /
    • v.28 no.4
    • /
    • pp.428-432
    • /
    • 2012
  • We determined the complete genome sequence of a Vietnamese isolate of Southern rice black-streaked dwarf virus (SRBSDV). Whole genome comparisons and phylogenetic analysis showed that the genome of the Vietnamese isolate shared high nucleotide sequence identities of over 97.5% with those of the reported Chinese isolates, confirming a common origin of them. Moreover, the greatest divergence between different SRBSDV isolates was found in the segments S1, S3, S4 and S6, which differs from the sequence alignment results between SRBSDV and Rice black streaked dwarf virus (RBSDV), implying that SRBSDV evolved in a unique way independent of RBSDV. This is the first report of a complete nucleotide sequence of SRBSDV from Vietnam and our data provides new clues for further understanding of molecular variation and epidemiology of SRBSDV in Southeast Asia.