• Title/Summary/Keyword: Whole genome sequence

Search Result 223, Processing Time 0.019 seconds

Whole-genome sequence analysis through online web interfaces: a review

  • Gunasekara, A.W.A.C.W.R.;Rajapaksha, L.G.T.G.;Tung, T.L.
    • Genomics & Informatics
    • /
    • 제20권1호
    • /
    • pp.3.1-3.10
    • /
    • 2022
  • The recent development of whole-genome sequencing technologies paved the way for understanding the genomes of microorganisms. Every whole-genome sequencing (WGS) project requires a considerable cost and a massive effort to address the questions at hand. The final step of WGS is data analysis. The analysis of whole-genome sequence is dependent on highly sophisticated bioinformatics tools that the research personal have to buy. However, many laboratories and research institutions do not have the bioinformatics capabilities to analyze the genomic data and therefore, are unable to take maximum advantage of whole-genome sequencing. In this aspect, this study provides a guide for research personals on a set of bioinformatics tools available online that can be used to analyze whole-genome sequence data of bacterial genomes. The web interfaces described here have many advantages and, in most cases exempting the need for costly analysis tools and intensive computing resources.

Whole Mitochondrial Genome Sequence of an Indian Plasmodium falciparum Field Isolate

  • Tyagi, Suchi;Pande, Veena;Das, Aparup
    • Parasites, Hosts and Diseases
    • /
    • 제52권1호
    • /
    • pp.99-103
    • /
    • 2014
  • Mitochondrial genome sequence of malaria parasites has served as a potential marker for inferring evolutionary history of the Plasmodium genus. In Plasmodium falciparum, the mitochondrial genome sequences from around the globe have provided important evolutionary understanding, but no Indian sequence has yet been utilized. We have sequenced the whole mitochondrial genome of a single P. falciparum field isolate from India using novel primers and compared with the 3D7 reference sequence and 1 previously reported Indian sequence. While the 2 Indian sequences were highly divergent from each other, the presently sequenced isolate was highly similar to the reference 3D7 strain.

전유전체(Whole gerlome) 서열 분석과 가시화를 위한 워크벤치 개발 (Development of Workbench for Analysis and Visualization of Whole Genome Sequence)

  • 최정현;진희정;김철민;장철훈;조환규
    • 정보처리학회논문지A
    • /
    • 제9A권3호
    • /
    • pp.387-398
    • /
    • 2002
  • 최근 활발한 소단위 게놈 프로젝트의 수행으로 많은 생물체의 유전체 전체 서열이 밝혀짐에 따라서 전유전체(whole genome)를 기본 단위로 하여 개별 유전자나 그에 관련된 기능 연구가 매우 활발히 이루어지고 있다. 전유전체의 염기 서열은 수백만 bp(base pairs)에서 수백억 bp(base pairs) 정도의 대용량 텍스트 데이터이기 때문에 단순한 온라인 문자 일치(on-line string matching) 알고리즘으로 분석하는 것은 매우 비효율적이다. 본 논문에서는 대용량의 유전체 서열을 분석하는데 적합한 자료 구조인 스트링 B-트리를 사용하여 유전체 서열의 분석과 가시화를 위한 워크벤치를 개발한 과정을 소개한다. 본 연구에서 개발한 시스템은 크게 질의문 부분과 가시화 부분으로 나뉘어 진다. 질의문 부분에는 유전체 서열에 특정 서열이 나타나는 부분의 위치와 횟수를 알아보거나 k번 나타나는 서열을 조사하는 것과 같은 기본적인 패턴 검색 부분과 k-mer 분석을 위한 질의어가 다양하게 준비되어 있다. 가시화 부분은 전유전체 서열과 주석(annotation)을 보여주거나, 유전체 분석을 용이하도록 여러 가시화 방법, CGR(Chaos Game Representation), k-mer graph, RWP(Random Walk Plot) 등으로 생물학자들이 쉽게 전체 구조와 특성 파악할 수 있도록 도와준다. 본 논문이 제안하는 분석 시스템은 생물체의 진화적 관계를 밝히고, 염색체 내에 아직 알려지지 않은 새로운 유전자나 기능이 밝혀지지 않은 junk DNA들의 기능 등을 연구하는데 사용할 수 있다.

Complete Genome Sequence of the Enterobacter asburiae IK3 Isolated from a Soybean (Glycine max) Rhizosphere

  • Sihyun Park;GyuDae Lee;Ikwhan Kim;Yeongyu Jeong;Jae-Ho Shin
    • 한국미생물·생명공학회지
    • /
    • 제51권3호
    • /
    • pp.306-308
    • /
    • 2023
  • This research presents the whole-genome sequence of Enterobacter asburiae strain IK3, which was isolated from the rhizosphere soil of soybean (Glycine max). The genome of the strain is composed of a single chromosome with 4 plasmids, total size of 5,084,040 bp, and the GC content is 55.5%.

High-Quality Whole Genome Sequence of a Linezolid-Resistant and Vancomycin-Susceptible Enterococcus faecalis Isolate ES-2-1 from a Pig Stool in South Korea

  • Jun Bong Lee;Nguyen Thi Mai Tho;Se Kye Kim;Jang Won Yoon
    • 한국미생물·생명공학회지
    • /
    • 제52권1호
    • /
    • pp.88-90
    • /
    • 2024
  • We report the whole genome sequence of a linezolid-resistant and vancomycin-susceptible Enterococcus faecalis strain, ES-2-1, which was isolated from a pig stool in South Korea. The assembled genome of ES-2-1 consists of a 2,648,168-bp circular chromosome containing the optrA gene (encoding the ABC-F type ribosomal protection protein), an 84,891-bp plasmid containing numerous antimicrobial resistance genes, and an 82,106-bp cryptic plasmid. The ES-2-1 strain belongs to sequence type 1024 (ST1024) and carries multidrug resistant genes including the optrA (oxazolidinone phenicol transferable resistance A) gene, which confers linezolid resistance.

Complete Genome Sequence of Bifidobacterium bifidum DS0908, Isolated from Human Fecal Sample

  • Haneol Yang;Yong-Sik Kim;Doo-Sang Park
    • 한국미생물·생명공학회지
    • /
    • 제51권4호
    • /
    • pp.566-568
    • /
    • 2023
  • In this report, we present the whole-genome sequence of Bifidobacterium bifidum DS0908 isolated from the human fecal sample. The genome composed of a single circular chromosome is 2,223,317 bp long and the DNA G+C content is 62.65%. No virulence genes were detected in the genomic sequences of B. bifidum DS0908.

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • 제10권1호
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • 한국축산식품학회지
    • /
    • 제33권6호
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

Whole-Genome Analysis of Salmonella Enterica subsp. Enterica serovar Gallinarum biovar Gallinarum Strain IJES3-1 Isolated from a Retail Chicken Shell Egg in Korea

  • Beom Soon Jang;Kun Taek Park
    • 한국식품위생안전성학회지
    • /
    • 제39권4호
    • /
    • pp.353-355
    • /
    • 2024
  • Salmonella enterica subsp. enterica serovar Gallinarum biovar Gallinarum causes fowl typhoid in poultry. In this study, we isolated Salmonella from a Korean retail chicken shell egg and performed whole-genome sequencing, from which we identified one chromosome (4,659,977-bp) and two plasmids (plasmid_1: 87,506 bp and plasmid_2: 2,331 bp). The isolate serotype was confirmed to be Gallinarum, with a biovar type of Gallinarum, which was finally identified as Salmonella enterica subsp. enterica serovar Gallinarum biovar Gallinarum. Multilocus sequence typing confirmed that the isolate was that of sequence type 78. The antimicrobial resistance gene, aac(6')-laa, was identified on the chromosome, and 166 virulence genes were detected on the chromosome and plasmid_1.