• 제목/요약/키워드: Genome sequence

검색결과 1,399건 처리시간 0.025초

Complete Genome Sequence Analysis of Two Divergent Groups of Sweet potato chlorotic fleck virus Isolates Collected from Korea

  • Kwak, Hae-Ryun;Kim, Jaedeok;Kim, Mikyeong;Seo, Jang-Kyun;Kim, Jeong-Soo;Choi, Hong-Soo
    • The Plant Pathology Journal
    • /
    • 제34권5호
    • /
    • pp.451-457
    • /
    • 2018
  • The Sweet potato chlorotic fleck virus (SPCFV), of the genus Carlavirus (family Betaflexiviridae), was first detected as one of several viruses infecting sweet potatoes (Ipomea batatas L.) in Korea. Out of 154 sweet potato samples collected in 2012 that were showing virus-like symptoms, 47 (31%) were infected with SPCFV, along with other viruses. The complete genome sequences of four SPCFV isolates were determined and analyzed using previously reported genome sequences. The complete genomes were found to contain 9,104-9,108 nucleotides, excluding the poly-A tail, containing six putative open reading frames (ORFs). Further, the SPCFV Korean isolates were divided into two groups (Group I and Group II) by phylogenetic analysis based on the complete nucleotide sequences; Group I and Group II had low nucleotide sequence identities of about 73%. For the first time, we determined the complete genome sequence for the Group II SPCFV isolates. The amino acid sequence identity in coat proteins (CP) between the two groups was over 90%, whereas the amino acid sequence identity in other proteins was less than 80%. In addition, SPCFV Korean isolates had a low amino acid sequence identity (61% CPs and 47% in the nucleotide-binding protein [NaBp] region) to that of Melon yellowing-associated virus (MYaV), a typical Carlavirus.

웹 기반 통합 유전체 분석 시스템의 설계 및 구현 (The Design and Implementation of Web-Based Integrated Genome Analysis Tools)

  • 최범순;이경희;권해룡;조완섭;이충세;김영창
    • 한국멀티미디어학회논문지
    • /
    • 제7권3호
    • /
    • pp.408-417
    • /
    • 2004
  • 유전체 분석 과정은 여러 단계를 걸쳐 다양한 소프트웨어 분석 도구가 사용되는 복잡한 작업을 수반한다. 유전체 분석과 관련된 기존의 소프트웨어 도구들은 대부분 리눅스나 유닉스 기반 프로그램 이므로 생물학자가 이들을 설치하고 사용하는데 어려움과 불편함이 많은 실정이다. 또한, 분석의 각 단계별로 생산되는 파일은 수작업을 통한 변환을 거쳐야만 다음 단계의 입력으로 사용될 수 있다. 근래에 웹을 기반으로한 도구들이 개발되고 있으나 한번에 하나의 서열을 처리하는 방식이므로 대량의 실험 데이터를 분석하는 경우에는 반복 작업으로 인한 시간과 노력이 요구되는 단점을 갖고 있다. 본 논문에서는 유전체 분석에 필요한 여러 도구들을 웹 환경에서 하나의 그래픽 사용자 인터페이스로 통합하여 생물학자들이 보다 쉽게 서열과 기능을 분석할 수 있도록 한 WGAT(Web-based Genome Analysis Tool)를 제안한다. WGAT는 리눅스 서버에 유전체 데이터 분석프로그램을 구동하고, 클라이언트 웹 (web)에서 데이터 파일과 분석에 필요한 선택사항들을 입력함으로써 한번에 여러 단계의 분석 작업을 수작업 없이 자동으로 처리할 수 있다. WGAT시스템의 생산성을 분석하기 위하여 기존의 방식과 WGAT를 이용한 방식의 서열분석 처리 시간을 비교한 결과 서열 단편의 개수가 1000개인 경우 기존의 방식보다 20배 이상 분석 능력이 향상됨을 확인할 수 있었다.

  • PDF

The complete chloroplast genome sequence of Avena sterilis L. using Illumina sequencing

  • Raveendar, Sebastin;Lee, Gi-An;Lee, Kyung Jun;Shin, Myoung-Jae;Cho, Yang-Hee;Ma, Kyung-Ho;Chung, Jong-Wook;Lee, Jung-Ro
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2017년도 9th Asian Crop Science Association conference
    • /
    • pp.139-139
    • /
    • 2017
  • The complete chloroplast genome sequence of Avena sterilis L., a dominant wild oat species in the family Poaceae, is first reported in this study. The complete cp genome sequence of A. sterilis is 135,887 bp in length with 38.5% overall GC content and exhibits a typical quadripartite structure comprising one pair of inverted repeats (21, 603 bp) separated by a small single-copy region (12,575 bp) and a large single-copy region (80,106). The A. sterilis cp genome encodes 111 unique genes, 76 of which are protein-coding genes, 4 rRNA genes, 30 tRNA genes and 18 duplicated genes in the inverted repeat region. Nine genes contain one or two introns. Pair-wise alignments of cp genome were performed for genome-wide comparison. This newly determined cp genome sequence of A. sterilis will provide valuable information for the future breeding programs of valuable cereal crops in the family Poaceae.

  • PDF

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • 한국축산식품학회지
    • /
    • 제33권6호
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

Draft Genome Sequence of the Reference Strain of the Korean Medicinal Mushroom Wolfiporia cocos KMCC03342

  • Bogun Kim;Byoungnam Min;Jae-Gu Han;Hongjae Park;Seungwoo Baek;Subin Jeong;In-Geol Choi
    • Mycobiology
    • /
    • 제50권4호
    • /
    • pp.254-257
    • /
    • 2022
  • Wolfiporia cocos is a wood-decay brown rot fungus belonging to the family Polyporaceae. While the fungus grows, the sclerotium body of the strain, dubbed Bokryeong in Korean, is formed around the roots of conifer trees. The dried sclerotium has been widely used as a key component of many medicinal recipes in East Asia. Wolfiporia cocos strain KMCC03342 is the reference strain registered and maintained by the Korea Seed and Variety Service for commercial uses. Here, we present the first draft genome sequence of W. cocos KMCC03342 using a hybrid assembly technique combining both short- and long-read sequences. The genome has a total length of 55.5 Mb comprised of 343 contigs with N50 of 332 kb and 95.8% BUSCO completeness. The GC ratio was 52.2%. We predicted 14,296 protein-coding gene models based on ab initio gene prediction and evidence-based annotation procedure using RNAseq data. The annotated genome was predicted to have 19 terpene biosynthesis gene clusters, which was the same number as the previously sequenced W. cocos strain MD-104 genome but higher than Chinese W. cocos strains. The genome sequence and the predicted gene clusters allow us to study biosynthetic pathways for the active ingredients of W. cocos.

Complete genome sequence of candidate probiotic Limosilactobacillus fermentum KUFM407

  • Bogun Kim;Ji yu Heo;Xiaoyue Xu;Hyunju Lee;Duleepa Pathiraja;Jae-Young Kim;Yi Hyun Choi;In-Geol Choi;Sae Hun Kim
    • Journal of Animal Science and Technology
    • /
    • 제66권4호
    • /
    • pp.859-862
    • /
    • 2024
  • It has been reported that the administration of Limosilactobacillus fermentum alleviates diseases such as osteoporosis and colitis. In this study, we report the complete genome sequence of Limosilactobacillus fermentum KUFM407, a probiotic strain of LAB isolated from Korean traditional fermented food, Kimchi. Whole genome sequencing of L. fermentum KUFM407 was performed on the Illumina MiSeq and Oxford Nanopore MinION platform. The genome consisted of one circular chromosome (2,077,616 base pair [bp]) with a guanine cytosine (GC) content of 51.5% and one circular plasmid sequence (13,931 bp). Genome annotation identified 1,932 protein-coding genes, 15 rRNAs, and 58 tRNAs in the assembly. The function annotation of the predicted proteins revealed genes involved in the biosynthesis of bacteriocin and fatty acids. The complete genome of L. fermentum KUFM407 could provide valuable information for the development of new probiotic food and health supplements.

Whole Genome Sequence of Streptomyces sp. from Novel Marine Actinomycetes

  • Hyeon Kyeong Lee;Heung-Soon Park;Eung-Soo Kim;Si-Sun Choi
    • 한국미생물·생명공학회지
    • /
    • 제51권3호
    • /
    • pp.325-327
    • /
    • 2023
  • This paper presents the complete genome sequence of a novel marine actinomycete, Streptomyces sp. MMBL 11-1. The genome of Streptomyces sp. MMBL 11-1 was obtained through next-generation sequencing using the PacBio Sequel system and Illumina platform provided by Macrogen, Korea. The assembled genome consists of five contigs, with a total length of 8,496,900 bp and a G+C content of 71.6%. The genome harbors multiple biosynthetic gene clusters (BGCs) associated with producing microbial natural products (MNPs). The comprehensive genomic information of this type of strain will serve as a valuable resource for identifying other marine actinomycetes strains.

A Short History of the Genome-Wide Association Study: Where We Were and Where We Are Going

  • Ikegawa, Shiro
    • Genomics & Informatics
    • /
    • 제10권4호
    • /
    • pp.220-225
    • /
    • 2012
  • Recent rapid advances in genetic research are ushering us into the genome sequence era, where an individual's genome information is utilized for clinical practice. The most spectacular results of the human genome study have been provided by genome-wide association studies (GWASs). This is a review of the history of GWASs as related to my work. Further efforts are necessary to make full use of its potential power to medicine.

A data management system for microbial genome projects

  • Ki-Bong Kim;Hyeweon Nam;Hwajung Seo and Kiejung Park
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2000년도 International Symposium on Bioinformatics
    • /
    • pp.83-85
    • /
    • 2000
  • A lot of microbial genome sequencing projects is being done in many genome centers around the world, since the first genome, Haemophilus influenzae, was sequenced in 1995. The deluge of microbial genome sequence data demands new and highly automatic data flow system in order for genome researchers to manage and analyze their own bulky sequence data from low-level to high-level. In such an aspect, we developed the automatic data management system for microbial genome projects, which consists mainly of local database, analysis programs, and user-friendly interface. We designed and implemented the local database for large-scale sequencing projects, which makes systematic and consistent data management and retrieval possible and is tightly coupled with analysis programs and web-based user interface, That is, parsing and storage of the results of analysis programs in local database is possible and user can retrieve the data in any level of data process by means of web-based graphical user interface. Contig assembly, homology search, and ORF prediction, which are essential in genome projects, make analysis programs in our system. All but Contig assembly program are open as public domain. These programs are connected with each other by means of a lot of utility programs. As a result, this system will maximize the efficiency in cost and time in genome research.

  • PDF