• Title/Summary/Keyword: Nucleotide sequencing

Search Result 638, Processing Time 0.024 seconds

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • v.37 no.1
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

Genome re-sequencing to identify single nucleotide polymorphism markers for muscle color traits in broiler chickens

  • Kong, H.R.;Anthony, N.B.;Rowland, K.C.;Khatri, B.;Kong, B.C.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.1
    • /
    • pp.13-18
    • /
    • 2018
  • Objective: Meat quality including muscle color in chickens is an important trait and continuous selective pressures for fast growth and high yield have negatively impacted this trait. This study was conducted to investigate genetic variations responsible for regulating muscle color. Methods: Whole genome re-sequencing analysis using Illumina HiSeq paired end read method was performed with pooled DNA samples isolated from two broiler chicken lines divergently selected for muscle color (high muscle color [HMC] and low muscle color [LMC]) along with their random bred control line (RAN). Sequencing read data was aligned to the chicken reference genome sequence for Red Jungle Fowl (Galgal4) using reference based genome alignment with NGen program of the Lasergene software package. The potential causal single nucleotide polymorphisms (SNPs) showing non-synonymous changes in coding DNA sequence regions were chosen in each line. Bioinformatic analyses to interpret functions of genes retaining SNPs were performed using the ingenuity pathways analysis (IPA). Results: Millions of SNPs were identified and totally 2,884 SNPs (1,307 for HMC and 1,577 for LMC) showing >75% SNP rates could induce non-synonymous mutations in amino acid sequences. Of those, SNPs showing over 10 read depths yielded 15 more reliable SNPs including 1 for HMC and 14 for LMC. The IPA analyses suggested that meat color in chickens appeared to be associated with chromosomal DNA stability, the functions of ubiquitylation (UBC) and quality and quantity of various subtypes of collagens. Conclusion: In this study, various potential genetic markers showing amino acid changes were identified in differential meat color lines, that can be used for further animal selection strategy.

Development of an efficient genotyping-by-sequencing (GBS) library construction method for genomic analysis of grapevine

  • Jang, Hyun A;Oh, Sang-Keun
    • Korean Journal of Agricultural Science
    • /
    • v.44 no.4
    • /
    • pp.495-503
    • /
    • 2017
  • Genotyping-by-sequencing (GBS) is an outstanding technology for genotyping and single nucleotide polymorphism (SNP) discovery compared to next generation sequencing (NGS) because it can save time when analyzing large-scale samples and carries a low cost per sample. Recently, studies using GBS have been conducted on major crops and, to a greater extent, on fruit crops. However, many researchers have some problems due to low GBS efficiency resulting from low quality GBS libraries. To overcome this limitation, we developed an efficient GBS library construction method that regulates important conditions such as restriction enzymes (RE) digestion and a PCR procedure for grapevine. For RE digestion, DNA samples are digested with ApeKI (3.6U) at $75^{\circ}C$ for 5 hours and adapters are ligated to the ends of gDNA products. To produce suitable PCR fragments for sequencing, we modified the PCR amplification conditions; temperature cycling consisted of $72^{\circ}C$ (5 min), $98^{\circ}C$ (30 s), followed by 16 cycles of $98^{\circ}C$ (30 s), $65^{\circ}C$ (30 s), $72^{\circ}C$ (20 s) with a final extension step. As a result, we had obtained optimal library construct sizes (200 to 400 bp) for GBS analysis. Furthermore, it not only increased the mapping efficiency by approximately 10.17% compared to the previous method, but also produced mapped reads which were distributed equally on the19 chromosomes in the grape genome. Therefore, we suggest that this system can be used for various fruit crops and is expected to increase the efficiency of various genomic analysis performed.

Application of Whole Exome Sequencing to Identify Disease-Causing Variants in Inherited Human Diseases

  • Goh, Gerald;Choi, Murim
    • Genomics & Informatics
    • /
    • v.10 no.4
    • /
    • pp.214-219
    • /
    • 2012
  • The recent advent of next-generation sequencing technologies has dramatically changed the nature of biomedical research. Human genetics is no exception-it has never been easier to interrogate human patient genomes at the nucleotide level to identify disease-associated variants. To further facilitate the efficiency of this approach, whole exome sequencing (WES) was first developed in 2009. Over the past three years, multiple groups have demonstrated the power of WES through robust disease-associated variant discoveries across a diverse spectrum of human diseases. Here, we review the application of WES to different types of inherited human diseases and discuss analytical challenges and possible solutions, with the aim of providing a practical guide for the effective use of this technology.

Molecular cloning and nucleotide sequencing of bovine viral diarrhea virus gp53 antigenic region (소 바이러스성 설사병 바이러스 gp53 항원부위 유전자의 재조합 및 염기서열 연구)

  • Yeo, Sang-geon;Cho, HJ;Masri, SA
    • Korean Journal of Veterinary Research
    • /
    • v.35 no.2
    • /
    • pp.287-295
    • /
    • 1995
  • Molecular cloning and nucleotide sequencing were undertaken for the RNA genome of gp53 antigenic region in cytopathic Singer strain of bovine viral diarrhea virus. The cloned cDNA was 939 nucleotides in length having a base composition of 31.0% A, 19.6% C, 25.5% G and 24.0% T. The sequence was corresponded to approximately 77.8%(817 bases) of predicted gp53 region and 122 bases after 3'end of gp53 region in the Singer strain when compared with NADL strain of known sequence. A single open reading frame was found in the sequence of 2nd frame and was deduced as encoding 312 amino acids.

  • PDF

LGMD2E with a novel nonsense variant in SGCB gene: a case of LGMD2E with a novel variant

  • La, Yun Kyung;Oh, Eun Kyoung;Lyou, Hyun Ji;Hong, Ji Man;Choi, Young-Chul
    • Annals of Clinical Neurophysiology
    • /
    • v.22 no.1
    • /
    • pp.29-32
    • /
    • 2020
  • Sarcoglycanopathies are a rare group of autosomal recessive limb-girdle muscular dystrophies (LGMDs) caused by genetic variants in α-, β-, γ-, or δ-sarcoglycan that maintain membrane integrity and contribute to molecular signal processing. High-throughput nucleotide sequencing was performed in patients with slowly progressive proximal muscle weakness from early childhood with respiratory involvement, which detected a novel homozygous nonsense variant (c.601C>T;p.Gln201Ter) in SGCB. This report informs about the clinical characteristics of LGMD2E (type-2E LGMD) in Korea and provides genetic confirmation of the disease.

Trend and Technology of Gene and Genome Research (유전자 및 유전체 연구 기술과 동향)

  • 이진성;김기환;서동상;강석우;황재삼
    • Journal of Sericultural and Entomological Science
    • /
    • v.42 no.2
    • /
    • pp.126-141
    • /
    • 2000
  • A major step towards understanding of the genetic basis of an organism is the complete sequence determination of all genes in target genome. The nucleotide sequence encoded in the genome contains the information that specifies the amino acid sequence of every protein and functional RNA molecule. In principle, it will be possible to identify every protein resposible for the structure and function of the body of the target organism. The pattern of expression in different cell types will specify where and when each protein is used. The amino acid sequence of the proteins encoded by each gene will be derived from the conceptional translation of the nucleotide sequence. Comparison of these sequences with those of known proteins, whose sequences are sorted in database, will suggest an approximate function for many proteins. This mini review describes the development of new sequencing methods and the optimization of sequencing strategies for whole genome, various cDNA and genomic analysis.

  • PDF

Cloning and Sequencing of Nonstructural Protein (NSP4) Gene of Bovine Rotavirus Isolated in Korea (국내분리 소 로타바이러스 NSP4 유전자의 크로닝 및 염기서열 분석)

  • Back, Myoung-Soon;Kim, Won-Yong;Kang, Shien-Young
    • The Journal of Korean Society of Virology
    • /
    • v.30 no.1
    • /
    • pp.1-10
    • /
    • 2000
  • The nonstructural protein NSP4, encoded by gene 10 of rotavirus, has been shown to playa role in viral assembly and known to be an enterotoxin, causing diarrhea in mouse pups. NSP4 gene was cloned from CBNU-2 (virulent bovine rotavirus/diarrheic fecal sample) and CBNU-1 (cell-culture adapted bovine rotavirus/isolated from CBNU-2 and 75 times passaged on MA104 cells), respectively, by reverse transcriptase-polymerase chain reaction (RT-PCR) and sequenced and compared. The sequence data indicated that the NSP4 genes of bovine rotavirus (BRV) were 751 bases in length and encoded one open reading frame of 175 amino acids beginning at base 42 and terminating at base 569. Differences in nucleotide sequence between CBNU-2 and CBNU-1 were observed at 6 positions (base 274, 296, 391, 394, 396 and 579). NSP4 gene of BRV exhibited a high degree of nucleotide (90% and 94%) and amino acid sequence (91% and 97%) homology with those of SA11 and UK but a low degree of nucleotide (77% and 79%) and amino acids sequence (81% and 85%) homology with those of Wa and OSU.

  • PDF

MAP: Mutation Arranger for Defining Phenotype-Related Single-Nucleotide Variant

  • Baek, In-Pyo;Jeong, Yong-Bok;Jung, Seung-Hyun;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.12 no.4
    • /
    • pp.289-292
    • /
    • 2014
  • Next-generation sequencing (NGS) is widely used to identify the causative mutations underlying diverse human diseases, including cancers, which can be useful for discovering the diagnostic and therapeutic targets. Currently, a number of single-nucleotide variant (SNV)-calling algorithms are available; however, there is no tool for visualizing the recurrent and phenotype-specific mutations for general researchers. In this study, in order to support defining the recurrent mutations or phenotype-specific mutations from NGS data of a group of cancers with diverse phenotypes, we aimed to develop a user-friendly tool, named mutation arranger for defining phenotype-related SNV (MAP). MAP is a user-friendly program with multiple functions that supports the determination of recurrent or phenotype-specific mutations and provides graphic illustration images to the users. Its operation environment, the Microsoft Windows environment, enables more researchers who cannot operate Linux to define clinically meaningful mutations with NGS data from cancer cohorts.

Genetic Stock Identification of Common Carp (Cyprinus carpio) by Detection of Intraspecific DNA Sequence Variation in the Mitochondrial 12S rRNA Gene (미토콘드리아 12S rRNA 유전자 변이 조사를 통한 잉어(Cyprinus carpio)의 유전학적 동정)

  • 남윤권;주수동;정창화;노충환;조재윤;김동수
    • Journal of Aquaculture
    • /
    • v.10 no.4
    • /
    • pp.403-407
    • /
    • 1997
  • Intraspecific sequence variation was detected by polymerase chain reaction (PCR) and direct sequencing of a 350-nucleotide region of the mitochondrial 12S rRNA gene of two natural populations (Han River and Nakdong River) and one hatchery stock (Jinhae Inland Fisheries Institute) of local strain common carp, one Israeli strain of common carp stock from Pukyong National University (PKU), and one hybrid between Israeli strain of common carp female and local strain common carp male from PKU stock. There is little variation in 350 bases of the mitochondrial 12S rRNA gene sequences among 2 natural and 1 hatchery local strain common carp populatins, representing abut 7 to 20 nucleotide differences (less than 6%). The sequence of specimens from Han River was more similar to that from Nakdong River (identity=98.0%) than to that from Jinhae Inland Fisheries Institute (identity=96.3%). Sequence variation between Israeli strain and wild local strain common carp was higher than the variation within natural stocks. The level of variation was ranged from 15.7 to 17.7%. The hybrid showed very similar nucleotide4 sequence of 12S rRNA gene to the sequence of Israeli strain with the identity of 98.9%.

  • PDF