• 제목/요약/키워드: whole genome analysis

검색결과 328건 처리시간 0.025초

유전체 코호트 연구를 위한 대용량 염기서열 분석 (High Throughput Genotyping for Genomic Cohort Study)

  • 박웅양
    • Journal of Preventive Medicine and Public Health
    • /
    • 제40권2호
    • /
    • pp.102-107
    • /
    • 2007
  • Human Genome Project (HGP) could unveil the secrets of human being by a long script of genetic codes, which enabled us to get access to mine the cause of diseases more efficiently. Two wheels for HGP, bioinformatics and high throughput technology are essential techniques for the genomic medicine. While microarray platforms are still evolving, we can screen more than 500,000 genotypes at once. Even we can sequence the whole genome of an organism within a day. Because the future medicne will focus on the genetic susceptibility of individuals, we need to find genetic variations of each person by efficient genotyping methods.

Analysis of whole genome sequencing and virulence factors of Vibrio vulnificus 1908-10 isolated from sea water at Gadeok island coast

  • Hee-kyung Oh;Nameun Kim;Do-Hyung Kim;Hye-Young Shin;Eun-Woo Lee;Sung-Hwan Eom;Young-Mog Kim
    • Fisheries and Aquatic Sciences
    • /
    • 제26권9호
    • /
    • pp.558-568
    • /
    • 2023
  • Vibrio vulnificus is an aquatic bacterium causing septicemia and wound infection in humans. To understand this pathogen at the genomic level, it was performed whole genome sequencing of a cefoxitin-resistant strain, V. vulnificus 1908-10 possessing virulence-related genes (vvhA, viuB, and vcgC) isolated from Gadeok island coastal seawater in South Korea. The genome of V. vulnificus 1908-10 consisted of two circular contigs and no plasmid. The total genome size was estimated to be 5,018,425 bp with a guanine-cytosine (GC) content of 46.9%. We found 119 tRNA and 34 rRNA genes respectively in the genome, along with 4,352 predicted protein sequences. Virulence factor (VF) analysis further revealed that V. vulnificus 1908-10 possess various virulence genes in classes of adherence, antiphagocytosis, chemotaxis and motility, iron uptake, quorum sensing, secretion system, and toxin. In the comparison of the presence/absence of virulence genes, V. vulnificus 1908-10 had fur, hlyU, luxS, ompU, pilA, pilF, rtxA, rtxC, and vvhA. Of the 30 V. vulnificus comparative strains, 80% of the C-genotype strains have all of these genes, whereas 40% of the E-genotype strains have all of them. In particular, pilA were identified in 80% of the C-type strains and 40% of the E-type strains, showing more difference than other genes. Therefore, V. vulnificus 1908-10 had similar VF characteristics to those of type C strains. Multifunctional-autoprocessing repeats-in-toxin (MARTX) toxin of V. vulnificus 1908-10 contained 8 A-type repeats (GXXGXXXXXG), 25 B.1-type repeats (TXVGXGXX), 18 B2-type repeats (GGXGXDXXX), and 7 C-type repeats (GGXGXDXXX). The National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST) showed that the RtxA protein of V. vulnificus 1908-10 had the effector domain in the order of cross-liking domain (ACD)-C58_PaToxP-like domain- α/β hydrolase-C58_PaToxP-like domain.

Characterization of a Potential Probiotic Lactiplantibacillus plantarum LRCC5310 by Comparative Genomic Analysis and its Vitamin B6 Production Ability

  • Yunjeong Lee;Nattira Jaikwang;Seong keun Kim;Jiseon Jeong;Ampaitip Sukhoom;Jong-Hwa Kim;Wonyong Kim
    • Journal of Microbiology and Biotechnology
    • /
    • 제33권5호
    • /
    • pp.644-655
    • /
    • 2023
  • Safety assessment and functional analysis of probiotic candidates are important for their industrial applications. Lactiplantibacillus plantarum is one of the most widely recognized probiotic strains. In this study we aimed to determine the functional genes of L. plantarum LRCC5310, isolated from kimchi, using next-generation, whole-genome sequencing analysis. Genes were annotated using the Rapid Annotations using Subsystems Technology (RAST) server and the National Center for Biotechnology Information (NCBI) pipelines to establish the strain's probiotic potential. Phylogenetic analysis of L. plantarum LRCC5310 and related strains showed that LRCC5310 belonged to L. plantarum. However, comparative analysis revealed genetic differences between L. plantarum strains. Carbon metabolic pathway analysis based on the Kyoto Encyclopedia of Genes and Genomes database showed that L. plantarum LRCC5310 is a homofermentative bacterium. Furthermore, gene annotation results indicated that the L. plantarum LRCC5310 genome encodes an almost complete vitamin B6 biosynthetic pathway. Among five L. plantarum strains, including L. plantarum ATCC 14917T , L. plantarum LRCC5310 detected the highest concentration of pyridoxal 5'-phosphate with 88.08 ± 0.67 nM in MRS broth. These results indicated that L. plantarum LRCC5310 could be used as a functional probiotic for vitamin B6 supplementation.

Comparison of Normalization Methods for Defining Copy Number Variation Using Whole-genome SNP Genotyping Data

  • Kim, Ji-Hong;Yim, Seon-Hee;Jeong, Yong-Bok;Jung, Seong-Hyun;Xu, Hai-Dong;Shin, Seung-Hun;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • 제6권4호
    • /
    • pp.231-234
    • /
    • 2008
  • Precise and reliable identification of CNV is still important to fully understand the effect of CNV on genetic diversity and background of complex diseases. SNP marker has been used frequently to detect CNVs, but the analysis of SNP chip data for identifying CNV has not been well established. We compared various normalization methods for CNV analysis and suggest optimal normalization procedure for reliable CNV call. Four normal Koreans and NA10851 HapMap male samples were genotyped using Affymetrix Genome-Wide Human SNP array 5.0. We evaluated the effect of median and quantile normalization to find the optimal normalization for CNV detection based on SNP array data. We also explored the effect of Robust Multichip Average (RMA) background correction for each normalization process. In total, the following 4 combinations of normalization were tried: 1) Median normalization without RMA background correction, 2) Quantile normalization without RMA background correction, 3) Median normalization with RMA background correction, and 4) Quantile normalization with RMA background correction. CNV was called using SW-ARRAY algorithm. We applied 4 different combinations of normalization and compared the effect using intensity ratio profile, box plot, and MA plot. When we applied median and quantile normalizations without RMA background correction, both methods showed similar normalization effect and the final CNV calls were also similar in terms of number and size. In both median and quantile normalizations, RMA backgroundcorrection resulted in widening the range of intensity ratio distribution, which may suggest that RMA background correction may help to detect more CNVs compared to no correction.

Development of A Monkey Kidney Cell Line Which Expresses Poliovirus Capsid Protein

  • Choi, Weon-Sang
    • 대한바이러스학회지
    • /
    • 제28권4호
    • /
    • pp.295-302
    • /
    • 1998
  • The RNA genome of poliovirus encodes a long polyprotein precursor and this polyprotein is cleaved proteolytically by viral protease to yield mature proteins. The mature proteins derived from the P1 polyprotein precursor are the component of capsids. To further delineate the process of capsid assembly and encapsidation, in a first attempt, a cell line which expresses the authentic P1 polyprotein was established. CV-1 cells were transfected with the pRCRSVS1P1 plasmid DNA which contains 5'ncr sequences, whole authentic capsid gene of poliovirus and neomycin resistance gene. These cells were treated with G418 for 3 months, and eventually G418 resistant cells were selected and formed colonies. Each colony was picked and grown in the media containing G418. DNA analysis indicated that 1 of 13 neomycin resistant cell lines (R2-18) contains whole poliovirus P1 capsid gene segment which was incorporated into the genome. Immuneprecipitation of cell lysates with sera from rabbit immunized with inactivateded Sabin type 1 particles demonstrated the constitutive expression of the poliovirus P1 capsid protein from R2-18.

  • PDF

Flanking Sequence and Copy-Number Analysis of Transformation Events by Integrating Next-Generation Sequencing Technology with Southern Blot Hybridization

  • Qin, Yang;Woo, Hee-Jong;Shin, Kong-Sik;Lim, Myung-Ho;Cho, Hyun-Suk;Lee, Seong-Kon
    • Plant Breeding and Biotechnology
    • /
    • 제5권4호
    • /
    • pp.269-281
    • /
    • 2017
  • With the continual development of genetically modified (GM) crops, it has become necessary to develop detailed and effective molecular characterization methods to select candidate events from a large pool of transformation events. Relative to traditional molecular analysis methods such as the polymerase chain reaction (PCR) and Southern blot hybridization, next generation sequencing (NGS) technology for whole-genome sequencing of complex crop genomes had proven comparatively useful for in-depth molecular characterization. In this study, four transformation events, including one in Bacillus thuringiensis (Bt)-resistant rice, one in resveratrol-producing rice, and two in beta-carotene-enhanced soybeans, were selected for molecular characterization. To merge NGS analysis and Southern blot-hybridization results, we confirmed the transgene insertion sites, insertion construction, and insertion numbers of these four transformation events. In addition, the read-coverage depth assessed by NGS analysis for inserted genes might provide consistent results in terms of inserted T-DNA numbers in case of complex insertion structures and highly duplicated donor genomes; however, PCR-based methods can produce incorrect conclusions. Our combined method provides an effective and complete analytical approach for whole-genome visual inspection of transformation events that require biosafety assessment.

Next-generation sequencing for the genetic characterization of Maedi/Visna virus isolated from the northwest of China

  • Zhao, Ling;Zhang, Liang;Shi, Xiaona;Duan, Xujie;Li, Huiping;Liu, Shuying
    • Journal of Veterinary Science
    • /
    • 제22권6호
    • /
    • pp.66.1-66.9
    • /
    • 2021
  • Background: Maedi/Visna virus (MVV) is a contagious viral pathogen that causes considerable economic losses to the sheep industry worldwide. Objectives: In China, MVV has been detected in several regions, but its molecular characteristics and genetic variations were not thoroughly investigated. Methods: Therefore, in this study, we conducted next-generation sequencing on an MVV strain obtained from northwest China to reveal its genetic evolution via phylogenetic analysis. Results: A MVV strain obtained from Inner Mongolia (NM) of China was identified. Sequence analysis indicated that its whole-genome length is 9193 bp. Homology comparison of nucleotides between the NM strain and reference strains showed that the sequence homology of gag and env were 77.1%-86.8% and 67.7%-75.5%, respectively. Phylogenetic analysis revealed that the NM strain was closely related to the reference strains isolated from America, which belong to the A2 type. Notably, there were 5 amino acid insertions in variable region 4 and a highly variable motif at the C-terminal of the surface glycoprotein (SU5). Conclusions: The present study is the first to show the whole-genome sequence of an MVV obtained from China. The detailed analyses provide essential information for understanding the genetic characteristics of MVV, and the results enrich the MVV library.

Molecular characteristics of Budgerigar fledgling disease polyomavirus detected from parrots in South Korea

  • Kim, Sungryong;Kim, Su-Jin;Na, Ki-Jeong
    • Journal of Veterinary Science
    • /
    • 제23권5호
    • /
    • pp.67.1-67.11
    • /
    • 2022
  • Background: Budgerigar fledgling disease polyomavirus (BFDV) is the pathogen that causes budgerigar fledgling disease in psittacine species. The clinical signs of PBFV infection include ascites, hepatitis, and crop stasis. BFDV is associated with a high mortality rate in nestling birds. In contrast, adult birds only have mild symptoms such as feather dystrophy. Objectives: This study aimed to determine the prevalence, genetic characteristics, and phylogenetic analysis of BFDV in pet parrots in Korea. Methods: Fecal and tissue samples were collected from 217 pet parrots from 10 veterinary hospitals including Chungbuk National University Veterinary Hospital. The molecular screening was performed using polymerase chain reaction (PCR) analysis of the small t/large T antigen gene segment. Full-length genome sequencing with the Sanger and phylogenetic analysis were performed on BFDV-positive samples. Results: The PCR results based on the small t/large T antigen gene marker indicated that BFDV DNA was present in 10 out of 217 screened samples. A whole-genome sequence was obtained from six strains and phylogenetic analysis revealed no significant relationship existed between the species and geographical locations amongst them. Conclusions: The prevalence of BFDV infection in South Korea is not high when compared to the prevalence of BFDV in other parts of the world, however, it has been reported sporadically in various species and geographic locations. The whole-genome analysis revealed 0.2%-0.3% variation in intragenomic homogeneity among the six strains analyzed. Korean strains are separately on the phylogenetic tree from their counterparts from China and Japan which might reflect the substantial genetic variation.

Validation of Customized Cancer Panel for Detecting Somatic Mutations and Copy Number Alterations

  • Choi, Su-Hye;Jung, Seung-Hyun;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • 제15권4호
    • /
    • pp.136-141
    • /
    • 2017
  • Accurate detection of genomic alterations, especially druggable hotspot mutations in tumors, has become an essential part of precision medicine. With targeted sequencing, we can obtain deeper coverage of reads and handle data more easily with a relatively lower cost and less time than whole-exome or whole-genome sequencing. Recently, we designed a customized gene panel for targeted sequencing of major solid cancers. In this study, we aimed to validate its performance. The cancer panel targets 95 cancer-related genes. In terms of the limit of detection, more than 86% of target mutations with a mutant allele frequency (MAF) <1% can be identified, and any mutation with >3% MAF can be detected. When we applied this system for the analysis of Acrometrix Oncology Hotspot Control DNA, which contains more than 500 COSMIC mutations across 53 genes, 99% of the expected mutations were robustly detected. We also confirmed the high reproducibility of the detection of mutations in multiple independent analyses. When we explored copy number alterations (CNAs), the expected CNAs were successfully detected, and this result was confirmed by target-specific genomic quantitative polymerase chain reaction. Taken together, these results support the reliability and accuracy of our cancer panel in detecting mutations. This panel could be useful for key mutation profiling research in solid tumors and clinical translation.

Systems-Level Analysis of Genome-Scale In Silico Metabolic Models Using MetaFluxNet

  • Lee, Sang-Yup;Woo, Han-Min;Lee, Dong-Yup;Choi, Hyun-Seok;Kim, Tae-Yong;Yun, Hong-Seok
    • Biotechnology and Bioprocess Engineering:BBE
    • /
    • 제10권5호
    • /
    • pp.425-431
    • /
    • 2005
  • The systems-level analysis of microbes with myriad of heterologous data generated by omics technologies has been applied to improve our understanding of cellular function and physiology and consequently to enhance production of various bioproducts. At the heart of this revolution resides in silico genome-scale metabolic model, In order to fully exploit the power of genome-scale model, a systematic approach employing user-friendly software is required. Metabolic flux analysis of genome-scale metabolic network is becoming widely employed to quantify the flux distribution and validate model-driven hypotheses. Here we describe the development of an upgraded MetaFluxNet which allows (1) construction of metabolic models connected to metabolic databases, (2) calculation of fluxes by metabolic flux analysis, (3) comparative flux analysis with flux-profile visualization, (4) the use of metabolic flux analysis markup language to enable models to be exchanged efficiently, and (5) the exporting of data from constraints-based flux analysis into various formats. MetaFluxNet also allows cellular physiology to be predicted and strategies for strain improvement to be developed from genome-based information on flux distributions. This integrated software environment promises to enhance our understanding on metabolic network at a whole organism level and to establish novel strategies for improving the properties of organisms for various biotechnological applications.