• Title/Summary/Keyword: Genome-wide

Search Result 695, Processing Time 0.03 seconds

Sample Size and Statistical Power Calculation in Genetic Association Studies

  • Hong, Eun-Pyo;Park, Ji-Wan
    • Genomics & Informatics
    • /
    • v.10 no.2
    • /
    • pp.117-122
    • /
    • 2012
  • A sample size with sufficient statistical power is critical to the success of genetic association studies to detect causal genes of human complex diseases. Genome-wide association studies require much larger sample sizes to achieve an adequate statistical power. We estimated the statistical power with increasing numbers of markers analyzed and compared the sample sizes that were required in case-control studies and case-parent studies. We computed the effective sample size and statistical power using Genetic Power Calculator. An analysis using a larger number of markers requires a larger sample size. Testing a single-nucleotide polymorphism (SNP) marker requires 248 cases, while testing 500,000 SNPs and 1 million markers requires 1,206 cases and 1,255 cases, respectively, under the assumption of an odds ratio of 2, 5% disease prevalence, 5% minor allele frequency, complete linkage disequilibrium (LD), 1:1 case/control ratio, and a 5% error rate in an allelic test. Under a dominant model, a smaller sample size is required to achieve 80% power than other genetic models. We found that a much lower sample size was required with a strong effect size, common SNP, and increased LD. In addition, studying a common disease in a case-control study of a 1:4 case-control ratio is one way to achieve higher statistical power. We also found that case-parent studies require more samples than case-control studies. Although we have not covered all plausible cases in study design, the estimates of sample size and statistical power computed under various assumptions in this study may be useful to determine the sample size in designing a population-based genetic association study.

Genetic Association Analysis of Fasting and 1- and 2-Hour Glucose Tolerance Test Data Using a Generalized Index of Dissimilarity Measure for the Korean Population

  • Yee, Jaeyong;Kim, Yongkang;Park, Taesung;Park, Mira
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.181-186
    • /
    • 2016
  • Glucose tolerance tests have been devised to determine the speed of blood glucose clearance. Diabetes is often tested with the standard oral glucose tolerance test (OGTT), along with fasting glucose level. However, no single test may be sufficient for the diagnosis, and the World Health Organization (WHO)/International Diabetes Federation (IDF) has suggested composite criteria. Accordingly, a single multi-class trait was constructed with three of the fasting phenotypes and 1- and 2-hour OGTT phenotypes from the Korean Association Resource (KARE) project, and the genetic association was investigated. All of the 18 possible combinations made out of the 3 sets of classification for the individual phenotypes were taken into our analysis. These were possible due to a method that was recently developed by us for estimating genomic associations using a generalized index of dissimilarity. Eight single-nucleotide polymorphisms (SNPs) that were found to have the strongest main effect are reported with the corresponding genes. Four of them conform to previous reports, located in the CDKAL1 gene, while the other 4 SNPs are new findings. Two-order interacting SNP pairs of are also presented. One pair (rs2328549 and rs6486740) has a prominent association, where the two single-nucleotide polymorphism locations are CDKAL1 and GLT1D1. The latter has not been found to have a strong main effect. New findings may result from the proper construction and analysis of a composite trait.

Comparison of linkage disequilibrium levels in Iranian indigenous cattle using whole genome SNPs data

  • Karimi, Karim;Koshkoiyeh, Ali Esmailizadeh;Gondro, Cedric
    • Journal of Animal Science and Technology
    • /
    • v.57 no.12
    • /
    • pp.47.1-47.10
    • /
    • 2015
  • Background: Knowledge of linkage disequilibrium (LD) levels among different populations can be used to detect genetic diversity and to investigate the historical changes in population sizes. Availability of large numbers of SNP through new sequencing technologies has provided opportunities for extensive researches in quantifying LD patterns in cattle breeds. The aim of this study was to compare the extent of linkage disequilibrium among Iranian cattle breeds using high density SNP genotyping data. Results: A total of 70 samples, representing seven Iranian indigenous cattle breeds, were genotyped for 777962 SNPs. The average values of LD based on the $r^2$ criterion were computed by grouping all syntenic SNP pairwises for intermarker distances from 0 Kb up to 1 Mb using three distance sets. Average $r^2$ above 0.3 was observed at distances less than 30 Kb for Sistani and Kermani, 20 Kb for Najdi, Taleshi, Kurdi and Sarabi, and 10 Kb for Mazandarani. The LD levels were considerably different among the Iranian cattle breeds and the difference in LD extent was more detectable between the studied breeds at longer distances. Lower level of LD was observed for Mazandarani breed as compared to other breeds indicating larger ancestral population size in this breed. Kermani breed continued to have more slowly LD decay than all of the other breeds after 3 Kb distances. More slowly LD decay was observed in Kurdi and Sarabi breeds at larger distances (>100 Kb) showing that population decline has been more intense in more recent generations for these populations. Conclusions: A wide genetic diversity and different historical background were well reflected in the LD levels among Iranian cattle breeds. More LD fluctuation was observed in the shorter distances (less than 10 Kb) in different cattle populations. Despite of the sample size effects, High LD levels found in this study were in accordance with the presence of inbreeding and population decline in Iranian cattle breeds.

Expression and Characterization of a New Esterase Cloned Directly from Agrobacterium tumefaciens Genome

  • PARK HYO-JUNG;KIM YOUNG-JUN;KIM HYUNG-KWOUN
    • Journal of Microbiology and Biotechnology
    • /
    • v.16 no.1
    • /
    • pp.145-148
    • /
    • 2006
  • A new functional lipolytic enzyme (AT4) has recently been found from Agrobacterium tumefaciens C58 Cereon using a genome-wide approach. The enzyme has some sequence similarity to E. coli acetyl hydrolase, Emericella nidulans lipase, Moraxella sp. lipase, Acinetobacter lwoffii esterase, and Streptomyces hygroscopicus acetyl hydrolase. However, the sequence similarities are very low (less than $25\%$), suggesting that it is a new lipase/esterase enzyme. ill the present study, intact cell of the A. tumefaciens strain was shown to have lipolytic activity on a tributyrin-LB plate. The AT4 gene was then expressed at a high level in E. coli BL21 (DE3) cells and the enzyme was purified simply by Ni-NTA column chromatography. The purified enzyme showed hydrolytic activity toward p-nitrophenyl caproate, but not toward olive oil, suggesting that the AT4 enzyme was a typical esterase rather than lipase. AT4 esterase had a maximum hydrolytic activity at $45^{\circ}C$ and pH 8.0, when p-nitrophenyl caproate was used as a substrate. It was relatively stable up to $40^{\circ}C$ and at pH 5.0-9.0. Calcium ion and EDT A did not affect the activity and thermal stability of the enzyme. As for substrate specificity, AT4 enzyme could rapidly hydrolyze acetyl and butyl groups from p-nitrophenyl esters and 1-naphthyl esters. In addition, it also released acetyl residues from acetylated glucose and xylose substrates. Therefore, this new esterase enzyme might be used as a biocatalyst in acetylation and deacetylation reactions performed in the fine chemical industry.

Genome-Wide Response of Deinococcus radiodurans on Cadmium Toxicity

  • Joe, Min-Ho;Jung, Sun-Wook;Im, Seong-Hun;Lim, Sang-Yong;Song, Hyun-Pa;Kwon, Oh-Suk;Kim, Dong-Ho
    • Journal of Microbiology and Biotechnology
    • /
    • v.21 no.4
    • /
    • pp.438-447
    • /
    • 2011
  • Deinococcus radiodurans is extremely resistant to various genotoxic conditions and chemicals. In this study, we characterized the effect of a sublethal concentration (100 ${\mu}M$) of cadmium (Cd) on D. radiodurans using a whole-genome DNA microarray. Time-course global gene expression profiling showed that 1,505 genes out of 3,116 total ORFs were differentially expressed more than 2-fold in response to Cd treatment for at least one timepoint. The majority of the upregulated genes are related to iron uptake, cysteine biosynthesis, protein disulfide stress, and various types of DNA repair systems. The enhanced upregulation of genes involved in cysteine biosynthesis and disulfide stress indicate that Cd has a high affinity for sulfur compounds. Provocation of iron deficiency and growth resumption of Cd-treated cells by iron supplementation also indicates that CdS forms in iron-sulfur-containing proteins such as the [Fe-S] cluster. Induction of base excision, mismatch, and recombinational repair systems indicates that various types of DNA damage, especially base excision, were enhanced by Cd. Exposure to sublethal Cd stress reduces the growth rate, and many of the downregulated genes are related to cell growth, including biosynthesis of cell membrane, translation, and transcription. The differential expression of 52 regulatory genes suggests a dynamic operation of complex regulatory networks by Cd-induced stress. These results demonstrate the effect of Cd exposure on D. radiodurans and how the related genes are expressed by this stress.

Global DNA Methylation of Porcine Embryos during Preimplantation Development

  • Yeo, S.E.;Kang, Y.K.;Koo, D.B.;Han, J.S.;Yu, K.;Kim, C.H.;Park, H.;Chang, W.K.;Lee, K.K.;Han, Y.M.
    • Korean Journal of Animal Reproduction
    • /
    • v.27 no.4
    • /
    • pp.309-315
    • /
    • 2003
  • DNA methylation at CpG sites, which is a epigenetic modification, is associated with gene expression without change of DNA sequences. During early mouse embryogenesis, dynamic changes of DNA methylation occur. In this study, DNA methylation patterns of porcine embryos produced in vivo and in vitro were examined at various developmental stages by the immunocytochemical staining method. Interestingly, active demethylation was not observed on the paternal pronucleus of porcine zygotes. However, differences were detected in the passive demethylation process between in vivo and in vitro embryos. There was no change in the DNA methylation state until the blastocyst stage of in vivo embryos, whereas partial demethylation was observed in several blastomeres from a 4 cell stage to a morula stage of in vitro embryos. The whole genome of inner cell mass (ICM) and trophectoderm (TE) cells in porcine blastocysts were evenly methylated without de novo methylation. Our findings demonstrate that genome-wide demethylation does not occur in pig embryos during preimplantation development unlike murine and bovine embryos. It indicates that the machinery regulating epigenetic reprogramming may be different between species.

Loss of Heterozygosity at the Calcium Regulation Gene Locus on Chromosome 10q in Human Pancreatic Cancer

  • Long, Jin;Zhang, Zhong-Bo;Liu, Zhe;Xu, Yuan-Hong;Ge, Chun-Lin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.6
    • /
    • pp.2489-2493
    • /
    • 2015
  • Background: Loss of heterozygosity (LOH) on chromosomal regions is crucial in tumor progression and this study aimed to identify genome-wide LOH in pancreatic cancer. Materials and Methods: Single-nucleotide polymorphism (SNP) profiling data GSE32682 of human pancreatic samples snap-frozen during surgery were downloaded from Gene Expression Omnibus database. Genotype console software was used to perform data processing. Candidate genes with LOH were screened based on the genotype calls, SNP loci of LOH and dbSNP database. Gene annotation was performed to identify the functions of candidate genes using NCBI (the National Center for Biotechnology Information) database, followed by Gene Ontology, INTERPRO, PFAM and SMART annotation and UCSC Genome Browser track to the unannotated genes using DAVID (the Database for Annotation, Visualization and Integration Discovery). Results: The candidate genes with LOH identified in this study were MCU, MICU1 and OIT3 on chromosome 10. MCU was found to encode a calcium transporter and MICU1 could encode an essential regulator of mitochondrial $Ca^{2+}$ uptake. OIT3 possibly correlated with calcium binding revealed by the annotation analyses and was regulated by a large number of transcription factors including STAT, SOX9, CREB, NF-kB, PPARG and p53. Conclusions: Global genomic analysis of SNPs identified MICU1, MCU and OIT3 with LOH on chromosome 10, implying involvement of these genes in progression of pancreatic cancer.

Gpx3-dependent Responses Against Oxidative Stress in Saccharomyces cerevisiae

  • Kho, Chang-Won;Lee, Phil-Young;Bae, Kwang-Hee;Kang, Sung-Hyun;Cho, Sa-Yeon;Lee, Do-Hee;Sun, Choong-Hyun;Yi, Gwan-Su;Park, Byoung-Chul;Park, Sung-Goo
    • Journal of Microbiology and Biotechnology
    • /
    • v.18 no.2
    • /
    • pp.270-282
    • /
    • 2008
  • The yeast Saccharomyces cerevisiae has defense mechanisms identical to higher eukaryotes. It offers the potential for genome-wide experimental approaches owing to its smaller genome size and the availability of the complete sequence. It therefore represents an ideal eukaryotic model for studying cellular redox control and oxidative stress responses. S. cerevisiae Yap1 is a well-known transcription factor that is required for $H_2O_2$-dependent stress responses. Yap1 is involved in various signaling pathways in an oxidative stress response. The Gpx3 (Orp1/PHGpx3) protein is one of the factors related to these signaling pathways. It plays the role of a transducer that transfers the hydroperoxide signal to Yap1. In this study, using extensive proteomic and bioinformatics analyses, the function of the Gpx3 protein in an adaptive response against oxidative stress was investigated in wild-type, gpx3-deletion mutant, and gpx3-deletion mutant overexpressing Gpx3 protein strains. We identified 30 proteins that are related to the Gpx3-dependent oxidative stress responses and 17 proteins that are changed in a Gpx3-dependent manner regardless of oxidative stress. As expected, $H_2O_2$-responsive Gpx3-dependent proteins include a number of antioxidants related with cell rescue and defense. In addition, they contain a variety of proteins related to energy and carbohydrate metabolism, transcription, and protein fate. Based upon the experimental results, it is suggested that Gpx3-dependent stress adaptive response includes the regulation of genes related to the capacity to detoxify oxidants and repair oxidative stress-induced damages affected by Yap1 as well as metabolism and protein fate independent from Yap1.

Transcriptome Analysis in Brassica rapa under the Abiotic Stresses Using Brassica 24K Oligo Microarray

  • Lee, Sang-Choon;Lim, Myung-Ho;Kim, Jin A;Lee, Soo-In;Kim, Jung Sun;Jin, Mina;Kwon, Soo-Jin;Mun, Jeong-Hwan;Kim, Yeon-Ki;Kim, Hyun Uk;Hur, Yoonkang;Park, Beom-Seok
    • Molecules and Cells
    • /
    • v.26 no.6
    • /
    • pp.595-605
    • /
    • 2008
  • Genome wide transcription analysis in response to stresses is essential to provide the basis of effective engineering strategies to improve stress tolerance in crop plants. In order to perform transcriptome analysis in Brassica rapa, we constructed a B. rapa oligo microarray, KBGP-24K, using sequence information from approximately 24,000 unigenes and analyzed cold ($4^{\circ}C$), salt (250 mM NaCl), and drought (air-dry) treated B. rapa plants. Among the B. rapa unigenes represented on the microarray, 417 (1.7%), 202 (0.8%), and 738 (3.1%) were identified as responsive genes that were differently expressed 5-fold or more at least once during a 48-h treatment with cold, salt, and drought, respectively. These results were confirmed by RT-PCR analysis. In the abiotic stress responsive genes identified, we found 56 transcription factor genes and 60 commonly responsive genes. It suggests that various transcriptional regulatory mechanisms and common signaling pathway are working together under the abiotic stresses in B. rapa. In conclusion, our new developed 24K oligo microarray will be a useful tool for transcriptome profiling and this work will provide valuable insight in the response to abiotic stress in B. rapa.

Construction of core collection based on single nucleotide polymorphism analysis in soybean germplasm

  • Jeong, Namhee;Park, Soo-Kwon;Lee, Choonseok;Ok, Hyun-Choong;Kim, Dool-Yi;Kim, Jae-Hyun;Park, Ki-Do;Moon, Jung-Kyung;Kim, Namshin;Choi, Man Soo
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2017.06a
    • /
    • pp.106-106
    • /
    • 2017
  • The soybean [Glycine max (L.) Merr.] is one of the most important crop resources worldwide as food and forage. It is also important and valuable that to hold crop resources to have high genetic diversities. Recently, a core collection has been constructed in many plants to preserve the genetic resources of various plants. A core collection is small population to represent the genetic diversity of the total collection, and is of strategic importance as they allow the use of a small part of a germplasm collection that is representative of the total collection. Here, we developed the core collection consisting of 816 accessions by using approximately 180,000 (180K) single nucleotide polymorphisms (SNPs) developed in previous study. In addition, we performed genetic diversity and population structure analysis to construct the core collection from entire 4,392 collections. there were excluded sample call rates less than 93% and duplicated samples more than 99.9% according to genotype analysis using 180K SNPs from entire collections. Furthermore, we were also excluded natural hybrid resources which Glycine max and Glycine soja are mixed in half through population structure analysis. As a result, we are constructed the core collection of genetic diversity that reflects 99% of the entire collections, including 430 cultivated soybeans (Glycine max) and 386 wild soybeans (Glycine soja). The core collection developed in this study should be to provide useful materials for both soybean breeding programs and genome-wide association studies.

  • PDF