• Title/Summary/Keyword: Single Nucleotide Polymorphism [SNP]

Search Result 570, Processing Time 0.032 seconds

Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle

  • Lee, DooHo;Kim, Yeongkuk;Chung, Yoonji;Lee, Dongjae;Seo, Dongwon;Choi, Tae Jeong;Lim, Dajeong;Yoon, Duhak;Lee, Seung Hwan
    • Journal of Animal Science and Technology
    • /
    • v.63 no.6
    • /
    • pp.1232-1246
    • /
    • 2021
  • Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. However, a high-density chip for many animals would be increasing the genotyping cost. Therefore, statistical inference of genotype imputation (low-density chip to high-density) will be useful in the animal industry. The purpose of this study is to investigate the effect of the reference population size and marker density on the imputation accuracy and to suggest the appropriate number of reference population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo cattle were divided into reference and validation populations. The reference sets consisted of 50k (38,916) marker data and different population sizes (500, 1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four validation sets (Total 889) and the different marker density (5k [5,000], 10k [10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct comparison of the true genotype and the imputed genotype. In conclusion, when the lowest marker density (5k) was used in the validation set, according to the reference population size, the imputation accuracy was 0.793 to 0.929. On the other hand, when the highest marker density (15k), according to the reference population size, the imputation accuracy was 0.904 to 0.967. Moreover, the reference population size should be more than 1,000 to obtain at least 88% imputation accuracy in Hanwoo cattle.

Characterization of Single Nucleotide Polymorphisms in 55 Disease-Associated Genes in a Korean Population

  • Lee, Seung-Ku;Kim, Hyoun-Geun;Kang, Jason-J.;Oh, Won-Il;Oh, Berm-Seok;Kwack, Kyu-Bum
    • Genomics & Informatics
    • /
    • v.5 no.4
    • /
    • pp.152-160
    • /
    • 2007
  • Most common diseases are caused by multiple genetic and environmental factors. Among the genetic factors, single nucleotide polymorphisms (SNPs) are common DNA sequence variations in individuals and can serve as important genetic markers. Recently, investigations of gene-based and whole genome-based SNPs have been applied to association studies for marker discovery. However, SNPs are so population-specific that the association needs to be verified. Fifty-five genes and 384 SNPs were selected based on association with disease. Genotypes of 337 SNPs in candidate genes were determined using Illumina Sentrix Array Matrix (SAM) chips by an allele-specific extension method in 364 unrelated Korean individuals. Allelic frequencies of SNPs were compared with those of other populations obtained from the International HapMap database. Minor allele frequencies, linkage disequilibrium blocks, tagSNPs, and haplotypes of functional candidate SNPs in 55 genetic disease-associated genes were provided. Our data may provide useful information for the selection of genetic markers for gene-based genetic disease-association studies of the Korean population.

Identification of Domesticated Silkworm Varieties Using a Whole Genome Single Nucleotide Polymorphisms-based Decision Tree (전장유전체 SNP 기반 decision tree를 이용한 누에 품종 판별)

  • Park, Jong Woo;Park, Jeong Sun;Jeong, Chan Young;Kwon, Hyeok Gyu;Kang, Sang Kuk;Kim, Seong-Wan;Kim, Nam-Suk;Kim, Kee Young;Kim, Iksoo
    • Journal of Life Science
    • /
    • v.32 no.12
    • /
    • pp.947-955
    • /
    • 2022
  • Silkworms, which have recently shown promise as functional health foods, show functional differences between varieties; therefore, the need for variety identification is emerging. In this study, we analyzed the whole silkworm genome to identify 10 unique silkworm varieties (Baekhwang, Baekok, Daebaek, Daebak, Daehwang, Goldensilk, Hansaeng, Joohwang, Kumkang, and Kumok) using single nucleotide polymorphisms (SNP) present in the genome as biomarkers. In addition, nine SNPs were selected to discriminate between varieties by selecting SNPs specific to each variety. We subsequently created a decision tree capable of cross-verifying each variety and classifying the varieties through sequential analysis. Restriction fragment length polymorphism (RFLP) was used for SNP867 and SNP9183 to differentiate between the varieties of Daehwang and Goldensilk and between Kumkang and Daebak, respectively. A tetra-primer amplification refractory (T-ARMS) mutation was used to analyze the remaining SNPs. As a result, we could isolate the same group or select an individual variety using the nine unique SNPs from SNP780 to SNP9183. Furthermore, nucleotide sequence analysis for the region confirmed that the alleles were identical. In conclusion, our results show that combining SNP analysis of the whole silkworm genome with the decision tree is of high value as a discriminative marker for classifying silkworm varieties.

Association between Single Nucleotide Polymorphisms of Fatty Acid Synthase and Fat Deposition in the Liver of the Overfed Goose

  • Wu, Wei;Guo, Xuan;Zhang, Lei;Hu, Dan
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.27 no.9
    • /
    • pp.1244-1249
    • /
    • 2014
  • Goose fatty liver is one of the most delicious and popular foods in the world, but there is no reliable genetic marker for the early selection and breeding of geese with good liver-producing potential. In our study, one hundred and twenty-four 78-day-old Landes geese bred in Shunda Landes goose breeding farm, Jiutai, Jilin, China were selected randomly. The fatty livers were sampled each week after overfeeding during a three week period. Polymerase chain reaction-single strand conformation polymorphism and DNA sequencing were used to identify single nucleotide polymorphisms (SNPs) of fatty acid synthase (FAS), which is an important enzyme involved in the synthesis of fat under both physiological and pathological conditions. Least-squares correlation was established between these SNPs and fatty liver weight, abdominal fat weight, and intestinal fat weight of the overfed Landes geese, respectively. The results showed that fatty liver weight of geese with EF and FF genotypes (amplified by primer P1) was significantly higher than that of the EE genotype (p<0.05), and liver weight of CD and DD genotypes (amplified by primer P2) was significantly higher than that of the CC genotype (p<0.05). Different genotype combinations showed different liver weights, and from highest to lowest were ABDD, DDEF, DDFF, DDEE, ABEF, ABFF, AADD, and CDEF. Further analysis of DNA sequencing showed that there were two SNPs within the 5' promoter region the FAS gene. The geese of EF and FF genotypes carried a change of T to C, and the geese of CD and DD genotypes carried a change of A to G. The changes of the bases could potentially influence the binding of some transcription factors to this region as to regulate FAS gene. To our knowledge, this is the first report of SNPs found within the 5' promoter region of the Landes goose FAS gene, and our data will provide an insight for early selection of geese for liver production.

Gene Duplications Revealed during the Process of SNP Discovery in Soybean[Glycine max(L.) Merr.]

  • Cai, Chun Mei;Van, Kyu-Jung;Lee, Suk-Ha
    • Journal of Crop Science and Biotechnology
    • /
    • v.10 no.4
    • /
    • pp.237-242
    • /
    • 2007
  • Genome duplication(i.e. polyploidy) is a common phenomenon in the evolution of plants. The objective of this study was to achieve a comprehensive understanding of genome duplication for SNP discovery by Thymine/Adenine(TA) cloning for confirmation. Primer pairs were designed from 793 EST contigs expressed in the roots of a supernodulating soybean mutant and screened between 'Pureunkong' and 'Jinpumkong 2' by direct sequencing. Almost 27% of the primer sets were failed to obtain sequence data due to multiple bands on agarose gel or poor quality sequence data from a single band. TA cloning was able to identify duplicate genes and the paralogous sequences were coincident with the nonspecific peaks in direct sequencing. Our study confirmed that heterogeneous products by the co-amplification of a gene family member were the main cause of obtaining multiple bands or poor quality sequence data in direct sequencing. Counts of amplified bands on agarose gel and peaks of sequencing trace suggested that almost 27% of nonrepetitive soybean sequences were present in as many as four copies with an average of 2.33 duplications per segment. Copy numbers would be underestimated because of the presence of long intron between primer binding sites or mutation on priming site. Also, the copy numbers were not accurately estimated due to deletion or tandem duplication in the entire soybean genome.

  • PDF

A genome-wide association study (GWAS) for pH value in the meat of Berkshire pigs

  • Park, Jun;Lee, Sang-Min;Park, Ja-Yeon;Na, Chong-Sam
    • Journal of Animal Science and Technology
    • /
    • v.63 no.1
    • /
    • pp.25-35
    • /
    • 2021
  • The purpose of this study is to estimate the single nucleotide polymorphism (SNP) effect for pH values affecting Berkshire meat quality. A total of 39,603 SNPs from 1,978 heads after quality control and 882 pH values were used estimate SNP effect by single step genomic best linear unbiased prediction (ssGBLUP) method. The average physical distance between adjacent SNP pairs was 61.7kbp and the number and proportion of SNPs whose minor allele frequency was below 10% were 9,573 and 24.2%, respectively. The average of observed heterozygosity and polymorphic information content was 0.32 ± 0.16 and 0.26 ± 0.11, respectively and the estimate for average linkage disequilibrium was 0.40. The heritability of pH45m and pH24h were 0.10 and 0.15 respectively. SNPs with an absolute value more than 4 standard deviations from the mean were selected as threshold markers, among the selected SNPs, protein-coding genes of pH45m and pH24h were detected in 6 and 4 SNPs, respectively. The distribution of coding genes were detected at pH45m and were detected at pH24h.

Single Nucleotide Polymorphism in the Coding Region of Bovine Chemerin Gene and Their Associations with Carcass Traits in Japanese Black Cattle

Confirming Single Nucleotide Polymorphisms from Expressed Sequence Tag Datasets Derived from Three Cattle cDNA Libraries

  • Lee, Seung-Hwan;Park, Eung-Woo;Cho, Yong-Min;Lee, Ji-Woong;Kim, Hyoung-Yong;Lee, Jun-Heon;Oh, Sung-Jong;Cheong, Il-Cheong;Yoon, Du-Hak
    • BMB Reports
    • /
    • v.39 no.2
    • /
    • pp.183-188
    • /
    • 2006
  • Using the Phred/Phrap/Polyphred/Consed pipeline established in the National Livestock Research Institute of Korea, we predicted candidate coding single nucleotide polymorphisms (cSNPs) from 7,600 expressed sequence tags (ESTs) derived from three cDNA libraries (liver, M. longissimus dorsi, and intermuscular fat) of Hanwoo (Korean native cattle) steers. From the 7,600 ESTs, 829 contigs comprising more than two EST reads were assembled using the Phrap assembler. Based on the contig analysis, 201 candidate cSNPs were identified in 129 contigs, in which transitions (69%) outnumbered transversions (31%). To verify whether the predicted cSNPs are real, 17 SNPs involved in lipid and energy metabolism were selected from the ESTs. Twelve of these were confirmed to be real while five were identified as artifacts, possibly due to expressed sequence tag sequence error. Further analysis of the 12 verified cSNPs was performed using the program BLASTX. Five were identified as nonsynonymous cSNPs, five were synonymous cSNPs, and two SNPs were located in 3'-UTRs. Our data indicated that a relatively high SNP prediction rate (71%) from a large EST database could produce abundant cSNPs rapidly, which can be used as valuable genetic markers in cattle.

Evolutionary Analyses of Hanwoo (Korean Cattle)-Specific Single-Nucleotide Polymorphisms and Genes Using Whole-Genome Resequencing Data of a Hanwoo Population

  • Lee, Daehwan;Cho, Minah;Hong, Woon-young;Lim, Dajeong;Kim, Hyung-Chul;Cho, Yong-Min;Jeong, Jin-Young;Choi, Bong-Hwan;Ko, Younhee;Kim, Jaebum
    • Molecules and Cells
    • /
    • v.39 no.9
    • /
    • pp.692-698
    • /
    • 2016
  • Advances in next generation sequencing (NGS) technologies have enabled population-level studies for many animals to unravel the relationships between genotypic differences and traits of specific populations. The objective of this study was to perform evolutionary analysis of single nucleotide polymorphisms (SNP) in genes of Korean native cattle Hanwoo in comparison to SNP data from four other cattle breeds (Jersey, Simmental, Angus, and Holstein) and four related species (pig, horse, human, and mouse) obtained from public databases through NGS-based resequencing. We analyzed population structures and differentiation levels for the five cattle breeds and estimated species-specific SNPs with their origins and phylogenetic relationships among species. In addition, we identified Hanwoo-specific genes and proteins, and determined distinct changes in protein-protein interactions among five species (cattle, pig, horse, human, mouse) in the STRING network database by additionally considering indirect protein interactions. We found that the Hanwoo population was clearly different from the other four cattle populations. There were Hanwoo-specific genes related to its meat trait. Protein interaction rewiring analysis also confirmed that there were Hanwoo-specific protein-protein interactions that might have contributed to its unique meat quality.

Development of Cleaved Amplified Polymorphic Sequence Markers for the Identification of Lentinula edodes Cultivars Sanmaru 1ho and Chunjang 3ho (표고버섯 품종 산마루1호, 천장3호를 구분할 수 있는 CAPS Marker 개발)

  • Moon, Suyun;Lee, Hwa-Yong;Kim, Myungkil;Ka, Kang-Hyeon;Ko, Han Kyu;Chung, Jong-Wook;Koo, Chang-Duck;Ryu, Hojin
    • The Korean Journal of Mycology
    • /
    • v.45 no.2
    • /
    • pp.114-120
    • /
    • 2017
  • Lentinula edodes is an edible mushroom that is mainly cultivated in Asian countries. Recently, new cultivars of this mushroom have been developed in Korea; variety protection is very important, so the development of efficient molecular markers that can distinguish each variety is required. In this study, we developed cleaved amplified polymorphic sequence (CAPS) markers for the identification of L. edodes cultivars (Sanmaru 1ho and Chunjang 3ho). These markers were developed from whole genomic sequencing data from L. edodes monokaryon strain B17 and resequencing data from 10 dikaryon strains. A single nucleotide polymorphism changed in scaffold 9 POS 1630048 in Sanmaru 1ho($G{\rightarrow}T$), and in scaffold 13 POS 920681 in Chunjang 3ho ($G{\rightarrow}A$). The restriction enzymes TspR I and Xho I distinguished Sanmaru 1ho and Chunjang 3ho, respectively, from other strains. Thus, we developed 2 CAPS markers for the identification of the L. edodes cultivars Sanmaru 1ho and Chunjang 3ho.