• Title/Summary/Keyword: 일배체형

Search Result 31, Processing Time 0.024 seconds

A New Algorithm of Reducing Candidate Haplotypes for Haplotype Inference (일배체형 추론을 위한 후보군 간소화 알고리즘)

  • Choi, Mun-Ho;Kang, Seung-Ho;Lim, Hyeong-Seok
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.7
    • /
    • pp.1732-1739
    • /
    • 2013
  • The identification of haplotypes, which encode SNPs in a single chromosome, makes it possible to perform a haplotype-based association test with diseases. Given a set of genotypes from a population, the process of recovering the haplotypes that explain the genotypes is called haplotype inference. We propose a new preprocessing algorithm for the haplotype inference by pure parsimony (HIPP). The proposed algorithm excludes a large amount of redundant candidate haplotypes by detecting some groups of haplotypes that are dispensable for optimal solutions. For the well-known synthetic and biological data, the experimental results of our method show that our method run much faster than other preprocessing methods. After applying our preprocessing results, the numbers of haplotypes of HIPP solvers are equal to or slightly larger than that of optimal solutions.

Estimation of Haplotype Proportions in Single Necleotide Polymorphism Group Using EM Algorithm (EM 알고리듬을 이용한 단일염기변이 (SNP;SINGLE NUCLEOTIDE POLYMORPHISM)군의 일배체형 (HAPLOTYPE) 비율 추정)

  • 김선우;김종원;이경아
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.2
    • /
    • pp.195-202
    • /
    • 2003
  • Haplotype analysis in SNP is very useful for the study of complex genetic disease due to low cost and high efficiency comparing to individual analysis of each SNP, and is functionally important in biological view. But, the gametic phase of haplotypes is usually unknown in SNP group, and it is difficult to predict haplotype proportions. In this study, haplotype proportions were estimated using EM algorithm from diploid data of SNP group in solid tumor group and normal group. From these results, linkage disequilibrium among SNPs was analyzed.

Haplotype-Based Association and Linkage Analysis of Angiotensin-I Converting Enzyme(ACE) Gene with a Hypertension (일배체형에 기초한 고혈압과 ACE 유전자의 연관성 분석)

  • Kim Jinheum;Nam Chung Mo;Kang Dae Ryong;Suh Il
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.297-310
    • /
    • 2005
  • In this study we investigate the association between the haplotype block of 4 SNPs in ACE genes and hypertension with a case-control dataset of size of 277 and 40 families data collected from Kangwha studies. To this end we perform a haplotype-based case-control association study and a haplotype-based TDT study. We do the same analysis with tag-SNPs that can identify the haplotype block. Through a cladogram analysis we make the evolution-tree of haplotypes and then classify the haplotypes into a few clades by collecting haplotypes exposed to the disease to the same extent. We also discuss the association between these clades and hypertension.

Solving the Haplotype Assembly Problem for Human Using the Improved Branch and Bound Algorithm (개선된 분기한정 알고리즘을 이용한 인간 유전체의 일배체형 조합문제 해결)

  • Choi, Mun-Ho;Kang, Seung-Ho;Lim, Hyeong-Seok
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.10
    • /
    • pp.697-704
    • /
    • 2013
  • The identification of haplotypes, which encode SNPs in a single chromosome, makes it possible to perform haplotype-based association tests with diseases. Minimum Error Correction model, one of models to computationally assemble a pair of haplotypes for a given organism from Single Nucleotide Polymorphism fragments, has been known to be NP-hard even for gapless cases. In the previous work, an improved branch and bound algorithm was suggested and showed that it is more efficient than naive branch and bound algorithm by performing experiments for Apis mellifera (honeybee) data set. In this paper, to show the extensibility of the algorithm to other organisms we apply the improved branch and bound algorithm to the human data set and confirm the efficiency of the algorithm.

Difference in the Transcriptional Activity of the Interleukin-4 Promoter Haplotypes (Interleukin-4 유전자의 Promoter 일배체형에 따른 전사능의 차이)

  • Choi, Eun Hwa;Kim, Hee Sup;Chanock, Stephen J.;Lee, Hoan Jong
    • Clinical and Experimental Pediatrics
    • /
    • v.48 no.5
    • /
    • pp.495-499
    • /
    • 2005
  • Purpose : Interleukin-4(IL-4) is a critical component of the Th2 cytokine pathway and contributes to severity of respiratory syncytial virus(RSV) bronchiolitis. Previous studies observed an association between severe RSV bronchiolitis in Korean children with a common haplotype of the IL4 promoter. This study was performed to investigate functional differences of the variant IL4 promoter haplotypes. Methods : Genomic DNA was obtained from 20 children from 6 to 48 months of age in the Department of Pediatrics, Seoul National University Bundang Hospital. The IL4 promoter spanning an 1.2 kb region was amplified and haplotype was determined by cloning and the PHASE reconstruction. Transcriptional activity of Jurkat T cells which were transfected with each IL4 haplotype were analyzed by use of luciferase assay. Results : Three haplotypes of the IL4 promoter have been identified with the frequency of GCC(7 percent), TCC(17 percent), and TTT(76 percent). The TTT haplotype demonstrated the highest luciferase values in both unstimulated and PMA-stimulated Jurkat T cells. Increases in transcriptional activity compared to GCC have been shown in TTT(5.3 fold higher) followed by TCC(4.2 fold higher) in unstimulated Jurkat T cells. Conclusion : We provided evidence that increased transcriptional activity of the TTT haplotype of the IL4 promoter, which has previously been over-represented in Korean children with severe RSV bronchiolitis. Therefore, IL-4 could play a potential role in the pathogenesis of RSV infection, possibly via an altered transcriptional activity of the different IL4 haplotypes.

Study on Effects of Population Stratification on Haplotype Trend Test in Case-Control Studies (환자-대조군 연구에서 인구집단 층화가 일배체형 경향성 검정에 미치는 영향)

  • Kim, Jin-Heum;Kang, Dae-Ryong;Lim, Hyun-Sun;Nam, Chung-Mo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.5
    • /
    • pp.1085-1096
    • /
    • 2009
  • Population stratification can cause spurious associations between genetic markers and disease locus. In order to handle this population stratification in haplotype-based case-control association studies, we added population indicators as covariates to the haplotype trend regression model proposed by Zaykin et al. (2002). We investigated through simulations how both population stratification and measurement error in the estimation of true population of each individual affect type I error probabilities of the association tests based on both Zaykin et al.'s (2002) model and the proposed model. Based on those results, in the situation that there exists population stratification but there is no error in population classification of each individual, our proposed model does satisfy a type I error probability whereas Zaykin et al.'s (2002) model does not. However, as the measurement error increases, a type I error probability of our model correspondingly becomes larger than a nominal significance level. It implies that as long as uncertainty in the estimation of true population of each individual still remains, it is nearly impossible to avoid false positive in case-control association studies based on haplotypes.

The Relationship between MDR1 Polymorphisms and the Response to Etoposide/Cisplatin Combination Chemotherapy in Small Cell Lung Cancer (소세포폐암에서 Multidrug Resistance-1 유전자의 다형성과 Etoposide-cisplatin 항암화학요법 반응의 관계)

  • Sohn, Ji Woong;Lee, Shin Yup;Lee, Su Jung;Jeon, Hyo-Sung;Lee, Jae Hee;Park, Jae Hyung;Kim, Eun Jin;Kang, Young Mo;Lee, Jae-Tae;Cha, Seung Ick;Kim, Chang Ho;Jung, Tae Hoon;Park, Jae Yong
    • Tuberculosis and Respiratory Diseases
    • /
    • v.58 no.2
    • /
    • pp.135-141
    • /
    • 2005
  • 배경 및 목적 : Multidrug Resistance-1 (MDR1) 유전자는 다약제내성에 관여하는 P-glycoprotein을 암호화한다. MDR1 유전자의 다형성은 P-glycoprotein의 발현과 기능의 차이를 일으켜 항암화학요법 반응에 영향을 미칠 수 있을 것이다. 저자들은 소세포폐암 환자에서 MDR1 유전자의 다형성과 일배체형에 따른 항암화학요법에 대한 반응을 조사하였다. 대상 및 방법 : 경북대학병원에서 병리적으로 소세포폐암으로 진단받고 etoposide-cisplatin 항암화학요법을 받은 54명을 대상으로 하였다. 전혈 5cc에서 DNA를 추출하고 PCR-RFLP법을 통해 MDR1 유전자 엑손 21의 2677G>T 다형성과, 엑손 26의 3435C>T 다형성을 조사하고 다형성과 일배체형에 따른 항암화학요법의 반응을 조사하였다. 결 과 : 2677G>T 유전자형에 따른 항암화학요법의 반응은 유의한 차이가 없었다. 3435 CC 유전자형은 3435 CT+TT 형에 비해 치료 반응율이 유의하게 높았다 (P = 0.025). 유전자형 분석 결과와 일치되게 2677G/3435C 일배체형은 다른 일배체형에 비해 치료반응을 보이는 경우가 유의하게 많았다 (P = 0.015). 결 론 : 소세포폐암에서 MDR1 유전자의 2677G>T와 3435C>T 다형성 및 이들 다형성의 일배체형은 etoposide-cisplatin 항암화학요법의 반응을 예측할 수 있는 지표로 사용될 수 있을 것으로 생각된다.

A New Method for Imputation of Missing Genotype using Linkage Disequilibrium and Haplotype Information (결측치가 존재하는 유전형 자료에서의 연관불균형과 일배체형을 사용한 결측치 대치 방법)

  • Park Yun-Ju;Kim Young-Jin;Park Jung-Sun;Kim Kuchan;Koh Insong;Jung Ho-Youl
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.2
    • /
    • pp.99-107
    • /
    • 2005
  • In this paper, wc propose a now missing imputation method for minimizing loss of information linkage disequilibrium-based and haplotype-based imputation method, which estimate missing values of the data based on the specificity of Single Nucleotide Polymorphism(SNP) genotype data. Method for imputing data is needed to minimize the loss of information caused by experimental missing data. In general, missing imputation of biological data has used major allele imputation method. but this approach is not optima]. 1'his method has high error rates of missing values estimation since the characteristics of the genotype data are not considered not take into consideration the specific structure of the data. In this paper, we show the results of the comparative evaluation of our model methods and major imputation method for the estimation of missing values.

The LD based Haplotype Reconstruction System for Large scale Genotype dataset (대용량 유전자형 데이터에 대한 LD기반의 일배체형 재구성 시스템)

  • Kim Sang-Jun;Yeo Sang-Soo;Kim Sung-Kwon
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.271-273
    • /
    • 2005
  • 유전자 분석기술의 발전은 지놈 프로젝트(genome project)와 햅맵 프로젝트(hapmap project)를 가능하게 하였으며 이제는 맞춤형 진단 및 신약 개발 등 실제 사업의 구체화를 가져오게 하였다. 실제 사업에 적용시키기 위해서는 비용 절감의 문제를 해결해야 한다. 그래서 대용량의 유전자형(genotype)데이터를 정확하고 빠르게 일배체형(haplotype)으로 재구성해 줄 수 있는 시스템이 생물 산업 및 제약 산업에서 제기되어 지고 있다. 기존의 연구에서 비록 정확성이 높은 알고리즘들이 개발되어 있지만 기존의 방법들은 계산에 필요한 양이 크기 때문에 대용량 데이터에 대한 처리가 불가능하였다. 우리가 제안하는 시스템은 대용량 데이터를 유동적인 크기로 블록을 분할하여 대용량 데이터 처리 문제를 해결하였다. 또한 나누어진 블록에서 나타나는 모호한 이형접합체(heterozygote)의 위상(phase)의 결정 과정에 LD기반의 블록 분할 방법을 이용함으로써, 추론된 결과의 정확률을 높였다. 구현된 시스템의 성능평가는 ms로 구성한 인공데이터를 사용하여 수행하였다.

  • PDF

Haplotype Assembly from Weighted SNP Fragments and Related Genotype Information (신뢰도를 가진 SNP 단편들과 유전자형으로부터 일배체형 조합)

  • Kang, Seung-Ho;Jeong, In-Seon;Choi, Mun-Ho;Lim, Hyeong-Seok
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.35 no.11
    • /
    • pp.509-516
    • /
    • 2008
  • The Minimum Letter Flips (MLF) model and the Weighted Minimum Letter Flips (WMLF) model are for solving the haplotype assembly problem. But these two models are effective only when the error rate in SNP fragments is low. In this paper, we first establish a new computational model that employs the related genotype information as an improvement of the WMLF model and show its NP-hardness, and then propose an efficient genetic algorithm to solve the haplotype assembly problem. The results of experiments on random data set and a real data set indicate that the introduction of genotype information to the WMLF model is quite effective in improving the reconstruction rate especially when the error rate in SNP fragments is high. And the results also show that genotype information increases the convergence speed of the genetic algorithm.