• Title/Summary/Keyword: promoter prediction

Search Result 32, Processing Time 0.025 seconds

PromoterWizard: An Integrated Promoter Prediction Program Using Hybrid Methods

  • Park, Kie-Jung;Kim, Ki-Bong
    • Genomics & Informatics
    • /
    • v.9 no.4
    • /
    • pp.194-196
    • /
    • 2011
  • Promoter prediction is a very important problem and is closely related to the main problems of bioinformatics such as the construction of gene regulatory networks and gene function annotation. In this context, we developed an integrated promoter prediction program using hybrid methods, PromoterWizard, which can be employed to detect the core promoter region and the transcription start site (TSS) in vertebrate genomic DNA sequences, an issue of obvious importance for genome annotation efforts. PromoterWizard consists of three main modules and two auxiliary modules. The three main modules include CDRM (Composite Dependency Reflecting Model) module, SVM (Support Vector Machine) module, and ICM (Interpolated Context Model) module. The two auxiliary modules are CpG Island Detector and GCPlot that may contribute to improving the predictive accuracy of the three main modules and facilitating human curator to decide on the final annotation.

A Study On the Application Methods of a Support Vector Machine for Gene Promoter Prediction. (유전자 프로모터 예측을 위한 Support Vector Machine의 응용 방법에 대한 연구)

  • Kim, Ki-Bong
    • Journal of Life Science
    • /
    • v.17 no.5 s.85
    • /
    • pp.714-718
    • /
    • 2007
  • The high-throughput sequencing of a lot of genomes has resulted in the relatively rapid accumulation of an enormous amount of genomic sequence data. In this context, the problem posed by the detection of promoters in genomic DNA sequences via computational methods has attracted considerable attention in recent years since exact promoter prediction can give a clue to the elucidation of overall genetic networks. In this study, applications of support vector machine(SVM) to promoter prediction are explored to show a right approaches to discriminate between promoter and non-promoter regions by means of SVM. The results of various experiments show that encoding method, encoding region and learning data constitution can play an important role in the performance of SVM.

Promoter Prediction using Genetic Algorithm (유전자 알고리즘을 이용한 Promoter 예측)

  • 오민경;김창훈;김기봉;공은배;김승목
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10b
    • /
    • pp.12-14
    • /
    • 1999
  • Promoter는 transcript start site 앞부분에 위치하여 RNA polymerase가 높은 친화성을 보이며 바인당하는 DNA상의 특별한 부위로서 여기서부터 DNA transcription이 시작된다. function이나 tissue-specific gene들의 그룹별로 그 promoter들의 특이한 패턴들의 조합을 발견함으로써 Specific한 transcription을 조절하는 것으로 알려져 있어 promoter로 인한 그 gene의 정보를 어느 정도 알 수가 있다. 사람의 housekeeping gene promoter들을 EPD(eukaryotic promoter database)와 EMBL nucleic acid sequence database로부터 수집하여 이것들 간에 의미 있게 나타나는 모든 패턴들을 optimization algorithm으로 알려진 genetic algorithm을 이용해서 찾아보았다.

  • PDF

Promoter Classification Using Genetic Algorithm Controlled Generalized Regression Neural Network (유전자 알고리즘과 일반화된 회귀 신경망을 이용한 프로모터 서열 분류)

  • 김성모;김근호;김병환
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.53 no.7
    • /
    • pp.531-535
    • /
    • 2004
  • A new method is presented to construct a classifier. This was accomplished by combining a generalized regression neural network (GRNN) and a genetic algorithm (GA). The classifier constructed in this way is referred to as a GA-GRNN. The GA played a role of controlling training factors simultaneously. The GA-GRNN was applied to classify 4 different Promoter sequences. The training and test data were composed of 115 and 58 sequence patterns, respectively. The classifier performance was investigated in terms of the classification sensitivity and prediction accuracy. Compared to conventional GRNN, GA-GRNN significantly improved the total classification sensitivity as well as the total prediction accuracy. As a result, the proposed GA-GRNN demonstrated improved classification sensitivity and prediction accuracy over the convention GRNN.

Promoter classification using genetic algorithm controlled generalized regression neural network

  • Kim, Kun-Ho;Kim, Byun-Gwhan;Kim, Kyung-Nam;Hong, Jin-Han;Park, Sang-Ho
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.2226-2229
    • /
    • 2003
  • A new method is presented to construct a classifier. This was accomplished by combining a generalized regression neural network (GRNN) and a genetic algorithm (GA). The classifier constructed in this way is referred to as a GA-GRNN. The GA played a role of controlling training factors simultaneously. In GA optimization, neuron spreads were represented in a chromosome. The proposed optimization method was applied to a data set, consisted of 4 different promoter sequences. The training and test data were composed of 115 and 58 sequence patterns, respectively. The range of neuron spreads was experimentally varied from 0.4 to 1.4 with an increment of 0.1. The GA-GRNN was compared to a conventional GRNN. The classifier performance was investigated in terms of the classification sensitivity and prediction accuracy. The GA-GRNN significantly improved the total classification sensitivity compared to the conventional GRNN. Also, the GA-GRNN demonstrated an improvement of about 10.1% in the total prediction accuracy. As a result, the proposed GA-GRNN illustrated improved classification sensitivity and prediction accuracy over the conventional GRNN.

  • PDF

Quantitative Assessment of the Diagnostic Role of CDH13 Promoter Methylation in Lung Cancer

  • Zhong, Yun-Hua;Peng, Hao;Cheng, Hong-Zhong;Wang, Ping
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.3
    • /
    • pp.1139-1143
    • /
    • 2015
  • In order to explore the association between cadherin 13 (CDH13) gene promoter methylation and lung carcinoma (LC) risk, we carried out a meta-analysis with searching of PubMed, Web of Science. Ultimately, 17 articles were identified and analysised by STATA 12.0 software. Overall, we found a significant relationship between CDH13 promoter methylation and LC risk (odds ratio=6.98, 95% confidence interval: 4.21-11.56, p<0.001). Subgroup analyses further revealed that LC risk was increased for individuals carrying the methylated CDH13 compared with those with unmethylated CDH13. Hence, our study identified a strong association between CDH13 gene promoter methylation and LC and highlighted a promising potential for CDH13 methylation in LC risk prediction.

DNA Sequence Classification Using a Generalized Regression Neural Network and Random Generator (난수발생기와 일반화된 회귀 신경망을 이용한 DNA 서열 분류)

  • 김성모;김근호;김병환
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.53 no.7
    • /
    • pp.525-530
    • /
    • 2004
  • A classifier was constructed by using a generalized regression neural network (GRU) and random generator (RG), which was applied to classify DNA sequences. Three data sets evaluated are eukaryotic and prokaryotic sequences (Data-I), eukaryotic sequences (Data-II), and prokaryotic sequences (Data-III). For each data set, the classifier performance was examined in terms of the total classification sensitivity (TCS), individual classification sensitivity (ICS), total prediction accuracy (TPA), and individual prediction accuracy (IPA). For a given spread, the RG played a role of generating a number of sets of spreads for gaussian functions in the pattern layer Compared to the GRNN, the RG-GRNN significantly improved the TCS by more than 50%, 60%, and 40% for Data-I, Data-II, and Data-III, respectively. The RG-GRNN also demonstrated improved TPA for all data types. In conclusion, the proposed RG-GRNN can effectively be used to classify a large, multivariable promoter sequences.

Prediction of promoter by Backpropagation (Backpropagation을 이용한 Promoter 예측 방법)

  • 허미영;김홍기;최진성
    • Proceedings of the IEEK Conference
    • /
    • 2003.07d
    • /
    • pp.1569-1572
    • /
    • 2003
  • 최근 생명공학 분야의 기술이 혁신적으로 발달함에 따라 게놈 프로젝트가 본래 계획보다 2년 앞당겨져 2003 년 4 월 인간 유전자의 완전한 서열을 밝히고 성공적으로 완료됨으로서 관련 연구자들은 인간의 유전자에 대한 대량의 서열 데이터를 얻게 되었다. 그래서 게놈 프로젝트의 다음 단계로서 엄청난 양의서열 정보 분석으로부터 유전자의 기능을 파악하고자 하는 연구들이 이미 세계적으로 활발히 진행되고 있다. 이러한 연구들의 최종적 목표는 질병 치료와 생명연장의 실현이라고 볼 수 있다. 유전자 연구를 위해선 우선 일차적으로 유전자 부위를 파악해야 한다. 유전자는 구조적으로 다시 여러 부분으로 나뉘는데 유전자 발현의 개시에 매우 중요한 요소 중 하나가 바로 프로모터 (Promoter) 이다. 프로모터 내에는 TATA box 가 있는데 이는 프로모터의 핵심 요소이다. 프로모터는 생명체의 종 그리고 RNA 중합효소의 종류에 따라 다르다. 이 논문에서는 다양한 신경망 알고리즘 중의 하나인 Backtpropagation 을 이용하여 밝혀지지 알은 서열에서 인간을 포함하는 원핵생물의 프로모터 서열을 예측할 수 있는 방법을 얻었기에 소개하고자 한다.

  • PDF

Modelling of starch industry wastewater microfiltration parameters by neural network

  • Jokic, Aleksandar I.;Seres, Laslo L.;Milovic, Nemanja R.;Seres, Zita I.;Maravic, Nikola R.;Saranovic, Zana;Dokic, Ljubica P.
    • Membrane and Water Treatment
    • /
    • v.9 no.2
    • /
    • pp.115-121
    • /
    • 2018
  • Artificial neural network (ANN) simulation is used to predict the dynamic change of permeate flux during wheat starch industry wastewater microfiltration with and without static turbulence promoter. The experimental program spans range of a sedimentation times from 2 to 4 h, for feed flow rates 50 to 150 L/h, at transmembrane pressures covering the range of $1{\times}10^5$ to $3{\times}10^5Pa$. ANN predictions of the wastewater microfiltration are compared with experimental results obtained using two different set of microfiltration experiments, with and without static turbulence promoter. The effects of the training algorithm, neural network architectures on the ANN performance are discussed. For the most of the cases considered, the ANN proved to be an adequate interpolation tool, where an excellent prediction was obtained using automated Bayesian regularization as training algorithm. The optimal ANN architecture was determined as 4-10-1 with hyperbolic tangent sigmoid transfer function transfer function for hidden and output layers. The error distributions of data revealed that experimental results are in very good agreement with computed ones with only 2% data points had absolute relative error greater than 20% for the microfiltration without static turbulence promoter whereas for the microfiltration with static turbulence promoter it was 1%. The contribution of filtration time variable to flux values provided by ANNs was determined in an important level at the range of 52-66% due to increased membrane fouling by the time. In the case of microfiltration with static turbulence promoter, relative importance of transmembrane pressure and feed flow rate increased for about 30%.

Prediction of Core Promoter Region with Dependency - Reflecting Decomposition Model (의존성 반영 분해모델에 의한 유전자의 핵심 프로모터 영역 예측)

  • 김기봉;박기정;공은배
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.3_4
    • /
    • pp.379-387
    • /
    • 2003
  • A lot of microbial genome projects have been completed to pour the enormous amount of genomic sequence data. In this context. the problem of identifying promoters in genomic DNA sequences by computational methods has attracted considerable research attention in recent years. In this paper, we propose a new model of prokaryotic core promoter region including the -10 region and transcription initiation site, that is Dependency-Reflecting Decomposition Model (DRDM), which captures the most significant biological dependencies between positions (allowing for non-adjacent as well as adjacent dependencies). DRDM showed a good result of performance test and it will be employed effectively in predicting promoters in long microbial genomic Contigs.