• 제목/요약/키워드: DNA data

검색결과 2,042건 처리시간 0.027초

On-off Regulation of 3' Exonuclease Excision to DNA Polymerization by Exo+ Polymerase

  • Zhang, Jia;Li, Kai
    • BMB Reports
    • /
    • 제36권6호
    • /
    • pp.525-528
    • /
    • 2003
  • The role of 3' exonuclease excision in DNA polymerization was evaluated in primer extensions using 3' allele-specific primers that had exonuclease-digestible and exonuclease-resistant 3' termini. With exonuclease-digestible unmodified 3' mismatched primers, the exo+ polymerase yielded template-dependent products. Using exonuclease-resistant 3' mismatched primers, no primer-extended product resulted from exo+ polymerase. As a control, polymerase without proofreading activity yielded primer-dependent products from 3' mismatched primers. These data indicated that a successful removal of the mismatch is required for DNA polymerization from the 3' mismatched primers by exo+ polymerase. In addition to the well-known proofreading from this mismatch removal, the premature termination in DNA polymerization, due to the failure of the efficient removal of the mismatched nucleotides, worked as an off-switch in maintaining the high fidelity in DNA replication from exo+ polymerase.

Estimation of nuclear DNA content of various bamboo and rattan species

  • Kumar, Prakash P.;Turner, Ian M.;Rao, A. Nagaraja;Arumuganathan, K.
    • Plant Biotechnology Reports
    • /
    • 제5권4호
    • /
    • pp.317-322
    • /
    • 2011
  • We determined the nuclear DNA content (genome size) of over 35 accessions each of bamboo and rattan species from Southeast Asia. The 2C DNA per nucleus was quantified by flow cytometry. The fluorescence of nuclei isolated from the leaves and stained with propidium iodide was measured. The genome size of the bamboo species examined was between 2.5 and 5.9 pg DNA per 2C nucleus. The genome size of the rattan species examined ranged from 1.8 to 10.5 pg DNA per 2C nucleus. This information will be useful for scientists working in diverse areas of plant biology such as biotechnology, biodiversity, genome analysis, plant breeding, physiology and molecular biology. Such data may be utilized to attempt to correlate the genome size with the ploidy status of bamboo species in cases where ploidy status has been reported.

열역학적 데이터를 이용한 26도시 Traveling-Salesman Problem 시뮬레이션 (DNA Computing Simulation Using Thermodynamic Data For TSP With 26 Nodes.)

  • 장하영;신수용;장병탁
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2004년도 봄 학술발표논문집 Vol.31 No.1 (B)
    • /
    • pp.316-318
    • /
    • 2004
  • DNA 컴퓨팅에 대한 연구가 진행되어 강에 따라 기존의 튜링 머신과 동등한 계산 능력을 가진 다양한 계산 모델이 제안되고 있으며, 이와 함께 DNA의 병렬성을 이용하여 NP 문제들을 풀고자 하는 시도가 계속되고 있다. 그러나 전통적인 폰 노이만 기계에서의 알고리즘이 해집합에 대한 순차적 탐색을 하는 것과는 달리 가능한 모든 해를 미리 생성해 놓고 그 중에서 해를 찾아내는 기존의 DNA 컴퓨팅 알고리즘으로는 NP 문제의 크기가 증가함에 따라 초기 해의 생성조차도 불가능하게 된다. 이에 대한 해법의 하나로 진화적인 방법론을 생각할 수 있지만, 이 경우에는 진화 연산을 위한 추가적인 연산자의 고안과 이의 적용에 따른 어려움이 생긴다. 따라서 본 논문에서는 DNA 컴퓨팅에서 가능한 초기 해를 모두 생성할 수 있는지를 열역학적인 데이터에 근거한 시뮬레이션을 통하여 검증하였다. 이러한 과정을 통해서 값비싼 실제 실험의 성공 여부나 실험 디자인의 정당성 등을 미리 예측할 수 있을 뿐만 아니라, DNA 컴퓨팅이 보다 큰 크기의 NP 문제를 해결할 수 있는 가능성을 제공할 수 있다.

  • PDF

Comparison of Normalizations for cDNA Microarray Data

  • 김윤희;김호;박웅양;서진영;정진호
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2002년도 춘계 학술발표회 논문집
    • /
    • pp.175-181
    • /
    • 2002
  • cDNA microarray experiments permit us to investigate the expression levels of thousands of genes simultaneously and to make it easy to compare gene expression from different populations. However, researchers are asked to be cautious in interpreting the results because of the unexpected sources of variation such as systematic errors from the microarrayer and the difference of cDNA dye intensity. And the scanner itself calculates both of mean and median of the signal and background pixels, so it follows a selection which raw data will be used in analysis. In this paper, we compare the results in each case of using mean and median from the raw data and normalization methods in reducing the systematic errors with arm's skin cells of old and young males. Using median is preferable to mean because the distribution of the test statistic (t-statistic) from the median is more close to normal distribution than that from mean. Scaled print tip normalization is better than global or lowess normalization due to the distribution of the test-statistic.

  • PDF

A DNA Microarray LIMS System for Integral Genomic Analysis of Multi-Platform Microarrays

  • Cho, Mi-Kyung;Kang, Jason Jong-ho;Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • 제5권2호
    • /
    • pp.83-87
    • /
    • 2007
  • The analysis of DNA microarray data is a rapidly evolving area of bioinformatics, and various types of microarray are emerging as some of the most exciting technologies for use in biological and clinical research. In recent years, microarray technology has been utilized in various applications such as the profiling of mRNAs, assessment of DNA copy number, genotyping, and detection of methylated sequences. However, the analysis of these heterogeneous microarray platform experiments does not need to be performed separately. Rather, these platforms can be co-analyzed in combination, for cross-validation. There are a number of separate laboratory information management systems (LIMS) that individually address some of the needs for each platform. However, to our knowledge there are no unified LIMS systems capable of organizing all of the information regarding multi-platform microarray experiments, while additionally integrating this information with tools to perform the analysis. In order to address these requirements, we developed a web-based LIMS system that provides an integrated framework for storing and analyzing microarray information generated by the various platforms. This system enables an easy integration of modules that transform, analyze and/or visualize multi-platform microarray data.

한국 남서해안 자연채묘 양식 김의 DNA 염기서열과 종 동정 (DNA Sequences and Identification of Porphyra Cultivated by Natural Seeding on the Southwest Coast of Korea)

  • 황미숙;김선미;하동수;백재민;김형섭;최한구
    • ALGAE
    • /
    • 제20권3호
    • /
    • pp.183-196
    • /
    • 2005
  • Nuclear SSU and ITS1 rDNA and plastid rbcL sequences were determined to identify the seven samples of Porphyra cultivated by means of natural seeding on the southwest coast of Korea and analyzed to access the phylogenetic relationships of them with the natural populations of P. tenera and P. yezoensis from Korea and Japan. SSU, rbcL and ITS1 data from 18, 21 and 31 samples, respectively, including previously published sequences were investigated in the study. Results from our individual and combined data indicated that the seven samples were all P. yezoensis and the entities except one from Muan 2 aquafarm strongly grouped together with the natural populations of P. yezoensis from the south and the west coast of Korea. The sample from Muan 2 seems to be derived from a strain of P. yezoensis introduced from Japan by Porphyra farmers, based on DNA sequence data.

A Novel Node Management in Hadoop Cluster by using DNA

  • Balaraju. J;PVRD. Prasada Rao
    • International Journal of Computer Science & Network Security
    • /
    • 제23권9호
    • /
    • pp.134-140
    • /
    • 2023
  • The distributed system is playing a vital role in storing and processing big data and data generation is speedily increasing from various sources every second. Hadoop has a scalable, and efficient distributed system supporting commodity hardware by combining different networks in the topographical locality. Node support in the Hadoop cluster is rapidly increasing in different versions which are facing difficulty to manage clusters. Hadoop does not provide Node management, adding and deletion node futures. Node identification in a cluster completely depends on DHCP servers which managing IP addresses, hostname based on the physical address (MAC) address of each Node. There is a scope to the hacker to theft the data using IP or Hostname and creating a disturbance in a distributed system by adding a malicious node, assigning duplicate IP. This paper proposing novel node management for the distributed system using DNA hiding and generating a unique key using a unique physical address (MAC) of each node and hostname. The proposed mechanism is providing better node management for the Hadoop cluster providing adding and deletion node mechanism by using limited computations and providing better node security from hackers. The main target of this paper is to propose an algorithm to implement Node information hiding in DNA sequences to increase and provide security to the node from hackers.

Development of a Reproducibility Index for cDNA Microarray Experiments

  • 김병수;라선영
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2002년도 춘계 학술발표회 논문집
    • /
    • pp.79-83
    • /
    • 2002
  • Since its introduction in 1995 by Schena et al. cDNA microarrays have been established as a potential tool for high-throughput analysis which allows the global monitoring of expression levels for thousands of genes simultaneously. One of the characteristics of the cDNA microarray data is that there is inherent noise even after the removal of systematic effects in the experiment. Therefore, replication is crucial to the microarray experiment. The assessment of reproducibility among replicates, however, has drawn little attention. Reproducibility may be assessed with several different endpoints along the process of data reduction of the microarray data. We define the reproducibility to be the degree with which replicate arrays duplicate each other. The aim of this note is to develop a novel measure of reproducibility among replicates in the cDNA microarray experiment based on the unprocessed data. Suppose we have p genes and n replicates in a microarray experiment. We first develop a measure of reproducibility between two replicates and generalize this concept for a measure of reproducibility of one replicate against the remaining n-1 replicates. We used the rank of the outcome variable and employed the concept of a measure of tracking in the blood pressure literature. We applied the reproducibility measure to two sets of microarray experiments in which one experiment was performed in a more homogeneous environment, resulting in validation of this novel method. The operational interpretation of this measure is clearer than Pearson's correlation coefficient which might be used as a crude measure of reproducibility of two replicates.

  • PDF

유전자 분석 자료에 의한 친자 및 혈연관계 분석시스템 개발 및 활용 (Development and Applications of A Paternity and Kinship Analysis System Based on DNA Data)

  • 구교찬;김선욱
    • 한국산학기술학회논문지
    • /
    • 제16권10호
    • /
    • pp.6715-6721
    • /
    • 2015
  • 최근 실종자, 변사자, 미아 등의 유전자 분석 자료는 지속적으로 증가하고 있으나, 현재 친자확인을 위한 통계학적 계산은 대부분 수기에 의하거나 엑셀을 통해서 이루어지고 있다. 따라서 유전자 분석 자료 중 상염색체 Short Tandem Repeat (STR)을 체계적으로 관리하고 효과적으로 분석할 수 있는 소프트웨어의 개발이 필요하다. 친자관계 및 혈연관계를 다양한 옵션 하에서 용이하게 분석하는 웹 기반 유전자자료 분석시스템이 광범위한 테스트 없이 약 20개월의 연구를 통해서 개발되었다. 친자관계 분석을 위해서 부계지수 계산 알고리즘을 사용하였고, 혈연관계 분석을 위해서 Identity by descent (IBD) 공식을 사용하였다. 이 시스템은 실제 데이터를 기반으로 혈연관계지수와 친자확률이 검증됨으로써 신뢰성이 확보됨은 물론, 대량 재난 재해 시 발생될 유전자 분석 자료의 관리 및 분석에 효과적으로 이용될 수 있을 것이다. 이 외에도 본 시스템은 데이터베이스와 알고리즘의 통합 환경, 사용자 중심 인터페이스, 프로세스 자동화 등 고급기능을 포함한다.

암 분류를 위한 음의 상관관계 특징을 이용한 앙상블 분류기 (Ensemble Classifier with Negatively Correlated Features for Cancer Classification)

  • 원홍희;조성배
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제30권12호
    • /
    • pp.1124-1134
    • /
    • 2003
  • 최근의 DNA 마이크로어레이 기술로 많은 양의 유전자 데이타를 얻을 수 있는데, 특히 암의 진단과 치료에 적용되어 암의 정확한 분류에 많은 도움을 줄 것으로 기대된다. DNA로부터 얻어지는 유전자 데이타의 양은 매우 방대하므로 이를 효과적으로 분석하는 것은 매우 중요하다. 암의 분류는 진단과 치료에 있어 매우 중요하므로 하나의 분류기에 의존한 분류 결과보다는 다수의 전문화된 분류기 결과를 결합하여 결과를 도출하는 것이 바람직하다. 일반적으로 분류기를 결합함으로써 분류 성능 및 분류 결과에 대한 신뢰도를 높일 수 있다. 앙상블 분류기의 많은 장점에도 불구하고, 오류 의존적인 분류기의 결합은 성능 향상에 한계가 있다. 본 논문에서는 암을 정확하게 분류하기 위해서 음의 상관관계를 갖는 특징으로 학습한 신경망 분류기를 결합하는 방법을 제안하고, 제안한 방법의 유용성을 체계적으로 분석하고자 한다. 세 가지 벤치마크 암 데이타에 대하여 제안한 방법을 적용하여 실험한 결과, 음의 상관관계 특징을 이용한 앙상블 분류기가 다른 분류기보다 높은 성능을 내는 것을 확인할 수 있었다.