• Title/Summary/Keyword: 아미노산 서열 확인

Search Result 252, Processing Time 0.022 seconds

In Sitilico Protein Sequencing Based on Mass Spectrometry Using Multiple Pretenses (다중 효소를 이용한 질량분석기법에 기반한 단백질의 아미노산 서열 분석)

  • 문석현;이도헌;이광형
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2002.12a
    • /
    • pp.473-477
    • /
    • 2002
  • 세포내에서 특정 단백질이 합성되어 이용되는 것을 단백질의 발현이라 한다. 이러한 단백질의발현을 조사하는 작업은 세포내 대사과정을 밝혀내는 데 있어서 매우 중요한 역할을 담당하고 있다. 단백질의 발현을 조사하기 위해서는 세포로부터 추출하여 정제한 단백질이 어떤 단백질인지를 확인하는 작업이 필요한데 현재로써는 확인하고자 하는 단백질 효소로 분해하여 분해된 조각들의 질량을 측정하여 기존에 알려진 단백질들을 분해했을 때 이론상 나을 수 있는 조각들의 무게와 비교하여 가장 근접한 단백질을 찾아내는 질량분석기법(mass Spectrometry)이 널리 사용된다. 그러나 이 방법은 확인하고자 하는 단백질의 아미노산 서열이 알려져 있을 경우에만 사용할 수 있다는 한계점을 가지고 있다. 본 논문에서는 이러한 한계를 계산적인 방법으로 극복하고자 동일단백질을 여러가지 효소로 분해하여 나오는 조각들의 질량을 측정하고 이들을 조합하여 원래 단백질의 아미노산 서열을 알아낼 수 있는 알고리즘을 제안한다.

Signal Sequence Prediction Based on Hydrophobicity and Substitution Matrix (소수성과 치환행렬에 기반한 신호서열 예측)

  • Chi, Sang-Mun
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.7
    • /
    • pp.595-602
    • /
    • 2007
  • This paper proposes a method that discriminates signal peptide and predicts the cleavage site of the secretory proteins cleaved by the signal peptidase I. The preprocessing stage uses hydrophobicity scales of amino acids in order to predict the presence of signal sequence and the cleavage site. The preprocessing enhances the performance of the prediction method by eliminating the non-secretory proteins in the early stage of prediction. for the effective use of support vector machine for the signal sequence prediction, the biologically relevant distance between the amino acid sequences is defined by using the hydrophobicity and substitution matrix; the hydrophobicity can be used to Predict the location of amino acid in a cell and the substitution matrix represents the evolutionary relationships of amino acids. The proposed method showed 98.9% discrimination rates from signal sequences and 88% correct rate of the cleavage site prediction on Swiss-Prot release 50 protein database using the 5-fold-cross-validation. In the comparison tests, the proposed method has performed significantly better than other prediction methods.

A New Gene of Protein Related to Myoblast Fusion detected by Monoclonal antibidy (근원세포 융합과 관련된 새로운 유전자의 확인)

  • 박수정;이영주
    • The Korean Journal of Zoology
    • /
    • v.38 no.1
    • /
    • pp.49-54
    • /
    • 1995
  • 본 연구자들은 근원세포를 면역시켜 얻은 hybidoma들을 검색하여. 계배 근원세포의 분화와 관련된 단백질을 인지하여 분화를 억제하는 대과가 있는 monoclonal antibody 3H35를 선별하여 그 항원을 확인한 바 있다(Kim et af.. (1992), Korean J. Zool 35 29-36) 본 연구에서는 λZAP에 cloning된 chicken muscle CDNA library들을 lacZ fusion protein으로 발현시켜 항체 3H35로 검색하여 그 유전자를 찾아내었다. 선별한 CDNA clone 중 C59의 삽입 절편은 1.6 kb이었고, 발현시킨 facE fusion protein 은 60 kDa로, f-galactosidase에 대한 항체에 반응하며 3H35와도 반응함을 immunoaffinitv adsorbant와 immunoblot으로 확인하였다 Clone C59의 삽입 절편의 염기서열을 분석한 결과, 실제 유전자는 1.6 kb 이상이며, 알려진 어느 다른 유전자와도 관련이 없는 새로운 근특이 유전자로 판단되었다. 아미노산으로 전환시켰을 때 31개의 특이한 서열이 7차례 반복된 부분이 나타났으며 이 서열의 23개가 일정하게 보존되어있고 나머지 서열의 아미노산의 polarity도 매우 유사하게 효존되어있다. 이들의 보존성이 극히 높은 것으로 보아 독특한 기능을 수행하는 domain으로 추정된다.

  • PDF

Genetic Variation of Cytochrome P450 Genes in Garlic Cultivars (마늘유래 Cytochrome P450 유전자의 변이 분석)

  • Kwon, Soon-Tae;Kamiya, Juli
    • Korean Journal of Plant Resources
    • /
    • v.24 no.5
    • /
    • pp.584-590
    • /
    • 2011
  • Wound inducible P450-Esg cDNA, one of cytochrome P450 gene family, was isolated from shoot of Euiseong garlic cultivar. P450-Esg cDNA possesses highly conserved heme-binding domain in the nucleotide sequence, and 1,419 bp of open reading frame (ORF) coding of 473 amino acids. Based on the nucleotide sequence analysis of P450-Esg homologous from twelve garlic cultivars, two domains, one domain between 472 to 510 bp, and the other between 1,210 to 1,249 bp from start codon (ATG), showed various nucleotide polymorphism among cultivars. Sequence of heme-binding domain in P450-Esg homologous, which is located at the domain between 1,210 to 1,240 bp from start codon, showed various nucleotide polymorphism as well as amino acid sequence polymorphism among twelve garlic cultivars. Anther domain, between 472 to 510 bp from start codon, showed exactly same amino acid sequence in the twelve garlic cultivars, but there were various single nucleotide polymorphism to the cultivars.

Analysis of N- Terminal Amino Acid Sequence of Catechol 2,3-dioxygenase from Aniline Degrading Delftia sp. JK-2 (Aniline 분해세균 Delftia sp. JK-2에서 분리된 Catechol 2,3-dioxygenase의 N-말단 아미노산 서열 분석)

  • Hwang Seon-Young;Kahng Hyung-Yeel;Oh Kye-Heon
    • Korean Journal of Microbiology
    • /
    • v.41 no.1
    • /
    • pp.13-17
    • /
    • 2005
  • The aim of this work was to investigate the N-terminal amino acid sequence of catechol 2,3-dioxygenase isolated from Delftia sp. JK-2, which could utilize aniline as sole carbon, nitrogen and energy source. Molecular weight of the enzyme was determined to approximately 35 kDa by SDS-PAGE. N-terminal amino acid sequence of C2,3O from strain JK-2 was $^1MGVMRIGHASLKVMDMDAAVRHYENV^{26}$, and exhibited high sequence similarity with that of C2,3O from Pseudomonas sp., Comamonas sp. JS765, Comamonas test-osteroni, or Burkholderia sp. RP007. Approximately 950-bp C2,3O was obtained through PCR using the primers derived from N-terminal amino acid sequence. Analysis of the DNA sequence revealed that the deduced 296 amino acid sequences were determined, and it showed $100\%$ identity with C2,3O from Pseudomonas sp. AW-2 and $97\%$ similarity with Comamonas sp. JS765.

Estimating Amino Acid Composition of Protein Sequences Using Position-Dependent Similarity Spectrum (위치 종속 유사도 스펙트럼을 이용한 단백질 서열의 아미노산 조성 추정)

  • Chi, Sang-Mun
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.1
    • /
    • pp.74-79
    • /
    • 2010
  • The amino acid composition of a protein provides basic information for solving many problems in bioinformatics. We propose a new method that uses biologically relevant similarity between amino acids to determine the amino acid composition, where the BOLOSUM matrix is exploited to define a similarity measure between amino acids. Futhermore, to extract more information from a protein sequence than conventional methods for determining amino acid composition, we exploit the concepts of spectral analysis of signals such as radar and speech signals-the concepts of time-dependent analysis, time resolution, and frequency resolution. The proposed method was applied to predict subcellular localization of proteins, and showed significantly improved performance over previous methods for amino acid composition estimation.

The Complete Amino Acid Sequence of Newborn Dog Prochymosin (강아지 프로카이모신의 전 아미노산 서열)

  • Yoon, Joo-Ok;Kim, Hyun-Ku
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.7 no.3
    • /
    • pp.289-300
    • /
    • 1997
  • Newborn dog chymosin was extracted from the stomachs of dogs of 2 weeks of age, and was purified by ion exchange chromatography. Half of the sequence was determined by amino acid sequencing and the complete sequence was deduced from a cloned chymosin cDNA Results showed that the zymogen showed 79% sequence identity with calf prochymosin and 54% identity with porcine pepsinogen A The size of the propart and location of the residue which becomes the amino-terminus in the active enzyme was the same in the prochymosins. The maximum general proteolytic activity at pH 3.2 of newborn dog chymosin was 3-4% of that of porcine pepsin A at pH 2, whereas the milk clotting activity relative to the general proteolytic activity of newborn dog chymosin was much higher than that of calf chymosin. Agar gel electrophoresis at pH 5.2 of stomach extracts of individual dogs showed the existence of two predominant genetic variants of zymogen and enzyme. The two variants could not be distinguished by amino acid composition or amino-terminal sequencing, and no differences in the enzymatic properties of the genetic variants were observed. It was concluded that of the residues that participate in the substrate binding, calf and newborn dog chymosin differ in the following positions (porcine pepsin numbering, subsites in parentheses) : Ser 12 Thr(S$_4$), Leu 30 Val(S$_1$/S$_3$), His 74 Gln(S'$_2$), Val 111 Ile(S$_1$/S$_3$), Lys 220 Met(S$_4$). With regard to the low general proteolytic activity of newborn dog chymosin, the substitution Asp303 Val relative to calf chymosin may contribute to an explanation of this.

  • PDF

대두단백질 가수분해물의 쓴맛 펩타이드 구조와 특징

  • Lee, Cheol-Ho
    • Proceedings of the Korean Society of Life Science Conference
    • /
    • 2001.06a
    • /
    • pp.3-41
    • /
    • 2001
  • 단백질의 부분 가수분해는 산성 음료에서의 용해도 증가, 환자들의 소화력과 알러지 내성의 개선, 다른 기능적 특성의 개발 등을 위하여 식품산업에 널리 이용되고 있다. 그러나 우유 단백질이나 대두 단백질과 같은 몇 가지 단백질들은 가수분해에 의하여 강한 쓴맛을 형성한다, 단백질 가수분해물의 쓴맛에 관한 연구는 1950년대 초에 시작되었으며, 여러 가지 원료로부터 쓴맛물질이 분리되었다. 이들 단백질 가수분해물의 쓴맛 물질은 올리고펩타이드로 알려져 있으며, 펩타이드 분자를 구성하는 소수성 아미노산의 존재와 밀접한 관계가 있는 것으로 보고되고 있다. 본 연구에서는 최근에 발달된 분석기술과 생명공학적 기법으로 E. coli에서 생산한 콩 단백질 단일 subunit를 이용하여 효소적 가수분해물의 분자구조를 확인하고자 하였다. 탈지대두박으로부터 115 glycinin와 E.coli떼서 발현된 proglycinin을 각각 90%, 97%의 정제도로 분리하여 이들 단백질을 trypsin으로 각각 가수분해하였다. 115 glycinin은 효소/기질 비 3%에서 4시간 가수분해에 의해 $14.0{\times}10^{-5}$ M quinine-HCI equivalent의 강한 쓴맛을 나타내었으며, 12%의 가수분해도(DH)를 나타내었다. 대두 단백질의 쓴맛 성분을 확인 위하여 이미 아미노산 서열이 밝혀진 11S glycinin과 proglycinin 가수분해물에서 GP-HPLC, $C_{18}$ RP-HPLC 등을 통하여 쓴맛 peptide들을 분리하였다. 각각의 분획은 다시 21개의 peptide로 분리되어 그 서열이 결정되었으며 이중 RP와 GI는 이미 알려진 쓴맛 dipeptide였고, LAGNQEQE, SAEFG, NALPE, KLHENIAR, GMIYPG 등이 주된 쓴맛 Peptide로 확인되었다. 이들은 11S glycinin의 5개의 subunit 중에서 그 위치가 확인되었다. Proglycinin 가수분해물에서도 11S glycinin과 같은 방법으로 7개의 쓴맛 peptide가 분리되었다. 이들은 $A_{1a}B_{1b}$의 아미노산 서열 중에서 37-42, 103-110, 164-167, 323-327, 367-373의 위치에 분포하고 있었으며, NALKPD, IYPGCPST, SlDT, HNIGQT, NAMFVPH의 서열을 나타내었다. 분리된 쓴맛 peptide 중에서 가장 쓴 두 분회의 peptide를 합성하여 관능 검사한 결과, NALPE는 매우 쓴맛을 내는 peptide로 확인되었다.

  • PDF

Proteome Data Analysis of Hairy Root of Panax ginseng : Use of Expressed Sequence Tag Data of Ginseng for the Protein Identification (인삼 모상근 프로테옴 데이터 분석 : 인삼 EST database와의 통합 분석에 의한 단백질 동정)

  • Kwon, Kyung-Hoon;Kim, Seung-Il;Kim, Kyung-Wook;Kim, Eun-A;Cho, Kun;Kim, Jin-Young;Kim, Young-Hwan;Yang, Deok-Chun;Hur, Cheol-Goo;Yoo, Jong-Shin;Park, Young-Mok
    • Journal of Plant Biotechnology
    • /
    • v.29 no.3
    • /
    • pp.161-170
    • /
    • 2002
  • For the hairy root of Panax ginseng, we have got mass spectrums from MALDI/TOF/MS analysis and Tandem mass spectrums from ESI/Q-TOF/MS analysis. While mass spectrum provides the molecular weights of peptide fragments digested by protease such as trypsin, tandem mass spectrum produces amino acid sequence of digested peptides. Each amino acid sequences can be a query sequence in BLAST search to identify proteins. For the specimens of animals or plants of which genome sequences were known, we can easily identify expressed proteins from mass spectrums with high accuracy. However, for the other specimens such as ginseng, it is difficult to identify proteins with accuracy since all the protein sequences are not available yet. Here we compared the mass spectrums and the peptide amino acid sequences with ginseng expressed sequence tag (EST) DB. The matched EST sequence was used as a query in BLAST search for protein identification. They could offer the correct protein information by the sequence alignment with EST sequences. 90% of peptide sequences of ESI/Q-TOF/MS are matched with EST sequences. Comparing 68% matches of the same sequences with the nr database of NCBI, we got more matches by 22% from ginseng EST sequence search. In case of peptide mass fingerprinting from MALDI/TOF/MS, only about 19% (9 proteins of 47 spots) among peptide matches from nr DB were correlated with ginseng EST DB. From these results, we suggest that amino acid sequencing using tandem mass spectrum analysis may be necessary for protein identification in ginseng proteome analysis.