• Title/Summary/Keyword: NCBI (National Center for Biotechnology Information)

Search Result 44, Processing Time 0.023 seconds

Patome: Database of Patented Bio-sequences

  • Kim, SeonKyu;Lee, ByungWook
    • Genomics & Informatics
    • /
    • v.3 no.3
    • /
    • pp.94-97
    • /
    • 2005
  • We have built a database server called Patome which contains the annotation information for patented bio-sequences from the Korean Intellectual Property Office (KIPO). The aims of the Patome are to annotate Korean patent bio-sequences and to provide information on patent relationship of public database entries. The patent sequences were annotated with Reference Sequence (RefSeq) or NCBI's nr database. The raw patent data and the annotated data were stored in the database. Annotation information can be used to determine whether a particular RefSeq ID or NCBI's nr ID is related to Korean patent. Patome infrastructure consists of three components­the database itself, a sequence data loader, and an online database query interface. The database can be queried using submission number, organism, title, applicant name, or accession number. Patome can be accessed at http://www.patome.net. The information will be updated every two months.

Construction of BLAST Server for Mollusks (연체동물 전용 서열 블라스트 서버구축)

  • Lee, Yong-Seok;Jo, Yong-Hun;Kim, Dae-Soo;Kim, Dae-Won;Kim, Min-Young;Choi, Sang-Haeng;Yon, Jei-Oh;Byun, In-Sun;Kang, Bo-Ra;Jeong, Kye-Heon;Park, Hong-Seog
    • The Korean Journal of Malacology
    • /
    • v.20 no.2
    • /
    • pp.165-169
    • /
    • 2004
  • The BLAST server for the mollusk was constructed on the basis of the Intel Server Platform SC-5250 dual Xeon 2.8 GHz cpu and Linux operating system. After establishing the operating system, we installed NCBI (National Center for Biotechnology Information) WebBLAST package after web server configuration for cgi (common gate interface) (http://chimp.kribb.re.kr/mollusks). To build up the stand alone blast, we conducted as follows: First, we downloaded the genome information (mitochondria genome information), DNA sequences, amino acid sequences related with mollusk available at NCBI. Second, it was translated into the multifasta format that was stored as database by using the formatdb program provided by NCBI. Finally, the cgi was used for the Stand Alone Blast server. In addition, we have added the vector, Escherichia coli, and repeat sequences into the server to confirm a potential contamination. Finally, primer3 program is also installed for the users to design the primer. The stand alone BLAST gave us several advantages: (1) we can get only the data that agree with the nucleotide sequence directly related with the mollusks when we are searching BLAST; (2) it will be very convenient to confirm contamination when we made the cDNA or genomic library from mollusks; (3) Compared to the current NSBI, we can quickly get the BLAST results on the mollusks sequence information.

  • PDF

Extended latex proteome analysis deciphers additional roles of the lettuce laticifer

  • Cho, Won-Kyong;Chen, Xiong-Yan;Rim, Yeong-Gil;Chu, Hyo-Sub;Jo, Yeon-Hwa;Kim, Su-Wha;Park, Zee-Yong;Kim, Jae-Yean
    • Plant Biotechnology Reports
    • /
    • v.4 no.4
    • /
    • pp.311-319
    • /
    • 2010
  • Lettuce is an economically important leafy vegetable that accumulates a milk-like sap called latex in the laticifer. Previously, we conducted a large-scale lettuce latex proteomic analysis. However, the identified proteins were obtained only from lettuce ESTs and proteins deposited in NCBI databases. To extend the number of known latex proteins, we carried out an analysis identifying 302 additional proteins that were matched to the NCBI non-redundant protein database. Interestingly, the newly identified proteins were not recovered from lettuce EST and protein databases, indicating the usefulness of this hetero system in MudPIT analysis. Gene ontology studies revealed that the newly identified latex proteins are involved in many processes, including many metabolic pathways, binding functions, stress responses, developmental processes, protein metabolism, transport and signal transduction. Application of the non-redundant plant protein database led to the identification of an increased number of latex proteins. These newly identified latex proteins provide a rich source of information for laticifer research.

Computational analysis of SARS-CoV-2, SARS-CoV, and MERS-CoV genome using MEGA

  • Sohpal, Vipan Kumar
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.30.1-30.7
    • /
    • 2020
  • The novel coronavirus pandemic that has originated from China and spread throughout the world in three months. Genome of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) predecessor, severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV) play an important role in understanding the concept of genetic variation. In this paper, the genomic data accessed from National Center for Biotechnology Information (NCBI) through Molecular Evolutionary Genetic Analysis (MEGA) for statistical analysis. Firstly, the Bayesian information criterion (BIC) and Akaike information criterion (AICc) are used to evaluate the best substitution pattern. Secondly, the maximum likelihood method used to estimate of transition/transversions (R) through Kimura-2, Tamura-3, Hasegawa-Kishino-Yano, and Tamura-Nei nucleotide substitutions model. Thirdly and finally nucleotide frequencies computed based on genomic data of NCBI. The results indicate that general times reversible model has the lowest BIC and AICc score 347,394 and 347,287, respectively. The transition/transversions bias for nucleotide substitutions models varies from 0.56 to 0.59 in MEGA output. The average nitrogenous bases frequency of U, C, A, and G are 31.74, 19.48, 28.04, and 20.74, respectively in percentages. Overall the genomic data analysis of SARS-CoV-2, SARS-CoV, and MERS-CoV highlights the close genetic relationship.

A DNA Sequence Alignment Algorithm Using Quality Information and a Fuzzy Inference Method (품질 정보와 퍼지 추론 기법을 이용한 DNA 염기 서열 배치 알고리즘)

  • Kim, Kwang-Baek
    • Journal of Intelligence and Information Systems
    • /
    • v.13 no.2
    • /
    • pp.55-68
    • /
    • 2007
  • DNA sequence alignment algorithms in computational molecular biology have been improved by diverse methods. In this paper, we proposed a DNA sequence alignment algorithm utilizing quality information and a fuzzy inference method utilizing characteristics of DNA sequence fragments and a fuzzy logic system in order to improve conventional DNA sequence alignment methods using DNA sequence quality information. In conventional algorithms, DNA sequence alignment scores were calculated by the global sequence alignment algorithm proposed by Needleman-Wunsch applying quality information of each DNA fragment. However, there may be errors in the process for calculating DNA sequence alignment scores in case of low quality of DNA fragment tips, because overall DNA sequence quality information are used. In the proposed method, exact DNA sequence alignment can be achieved in spite of low quality of DNA fragment tips by improvement of conventional algorithms using quality information. And also, mapping score parameters used to calculate DNA sequence alignment scores, are dynamically adjusted by the fuzzy logic system utilizing lengths of DNA fragments and frequencies of low quality DNA bases in the fragments. From the experiments by applying real genome data of NCBI (National Center for Biotechnology Information), we could see that the proposed method was more efficient than conventional algorithms using quality information in DNA sequence alignment.

  • PDF

A Comparative Analysis of Bioinformation Website Services (생명정보 분야 웹사이트 서비스에 대한 비교.분석에 관한 연구)

  • Ahn, Bu-Young;Lee, Eung-Bong
    • Journal of Information Management
    • /
    • v.40 no.1
    • /
    • pp.157-181
    • /
    • 2009
  • As the information technology is evolved and the human genome project is finalized over the world, the Bioinformatics - the integration of abundant Biological science and information technology - has shown up and is continuously being advanced. Together with the evolution of Bioinformatics, the websites dealing with Bioinformation have been set up to provide relevant information to the Bioscientists. Among the numerous global websites, the preferred websites by the majority of domestic Bioscientists are BRIC (Biological Research Information Center) of POSTECH(Pohang University of Science and Technology) in Korea, CCBB(Center for Computational Biology and Bioinformatics) of KISTI(Korea Institute of Science and Technology Information), KOBIC(Korean Bioinformation Center) of KRIBB(Korea Research Institute of Bioscience and Biotechnology), NCBI(National Center for Biotechnology Information) in USA, EBI(European Bioinformatics Institute) in Europe and DDBJ(DNA Data Bank of Japan) in Japan. In this paper, the comparative analysis was executed by investigating contents status and functions of the above-mentioned 6 websites. In addition, questionnaire survey of Bioscience Researchers' utilization status and their needs to those 6 websites was conducted.

KUGI: A Database and Search System for Korean Unigene and Pathway Information

  • Yang, Jin-Ok;Hahn, Yoon-Soo;Kim, Nam-Soon;Yu, Ung-Sik;Woo, Hyun-Goo;Chu, In-Sun;Kim, Yong-Sung;Yoo, Hyang-Sook;Kim, Sang-Soo
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.407-411
    • /
    • 2005
  • KUGI (Korean UniGene Information) database contains the annotation information of the cDNA sequences obtained from the disease samples prevalent in Korean. A total of about 157,000 5'-EST high throughput sequences collected from cDNA libraries of stomach, liver, and some cancer tissues or established cell lines from Korean patients were clustered to about 35,000 contigs. From each cluster a representative clone having the longest high quality sequence or the start codon was selected. We stored the sequences of the representative clones and the clustered contigs in the KUGI database together with their information analyzed by running Blast against RefSeq, human mRNA, and UniGene databases from NCBI. We provide a web-based search engine fur the KUGI database using two types of user interfaces: attribute-based search and similarity search of the sequences. For attribute-based search, we use DBMS technology while we use BLAST that supports various similarity search options. The search system allows not only multiple queries, but also various query types. The results are as follows: 1) information of clones and libraries, 2) accession keys, location on genome, gene ontology, and pathways to public databases, 3) links to external programs, and 4) sequence information of contig and 5'-end of clones. We believe that the KUGI database and search system may provide very useful information that can be used in the study for elucidating the causes of the disease that are prevalent in Korean.

  • PDF