• Title/Summary/Keyword: Bioinformatics Software

Search Result 127, Processing Time 0.02 seconds

The effective model of the human Acetyl-CoA Carboxylase inhibition by aromatic-structure inhibitors

  • Minh, Nguyen Truong Cong;Thanh, Bui Tho;Truong, Le Xuan;Suong, Nguyen Thi Bang;Thao, Le Thi Xuan
    • Journal of IKEEE
    • /
    • v.21 no.3
    • /
    • pp.309-319
    • /
    • 2017
  • The research investigates the inhibition of fatty acid biosynthesis of the human Acetyl-CoA Carboxylase enzyme by the aromatic-structure inhibitors (also known as ligands) containing variables of substituents, contributing an important role in the treatment of fatty-acid metabolic syndrome expressed by the group of cardiovascular risk factors increasing the incidence of coronary heart disease and type-2 diabetes. The effective interoperability between ligand and enzyme is characterized by a 50% concentration of enzyme inhibitor ($IC_{50}$) which was determined by experiment, and the factor of geometry structure of the ligands which are modeled by quantum mechanical methods using HyperChem 8.0.10 and Gaussian 09W softwares, combining with the calculation of quantum chemical and chemico-physical structural parameters using HyperChem 8.0.10 and Padel Descriptor 2.21 softwares. The result data are processed with the combination of classical statistical methods and modern bioinformatics methods using the statistical softwares of Department of Pharmaceutical Technology - Jadavpur University - India and R v3.3.1 software in order to accomplish a model of the quantitative structure - activity relationship between aromatic-structure ligands inhibiting fatty acid biosynthesis of the human Acetyl-CoA Carboxylase.

Pathway Retrieval for Transcriptome Analysis using Fuzzy Filtering Technique andWeb Service

  • Lee, Kyung-Mi;Lee, Keon-Myung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.2
    • /
    • pp.167-172
    • /
    • 2012
  • In biology the advent of the high-throughput technology for sequencing, probing, or screening has produced huge volume of data which could not be manually handled. Biologists have resorted to software tools in order to effectively handle them. This paper introduces a bioinformatics tool to help biologists find potentially interesting pathway maps from a transcriptome data set in which the expression levels of genes are described for both case and control samples. The tool accepts a transcriptome data set, and then selects and categorizes some of genes into four classes using a fuzzy filtering technique where classes are defined by membership functions. It collects and edits the pathway maps related to those selected genes without analyst' intervention. It invokes a sequence of web service functions from KEGG, which an online pathway database system, in order to retrieve related information, locate pathway maps, and manipulate them. It maintains all retrieved pathway maps in a local database and presents them to the analysts with graphical user interface. The tool has been successfully used in identifying target genes for further analysis in transcriptome study of human cytomegalovirous. The tool is very helpful in that it can considerably save analysts' time and efforts by collecting and presenting the pathway maps that contain some interesting genes, once a transcriptome data set is just given.

EST Analysis system for panning gene

  • Hur, Cheol-Goo;Lim, So-Hyung;Goh, Sung-Ho;Shin, Min-Su;Cho, Hwan-Gue
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2000.11a
    • /
    • pp.21-22
    • /
    • 2000
  • Expressed sequence tags (EFTs) are the partial segments of cDNA produced from 5 or 3 single-pass sequencing of cDNA clones, error-prone and generated in highly redundant sets. Advancement and expansion of Genomics made biologists to generate huge amount of ESTs from variety of organisms-human, microorganisms as well as plants, and the cumulated number of ESTs is over 5.3 million, As the EST data being accumulate more rapidly, it becomes bigger that the needs of the EST analysis tools for extraction of biological meaning from EST data. Among the several needs of EST analyses, the extraction of protein sequence or functional motifs from ESTs are important for the identification of their function in vivo. To accomplish that purpose the precise and accurate identification of the region where the coding sequences (CDSs) is a crucial problem to solve primarily, and it will be helpful to extract and detect of genuine CD5s and protein motifs from EST collections. Although several public tools are available for EST analysis, there is not any one to accomplish the object. Furthermore, they are not targeted to the plant ESTs but human or microorganism. Thus, to correspond the urgent needs of collaborators deals with plant ESTs and to establish the analysis system to be used as general-purpose public software we constructed the pipelined-EST analysis system by integration of public software components. The software we used are as follows - Phred/Cross-match for the quality control and vector screening, NCBI Blast for the similarity searching, ICATools for the EST clustering, Phrap for EST contig assembly, and BLOCKS/Prosite for protein motif searching. The sample data set used for the construction and verification of this system was 1,386 ESTs from human intrathymic T-cells that verified using UniGene and Nr database of NCBI. The approach for the extraction of CDSs from sample data set was carried out by comparison between sample data and protein sequences/motif database, determining matched protein sequences/motifs that agree with our defined parameters, and extracting the regions that shows similarities. In recent future, in addition to these components, it is supposed to be also integrated into our system and served that the software for the peptide mass spectrometry fingerprint analysis, one of the proteomics fields. This pipelined-EST analysis system will extend our knowledge on the plant ESTs and proteins by identification of unknown-genes.

  • PDF

An Easy-to-Use Three-Dimensional Molecular Visualization and Analysis Program: POSMOL

  • Lee, Sang-Joo;Chung, Hae-Yong;Kim, Kwang S.
    • Bulletin of the Korean Chemical Society
    • /
    • v.25 no.7
    • /
    • pp.1061-1064
    • /
    • 2004
  • Molecular visualization software has the common objective of manipulation and interpretation of data from numerical simulations. They visualize many complicated molecular structures with personal computer and workstation, to help analyze a large quantity of data produced by various computational methods. However, users are often discouraged from using these tools for visualization and analysis due to the difficult and complicated user interface. In this regard, we have developed an easy-to-use three-dimensional molecular visualization and analysis program named POSMOL. This has been developed on the Microsoft Windows platform for the easy and convenient user environment, as a compact program which reads outputs from various computational chemistry software without editing or changing data. The program animates vibration modes which are needed for locating minima and transition states in computational chemistry, draws two and three dimensional (2D and 3D) views of molecular orbitals (including their atomic orbital components and these partial sums) together with molecular systems, measures various geometrical parameters, and edits molecules and molecular structures.

Towards Effective Analysis and Tracking of Mozilla and Eclipse Defects using Machine Learning Models based on Bugs Data

  • Hassan, Zohaib;Iqbal, Naeem;Zaman, Abnash
    • Soft Computing and Machine Intelligence
    • /
    • v.1 no.1
    • /
    • pp.1-10
    • /
    • 2021
  • Analysis and Tracking of bug reports is a challenging field in software repositories mining. It is one of the fundamental ways to explores a large amount of data acquired from defect tracking systems to discover patterns and valuable knowledge about the process of bug triaging. Furthermore, bug data is publically accessible and available of the following systems, such as Bugzilla and JIRA. Moreover, with robust machine learning (ML) techniques, it is quite possible to process and analyze a massive amount of data for extracting underlying patterns, knowledge, and insights. Therefore, it is an interesting area to propose innovative and robust solutions to analyze and track bug reports originating from different open source projects, including Mozilla and Eclipse. This research study presents an ML-based classification model to analyze and track bug defects for enhancing software engineering management (SEM) processes. In this work, Artificial Neural Network (ANN) and Naive Bayesian (NB) classifiers are implemented using open-source bug datasets, such as Mozilla and Eclipse. Furthermore, different evaluation measures are employed to analyze and evaluate the experimental results. Moreover, a comparative analysis is given to compare the experimental results of ANN with NB. The experimental results indicate that the ANN achieved high accuracy compared to the NB. The proposed research study will enhance SEM processes and contribute to the body of knowledge of the data mining field.

Imputation Accuracy from 770K SNP Chips to Next Generation Sequencing Data in a Hanwoo (Korean Native Cattle) Population using Minimac3 and Beagle (Minimac3와 Beagle 프로그램을 이용한 한우 770K chip 데이터에서 차세대 염기서열분석 데이터로의 결측치 대치의 정확도 분석)

  • An, Na-Rae;Son, Ju-Hwan;Park, Jong-Eun;Chai, Han-Ha;Jang, Gul-Won;Lim, Dajeong
    • Journal of Life Science
    • /
    • v.28 no.11
    • /
    • pp.1255-1261
    • /
    • 2018
  • Whole genome analysis have been made possible with the development of DNA sequencing technologies and discovery of many single nucleotide polymorphisms (SNPs). Large number of SNP can be analyzed with SNP chips, since SNPs of human as well as livestock genomes are available. Among the various missing nucleotide imputation programs, Minimac3 software is suggested to be highly accurate, with a simplified workflow and relatively fast. In the present study, we used Minimac3 program to perform genomic missing value substitution 1,226 animals 770K SNP chip and imputing missing SNPs with next generation sequencing data from 311 animals. The accuracy on each chromosome was about 94~96%, and individual sample accuracy was about 92~98%. After imputation of the genotypes, SNPs with R Square ($R^2$) values for three conditions were 0.4, 0.6, and 0.8 and the percentage of SNPs were 91%, 84%, and 70% respectively. The differences in the Minor Allele Frequency gave $R^2$ values corresponding to seven intervals (0, 0.025), (0.025, 0.05), (0.05, 0.1), (0.1, 0.2), (0.2, 0.3). (0.3, 0.4) and (0.4, 0.5) of 64~88%. The total analysis time was about 12 hr. In future SNP chip studies, as the size and complexity of the genomic datasets increase, we expect that genomic imputation using Minimac3 can improve the reliability of chip data for Hanwoo discrimination.

A Combined Approach for Locating Box H/ACA snoRNAs in the Human Genome

  • Eo, Hae Seok;Jo, Kwang Sun;Lee, Seung Won;Kim, Chang-Bae;Kim, Won
    • Molecules and Cells
    • /
    • v.20 no.1
    • /
    • pp.35-42
    • /
    • 2005
  • A novel combined method for locating box H/ACA small nucleolar RNAs (snoRNAs) is described, together with a software tool. The method adopts both a probabilistic hidden Markov model (HMM) and a minimum free energy (MFE) rule, and filters possible candidate box H/ACA snoRNAs obtained from genomic DNA sequences. With our novel method 12 known box H/ACA snoRNAs, and one strong candidate were identified in 30 nucleolar protein genomic sequences.

A Computer-aided Design Tool with Semiautomatic Image-Processing Features for Visualizing Biological Pathways

  • Ham, Sung-Il;Yang, San-Duk;Thong, Chin-Ting;Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • v.7 no.3
    • /
    • pp.168-170
    • /
    • 2009
  • The explosion in biological data resulting from high-throughput experiments requires new software tools to manipulate and display pathways in a way that can integrate disparate sources of information. A visual Java-based CAD tool for drawing and annotating biological pathways with semiautomatic image-processing features is described in this paper. The result of the image-editing process is an XML file for the appropriate links. This tool integrates the pathway images and XML file sources. The system has facilities for linking graphical objects to external databases and is capable of reproducing existing visual representations of pathway maps.

Bioinformatic Suggestions on MiSeq-Based Microbial Community Analysis

  • Unno, Tatsuya
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.6
    • /
    • pp.765-770
    • /
    • 2015
  • Recent sequencing technology development has revolutionized fields of microbial ecology. MiSeq-based microbial community analysis allows us to sequence more than a few hundred samples at a time, which is far more cost-effective than pyrosequencing. The approach, however, has not been preferably used owing to computational difficulties of processing huge amounts of data as well as known Illumina-derived artefact problems with amplicon sequencing. The choice of assembly software to take advantage of paired-end sequencing and methods to remove Illumina artefacts sequences are discussed. The protocol we suggest not only removed erroneous reads, but also dramatically reduced computational workload, which allows even a typical desktop computer to process a huge amount of sequence data generated with Illumina sequencers. We also developed a Web interface (http://biotech.jejunu.ac.kr/ ~abl/16s/) that allows users to conduct fastq-merging and mothur batch creation. The study presented here should provide technical advantages and supports in applying MiSeq-based microbial community analysis.

Bioinformatics Interpretation of Exome Sequencing: Blood Cancer

  • Kim, Jiwoong;Lee, Yun-Gyeong;Kim, Namshin
    • Genomics & Informatics
    • /
    • v.11 no.1
    • /
    • pp.24-33
    • /
    • 2013
  • We had analyzed 10 exome sequencing data and single nucleotide polymorphism chips for blood cancer provided by the PGM21 (The National Project for Personalized Genomic Medicine) Award program. We had removed sample G06 because the pair is not correct and G10 because of possible contamination. In-house software somatic copy-number and heterozygosity alteration estimation (SCHALE) was used to detect one loss of heterozygosity region in G05. We had discovered 27 functionally important mutations. Network and pathway analyses gave us clues that NPM1, GATA2, and CEBPA were major driver genes. By comparing with previous somatic mutation profiles, we had concluded that the provided data originated from acute myeloid leukemia. Protein structure modeling showed that somatic mutations in IDH2, RASGEF1B, and MSH4 can affect protein structures.