• Title/Summary/Keyword: visualization of genome information

Search Result 36, Processing Time 0.023 seconds

GTVseq: A Web-based Genotyping Tool for Viral Sequences

  • Shin, Jae-Min;Park, Ho-Eun;Ahn, Yong-Ju;Cho, Doo-Ho;Kim, Ji-Han;Kee, Mee-Kyung;Kim, Sung-Soon;Lee, Joo-Shil;Kim, Sang-Soo
    • Genomics & Informatics
    • /
    • v.6 no.1
    • /
    • pp.54-58
    • /
    • 2008
  • Genotyping Tool for Viral SEQuences (GTVseq) provides scientists with the genotype information on the viral genome sequences including HIV-1, HIV-2, HBV, HCV, HTLV-1, HTLV-2, poliovirus, enterovirus, flavivirus, Hantavirus, and rotavirus. GTVseq produces alternative and additive genotype information for the query viral sequences based on two different, but related, scoring methods. The genotype information produced is reported in a graphical manner for the reference genotype matches and each graphical output is linked to the detailed sequence alignments between the query and the matched reference sequences. GTVseq also reports the potential 'repeats' and/or 'recombination' sequence region in a separated window. GTVseq does not replace completely other well-known genotyping tools such as NCBI's virus sequence genotyping tool (http://www.ncbi. nlm.nih.gov/projects/genotyping/formpage.cgi), but provides additional information useful in the confirmation or for further investigation of the genotype(s) for the newly isolated viral sequences.

EvoSNP-DB: A database of genetic diversity in East Asian populations

  • Kim, Young Uk;Kim, Young Jin;Lee, Jong-Young;Park, Kiejung
    • BMB Reports
    • /
    • v.46 no.8
    • /
    • pp.416-421
    • /
    • 2013
  • Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/].

ChroView: A Trace Viewer for Browsing and Editing Chromatogram files

  • Tae, Hong-Seok;Kong, Eun-Bae;Park, Kie-Jung
    • Genomics & Informatics
    • /
    • v.5 no.1
    • /
    • pp.30-31
    • /
    • 2007
  • Many visualization tools have been designed to aid information processing during whole genome projects. We have developed a trace viewer program, ChroView, which can read a chromatogram file and display the chromatogram traces of the four bases. The program can be used to examine sequencing quality and base-calling errors. It can also help researchers to edit and save base-calling results while browsing the traces. Additionally, this program has a basecalling feature which can produce supplementary data for validation of the results from other base-calling programs.

MP-Lasso chart: a multi-level polar chart for visualizing group Lasso analysis of genomic data

  • Min Song;Minhyuk Lee;Taesung Park;Mira Park
    • Genomics & Informatics
    • /
    • v.20 no.4
    • /
    • pp.48.1-48.7
    • /
    • 2022
  • Penalized regression has been widely used in genome-wide association studies for joint analyses to find genetic associations. Among penalized regression models, the least absolute shrinkage and selection operator (Lasso) method effectively removes some coefficients from the model by shrinking them to zero. To handle group structures, such as genes and pathways, several modified Lasso penalties have been proposed, including group Lasso and sparse group Lasso. Group Lasso ensures sparsity at the level of pre-defined groups, eliminating unimportant groups. Sparse group Lasso performs group selection as in group Lasso, but also performs individual selection as in Lasso. While these sparse methods are useful in high-dimensional genetic studies, interpreting the results with many groups and coefficients is not straightforward. Lasso's results are often expressed as trace plots of regression coefficients. However, few studies have explored the systematic visualization of group information. In this study, we propose a multi-level polar Lasso (MP-Lasso) chart, which can effectively represent the results from group Lasso and sparse group Lasso analyses. An R package to draw MP-Lasso charts was developed. Through a real-world genetic data application, we demonstrated that our MP-Lasso chart package effectively visualizes the results of Lasso, group Lasso, and sparse group Lasso.

The AB05 NIAB Tools Workbench for Building Automatic Biopathway Maps for Agricultural Organisms

  • Cho, Mi-Kyung;Yoon, Kyung-Oh;Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • v.5 no.4
    • /
    • pp.200-202
    • /
    • 2007
  • For the past several years, we have built various tools for automatic construction of biopathways to help biological experts, especially in the field of agriculture. We integrated several systems for constructing web applications for analyzing biological pathway information for agricultural species, constructing optimized pathway maps. In addition to building web applications for agricultural pathway information, we developed several stand-alone software tools, which are publicly downloadable under proper license agreements.

Perspectives on Clinical Informatics: Integrating Large-Scale Clinical, Genomic, and Health Information for Clinical Care

  • Choi, In Young;Kim, Tae-Min;Kim, Myung Shin;Mun, Seong K.;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.11 no.4
    • /
    • pp.186-190
    • /
    • 2013
  • The advances in electronic medical records (EMRs) and bioinformatics (BI) represent two significant trends in healthcare. The widespread adoption of EMR systems and the completion of the Human Genome Project developed the technologies for data acquisition, analysis, and visualization in two different domains. The massive amount of data from both clinical and biology domains is expected to provide personalized, preventive, and predictive healthcare services in the near future. The integrated use of EMR and BI data needs to consider four key informatics areas: data modeling, analytics, standardization, and privacy. Bioclinical data warehouses integrating heterogeneous patient-related clinical or omics data should be considered. The representative standardization effort by the Clinical Bioinformatics Ontology (CBO) aims to provide uniquely identified concepts to include molecular pathology terminologies. Since individual genome data are easily used to predict current and future health status, different safeguards to ensure confidentiality should be considered. In this paper, we focused on the informatics aspects of integrating the EMR community and BI community by identifying opportunities, challenges, and approaches to provide the best possible care service for our patients and the population.

Development of Whole Genome Visualization Component Ware for Evolutionary Genetic Analysis (유전자 진화 분석을 위한 전유전체 가시화 Component Ware 개발)

  • Cho, Chi-Young;Park, Soo-Hyun;Kim, Dae-Soo;Ha, Hong-Seok;Ahn, Kung;Kim, Heui-Soo;Cho, Hwan-Gue
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2008.06c
    • /
    • pp.198-202
    • /
    • 2008
  • 많은 양의 유전자 정보가 유전공학의 발전과 Genome 프로젝트의 결과로 축적되고 있으며, 이러한 유전자 정보를 체계적으로 관리하고 가시화하기 위한 생물정보학 분야의 연구가 진행되고 있다. 기존의 많은 Genome Browser들이 완성된 형태의 툴로 서비스되고 있다. 이러한 툴들은 다목적의 많은 기능을 포함하고 있어 특정연구를 진행해야하는 연구자들은 너무 많은 정보로부터 원하는 것을 찾기 위해 시간과 노력이 필요하게 된다. 본 논문에서는 특정한 목적의 Gene 가시화 툴을 제작할 수 있는 Component Ware를 제안하고 이를 이용한 진화분석용 가시화 툴을 소개한다.

  • PDF

Integration of the PubAnnotation ecosystem in the development of a web-based search tool for alternative methods

  • Neves, Mariana
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.18.1-18.5
    • /
    • 2020
  • Finding publications that propose alternative methods to animal experiments is an important but time-consuming task since researchers need to perform various queries to literature databases and screen many articles to assess two important aspects: the relevance of the article to the research question, and whether the article's proposed approach qualifies to being an alternative method. We are currently developing a Web application to support finding alternative methods to animal experiments. The current (under development) version of the application utilizes external tools and resources for document processing, and relies on the PubAnnotation ecosystem for annotation querying, annotation storage, dictionary-based tagging of cell lines, and annotation visualization. Currently, our two PubAnnotation repositories for discourse elements contain annotations for more than 110k PubMed documents. Further, we created an annotator for cell lines that contain more than 196k terms from Cellosaurus. Finally, we are experimenting with TextAE for annotation visualization and for user feedback.

DNA Sequence Visualization with k-convex Hull (k-convex hull을 이용한 DNA 염기 배열의 가시화)

  • Kim, Min Ah;Lee, Eun Jeong;Cho, Hwan Gyu
    • Journal of the Korea Computer Graphics Society
    • /
    • v.2 no.2
    • /
    • pp.61-68
    • /
    • 1996
  • In this paper we propose a new visualization technique to characterize qualitative information of a large DNA sequence. While a long DNA sequence has huge information, it is not easy to obtain genetic information from the DNA sequence. We transform DNA sequences into a polygon to compute their homology in image domain rather than text domain. Our program visualizes DNA sequences with colored random walk plots and simplify them k-convex hulls. A random walk plot represents DNA sequence as a curve in a plane. A k-convex hull simplifies a random work plot by removing some parts of its insignificant information. This technique gives a biologist an insight to detect and classify DNA sequences with easy. Experiments with real genome data proves our approach gives a good visual forms for long DNA sequences for homology analysis.

  • PDF

Level Scale Interface Design for Real-Time Visualizing Large-Scale Data (대용량 자료 실시간 시각화를 위한 레벨 수준 표현 인터페이스 설계)

  • Lee, Do-Hoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.2
    • /
    • pp.105-111
    • /
    • 2008
  • Various visualizing methods have been proposed according to the input and output types. To show complex and large-scale raw data and information. LOD and special region scale method have been used for them. In this paper, I propose level scale interface for dynamic and interactive controlling large scale data such as bio-data. The method has not only advantage of LOD and special region scale but also dynamic and real-time processing. In addition, the method supports elaborate control from large scale to small one for visualization on a region in detail. Proposed method was adopted for genome relationship visualization tool and showed reasonable control method.

  • PDF