• Title/Summary/Keyword: phenotype data

Search Result 216, Processing Time 0.029 seconds

MAP: Mutation Arranger for Defining Phenotype-Related Single-Nucleotide Variant

  • Baek, In-Pyo;Jeong, Yong-Bok;Jung, Seung-Hyun;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.12 no.4
    • /
    • pp.289-292
    • /
    • 2014
  • Next-generation sequencing (NGS) is widely used to identify the causative mutations underlying diverse human diseases, including cancers, which can be useful for discovering the diagnostic and therapeutic targets. Currently, a number of single-nucleotide variant (SNV)-calling algorithms are available; however, there is no tool for visualizing the recurrent and phenotype-specific mutations for general researchers. In this study, in order to support defining the recurrent mutations or phenotype-specific mutations from NGS data of a group of cancers with diverse phenotypes, we aimed to develop a user-friendly tool, named mutation arranger for defining phenotype-related SNV (MAP). MAP is a user-friendly program with multiple functions that supports the determination of recurrent or phenotype-specific mutations and provides graphic illustration images to the users. Its operation environment, the Microsoft Windows environment, enables more researchers who cannot operate Linux to define clinically meaningful mutations with NGS data from cancer cohorts.

Implementing a Web-based Seed Phenotype Trait Visualization Support System (웹 기반 종자 표현체 특성 가시화 지원시스템 구현)

  • Yang, OhSeok;Choi, SangMin;Seo, DongWoo;Choi, SeungHo;Kim, YoungUk;Lee, ChangWoo;Lee, EunGyeong;Baek, JeongHo;Kim, KyungHwan;Lee, HongRo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.25 no.5
    • /
    • pp.83-90
    • /
    • 2020
  • In this paper, a web-based seed phenotype visualization support system is proposed to extract and visualize data such as the surface color, length, area, perimeter and compactness of seed, which is phenotype information from the image of soybean/rice seeds. This system systematically stores data extracted from seeds in databases, and provides a web-based user interface that facilitates the analysis of data by researchers using data tables and charts. Conventional seed characteristic studies have been manually measured by humans, but the system developed in this paper allows researchers to simply upload seed images for analysis and obtain seed's numerical data after image processing. It is expected that the proposed system will be able to obtain time efficiency and remove spatial restriction, if it is used in seed characterization research, and it will be easy to analyze through systematic management of research results and visualization of the phenotype characteristics.

WTO, an ontology for wheat traits and phenotypes in scientific publications

  • Nedellec, Claire;Ibanescu, Liliana;Bossy, Robert;Sourdille, Pierre
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.14.1-14.11
    • /
    • 2020
  • Phenotyping is a major issue for wheat agriculture to meet the challenges of adaptation of wheat varieties to climate change and chemical input reduction in crop. The need to improve the reuse of observations and experimental data has led to the creation of reference ontologies to standardize descriptions of phenotypes and to facilitate their comparison. The scientific literature is largely under-exploited, although extremely rich in phenotype descriptions associated with cultivars and genetic information. In this paper we propose the Wheat Trait Ontology (WTO) that is suitable for the extraction and management of scientific information from scientific papers, and its combination with data from genomic and experimental databases. We describe the principles of WTO construction and show examples of WTO use for the extraction and management of phenotype descriptions obtained from scientific documents.

Prediction and visualization of CYP2D6 genotype-based phenotype using clustering algorithms

  • Kim, Eun-Young;Shin, Sang-Goo;Shin, Jae-Gook
    • Translational and Clinical Pharmacology
    • /
    • v.25 no.3
    • /
    • pp.147-152
    • /
    • 2017
  • This study focused on the role of cytochrome P450 2D6 (CYP2D6) genotypes to predict phenotypes in the metabolism of dextromethorphan. CYP2D6 genotypes and metabolic ratios (MRs) of dextromethorphan were determined in 201 Koreans. Unsupervised clustering algorithms, hierarchical and k-means clustering analysis, and color visualizations of CYP2D6 activity were performed on a subset of 130 subjects. A total of 23 different genotypes were identified, five of which were observed in one subject. Phenotype classifications were based on the means, medians, and standard deviations of the log MR values for each genotype. Color visualization was used to display the mean and median of each genotype as different color intensities. Cutoff values were determined using receiver operating characteristic curves from the k-means analysis, and the data were validated in the remaining subset of 71 subjects. Using the two highest silhouette values, the selected numbers of clusters were three (the best) and four. The findings from the two clustering algorithms were similar to those of other studies, classifying $^*5/^*5$ as a lowest activity group and genotypes containing duplicated alleles (i.e., $CYP2D6^*1/^*2N$) as a highest activity group. The validation of the k-means clustering results with data from the 71 subjects revealed relatively high concordance rates: 92.8% and 73.9% in three and four clusters, respectively. Additionally, color visualization allowed for rapid interpretation of results. Although the clustering approach to predict CYP2D6 phenotype from CYP2D6 genotype is not fully complete, it provides general information about the genotype to phenotype relationship, including rare genotypes with only one subject.

Multi-block Analysis of Genomic Data Using Generalized Canonical Correlation Analysis

  • Jun, Inyoung;Choi, Wooree;Park, Mira
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.33.1-33.9
    • /
    • 2018
  • Recently, there have been many studies in medicine related to genetic analysis. Many genetic studies have been performed to find genes associated with complex diseases. To find out how genes are related to disease, we need to understand not only the simple relationship of genotypes but also the way they are related to phenotype. Multi-block data, which is a summation form of variable sets, is used for enhancing the analysis of the relationships of different blocks. By identifying relationships through a multi-block data form, we can understand the association between the blocks in comprehending the correlation between them. Several statistical analysis methods have been developed to understand the relationship between multi-block data. In this paper, we will use generalized canonical correlation methodology to analyze multi-block data from the Korean Association Resource project, which has a combination of single nucleotide polymorphism blocks, phenotype blocks, and disease blocks.

Mouse phenogenomics, toolbox for functional annotation of human genome

  • Kim, Il-Yong;Shin, Jae-Hoon;Seong, Je-Kyung
    • BMB Reports
    • /
    • v.43 no.2
    • /
    • pp.79-90
    • /
    • 2010
  • Mouse models are crucial for the functional annotation of human genome. Gene modification techniques including gene targeting and gene trap in mouse have provided powerful tools in the form of genetically engineered mice (GEM) for understanding the molecular pathogenesis of human diseases. Several international consortium and programs are under way to deliver mutations in every gene in mouse genome. The information from studying these GEM can be shared through international collaboration. However, there are many limitations in utility because not all human genes are knocked out in mouse and they are not yet phenotypically characterized by standardized ways which is required for sharing and evaluating data from GEM. The recent improvement in mouse genetics has now moved the bottleneck in mouse functional genomics from the production of GEM to the systematic mouse phenotype analysis of GEM. Enhanced, reproducible and comprehensive mouse phenotype analysis has thus emerged as a prerequisite for effectively engaging the phenotyping bottleneck. In this review, current information on systematic mouse phenotype analysis and an issue-oriented perspective will be provided.

Diversity of Macrophomina phaseolina Based on Morphological and Genotypic Characteristics in Iran

  • Mahdizadeh, Valiollah;Safaie, Naser;Goltapeh, Ebrahim Mohammadi
    • The Plant Pathology Journal
    • /
    • v.27 no.2
    • /
    • pp.128-137
    • /
    • 2011
  • Fifty two Macrophomina phaseolina isolates were recovered from 24 host plant species through the 14 Iranian provinces. All isolates were confirmed to species using species-specific primers. The colony characteristics of each isolate were recorded, including chlorate phenotype, relative growth rate at $30^{\circ}C$ and $37^{\circ}C$, average size of microsclerotia, and time to microsclerotia formation. The feathery colony phenotype was the most common (63.7%) on the chlorate selective medium and represented the chlorate sensitive phenotype of the Iranian Macrophomina phaseolina population. Meantime, inter simple sequence repeats (ISSR) Markers were used to assess the genetic diversity of the fungus. Unweighted pair-group method using arithmetic means (UPGMA) clustering of data showed that isolates did not clearly differentiate to the specific group according to the host or geographical origins, however, usually the isolates from the same host or the same geographic origin tend to group nearly. Our results did not show a correlation between the genetic diversity based on the ISSR and phenotypic characteristics. Similar to the M. phaseolina populations in the other countries, the Iranian isolates were highly diverse based on the phenotypic and the genotypic characteristics investigated and needs more studies using neutral molecular tools to get a deeper insight into this complex species.

Simulation study on the estimation of multinomial proportions

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.411-417
    • /
    • 2012
  • In this paper, we consider the estimation of multinomial proportions. Multinomial distribution is the most important multivaritate distribution. Estimation of multinomial parameters for multinomial distribution is widely applicable to many practical research areas including genetics. We investigated the properties of several frequency substitution estimates and derived the maximum likelihood estimate of multinomial proportions of Hardy Weinberg proportions. Phenotype and genotype frequencies of allele are used to the estimation of multinomial proportions. These estimates are then analyzed via numerical data. Small sample Monte Carlo simulation is conducted to compare considered estimates of multinomial proportions.

HisCoM-GGI: Software for Hierarchical Structural Component Analysis of Gene-Gene Interactions

  • Choi, Sungkyoung;Lee, Sungyoung;Park, Taesung
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.38.1-38.3
    • /
    • 2018
  • Gene-gene interaction (GGI) analysis is known to play an important role in explaining missing heritability. Many previous studies have already proposed software to analyze GGI, but most methods focus on a binary phenotype in a case-control design. In this study, we developed "Hierarchical structural CoMponent analysis of Gene-Gene Interactions" (HisCoM-GGI) software for GGI analysis with a continuous phenotype. The HisCoM-GGI method considers hierarchical structural relationships between genes and single nucleotide polymorphisms (SNPs), enabling both gene-level and SNP-level interaction analysis in a single model. Furthermore, this software accepts various types of genomic data and supports data management and multithreading to improve the efficiency of genome-wide association study data analysis. We expect that HisCoM-GGI software will provide advanced accessibility to researchers in genetic interaction studies and a more effective way to understand biological mechanisms of complex diseases.

Detecting survival related gene sets in microarray analysis (마이크로어레이 자료에서 생존과 유의한 관련이 있는 유전자집단 검색)

  • Lee, Sun-Ho;Lee, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.1
    • /
    • pp.1-11
    • /
    • 2012
  • When the microarray experiment developed, main interest was limited to detect differentially expressed genes associated with a phenotype of interest. However, as human diseases are thought to occur through the interactions of multiple genes within a same functional category, the unit of analysis of the microarray experiment expanded to the set of genes. For the phenotype of censored survival time, Gene Set Enrichment Analysis(GSEA), Global test and Wald type test are widely used. In this paper, we modified the Wald type test by adopting normal score transformation of gene expression values and developed a parametric test which requires much less computation than others. The proposed method is compared with other methods using a real data set of ovarian cancer and a simulation data set.