• Title/Summary/Keyword: Graphical Data

Search Result 634, Processing Time 0.022 seconds

Comparison of model selection criteria in graphical LASSO (그래프 LASSO에서 모형선택기준의 비교)

  • Ahn, Hyeongseok;Park, Changyi
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.4
    • /
    • pp.881-891
    • /
    • 2014
  • Graphical models can be used as an intuitive tool for modeling a complex stochastic system with a large number of variables related each other because the conditional independence between random variables can be visualized as a network. Graphical least absolute shrinkage and selection operator (LASSO) is considered to be effective in avoiding overfitting in the estimation of Gaussian graphical models for high dimensional data. In this paper, we consider the model selection problem in graphical LASSO. Particularly, we compare various model selection criteria via simulations and analyze a real financial data set.

Graphical Models for DNA Microarray Data Mining

  • 양진산;장병탁
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2002.06a
    • /
    • pp.49-61
    • /
    • 2002
  • 현대적 실험방법 및 유전공학의 발전으로 최근 생물학적 자료는 비약적으로 늘어나고 있다. 이러한 자료의 기계학습을 이용한 분석방법은 많은 비용과 시간을 요구하는 전통적인 생물적 실험에 있어서 실험 시간을 단축시켜주고 실험비용을 줄여 주게 된다. 본 논문에서는 특별히 micro array data의 분석에 있어서 graphical model에 기반한 기계학습 방법들을 소개한다. 이중 GTM 은 특히 시각화 효과가 뛰어난 방법으로 Graphical model 에 기반한 GTM의 제반 특성을 소개하고 이를 yeast data의 분석에 적용시킨 결과를 자세히 알아보고자 한다. (**Presentation file을 수신 보관 중)

  • PDF

PROCESS ANALYSIS OF AUTOMOTIVE PARTS USING GRAPHICAL MODELLING

  • IRIKURA Norio;KUZUYA Kazuyoshi;NISHINA Ken
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 1998.11a
    • /
    • pp.295-300
    • /
    • 1998
  • Recently graphical modelling is being studied as a useful process analysis tool for exploratory causal analysis. Graphical modelling is a presentation method that uses graphs to describe statistical models of the structures of multivariate data. This paper describes an application of this graphical modeling with two cases from the automotive parts industry. One case is the unbalance problem of the pulley, an automotive generator part. There is multivariate data of the product from each of the processes which are connected in the series. By means of exploratory causal analysis between the variables using graphical modeling, the key processes which causes the variation of the final characteristics and their mechanism of the causal relationship have become clear. Another case is, also, the unbalanced problem of automotive starter parts which consists of many parts and is manufactured by complex machinery and assembling process. By means of the similar technique, the key processes are obtained easily and the results are reasonable from technical knowledge.

  • PDF

An Analysis Method of Superlarge Manufacturing Process Data Using Data Cleaning and Graphical Analysis (데이터 정제와 그래프 분석을 이용한 대용량 공정데이터 분석 방법)

  • 박재홍;변재현
    • Journal of Korean Society for Quality Management
    • /
    • v.30 no.2
    • /
    • pp.72-85
    • /
    • 2002
  • Advances in computer and sensor technology have made it possible to obtain superlarge manufacturing process data in real time, letting us extract meaningful information from these superlarge data sets. We propose a systematic data analysis procedure which field engineers can apply easily to manufacture quality products. The procedure consists of data cleaning and data analysis stages. Data cleaning stage is to construct a database suitable for statistical analysis from the original superlarge manufacturing process data. In the data analysis stage, we suggest a graphical easy-to-implement approach to extract practical information from the cleaned database. This study will help manufacturing companies to achieve six sigma quality.

Introduction to S-PLUS and graphical comparison with SAS (S-PLUS의 소개 및 SAS 와의 그래픽 비교)

  • 김성수;한경수
    • The Korean Journal of Applied Statistics
    • /
    • v.6 no.1
    • /
    • pp.1-11
    • /
    • 1993
  • Statistical graphics have been important new tools for data analysis and many statistical softwares are exploiting graphical methods. Among these softwares available in personal computer at low cost, we intriduce S-PLUS(version 2.0). S-PLUS is an interactive graphical data analysis system and object-oriented programming language. SAS/GRAPH is another popular graphical system for displaying data in the form of color plots, charts, maps, and slides on screen and hardcopy devices. S-PLUS is compared to SAS/GRAPH(version 6.04) in viewpoints of statistical graphics.

  • PDF

Visualization of Path Expressions with Set Attributes and Methods in Graphical Object Query Languages (그래픽 객체 질의어에서 집합 속성과 메소드를 포함한 경로식의 시각화)

  • 조완섭
    • Journal of KIISE:Databases
    • /
    • v.30 no.2
    • /
    • pp.109-124
    • /
    • 2003
  • Although most commercial relational DBMSs Provide a graphical query language for the user friendly interfaces of the databases, few research has been done for graphical query languages in object databases. Expressing complex query conditions in a concise and intuitive way has been an important issue in the design of graphical query languages. Since the object data model and object query languages are more complex than those of the relational ones, the graphical object query language should have a concise and intuitive representation method. We propose a graphical object query language called GOQL (Graphical Object Query Language) for object databases. By employing simple graphical notations, advanced features of the object queries such as path expressions including set attributes, quantifiers, and/or methods can be represented in a simple graphical notation. GOQL has an excellent expressive power compared with previous graphical object query languages. We show that path expressions in XSQL(1,2) can be represented by the simple graphical notations in GOQL. We also propose an algorithm that translates a graphical query in GOQL into the textual object query with the same semantics. We finally describe implementation results of GOQL in the Internet environments.

A Study on Bias Effect on Model Selection Criteria in Graphical Lasso

  • Choi, Young-Geun;Jeong, Seyoung;Yu, Donghyeon
    • Quantitative Bio-Science
    • /
    • v.37 no.2
    • /
    • pp.133-141
    • /
    • 2018
  • Graphical lasso is one of the most popular methods to estimate a sparse precision matrix, which is an inverse of a covariance matrix. The objective function of graphical lasso imposes an ${\ell}_1$-penalty on the (vectorized) precision matrix, where a tuning parameter controls the strength of the penalization. The selection of the tuning parameter is practically and theoretically important since the performance of the estimation depends on an appropriate choice of tuning parameter. While information criteria (e.g. AIC, BIC, or extended BIC) have been widely used, they require an asymptotically unbiased estimator to select optimal tuning parameter. Thus, the biasedness of the ${\ell}_1$-regularized estimate in the graphical lasso may lead to a suboptimal tuning. In this paper, we propose a two-staged bias-correction procedure for the graphical lasso, where the first stage runs the usual graphical lasso and the second stage reruns the procedure with an additional constraint that zero estimates at the first stage remain zero. Our simulation and real data example show that the proposed bias correction improved on both edge recovery and estimation error compared to the single-staged graphical lasso.

Probabilistic Graphical Model for Transaction Data Analysis (트랜잭션 데이터 분석을 위한 확률 그래프 모형)

  • Ahn, Gil Seung;Hur, Sun
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.42 no.4
    • /
    • pp.249-255
    • /
    • 2016
  • Recently, transaction data is accumulated everywhere very rapidly. Association analysis methods are usually applied to analyze transaction data, but the methods have several problems. For example, these methods can only consider one-way relations among items and cannot reflect domain knowledge into analysis process. In order to overcome defect of association analysis methods, we suggest a transaction data analysis method based on probabilistic graphical model (PGM) in this study. The method we suggest has several advantages as compared with association analysis methods. For example, this method has a high flexibility, and can give a solution to various probability problems regarding the transaction data with relationships among items.

Graphical Methods for Evaluating the Degree of the Orthogonal Blocking

  • Jang, Dae-Heung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.3
    • /
    • pp.669-675
    • /
    • 2006
  • When using response surface designs, the experimental trials should be carried out in blocks in case of heterogeneity of conditions. When we use nearly orthogonal blocking, we need evaluate the degree of orthogonal blocking. Graphical methods for evaluating the degree of orthogonal blocking are suggested.

  • PDF

Graphical exploratory data analysis for ball games in sports

  • Yi, Seongbaek;Jang, Dae-Heung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1413-1421
    • /
    • 2016
  • In this paper graphical exploratory data analyses are proposed for ball games in sports. The plot of sequence of scoring points of each team can be used to see how the playing game has been processed until the end of each set or quarter. With the plot of sequential score differences through all the games we can see a dominance of each team and the times of score changes, i.e., turnovers. The ternary plots show the contours of scoring compositions for each player and enable us to compare the scoring patterns of each team if any. Using the score sequence plot we also can see the score pattern distribution of players. For demonstration we use the results of the gold medal match between Russia and Brazil for men's volleyball and between USA and Spain for men's basketball at the London 2012 Summer Olympics.