• 제목/요약/키워드: Bayesian Techniques

검색결과 167건 처리시간 0.025초

의학적 의사결정 지표의 고찰 및 해석에 기초한 품질통계기법의 적용 (Application of Quality Statistical Techniques Based on the Review and the Interpretation of Medical Decision Metrics)

  • 최성운
    • 대한안전경영과학회지
    • /
    • 제15권2호
    • /
    • pp.243-253
    • /
    • 2013
  • This research paper introduces the application and implementation of medical decision metrics that classifies medical decision-making into four different metrics using statistical diagnostic tools, such as confusion matrix, normal distribution, Bayesian prediction and Receiver Operating Curve(ROC). In this study, the metrics are developed based on cross-section study, cohort study and case-control study done by systematic literature review and reformulated the structure of type I error, type II error, confidence level and power of detection. The study proposed implementation strategies for 10 quality improvement activities via 14 medical decision metrics which consider specificity and sensitivity in terms of ${\alpha}$ and ${\beta}$. Examples of ROC implication are depicted in this paper with a useful guidelines to implement a continuous quality improvement, not only in a variable acceptance sampling in Quality Control(QC) but also in a supplier grading score chart in Supplier Chain Management(SCM) quality. This research paper is the first to apply and implement medical decision-making tools as quality improvement activities. These proposed models will help quality practitioners to enhance the process and product quality level.

Maximum a posteriori estimation based wind fragility analysis with application to existing linear or hysteretic shear frames

  • Wang, Vincent Z.;Ginger, John D.
    • Structural Engineering and Mechanics
    • /
    • 제50권5호
    • /
    • pp.653-664
    • /
    • 2014
  • Wind fragility analysis provides a quantitative instrument for delineating the safety performance of civil structures under hazardous wind loading conditions such as cyclones and tornados. It has attracted and would be expected to continue to attract intensive research spotlight particularly in the nowadays worldwide context of adapting to the changing climate. One of the challenges encumbering efficacious assessment of the safety performance of existing civil structures is the possible incompleteness of the structural appraisal data. Addressing the issue of the data missingness, the study presented in this paper forms a first attempt to investigate the feasibility of using the expectation-maximization (EM) algorithm and Bayesian techniques to predict the wind fragilities of existing civil structures. Numerical examples of typical linear or hysteretic shear frames are introduced with the wind loads derived from a widely used power spectral density function. Specifically, the application of the maximum a posteriori estimates of the distribution parameters for the story stiffness is examined, and a surrogate model is developed and applied to facilitate the nonlinear response computation when studying the fragilities of the hysteretic shear frame involved.

Towards Effective Analysis and Tracking of Mozilla and Eclipse Defects using Machine Learning Models based on Bugs Data

  • Hassan, Zohaib;Iqbal, Naeem;Zaman, Abnash
    • Soft Computing and Machine Intelligence
    • /
    • 제1권1호
    • /
    • pp.1-10
    • /
    • 2021
  • Analysis and Tracking of bug reports is a challenging field in software repositories mining. It is one of the fundamental ways to explores a large amount of data acquired from defect tracking systems to discover patterns and valuable knowledge about the process of bug triaging. Furthermore, bug data is publically accessible and available of the following systems, such as Bugzilla and JIRA. Moreover, with robust machine learning (ML) techniques, it is quite possible to process and analyze a massive amount of data for extracting underlying patterns, knowledge, and insights. Therefore, it is an interesting area to propose innovative and robust solutions to analyze and track bug reports originating from different open source projects, including Mozilla and Eclipse. This research study presents an ML-based classification model to analyze and track bug defects for enhancing software engineering management (SEM) processes. In this work, Artificial Neural Network (ANN) and Naive Bayesian (NB) classifiers are implemented using open-source bug datasets, such as Mozilla and Eclipse. Furthermore, different evaluation measures are employed to analyze and evaluate the experimental results. Moreover, a comparative analysis is given to compare the experimental results of ANN with NB. The experimental results indicate that the ANN achieved high accuracy compared to the NB. The proposed research study will enhance SEM processes and contribute to the body of knowledge of the data mining field.

The Impact of Institutional Quality on FDI Inflows: The Evidence from Capital Outflow of Asian Economies

  • LE, Anh Hoang;KIM, Taegi
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제8권8호
    • /
    • pp.335-343
    • /
    • 2021
  • This paper investigates the effect of institutional quality on FDI inflows by using FDI outflows from Asian countries from 2009 to 2017. We used the FDI data from five major Asian economies, which are South Korea, China, Japan, Singapore, and Hong Kong. The gravity model was used to examine the effect of institutional quality on FDI flows. The regression model considers several independent variables, and we select the most appropriate variables by using the Bayesian Model Averaging (BMA) estimator. We have shown that foreign direct investment from Asian countries depends on the size of home and the partner countries, geographical distance, trade interaction between two countries, economic freedom, labor supply, tariff rate, and capacity of the government. The results of different estimation techniques emphasize that multinational enterprises prefer to invest in those countries which have a higher income, which shows the evidence for Lucas's paradox. The results also show that economic freedom and control of corruption have a positive impact on FDI inwards. The regression results show that better institutional quality in host countries encourages more FDIs from Asian economies. It suggests that the state should control corruption and create a free economic environment to attract FDIs.

Kennicutt-Schmidt law with H I velocity profile decomposition in NGC 6822

  • Park, Hye-Jin;Oh, Se-Heon;Wang, Jing;Zheng, Yun;Zhang, Hong-Xin;de Blok, W.J.G.
    • 천문학회보
    • /
    • 제46권1호
    • /
    • pp.32.3-33
    • /
    • 2021
  • We present H I gas kinematics and star formation activities of NGC 6822, a dwarf galaxy located in the Local Volume at a distance of ~ 490 kpc. We perform profile decomposition of the line-of-sight velocity profiles of the high-resolution (42.4" × 12" spatial; 1.6 km/s spectral) H I data cube taken with the Australia Telescope Compact Array (ATCA). For this, we use a new tool, the so-called BAYGAUD (BAYesian GAUssian Decompositor) which is based on Bayesian Markov Chain Monte Carlo (MCMC) techniques, allowing us to decompose a line-of-sight velocity profile into an optimal number of Gaussian components in a quantitative manner. We classify the decomposed H I gas components of NGC 6822 into bulk-narrow, bulk-broad, and non_bulk with respect to their velocity and velocity dispersion. We correlate their gas surface densities with the surface star formation rates derived using both GALEX far-ultraviolet and WISE 22 micron data to examine the impact of gas turbulence caused by stellar feedback on the Kennicutt-Schmidt (K-S) law. The bulk-narrow component that resides within r25 is likely to follow the linear extension of the Kennicutt-Schmidt (K-S) law for molecular hydrogen (H2) at the low gas surface density regime where H I is not saturated.

  • PDF

Predicting Stock Liquidity by Using Ensemble Data Mining Methods

  • Bae, Eun Chan;Lee, Kun Chang
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권6호
    • /
    • pp.9-19
    • /
    • 2016
  • In finance literature, stock liquidity showing how stocks can be cashed out in the market has received rich attentions from both academicians and practitioners. The reasons are plenty. First, it is known that stock liquidity affects significantly asset pricing. Second, macroeconomic announcements influence liquidity in the stock market. Therefore, stock liquidity itself affects investors' decision and managers' decision as well. Though there exist a great deal of literature about stock liquidity in finance literature, it is quite clear that there are no studies attempting to investigate the stock liquidity issue as one of decision making problems. In finance literature, most of stock liquidity studies had dealt with limited views such as how much it influences stock price, which variables are associated with describing the stock liquidity significantly, etc. However, this paper posits that stock liquidity issue may become a serious decision-making problem, and then be handled by using data mining techniques to estimate its future extent with statistical validity. In this sense, we collected financial data set from a number of manufacturing companies listed in KRX (Korea Exchange) during the period of 2010 to 2013. The reason why we selected dataset from 2010 was to avoid the after-shocks of financial crisis that occurred in 2008. We used Fn-GuidPro system to gather total 5,700 financial data set. Stock liquidity measure was computed by the procedures proposed by Amihud (2002) which is known to show best metrics for showing relationship with daily return. We applied five data mining techniques (or classifiers) such as Bayesian network, support vector machine (SVM), decision tree, neural network, and ensemble method. Bayesian networks include GBN (General Bayesian Network), NBN (Naive BN), TAN (Tree Augmented NBN). Decision tree uses CART and C4.5. Regression result was used as a benchmarking performance. Ensemble method uses two types-integration of two classifiers, and three classifiers. Ensemble method is based on voting for the sake of integrating classifiers. Among the single classifiers, CART showed best performance with 48.2%, compared with 37.18% by regression. Among the ensemble methods, the result from integrating TAN, CART, and SVM was best with 49.25%. Through the additional analysis in individual industries, those relatively stabilized industries like electronic appliances, wholesale & retailing, woods, leather-bags-shoes showed better performance over 50%.

상선 운항 사고의 양적 위기평가기법 개발 (Development of Quantitative Risk Assessment Methodology for the Maritime Transportation Accident of Merchant Ship)

  • 임정빈
    • 한국항해항만학회지
    • /
    • 제33권1호
    • /
    • pp.9-19
    • /
    • 2009
  • 본 논문에서는 상선의 운항 사고에 관한 양적 위기평가에 관한 실험적인 접근방법들을 기술했다. 이 연구의 목적은 국제해사기구의 공식 안전성 평가(FSA)를 기반으로 운항 사고에 크게 기여하는 요소들을 분석하고, 양적 위기평가기법에 기반을 둔 운항 사고의 확률적인 위기수준을 평가한 후, 선박 안전을 저해할 수 있는 운항 사고 위기를 예측하는 것이다. 확률지수(PI)와 심각성지수(SI) 구성된 위기지수(RI)에 대한 운항 사고의 확률적인 위기수준은 베이지안 이론을 적용한 베이지안 네트워크를 기반으로 본 연구에서 제안한 운항사고 위기 모델을 이용해서 예측했다. 그리고 355건의 핵심 손상 사고기록으로 구성된 시나리오 그룹을 이용하여 제안한 모델의 적용 가능성을 평가하였다. 평가결과, 예측한 PI의 정답률 $r_{Acc}$은 82.8%로 나타났고, $S_p{\gg}1.0$$S_p{\ll}1.0$에 포함되는 PI 변수들의 민감도 초과비율은 10% 이내로 나타났으며, 예측한 SI의 평균 오차 $\bar{d_{SI}}$는 0.0195로 나타났고, 예측한 RI의 정답률은 91.8%로 나타났다. 이러한 결과는 제안한 모델과 방법이 실제 해상운송 현장에 적용 가능함을 나타낸다.

GC/MS 분석과 베이지안 분류 모형을 이용한 새 윤활유와 사용 엔진 오일의 동일성 추적과 분류 (Identification and classification of fresh lubricants and used engine oils by GC/MS and bayesian model)

  • 김남이;남금문;김유나;이동계;박세연;이경재;이재용
    • 분석과학
    • /
    • 제27권1호
    • /
    • pp.41-59
    • /
    • 2014
  • 국내 시판제품으로 서울시내에서 구입한 산업용 윤활유, 이륜구동 윤활유, 선박용 윤활유, 자동차용 윤활유(엔진오일, 수동 변속기 기어유, 자동변속기 오일) 등 80종(기유 4종 포함)의 새 윤활유들(80 classes)과 8종의 경유 차량과 16종의 휘발유 차량에 각각 3종씩의 경유와 휘발유 전용 엔진 오일로 교환하여 차량별 및 주행거리별로 각각 채취한 사용 엔진 오일 86종을 GC/MS로 분석한 TIC로 데이터베이스를 만들고, 새 윤활유와 사용 엔진오일들의 동일성 추적과 차량별 분류를 위하여 차원 축소와 베이지안 방식의 분류 모형을 개발하였다. 새 윤활유의 분류는 웨이블렛 적합방법과 주성분 분석방법으로 차원 축소하여 베이지안 방식의 분류 모형을 적용한 결과 각각 97.5%와 96.7%의 정분류율을 보여 차원 축소는 웨이블렛 적합방법이 더 좋은 결과를 나타냈다. 그리고 새 윤활유의 분류에서 선택된 웨이블렛 적합방법의 차원 축소와 베이지안 방식의 분류 모형에 의한 사용 엔진 오일의 차량별 분류(총 24 classes)는 86.4%의 정분류율을 보였고, 경유 차량인지 휘발유 차량인지를 구분하는 차량 연료 타입별 분류(총 2 classes)는 99.6%의 정분류율을 나타내었고, 사용 엔진 오일 브랜드별 분류(총 6 classes)는 97.3%의 정분류율을 나타내었다.

신경망과 운전자 알고리즘을 이용한 스팸 메일 필터링 기법에 구현과 성능평가 (Implementation and Experimental Results of Neural Network and Genetic Algorithm based Spam Filtering Technique)

  • 김범배;최형기
    • 정보처리학회논문지C
    • /
    • 제13C권2호
    • /
    • pp.259-266
    • /
    • 2006
  • 스팸 메일의 양의 급증함에 따라, 다양한 스팸 메일 필터링 기법이 제시되고 있다. 이런 필터링 기법 가운데, 학습 기반 필터링 기법은 현재 가장 보편화된 필터링 기법 가운데 하나이다. 본고에서는 신경망과, 유전자알고리즘, 카이제곱통계를 이용한 학습 기반 필터링 기법을 제시한다. 제안된 필터링 기법은 기존 필터링 기법의 문제를 해결하고, 스팸 메일 필터링에 높은 정확도를 제공할 수 있다 제안된 필터링 기법은 스팸메일 필터링 정확도와 정상 메일 필터링 정확도에서 각각 95.25%와 95.31%의 높은 정확도를 보인다. 이런 실험 결과는 기존의 규칙 기반 필터링 기법과 베이지안 필터링 기법에 비해 각각 7%, 12% 이상 높은 수치이다.

Learning Graphical Models for DNA Chip Data Mining

  • Zhang, Byoung-Tak
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2000년도 International Symposium on Bioinformatics
    • /
    • pp.59-60
    • /
    • 2000
  • The past few years have seen a dramatic increase in gene expression data on the basis of DNA microarrays or DNA chips. Going beyond a generic view on the genome, microarray data are able to distinguish between gene populations in different tissues of the same organism and in different states of cells belonging to the same tissue. This affords a cell-wide view of the metabolic and regulatory processes under different conditions, building an effective basis for new diagnoses and therapies of diseases. In this talk we present machine learning techniques for effective mining of DNA microarray data. A brief introduction to the research field of machine learning from the computer science and artificial intelligence point of view is followed by a review of recently-developed learning algorithms applied to the analysis of DNA chip gene expression data. Emphasis is put on graphical models, such as Bayesian networks, latent variable models, and generative topographic mapping. Finally, we report on our own results of applying these learning methods to two important problems: the identification of cell cycle-regulated genes and the discovery of cancer classes by gene expression monitoring. The data sets are provided by the competition CAMDA-2000, the Critical Assessment of Techniques for Microarray Data Mining.

  • PDF