• 제목/요약/키워드: mixed data set

검색결과 149건 처리시간 0.022초

2단 사류펌프의 임펠러 성능향상 방안 연구 (STUDY ON THE HYDRAULIC DESIGN OF 2 STAGE MIXED FLOW PUMP)

  • 김영주;우남섭;권재기;정소걸;박의섭;배상은;박수한
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 2011년 춘계학술대회논문집
    • /
    • pp.556-560
    • /
    • 2011
  • The seawater lift pump system is responsible for maintaining the open canal level to provide the suction flow of circulating water pump at the set point. The objective of this paper is to design a 2-stage mixed flow pump(for seawater lifting) by inverse design and to evaluate the overall performance and the local flow fields of the pump by using a commercial CFD code. Rotating speed of the impeller is 1,750 rpm with the flow rate of 2,700 $m^3/h$. Finite volume method with structured mesh and Realizable ${\kappa}-{\varepsilon}$ turbulent model is used to guaranty more accurate prediction of turbulent flow in the pump impeller. The numerical results such as static head brake horse power and efficiency of the mixed flow pump are compared with the reference data. Also, the periodic condition calculation method for the mixed flow pump was carried out in order to investigate the pump performance characteristics with the modification of impeller geometry.

  • PDF

최적화에 기반을 둔 LAD의 패턴 생성 기법 (Optimization-Based Pattern Generation for LAD)

  • 장인용;류홍서
    • 한국컴퓨터정보학회논문지
    • /
    • 제11권1호
    • /
    • pp.11-18
    • /
    • 2006
  • LAD(Logical Analysis of Data)는 Boolean-logic에 기반을 둔 데이터 마이닝 방법론이다. LAD에 의한 데이터 분석 시 중요한 과정은 데이터 집합에 숨겨진 구조적 정보를 패턴의 형식으로 발견해내는 패턴 생성 단계이다. 기존의 패턴 생성 방법은 열거법에 기반을 두고 있어 높은 차수의 패턴을 생성하는 것은 실질적으로 불가능하였다. 본 논문에서는 최적화에 기반을 둔 패턴 생성 방법론을 제안하고 혼합 정수 선형 모형과 SCP(Set Covering Problem)의 두 가지 모형을 제안한다. 기계학습 분야에서 널리 쓰이는 데이터 집합에 대해 제안된 패턴 생성 방법을 이용한 분석 실험을 통하여 기존의 패턴 생성 방법으로는 생성될 수 없는 패턴을 쉽게 생성하는 효율성을 입증하였다.

  • PDF

광역지질도 작성을 위한 ISODATA 응용 (An Application of ISODATA Method for Regional Lithological Mapping)

  • 朴鍾南;徐延熙
    • 대한원격탐사학회지
    • /
    • 제5권2호
    • /
    • pp.109-122
    • /
    • 1989
  • The ISODATA method, which is one of the most famous of the square-error clustering methos, has been applied to two Chungju multivariate data sets in order to evaluate the effectiveness of the regional lithological mapping. One is an airborne radiometric data set and the other is a mixed data set of the airborne radiometric and Landsat TM data. In both cases, the classification of the Bulguksa granite and the Kyemyongsan biotite-quartz gneiss are the most successful. Hyangsanni dolomitic limestone and neighboring Daehyangsan quartzite are also classified by their typical lowness of the radioactive intensities, though it is still confused with some others such as water-covered areas and nearby alluvials, and unaltered limestone areas. Topographically rugged valleys are also classified as the same cluster as above. This could be due to unavoidable variations of flight height and the attitude of the airborne system in such rugged terrains. The regional geological mapping of sedimentary rock units of the Ockchun System is in general confused. This might be due to similarities between different sediments. Considarable discrepancies occurred in mapping some lithological boundaries might also be due to secondary effects such as contamination or smoothing in digitizing process. Further study should be continued in the variable selection scheme as no absolutely superior method claims to exist yet since it seems somewhat to be rather data dependent. Study could also be made on the data preprocessing in order to reduce the erratic effects as mentioned above, and thus hoprfully draw much better result in regional geological mapping.

경시적 자료의 주의력 결핍 과잉행동 장애를 종점으로 한 납의 벤치마크 용량 하한 도출 (Derivation of a benchmark dose lower bound of lead for attention deficit hyperactivity disorder using a longitudinal data set)

  • 이주형;김시연;하미나;권호장;김병수
    • 응용통계연구
    • /
    • 제29권7호
    • /
    • pp.1295-1309
    • /
    • 2016
  • 본 연구의 목적은 아동 건강에 미치는 환경의 영향을 평가하기 위하여 우리나라 환경부에서 구축한 경시적 자료인 CHEER 자료를 바탕으로 납의 벤치마크 용량 하한(BMDL)을 도출하여 Kim 등 (2014)의 결과를 재현하는 것이다. 본 연구에서는 CHEER 자료의 2005년 동집단을 사용하였는데, 벌점화 선형 스플라인을 이용한 변환공식으로 2005년 동집단의 ADHD 평가 척도를 통일하고, 경시적 자료의 특성을 반영한 두 개의 선형혼합모형을 구축하였다. 이후 구축된 모형을 바탕으로 혈중 납 농도의 BMDL을 도출하였다. 이 과정에서 Kim 등 (2014)에서 발견한 ADHD 점수의 평균으로의 회귀 현상이 재확인되었고, 2005년 동집단과 2006년 동집단의 분포 상의 특징적 차이가 발견되었다. 결과적으로 이 차이를 감안했을 때, Kim 등 (2014)과 일치적인 결과를 얻을 수 있었다.

Individual Tree Growth Models for Natural Mixed Forests in Changbai Mountains, Northeast China

  • Lu, Jun;Li, Fengri
    • 한국산림과학회지
    • /
    • 제96권2호
    • /
    • pp.160-169
    • /
    • 2007
  • The data used to develop distance-independent individual models for natural mixed forests were collected from 712 remeasured permanent sample plots (25,526 trees) of 10-year periodic from 1990 to 2000 in Baihe Forest Bureau of Changbai Mountains, northeast China. Based on analyzing relationship between diameter increment of individual trees with tree size, competitive status, and site condition, the diameter growth models for individual trees of 15 species growing in mixed-species uneven-aged forest stands, that have simple form, good predicting precision, and easily applicable, were developed using stepwise regression method. The main variables influencing on diameter increment of individual trees were tree size and competition, however, the site conditions were not significantly related with diameter increment. The tree size variables (lnDBH and $DBH^2$) were the most significant and important predictors of diameter growth existing in all 15 growth models. The diameter increment was directly proportional to tree diameter for each species. For the competitive factors in growth model, the relative diameter (RD), canopy closure (P), and the ratio of diameter of subject tree with maximum diameter (DDM) were contributed to the diameter increment at a certain extent. Other measures of stand density, such as basal area of stand (G) and stand density index (SDI), were not significantly influenced on diameter increment. Site factors, such as site index, slope and aspect were not important to diameter increment and excluded in the final models. The total variance explained by the final models of squared diameter increment ($R^2$) for all 15 species ranged from 35% to 72% and these results compared quit closely with those of Wykoff (1990) for mixed conifer stands. Using independent data set, validation measures were evaluated for predicting models of diameter increment developed in this study. The result indicated that the estimated precision was all greater than 94% and the models were suitable to describe diameter increment.

Pagoda Data Management and Metadata Requirements for Libraries in Myanmar

  • Tin Tin Pipe;Kulthida Tuamsuk
    • Journal of Information Science Theory and Practice
    • /
    • 제11권3호
    • /
    • pp.79-91
    • /
    • 2023
  • The storage of data documentation for Myanmar pagodas has various issues, and its retrieval method causes problems for users and libraries. This study utilized a mixed-methods approach, combining qualitative and quantitative methods to investigate pagoda data management in Myanmar libraries. The study aims to achieve the following objectives: to study the library collection management of pagodas in Myanmar, to investigate the management of pagoda data in Myanmar libraries, and to identify the pagoda data requirements for metadata development from the library professional perspective. The study findings revealed several challenges facing librarians and library users in accessing and managing Myanmar pagoda data, including limited stocks and retrieval tools, difficulty in accessing all available data online, and a lack of a centralized database or repository for storing and retrieving pagoda data. The study recommends the establishment of metadata criteria for managing a set of pagoda data and improving access to technology to address these challenges.

Consensus Clustering for Time Course Gene Expression Microarray Data

  • Kim, Seo-Young;Bae, Jong-Sung
    • Communications for Statistical Applications and Methods
    • /
    • 제12권2호
    • /
    • pp.335-348
    • /
    • 2005
  • The rapid development of microarray technologies enabled the monitoring of expression levels of thousands of genes simultaneously. Recently, the time course gene expression data are often measured to study dynamic biological systems and gene regulatory networks. For the data, biologists are attempting to group genes based on the temporal pattern of their expression levels. We apply the consensus clustering algorithm to a time course gene expression data in order to infer statistically meaningful information from the measurements. We evaluate each of consensus clustering and existing clustering methods with various validation measures. In this paper, we consider hierarchical clustering and Diana of existing methods, and consensus clustering with hierarchical clustering, Diana and mixed hierachical and Diana methods and evaluate their performances on a real micro array data set and two simulated data sets.

Supremacy of Realized Variance MIDAS Regression in Volatility Forecasting of Mutual Funds: Empirical Evidence From Malaysia

  • WAN, Cheong Kin;CHOO, Wei Chong;HO, Jen Sim;ZHANG, Yuruixian
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제9권7호
    • /
    • pp.1-15
    • /
    • 2022
  • Combining the strength of both Mixed Data Sampling (MIDAS) Regression and realized variance measures, this paper seeks to investigate two objectives: (1) evaluate the post-sample performance of the proposed weekly Realized Variance-MIDAS (RVar-MIDAS) in one-week ahead volatility forecasting against the established Generalized Autoregressive Conditional Heteroskedasticity (GARCH) model and the less explored but robust STES (Smooth Transition Exponential Smoothing) methods. (2) comparing forecast error performance between realized variance and squared residuals measures as a proxy for actual volatility. Data of seven private equity mutual fund indices (generated from 57 individual funds) from two different time periods (with and without financial crisis) are applied to 21 models. Robustness of the post-sample volatility forecasting of all models is validated by the Model Confidence Set (MCS) Procedures and revealed: (1) The weekly RVar-MIDAS model emerged as the best model, outperformed the robust DAILY-STES methods, and the weekly DAILY-GARCH models, particularly during a volatile period. (2) models with realized variance measured in estimation and as a proxy for actual volatility outperformed those using squared residual. This study contributes an empirical approach to one-week ahead volatility forecasting of mutual funds return, which is less explored in past literature on financial volatility forecasting compared to stocks volatility.

MINIMIZATION OF PARENT ROLL TRIM LOSS FOR THE PAPER INDUSTRY

  • Bae, Hee-Man
    • 한국경영과학회지
    • /
    • 제3권2호
    • /
    • pp.95-108
    • /
    • 1978
  • This paper discusses an application of mathematical programming techniques in the paper industry in determining optimal parent roll widths. Parent rolls are made from the reels produced at wide paper machines by slitting them to more manageable widths. The problem is finding a set of the slitting patterns that will minimize the trim loss involved in the sheeting operation. Two programming models, one linear and one mixed integer linear, are presented in this paper. Also presented are the computational experience, the model sensitivity, and the comparison of the optimal solutions with the simulated operational data.

  • PDF

A Decision Tree Algorithm using Genetic Programming

  • Park, Chongsun;Ko, Young Kyong
    • Communications for Statistical Applications and Methods
    • /
    • 제10권3호
    • /
    • pp.845-857
    • /
    • 2003
  • We explore the use of genetic programming to evolve decision trees directly for classification problems with both discrete and continuous predictors. We demonstrate that the derived hypotheses of standard algorithms can substantially deviated from the optimum. This deviation is partly due to their top-down style procedures. The performance of the system is measured on a set of real and simulated data sets and compared with the performance of well-known algorithms like CHAID, CART, C5.0, and QUEST. Proposed algorithm seems to be effective in handling problems caused by top-down style procedures of existing algorithms.