Optimization Methodology Integrated Data Mining and Statistical Method

데이터 마이닝과 통계적 기법을 통합한 최적화 기법

  • Song, Suh-Ill (Dept. of Industrial & Management Systems Engineering, Dong-A University) ;
  • Shin, Sang-Mun (Department of Systems Management & Engineering, Inje University) ;
  • Jung, Hey-Jin (Dept. of Industrial & Management Systems Engineering, Dong-A University)
  • 송서일 (동아대학교 산업경영공학과) ;
  • 신상문 (인제대학교 시스템경영공학과) ;
  • 정혜진 (동아대학교 산업경영공학과)
  • Published : 2006.12.31

Abstract

These days manufacture technology and manufacture environment are changing rapidly. By development of computer and enlargement of technique, most of manufacture field are computerized. In order to win international competition, it is important for companies how fast get the useful information from vast data. Statistical process control(SPC) techniques have been used as a problem solution tool at manufacturing process until present. However, these statistical methods are not applied more extensively because it has much restrictions in realistic problems. These statistical techniques have lots of problems when much data and factors are analyzed. In this paper, we proposed more practical and efficient a new statistical design technique which integrated data mining (DM) and statistical methods as alternative of problems. First step is selecting significant factor using DM feature selection algorithm from data of manufacturing process including many factors. Second step is finding optimum of process after estimating response function through response surface methodology(RSM) that is a statistical techniques

Keywords

References

  1. 백동현, 한창희(2003), '데이터마이닝을 이용한 반도체 FAB 공정의 수율개선 및 예측',한국지능정보시스템학회논문지, 9권, 1호, pp. 157-177
  2. 이기훈(2000), '데이터 마이닝에서 로버스트 통계적 기법의 도입', 산경논총, 18집
  3. Alexander, H. and Daniel, A. K.(1999), 'Clustering Method for Large Data Sets', SIGMOD99
  4. Chang, X. F. and Xian, F. W.(2004), 'Data mining techniques applied to predictive modeling of the knurling process', IIE Transactions, Vol. 36, pp. 253-263 https://doi.org/10.1080/07408170490274214
  5. DuMonuchel, W.(1999), 'Bayesian Data Mining in Large Frequency Tables With an Application to the Spontaneous Reportign System', The American Statistician, Vol. 53, pp. 177-202 https://doi.org/10.2307/2686093
  6. Gardner, M. and Bieker, J.(2000), 'Data Mining Solves Though Semiconductor Manufacturing Problem. Conference on Knowledge Discovery in Data Proceedings of the sixth ACMSIGKDD international conference on Knowledge discovery and data mining', New York, pp. 376-383
  7. Hall, M. A.(1998), 'Correlation-based Feature Selection for Machine Learning', Waikato University, Department of Computer Science. Hamilton, New Zealand
  8. John, G. H. and Kohavi, R., and Pflager, P. (1994), 'Irrelevant Features and the Subset Selection Problem', In Machine Learning : Proceedings of the Eleventh International Conference, Morgan Kaufmann
  9. Kuralmani, V. and Xie, M.(2002), 'A conditional decision procedure for high yield process', IIE Transaction, Vol. 34, pp. 1021-1030
  10. Lin, D. K. J. and Tu, W.(1995), 'Dual response surface optimation', Journal of Quality Technology, Vol. 27, pp. 34-39 https://doi.org/10.1080/00224065.1995.11979556
  11. Montgomery, D. C.(2001), Introduction to Statistical Quality Control, 4th edn. John Wiley & Sons, New York
  12. Seifert, J. W.(2004), Data Mining : An Overview, CRS Report RL31798
  13. Fayyad, U. and piatetsky -Shapiro, G. and Smyth, P.(1996), 'The KDD Process for Extracting Useful Knowledge from Volumes of Data', Communication of the ACM, Vol. 39, No. 11, pp. 27-34
  14. Witten, I. W. H. and Frank, E., Data Mining : Practical Machines Learning Tools and Techniques. 2nd edn Morgan Kaufmann, San Francisco
  15. Yu, L. and Liu, H.(2003), 'Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution', The Proceedings of the 20th International Conference on Machine Learning (ICML-03), Washington D. C., pp. 856-863