Applying Decision Tree Algorithms for Analyzing HS-VOSTS Questionnaire Results

  • Kang, Dae-Ki (Division of Computer & Information Engineering, Dongseo University)
  • Received : 2011.10.04
  • Accepted : 2012.07.17
  • Published : 2012.07.31

Abstract

Data mining and knowledge discovery techniques have shown to be effective in finding hidden underlying rules inside large database in an automated fashion. On the other hand, analyzing, assessing, and applying students' survey data are very important in science and engineering education because of various reasons such as quality improvement, engineering design process, innovative education, etc. Among those surveys, analyzing the students' views on science-technology-society can be helpful to engineering education. Because, although most researches on the philosophy of science have shown that science is one of the most difficult concepts to define precisely, it is still important to have an eye on science, pseudo-science, and scientific misconducts. In this paper, we report the experimental results of applying decision tree induction algorithms for analyzing the questionnaire results of high school students' views on science-technology-society (HS-VOSTS). Empirical results on various settings of decision tree induction on HS-VOSTS results from one South Korean university students indicate that decision tree induction algorithms can be successfully and effectively applied to automated knowledge discovery from students' survey data.

Keywords

References

  1. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, 2nd ed., (2006).
  2. Ross Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, (1993).
  3. Ross Quinlan, "Improved use of continuous attributes in C4.5," Journal of Artificial Intelligence Research, 4: 77-90, (1996).
  4. Christopher M. Bishop Neural Networks for Pattern Recognition, Oxford: Oxford University Press, (1995).
  5. Richard O. Duda, Peter E. Hart and David G. Stork, Pattern classification (2nd edition), Wiley, (2001).
  6. Judea Pearl, Causality: Models, Reasoning, and Inference, Cambridge University Press, (2000).
  7. Judea Pearl, "Bayesian Networks: A Model of Self-Activated Memory for Evidential Reasoning," Proceedings of the 7th Conference of the Cognitive Science Society, University of California, Irvine, CA: 329-334, (1985).
  8. Corinna Cortes and Vladimir Vapnik, "Support-Vector Networks," Machine Learning, 20, (1995).12. Sipser, M (1996). Introduction to the Theory of Computation, PWS Pub. Co.; 1 edition.
  9. Bernhard E. Boser, Isabelle M. Guyon, and Vladimir N. Vapnik, "A training algorithm for optimal margin classifiers," The 5th Annual ACM Workshop on COLT: 144-152, Pittsburgh, PA, (1992).
  10. Barry Robson, O. K. Baek, and Sean Ekins, The engines of Hippocrates: From the Dawn of Medicine to Medical and Pharmaceutical Informatics. Hoboken, NJ: John Wiley & Sons, (2009)
  11. Srinivas Aluru, Handbook of Computational Molecular Biology. Chapman & Hall/Crc, (2006).
  12. J. S. Lyons, and A. M. Bayoumi, "CQI processes, results, and program improvements for engineering design," IEEE Transactions on Education, 43(2): 174-181, (2000). https://doi.org/10.1109/13.848070
  13. A. Ertas, and J. Jones, The Engineering Design Process. 2nd ed. New York, N.Y., John Wiley & Sons, Inc., (1996).
  14. Yong kil Lee and Kyung hee Kang, "Analyzing opinions which university students from engineering and social science department have about science-technology-society literacy", Journal of Engineering Education Research, 13, (4): 43-50, (2010).
  15. Larry Laudan, "The Demise of the Demarcation Problem". In Adolf Grunbaum, Robert Sonne Cohen, Larry Laudan. Physics, Philosophy, and Psychoanalysis: Essays in Honor of Adolf Grunbaum. Springer, 1983.
  16. Karl Popper, The logic of scientific discovery, New York: Basic Books, 1959.
  17. Thomas. S. Kuhn, The Structure of Scientific Revolutions, 2nd ed., Chicago: Univ. of Chicago Press, 1970.
  18. Paul Feyerabend, Against Method: Outline of an Anarchistic Theory of Knowledge, 1975.
  19. Jai-Hang Lim, Soon-Min Kang, Young-Tae Kong, Byung- Soon Choi, and Jeong-Hee Nam, "The Development of an Instrument to Assess High School Students' Views on Science-Technology-Society," Journal of the Korean Association for Science Education, 24(6): 1143-1157, (2004).
  20. D.-K. Kang and K. Sohn, "Learning decision trees with taxonomy of propositionalized attributes," Pattern Recognition, 42(1): 84-92, Jan. 2009, Elsevier B.V. https://doi.org/10.1016/j.patcog.2008.07.009
  21. D.-K. Kang and M.-J. Kim, "Propositionalized Attribute Taxonomies from Data for Data-Driven Construction of Concise Classifiers," Expert Systems With Applications, 38(10): 12739-12746, September 2011, Elsevier B.V. https://doi.org/10.1016/j.eswa.2011.04.062
  22. J. R. Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.
  23. Is See5/C5.0 Better Than C4.5?, Rulequest Research 2009. Online: http://www.rulequest.com/see5-comparison.html.
  24. L. Breiman, Classification and Regression Trees, Chapman & Hall, 1984.
  25. G. V. Kass, "An exploratory technique for investigating large quantities of categorical data," Applied Statistics 29(2): 119-127, 1980. https://doi.org/10.2307/2986296
  26. Y. J. Lee, "Factors in Information and Communication Ethics Behavior of College Students Majoring in Information and Communication Engineering," Journal of Engineering Education Research, 13(3): 68-77, (2010). https://doi.org/10.18108/jeer.2010.13.3.68
  27. Y. J. Lee, "A Study on the Information and Communication Ethics from the Survey of College Students," Journal of Engineering Education Research, 13(3): 96-103, (2010). https://doi.org/10.18108/jeer.2010.13.3.96
  28. W. S. Kim, "A Study on Factors of Education's Outcome using Decision Trees," Journal of Engineering Education Research, 13(4): 51-59, (2010). https://doi.org/10.18108/jeer.2010.13.4.51
  29. C. P. Snow, The Two Cultures, London: Cambridge University Press, 1959.
  30. R. P. Feynman, The Pleasure of Finding Things Out: The Best Short Works of Richard P. Feynman, Basic Books; 1 edition, 2000.
  31. R. Carnap, The Continuum of Inductive Methods. University of Chicago Press, 1952.
  32. C. Hempel, Philosophy of Natural Science, 1966.
  33. K. Popper, Conjectures and Refutations: The Growth of Scientific Knowledge, 1963.
  34. P. Feyerabend, Against Method: Outline of an Anarchistic Theory of Knowledge, 1975.
  35. I. Lakatos, The Methodology of Scientific Research Programmes: Philosophical Papers Volume 1. Cambridge: Cambridge University Press, 1978.
  36. T. Kuhn, The Structure of Scientific Revolutions. Chicago: University of Chicago Press, 1962.
  37. W. Bauchspies, J. Croissant, and S. Restivo, Science, Technology, and Society: A Sociological Approach, 2005.