E-mail Classification and Category Re-organization using Dynamic Category Hierarchy and PCA

  • Park, Sun (Graduate Education Center of Jeonbuk of Electronics and Information Technology-BK21, Chonbuk National University) ;
  • Kim, Chul-Won (Department of Computer Engineering, Chonbuk National University) ;
  • An, Dong-Un (Division of Electronic & Information Engineering, Chonbuk National University)
  • Published : 2009.09.30

Abstract

The amount of incoming e-mails is increasing rapidly due to the wide usage of Internet. We often group e-mails into categories for maintaining e-mail efficiently. However reading the email messages and classifying them is still tedious task. Moreover, the number of e-mails and manual classifying is increasing everyday. So, automatic e-mail classification is important techniques. In this paper, we propose a multi-way e-mail classification method that uses PCA for automatic category generation and dynamic category hierarchy for re-organizing e-mail categories. It classifies a huge amount of receiving e-mail messages automatically, efficiently, and accurately.

Keywords

References

  1. W.W. Cohen. Learning Rules that classify E-mail. In Proc. AAAI Spring Symposium in Information Access, 1999
  2. I. Androutsopoulos et al. An Evaluation of NaIve Bayesian Anti-Spam Filtering. In Proc. Workshop on Machine Learning in the New Information Age, 2000
  3. G. Sakkis et al. Stacking classifiers for anti-spam filtering of e-mail. In Proc. 6th Conf. On Empirical Methods III Natural Language Processing, 2001
  4. H. Drucker, D. Wu, and V. N. Vapnik, Support Vector Machines for Spam Categorization. IEEE Transactions on Neural network, 10(5), 1999 https://doi.org/10.1109/72.788645
  5. L. Kun-Lun, Li, Kai, H, Hou-Kuan, T. Sheng-Feng, Active Learning with Simplified SVMS for SP AM Categorization. In Proc. First Conf. On Machine Learning and Cybernetics, Beijing, 4-5, November, 2002 https://doi.org/10.1109/ICMLC.2002.1167390
  6. M. Woitaszek, M. shaaban. IdentifYing Junk Electronic Mail in Microsoft Outlook with a Support Vector Machine. In Proc. 2003 Symposium. On Application and the Internet. 2003
  7. K. Mock. Dynamic Email Organization via Relevance Categories. In Proceedings of the International Conference on Tools with Artificial Intelligence 1999. Chicago IL, Nov. 1999 https://doi.org/10.1109/TAI.1999.809830
  8. G. Manco, E. Masciari. A Framework for Adaptive Mail Classification. In Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence. 2002 https://doi.org/10.1109/TAI.2002.1180829
  9. T. M. Alrashid, 1. A. Barker, B. S. Christian, S.C. Cox, M. W. Rabne, E. A. Slotta and L. R. Upthegrove. Safeguarding Copyrighted Contents, Digital Libraries and Intellectual Property Management, CWRU's Rights Management System. D-Lib Magazine, April 1998. http://www.dlib.org/dlib/april98/04barker.html
  10. Y. Ogawa, T. Morita, and K. Kobayashi. A fuzzy document retrieval system using the keyword connection matrix and a learning method. Fuzzy Sets and System, pp. 163-179, 1991 https://doi.org/10.1016/0165-0114(91)90210-H
  11. G. W. Furnas, S. Deerwester, S. T. Dumais, T. K. Landauer, R. A. Harshman, L. A. Streeter, and K. E. Lochbaum. Information retrieval using a singular value decomposition model of latent semantic structure. In Proceedings of the 11 th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 465-480, 1988 https://doi.org/10.1145/62437.62487
  12. R. B. Y and B. R. N. Modem Information Retrieval. Addison Wesley, 1999.
  13. C. Bumghi, L. Ju-Hong, P. Sun. Dynamic Construction of Category Hierarchy Using Fuzzy Relational Products. In proceedings of the 4th International Conference On Intelligent Data Engineering and Automated Learing. Hong Kong, China, pp.296-302, 2003
  14. S.S. Kang. Korean Information Retrieval and Morpheme analysis. HongReung Science Publishing Co., 2002
  15. D. Zhang, Y. Dong. Semantic, hierarchical, online clusteing of web search results. In proceedings Asia Pacific Web Conference (APWEB), Hangzhou, China, pp. 67-78, 2004
  16. DB. G. Choi, T. S. Park, J. II. Lee, S. Park. Web Search Model for Dynamic and Fuzzy Directory Search. LNCS 3878. pp. 406-409. 2006 https://doi.org/10.1007/11671299_42
  17. S. Park, S. H. Park, J. H. Lee, J. S. Lee. E-mail Classification Agent Using Category Generatoin and Daynamic Category Hierarchy. LNAI 3397. pp. 207-214. 2005