DOI QR코드

DOI QR Code

Building Hierarchical Knowledge Base of Research Interests and Learning Topics for Social Computing Support

소셜 컴퓨팅을 위한 연구·학습 주제의 계층적 지식기반 구축

  • Received : 2012.10.22
  • Accepted : 2012.11.29
  • Published : 2012.12.28

Abstract

This paper consists of two parts: In the first part, we describe our work to build hierarchical knowledge base of digital library patron's research interests and learning topics in various scholarly areas through analyzing well classified Electronic Theses and Dissertations (ETDs) of NDLTD Union catalog. Journal articles from ACM Transactions and conference web sites of computing areas also are added in the analysis to specialize computing fields. This hierarchical knowledge base would be a useful tool for many social computing and information service applications, such as personalization, recommender system, text mining, technology opportunity mining, information visualization, and so on. In the second part, we compare four grouping algorithms to select best one for our data mining researches by testing each one with the hierarchical knowledge base we described in the first part. From these two studies, we intent to show traditional verification methods for social community miming researches, based on interviewing and answering questionnaires, which are expensive, slow, and privacy threatening, can be replaced with systematic, consistent, fast, and privacy protecting methods by using our suggested hierarchical knowledge base.

본 논문은 연구 학습 주제 지식베이스를 통한 소셜컴퓨팅 지원에 관한 연구로 두 가지 하부 연구로 구성되었다. 첫 번째 연구는 다양한 학문분야에서 전자 도서관 이용자들의 연구 및 학습 주제를 추출하기 위해 분야별로 분류가 잘 되어 있는 NDLTD Union catalog의 석박사 학위 논문 (Electronic Theses and Dissertations : ETDs)을 분석하여 계층적 지식베이스를 구축하는 연구이다. 석박사 학위 논문 이외에 ACM Transactions 저널의 논문과 컴퓨터 분야 국제 학술대회 웹사이트도 추가로 분석하였는데 이는 컴퓨팅 분야의 보다 세분화된 지식베이스를 얻기 위해서이다. 계층적 지식베이스는 개인화 서비스, 추천시스템, 텍스트 마이닝, 기술기회탐색, 정보 가시화 등의 정보서비스와 소셜컴퓨팅에 유용하게 사용될 수 있다. 본 논문의 두 번째 연구 부분에서는 우리가 만든 계층적 지식기반을 활용하여 4개의 사용자 커뮤니티 마이닝 알고리즘 중에서 우리가 수행중인 소셜 컴퓨팅 연구, 즉 구성원간의 결합도에 기반한 추천시스템에 최상의 성능을 보이는 그룹핑 알고리즘을 찾는 성능 평가 연구 결과를 제시하였다. 우리는 이 논문을 통해서 우리가 제안하는 연구 학습 주제 데이터베이스를 사용하는 방법이 기존에 사용자 커뮤니티 마이닝을 위해 사용되던 비용이 많이 필요하고, 느리며, 개인정보 침해의 위험이 있는 인터뷰나 설문에 기반한 방법을 자동화되고, 비용이 적게 들고, 빠르고, 개인정보 침해 위험이 없으며, 반복 수행시에도 일관된 결과를 보여주는 방법으로 대체할 수 있음을 보이고자 한다.

Keywords

References

  1. ACM Transactions, ACM DL: Transactions, http://portal.acm.org/browse_dl.cfm?linked=1&part=transaction&coll=ACM&dl=ACM&CFID=19463219&CFTOKEN=17951970, 2007.
  2. D. Kelly, "Understanding Implicit Feedback and Document Preference: A Naturalistic User Study," Doctoral dissertation, Rutgers University, 2004.
  3. D. Nichols, "Implicit Rating and Filtering," Proceedings of the 5th DELOS Workshop on Filtering and Collaborative Filtering, Budapest, Hungary, pp.31-36, 1998.
  4. H. Suleman, "Introduction to the Open Archives Initiative protocol for metadata harvesting," Proceedings of the ACM/IEEE 2nd Joint Conference on Digital Libraries (JCDL 2002), Portland, OR, Software available at http://oai.dlib.vt.edu/odl/software/harvest/, 2002.
  5. J. Konstan, B. Miller, D. Maltz, J. Herlocker, L. Gordon, and J. Riedl, "GroupLens: Applying Collaborative Filtering to Usenet News," In Communications of the ACM, Vol.40, No.3, pp.77-87, 1997. https://doi.org/10.1145/245108.245126
  6. K. Tolle and H. Chen, "Comparing noun phrasing techniques for use with medical digital library tools," Journal of American Society for Information Science, Vol.51, No.4, pp.352-370, 2000. https://doi.org/10.1002/(SICI)1097-4571(2000)51:4<352::AID-ASI5>3.0.CO;2-8
  7. N. Belkin, "Interaction with Texts: Information Retrieval as Information-Seeking Behavior," In Information Retrieval, pp.55-66, 1993.
  8. NDLTD, Networked Digital Library of Theses and Dissertations, http://www.ndltd. org, 2007.
  9. P. Anick and S. Vaithyanathan, "Exploiting Clustering and Phrases for Context-based Information Retrieval," Proceedings of the 20th annual international ACM SIGIR conference on research and development, Philadelphia, PA, pp.314-323, 1997.
  10. R. Baeza-Yates, Modern Information Retrieval, Addison Wesley, Santiago, Chile, 1999.
  11. R. Duda, P. Hart, and D. Stork, Pattern Classification, A Wiley-Interscience Publication, 2000.
  12. R. Fidel and A. Pejtersen, "From Information Behaviour Research to the Design of Information System: the Cognitive Work Analysis Framework," Information Research, Vol.10, No.1, 2004.
  13. R. Miller and K. Bharat, "SPHINX: a framework for creating personal, site-specific web crawlers," Proceedings of the 7th World-Wide Web Conference, 1998.
  14. S. Ha, K. Im and H. Pae, "Comparison of Online Game User Communities by using Social Network Analysis," The Journal of the Korea Contents Association, Vol.9, No.8, pp.178-189, 2009. https://doi.org/10.5392/JKCA.2009.9.8.178
  15. S. Kim, E. Fox, W. Fan, C. North, D. Tatar, and R. Torres, "Design and Evaluation of Techniques to Utilize Implicit Rating Data in Complex Information Systems," Computer Science Technical Report, Blacksburg, VA, Virginia Tech, http://eprints.cs.vt.edu/archive/00000980/, TR-07-20, 2007.
  16. S. Kim, NDLTD, "Search Interface Embedded User Tracking System," available at http://boris.dlib.vt.edu:8080/controller/ index.jsp, 2006.
  17. S. Kim, S. Lele, S. Ramalingam, and E. Fox, "Visualizing User Communities and Usage Trends of Digital Libraries based on User Tracking Information," Proceedings of the 9th International Conference on Asian Digital Libraries (ICADL '06), Kyoto, Japan, Lecture Notes in Computer Science 4312, Springer-Verlag, Berlin Heidelberg New York, pp.111-120, 2006.
  18. S. Kim, S. Yang, and E. Fox, "Supply and Demand Analysis in NDLTD Based on Patron Specialty Survey and Contents Statistics," Proceedings of the 9th International Symposium on Electronic Theses and Dissertations (ETD '06), http://www6.bibl.ulaval.ca:8080/etd2006/pages/prog.jsf, 2006.
  19. S. Lin and N. Belkin, "Validation of a model of information seeking over multiple search sessions," Journal of American Society for Information Science & Tech.(JASIST), Vol.56, No.4, pp.393-415, 2005. https://doi.org/10.1002/asi.20127