DOI QR코드

DOI QR Code

Heterogeneous Web Information Integration System based on Entity Identification

  • Shin, Hyung-Wook (Development Inzisoft Co. LTD.) ;
  • Yang, Hyung-Jeong (Department of Computer Science, Chonnam National University) ;
  • Kim, Soo-Hyung (Department of Computer Science, Chonnam National University) ;
  • Lee, Guee-Sang (Department of Computer Science, Chonnam National University) ;
  • Kim, Kyoung-Yun (Department of Industrial and Manufacturing Engineering, Wayne State University) ;
  • Kim, Sun-Hee (Department of Computer Science, Carnegie Mellon University) ;
  • Ngoc, Do Luu (Department of Computer Science, Chonnam National University)
  • Received : 2012.09.17
  • Accepted : 2012.12.10
  • Published : 2012.12.28

Abstract

It is not easy for users to effectively have information that is semantically related but scattered on the Web. To obtain qualitatively improved information in web pages, it is necessary to integrate information that is heterogeneous but semantically related. In this study, we propose a method that provides XML-based metadata to users through integration of multiple heterogeneous Web pages. The metadata generated from the proposed system is obtained by integrating different heterogeneous information into a single page, using entity identification based on ontology. A wheelchair information integration system for disabled people is implemented to verify the efficiency of the proposed method. The implemented system provides an integrated web page from multiple web pages as a type of XML document.

Keywords

References

  1. T.B. Lee, J. Hendler, and O. Lassila, "The Semantic Web," Scientific American, May 2001.
  2. J. Martin, Strategic Data Planning Methodologies, Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1982, p.127.
  3. J.C. Song, D.I. Lee, and B.J. Moon, "Standardization of Semantic Web and Development Trends of Technical Factors," National IT Industry Promotion Agency, [IITA] Weekly Technological Trends, 2002.
  4. S.Y. Park, "Comparative Evaluation of Directory Services Provided by Major Korean Search Portals: In the Field of Computer and Internet," Korean Society for Library and Information Science, Journal of the Korean Library and Information Science Society, 2009, pp.215- 234. https://doi.org/10.4275/KSLIS.2009.43.1.215
  5. A. Gomez-Perez, M. Fernandez-Lopez, and O. Corcho, "Ontological Engineering: With Examples from the Areas of Knowledge Management," E-commerce and the Semantic Web. Springer, 2004.
  6. S.E. Shin and Y.H. Seo, "Semantic-based Query Generation For Information Retrieval," International Journal of Contents, vol.1, no.2, 2005, pp.39-43.
  7. http://www.Nate.com
  8. M. Y. Ra, "A Metadata-based Data Grid System for the Integration of Military Information," Korea Society of Computer Information Papers, vol.13, no.2, 2008, pp.95- 103.
  9. O. Corcho, P. Alper, P. Missier, S. Bechhofer, and C. Goble, "Grid metadata management: Requirements and architecture," Proc. 8th IEEE/ACM International Conference on Grid Computing, 2007, pp.97-104.
  10. A. A. Assali and H. Zanghi, "Automated Metadata Hierarchy Derivation," Information and Communication Technologies, vol.1, 2006, pp.505-510.
  11. D. K. Kim, K. J. Jeong, H.S. Shin, and S.T. Hwang, "An XML Schema-based Semantic Data Integration," Korean Institute of Information Scientists and Engineers Papers: System and Theory, vol.33, no.9, 2006, pp.563-573.
  12. http://developer.yahoo.com/searchmonkey/
  13. J.H. Kim, H.Y. Kwak, and H. Kwon, "RDB Schema Model of XML Document for Storage Capacity and Searching Efficiency," Journal of Korea Contents, vol.6, no.4, 2006, pp.19-28.
  14. RDF, http://www.w3.org/RDF
  15. W. H. Yu and H. J. Koh, "Design of a RDF Metadata System for the Searching of Application Programs," Journal of Korea Contents, vol. 5, no. 6, 2005, pp.1-9.
  16. OWL, http://www.w3.org/TR/owl-features
  17. D. Braverman, "Learning filters for optimum pattern recognition cognition Learning filters for optimum pattern recognition," Knowledge Acquisition, 1993, pp.280-285.
  18. http://www.cs.princeton.edu/-appel/modern/java/JLex/
  19. T. Gruber, "A translation approach to portable ontology specifications," Knowledge Acquisition, vol.5, 1993, pp.199-220. https://doi.org/10.1006/knac.1993.1008
  20. http://tartarus.org/-martin/PorterStemmer/
  21. J. H. Cho and H. G. Molina, "Parallel Crawlers," Technical Report, Stanford University, 2001.
  22. http://johannburkard.de/blog/programming/java/java-wildcard -string-matching.html
  23. Protege, http://protege.stanford.edu/
  24. J. H. Choi and Y. T. Park, "Ontology-based Automated Metadata Generation Considering Semantic Ambiguity," Journal of KISS-Software and Applications, vol.33, no.11, 2006, pp.986-998.
  25. S.S. Kim, S.H. Myaeng, and J.M. Yoo, "A Hybrid Information Retrieval Model Using Metadata and Text," Korean Institute of Information Scientists and Engineers Papers : Database, vol.34, no.3, 2007, pp.232-243.
  26. I.K. Lee, D.S. Hwang, S.T. Seo, and S.H. Kwon, "Ontology Integration based on Meta Ontology," Korean Institute of Intelligent Systems Papers, vol.34, no.3, 2007, pp.604-613.
  27. R. Guha and R. McCool, "Tap: Towards a Web of Data," http://tap.stanford.edu/.
  28. K.-Y. Kim, Y. S. Kim, M. R. Schmeler, "Remote Decision Support for Wheeled Mobility and Seating Devices," Expert Systems with Applications, Vol. 39, No. 8, 2012, pp. 7345-7354. https://doi.org/10.1016/j.eswa.2012.01.083