DOI QR코드

DOI QR Code

Research of Adaptive Transformation Method Based on Webpage Semantic Features for Small-Screen Terminals

  • Li, Hao (National Engineering Research Center for E-Learning, Huazhong Normal University) ;
  • Liu, Qingtang (Colleage of Information and Journalism Communication, Huazhong Normal University) ;
  • Hu, Min (National Engineering Research Center for E-Learning, Huazhong Normal University) ;
  • Zhu, Xiaoliang (National Engineering Research Center for E-Learning, Huazhong Normal University)
  • Received : 2012.12.07
  • Accepted : 2013.03.21
  • Published : 2013.10.31

Abstract

Small-screen mobile terminals have difficulty accessing existing Web resources designed for large-screen devices. This paper presents an adaptive transformation method based on webpage semantic features to solve this problem. According to the text density and link density features of the webpages, the webpages are divided into two types: index and content. Our method uses an index-based webpage transformation algorithm and a content-based webpage transformation algorithm. Experiment results demonstrate that our adaptive transformation method is not dependent on specific software and webpage templates, and it is capable of enhancing Web content adaptation on small-screen terminals.

Keywords

References

  1. H. Alam and F. Rahman, "Web Document Manipulation for Small Screen Devices: A Review," Proc. Int. Works. Web Document Anal., 2003, pp. 33-36.
  2. H. Lam and P. Baudisch, "Summary Thumbnails: Readable Overviews for Small Screen Web Browsers," Proc. CHI, Portland, OR, USA, Apr. 2005, pp. 681-690.
  3. S. Saha, M. Jamtgaard, and J. Villasenor, "Bringing the Wireless Internet to Mobile Devices," Computer, vol. 34, no. 6, 2001, pp. 54-58.
  4. S.J. Barnes and B. Corbitt, "Mobile Banking: Concept and Potential," Int. J. Mobile Commun., vol. 1, no. 3, 2003, pp. 273-288. https://doi.org/10.1504/IJMC.2003.003494
  5. D. Cai et al., "Vips: A Vision Based Page Segmentation Algorithm," Technical Report MSR-TR-2003-79, Microsoft Research, 2003.
  6. S. Baluja, "Browsing on Small Screens: Recasting Web-Page Segmentation into an Efficient Machine Learning Framework," Proc. 15th Int. Conf. World Wide Web, Edinburgh, Scotland, May 23-26, 2006, pp. 33-42.
  7. D. Chakrabarti, R. Kumar, and K. Punera, "A Graph-Theoretic Approach to Webpage Segmentation," Proc. 17th Int. Conf. World Wide Web, Beijing, China, Apr. 21-25, 2008, pp. 377-386.
  8. P. Xiang, X. Yang, and Y. Shi, "Web Page Segmentation Based on Gestalt Theory," Proc. IEEE Int. Conf. Multimedia Expo, 2007, pp. 2253-2256.
  9. S.J.H. Yang et al., "Applying Semantic Segment Detection to Enhance Web Page Presentation on the Mobile Internet," J. Inf. Sci. Eng., vol. 27, no. 2, 2011, pp. 697-713.
  10. J. Deng et al., "The Web Data Extracting and Application for Shop Online Based on Commodities Classified," Comput. Intell. Syst., 2011, vol. 234, pp. 189-197. https://doi.org/10.1007/978-3-642-24091-1_26
  11. C. Kohlschütter and W. Nejdl, "A Densitometric Approach to Web Page Segmentation," Proc. 17th ACM Conf. Inf. Knowl. Manag., Napa Valley, CA, USA, Oct. 26-30, 2008, pp. 1173-1182.
  12. R. Gyorodi et al., "Web Page Analysis Based on HTML DOM and Its Usage for Forum Statistics, Alerts and Geo Targeted Data Retrieval," WSEAS Trans. Comput., vol. 9, no. 8, 2010, pp. 822-831.
  13. K. Vieira et al., "A Fast and Robust Method for Web Page Template Detection and Removal," Proc. 15th ACM Int. Conf. Inf. Knowl. Manag., Nov. 06-11, 2006, pp. 258-267.
  14. D. Cai et al., "Extracting Content Structure for Web Pages Based on Visual Representation," Web Technol. Appl., vol. 2642, 2003, pp. 406-417. https://doi.org/10.1007/3-540-36901-5_42
  15. S. Gupta et al., "DOM-Based Content Extraction of HTML Documents," Proc. 12th Int. Conf. World Wide Web, May 20-24, 2003, pp. 207-214.