References
- 정영미. 2005. '정보검색연구'. 서울: 구미무역출판부
- 한광록, 선복근, 유형선. 2007. 웹 뉴스의 기사추출과 요약. '한국 컴퓨터정보학회 논문집', 12(5): 1-10
- Cadenhead, Tyrone, Jinlin Chen, and Terry Cook. 2008. 'Improving web information indexing and retrieval based on center block duplication detection.' International Journal of Innovative Computing and Applications, 1(3): 194-204 https://doi.org/10.1504/IJICA.2008.019687
- Debnath, Sandip, Prasenjit Mitra, and C. Lee Giles. 2005. 'Automatic extraction of informative blocks from webpages.' Proceedings of the 2005 ACM Symposium on Applied Computing, 1722-1726
- Etzioni, Oren. 1996. 'The world wide web: Quagmire or gold mine.' Communications of the ACM, 39(11): 65-68 https://doi.org/10.1145/240455.240473
- Gupta, S., K. Kaiser, D. Neistadt, and P. Grimm. 2003. 'DOM-based content extraction of HTML documents.' Proceedings of the 12th International Conference on World Wide Web, 249- 256
- Lin, Shian-Hua and Jan-Ming Ho. 2002. 'Discovering informative content blocks from web documents.' Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 588-593
- Reis, Davi Castro, Paulo Golgher, Altigran Silva, and Alberto Leaender. 2003. 'Automatic web news extraction using tree edit distance.' Proceedings of the 13th International Conference on World Wide Web, 502-511
- Sebastiani, Fabrizio. 2002. 'Machine learning in automated text categorization.' ACM Computing Surveys, 34(1): 1-47 https://doi.org/10.1145/505282.505283
- Song, Ruihua, Haifeng Liu, Ji-Rong Wen, and Wei-Ying Ma. 2004. 'Learning block importance models for web pages.' Proceedings of the 13th International Conference on World Wide Web, 203-111
- Vitali, Fabio, Angelo Di Iorio, and Elisa Ventura Campori. 2004. 'Rule-Based Structural Analysis of Web Pages.' Document Analysis Systems VI, 425-437 https://doi.org/10.1007/b100557
- Yi, Lan, Bing Liu, and Xiaoli Li. 2003. 'Eliminating noisy information in Web pages for data mining.' Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and data Mining, 296-305
- Yu, Shipeng, Deng Cai, Ji-Rong Wen, and Wei-Ying Ma. 2003. 'Improving pseudorelevance feedback in web information retrieval using web page segmentation.' Proceedings of the 12th International Conference on World Wide Web, 11-18
Cited by
- Text Extraction Algorithm using the HTML Logical Structure Analysis vol.16, pp.3, 2015, https://doi.org/10.9728/dcs.2015.16.3.445