References
- R. Mohemad, A.R. Hamdan, Z.A. Othman, and N.M.M. Noor, "Automatic Document Structure Analysis of Structured PDF Files," International Journal of New Computer Architectures and their Applications (IJNCAA), vol. 1, no. 2, pp. 404-411, August, 2011.
- J. Kim, D. X. Le, and G.R. Thoma, "Automated labeling in document images," in Proc. of SPIE Conference on Document Recognition and Retrieval VIII, vol. 4307, pp. 111-122, January, 2001.
- D. Niyogi and S.N. Srihari, "Knowledge-based derivation of document logical structure," in Proc. of International Conference on Document Analysis and Recognition, pp. 472-475, August 14 - 15, 1995.
- R. Rauf, M. Antkiewicz, and K. Czarnecki, "Logical structure extraction from software requirements documents," in Proc. of 19th IEEE International Requirements Engineering Conference, pp. 101-110, August 29, 2011.
- Kan, Min-Yen, Luong, Minh-Thang, "Logical Structure Recovery in Scholarly Articles with Rich Document Features," International Journal of Digital Library Systems, vol. 1, no. 4, pp. 1-23, October, 2010. https://doi.org/10.4018/jdls.2010100101
- S. Mao, Z. Xu, T. Tjahjadi, and G. R. Thoma, "Logical Entity Recognition in Multi-Style Document Page Images," in Proc. of 18th International Conference on Pattern Recognition, pp. 876-879, August 20-24, 2006.
- L. Breiman, "Random Forests," Machine Learning, vol. 45, no. 1, pp. 5-32, October, 2001. https://doi.org/10.1023/A:1010933404324
- C. Cortes and V. Vapnik, "Support-vector networks," Machine Learning, vol. 20, no. 3, pp. 273-297, July, 1995. https://doi.org/10.1007/BF00994018
- David W. Aha, Dennis F. Kibler, Marc K. Albert, "Instance-based learning algorithms," Machine Learning, vol. 6, pp. 37-66, January, 1991.
- S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, 2nd Edition, Prentice Hall, New Jersey, 2003.
- C. Bishop, Pattern recognition and machine learning, Springer, Berlin, 2006.
- Weka Home Page. (Available at http://www.cs.waikato.ac.nz/ml/weka/).
- Docx4j Enterprise Edition Homepage. (Available at http://www.docx4java.org/trac/docx4j).
- T. Mitchell. Machine Learning, The Mc-Graw-Hill, New York, 1997.
- W.B. Frakes and R. Baeza-Yates, Information Retrival : Data Structures and Algorithms, Prentice-Hall, New Jersey, 1992.
- J. D. Lafferty, A. McCallum, and F. C. N. Pereira, "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data," in Proc. of the Eighteenth International Conference on Machine Learning, pp. 282-289, June 28 - July 1, 2001.
- I.H. Witten, E. Frank, and M.A. Hall, Data Mining: Practical Machine Learning Tools and Techniques, Third Edition, Morgan Kaufmann, Burlington, 2011.
- S. Klampfl, M. Granitzer, K. Jack and R. Kern, "Unsupervised document structure analysis of digital scientific articles," International Journal on Digital Libraries, vol. 14, Issue 3-4, pp. 83-99, August, 2014. https://doi.org/10.1007/s00799-014-0115-1
- S. Klampfl and R. Kern, "Machine Learning Techniques for Automatically Extracting Contextual Information from Scientific Publications," Semantic Web Evaluation Challenges - Second SemWebEval Challenge at ESWC 2015, pp. 105-116 , May 31 - June 4, 2015.
- S. Klampfl and R. Kern, "Reconstructing the logical structure of a Scientific Publication Using Machine Learning," in Proc. of Semantic Web Challenges - Third SemWebEval Challenge at ESWC 2016, pp. 255-268, May 29 - June 2, 2016.
- J. Lafferty , A. McCallum and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. of the Eighteenth International Conference on Machine Learning, pp. 282-289, June 28 - July 1, 2001.
- Ora Lassila, Ralph R. Swick, Resource Description Framework (RDF) Model and Syntax Specification. 1999. (Available at https://www.w3.org/TR/1999/REC-rdf-syntax-19990222/).
- L. Liu and M. Tamer Ozsu, Encyclopedia of Database Systems. Springer, Berlin, 2009.