References
- I. Councill, C. Giles, and M.-Y. Kan, "ParsCit: an Open-source CRF Reference String Parsing Package," Proceedings of the 6th International Conference on Language Resources and Evaluation(LREC), 2008.
- R. Kern, and S. Klampfl, "Extraction of References Using Layout and Formatting Information from Scientific Articles," D-Lib Magazine, Vol. 19, No. 9/10, September/October, 2013.
- D. Tkaczyk, "New Methods for Metadata Extraction from Scientific Literature," PhD Thesis, ICM, University of Warsaw, 2015.
- M. Korner, B. Ghavimi, P. Mayr, H. Hartmann, and S. Staab, "Evaluating Reference String Extraction Using Line-Based Conditional Random Fields: A Case Study with German Language Publications," M. Kirikova et al. (Eds.): ADBIS 2017, CCIS 767, pp. 137-145, 2017.
- J. Boyd, "Automatic Metadata Extraction The High Energy Physics Use Case," Master's Thesis, CERN-THESIS-2015-105, 2015.
- Pdfextract, https://www.crossref.org/labs/pdfextract/
- D. Tkaczyk, P. Szostek, M. Fedoryszak, P. Dendek, and L. Bolikowski, "CERMINE: Automatic Extraction of Structured Metadata from Scientific Literature," International Journal on Document Analysis and Recognition(IJDAR), Vol. 18, No. 4, pp. 317-335, December, 2015. https://doi.org/10.1007/s10032-015-0249-8
- P. Lopez, "GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications," Proceedings of the 13th European Conference on Digital Libraries(ECDL), pp. 473-474, 2009.
- A. Bhardwaj, D. Mercier, A. Dengel, and S. Ahmed, "DeepBIBX: Deep Learning for Image Based Bibliographic Data Extraction," D. Liu et al. (Eds.): ICONIP 2017, Part II, LNCS 10635, pp. 286-293, 2017.
- J. Lafferty, A. McCallum, and F. Pereira, "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data," Proceedings of the 18th International Conference on Machine Learning(ICML), pp. 282-289, 2001.
- S. Bird, R. Dale, B. Dorr, B. Gibson, M. Joseph, M.-Y. Kan, D. Lee, B. Powley, D. Radev, and Y. Tan, "The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics," Proceedings of the 6th International Conference on Language Resources and Evaluation(LREC), 2008.
- S. Anzaroot, and A. McCallum, "A New Dataset for Fine-grained Citation Field Extraction," Proceedings of the ICML Workshop on Peer Reviewing and Publishing Models, 2013.
- US Census Bureau, "Frequently Occurring Surnames from the 2010 Census", https://www.census.gov/topics/population/genealogy/data/2010_surnames.html, 2010.
- Wikipedia: The Free Encyclopedia. Wikimedia Foundation, Inc. 22 July 2004. Web. 10 Aug. 2004.
- CRF++: Yet Another CRF toolkit, https://taku910.github.io/crfpp/
- DBLP, https://dblp.uni-trier.de/