Study on Measuring Geometrical Modification of Document Image in Scanning Process

스캐닝 과정에서 발생하는 전자문서의 기하학적 변형감지에 관한 연구

  • Oh, Dong-Yeol (Department of Computer Science, SoongSil University) ;
  • Oh, Hae-Seok (Department of Computer Science, Kyungwon University) ;
  • Rhew, Sung-Yul (Department of Computer Science, SoongSil University)
  • 오동열 (숭실대학교 IT대학 컴퓨터학과) ;
  • 오해석 (경원대학교 IT대학 컴퓨터 공학) ;
  • 류성열 (숭실대학교 IT대학 컴퓨터학과)
  • Published : 2009.08.31


Scanner which is a kind of optical devices is used to convert paper documents into document image files. The assessment of scanned document image is performed to check if there are any modification on document image files in scanning process. In assessment of scanned documents, user checks the degree of skew, noise, folded state and etc This paper proposed to how to measure geometrical modifications of document image in scanning process. In this study, we check the degree of modification in document image file by image processing and we compare the evaluation value which means the degree of modification in each items with OCR success ratio in a document image file. To analyse the correlation between OCR success ratio and the evaluation value which means the degree of modification in each items, we apply Pearson Correlation Coefficient and calculate weight value for each items to score total evaluation value of image modification degrees on a image file. The document image which has high rating score by proposed method also has high OCR success ratio.


Quality Assurance;Document Image Quality Check;Document Image Processing


  1. M. Cannon, P. Kelly, S. Sitharama Iyenger and Nathan Brener, "An automated system for numerically rating document image quality", Proceedings 1997 Symposium on Document Image Understanding Technology, pp. 161-167, 1997.
  2. Henry S. Baird, N, "Document Image Quality : Making Fine Discriminations" Document Analysis and Recognition ICDAR '99, pp. 459-462, 1999.
  3. Souza A, Cheriet M, Naio S, Suen C.Y "Automatic filter Selection Using Image Quality Assesment", Proceedings of the 7th international conference on document analysis and recognition, pp.508-512, 2003.
  4. Lu, S.J. and Chen, B.M. and Ko, C.C., "Perspective rectification of document images using fuzzy set and morphological operations", Image and Vision Computing Journal Vol. 23, pp. 541-553, 2005.
  5. Wei Dong, Qian Yu, Zhang C N, Hua Li, "Image Quality Assessment Using Rough Fuzzy Integrals", 27th International conference on Distributed Computing System Workshops, pp. 1-5, 2007.
  6. M. Sezgin, B. Sankur, "Survey over image thresholding techniques and quantitative performance evaluation", Journal of Electronic Image, Vol. 13 pp. 146-168, 2004.
  7. 조인성, 이지홍, 오상진, "사용자 중심의 유연한 실시간 머신비전 검사시스템 개발", 전자공학회논문지, Vol.45 pp. 42-50, 2008.
  8. Hoque, S. Sirlantzis, K. Fairhurst, M.C, "A new chain-code quantization approach enabling high performance handwriting recognition based on multiclassifier schemes", Document Analysis and Recognition, pp. 834-838, 2003
  9. 이규원, 우동민, "항공영상으로부터 에지 맴의 체인코드 추적에 의한 선소추출", 한국지능시스템학회논문지, pp. 709-713, 2005.
  10. Nandini N, Srikanta M. K, G. Hemantha, "Estimation of Skew Angle in Binary Document Images Using Hough Transform", Proceedings of world academy of science, engineering and technology vol 3, pp. 50-55, 2008.
  11. 김진흠, 김민호, "변수선택 편향이 없는 회귀나무를 만들기 위한 알고리즘", 한국 통계 학회, pp. 459-473, 2004.
  12. 박지선, 김택헌, 류영석, 양성봉 "추천 시스템을 위한 2 way 협동적 필터링 방법을 이용한 예측 알고리즘", 정보과학회논문지, Vol. 29, pp. 669-675, 2002.
  13. Maya R.Gupta, Nathaniel P, "OCR binarization and image pre-processing for searching historical documents", Pattern Recognition Volume 40, pp. 389-397, 2007.