DOI QR코드

DOI QR Code

TheReviser : A Gesture-based Editing System on a Digital Desk

TheReviser : 가상 데스크 상의 제스처 기반 문서 교정 시스템

  • 정기철 (숭실대학교 정보과학대학 미디어학부) ;
  • 강현 (한국전자통신연구원 디지털콘텐츠단)
  • Published : 2004.08.01

Abstract

TheReviser is a digital document revision application on a projection display, which allows us to interact a digital document with the same gestures used for paper documents revision. To enable these interactions, TheReviser should detect foreground objects such as hands or pens on a projection display, and should spot and recognize gesture commands from continuous movements of a user. To detect foreground objects from a complex background in various lighting conditions, we perform geometry and color calibration between a captured image and a frame buffer image. TheReviser uses an HMM-based gesture recognition method Experimental results show that the proposed application recognizes user's gestures on average 93.22% in test gesture sequences.

리바이저 시스템은 프로젝션 화면 상에서 종이 문서의 수정시 사용되는 교정 제스처와 동일한 제스처를 이용한 온라인 문서 교정 시스템이다. 이를 위해, 프로젝션 화면 상에서 손이나 문서와 같은 전경물체추출 기술과 연속 동작으로 부터의 제스처 인식 기술이 필요하다. 많은 조명 변화와 복잡한 배경 상에서 전경물체를 검출하기 위해서 기하보정과 색상보정을 수행하고, HMM 기반 제스처 인식기를 구현하였다. 실험 결과로부터 연속 제스처에서 93.22%의 인식률을 나타남을 볼 수 있다.

Keywords

References

  1. M. Ashdown, P. Robinson, 'The Escritoire : A Personal Projected Display,' Journal of WSCG, Vol.11, No.1, pp.33-40, 2003
  2. M. Black, A. Jepson, 'Recognition Temporal Trajectories using the Condensation Algorithm,' IEEE International Conference on Automatic Face and Gesture Recognition, Japan, pp.16-21, 1998 https://doi.org/10.1109/AFGR.1998.670919
  3. M. H. Coen, 'Design Principles for Intelligent Environments,' Fifteenth National Conference on Artificial Intelligence, (AAAI'98), Madison, WI, 1998
  4. J. Davis, M. Shah, 'Visual gesture recognition,' Vision Image and Signal Processing, Vol.141, No.2, pp.101-106, 1994 https://doi.org/10.1049/ip-vis:19941058
  5. Elliot, M. A. Hearst, 'A Comparison of the Affordances of a Digital Desk and Tablet for Architectural Image Tasks,' International Journal Human-Computer Studies, Vol.56, pp.173-197, 2002 https://doi.org/10.1006/ijhc.2001.0520
  6. R. Hartley, A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2001
  7. http://my.netian.com/~kimbdo/cos/munsil/munsil-41.htm
  8. X. D. Huang, Y. Ariki, M. A. Jack, Hidden Markov Models for Speech Recognition, Edinburgh Univ. Press, 1990
  9. S. Iba, J. M. V. Weghe, C. J. J. Paredis, P. K. Khosla, 'An Architecture for Gesture-based Control of Mobile Robots,' IEEE/RSJ International Conference on Intelligent Robots and Systems, 2, pp.851-857, 1999 https://doi.org/10.1109/IROS.1999.812786
  10. B. Johanson, T. Winograd, A. Fox, 'Interactive Workspaces,' IEEE Computer, Vol.36, pp.99-101, 2003 https://doi.org/10.1109/MC.2003.1193235
  11. R. Kjeldsen, J. Kender, 'Visual Hand Gesture Recognition for Window System Control,' Proceedings on International Workshop on Automatic Face - and Gesture - Recognition (IWAFGR), pp.184-188, 1995
  12. R. Kjeldsen, C. Pinhanez, G. Pingali, J. Hartman, T. Levas, M. Podlaseck, 'Interacting with Steerable Projected Displays,' IEEE International Conference on Automatic Face and Gesture Recognition, pp.20-21, May, 2002
  13. D. C. Lay, Linear Algebra and Its Applications, Addision Wesley, 1994
  14. H. K. Lee, J. H. Kim, 'An HMM-based Threshold Model Approach for Gesture Recognition,' IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.21, No.10, pp.961-973, 1999 https://doi.org/10.1109/34.799904
  15. K. Oka, Y. Sato, H. Koike, 'Real-Time Fingertip Tracking and Gesture Recognition,' IEEE Computer Graphics and Applications, pp.64-71, 2002 https://doi.org/10.1109/MCG.2002.1046630
  16. V. I. Pavlovic, R. Sharma, T. S. Huang, 'Visual Interpretation of Hand Gestures for Human-Computer Interaction : a Review,' IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.19, No.7, pp.677-695, 1997 https://doi.org/10.1109/34.598226
  17. F. Quek, 'Toward a Vision-based Human Gesture Interface,' International Conference on Virtual Reality Software and Technology, pp.17-31, 1994
  18. J. Rekimoto, 'SmartSkin : An Infrastructure for Freehand Manipulation on Interactive Surfaces,' CHI 2002, April, 2002
  19. G. Sharma, M. J. Vrhel, H. J. Trussell, 'Color Imaging for Multimedia,' Proceedings of the IEEE, Vol.86, No.6, June, 1998 https://doi.org/10.1109/5.687831
  20. J. Q. Stafford Fraser, P. Robinson, 'BrightBoard : A Video-Augmented Environment,' CHI 1996, ACM, pp.134-141, 1996
  21. T. Starner, A. Pentland, 'Real-Time American Sign Language Recognition from Video Using Hidden Markov Models,' Technical Report TR-375, Media Lab, MIT, 1995
  22. E. H. Stupp, M. S. Brennesholtz, Projection Display, John Wiley & Son, 1999
  23. R. Sukthankar, R. G. Stockton, M. D. Mullin, 'Smarter Presentation : Exploiting Homography in Camera-Projector System,' International Conference on Computer Vision, pp.247-253, 2001 https://doi.org/10.1109/ICCV.2001.937525
  24. P. Wellner, 'The DigitalDesk Calculator : Tactile Manipulation on a Desk Top Display,' ACM Symposium on User Interface Software and Technology (UIST'91), pp.27-33, 1991
  25. P. Wellner, 'Self Caliberation for the DigitalDesk,' Euro-PARC Technical Report EPC-93-109, 1993
  26. H. Kang, H. J. Kim, 'Design of an Interface on PDA for Korean,' IEEE Transactions on Consumer Electronics, Vol.46, No.3, pp.834-838, Aug., 2000 https://doi.org/10.1109/30.883457
  27. Whitaker, J., Benson, B., Standard Handbook of Video and Television Engineering, McGraw-Hill, 2000
  28. http://ww.intel.com/research/mrl/research/opencv/