코퍼스에 기반한 문학텍스트 분석

Corpus-Based Literary Analysis

  • 투고 : 2013.08.06
  • 심사 : 2013.08.30
  • 발행 : 2013.09.28


코퍼스 언어학이 연구방법의 한 분야로서 최근 그 입지를 급격하게 넓혀온 가운데, 언어학적 현상과 함께 문학텍스트의 이해를 깊게 하는데 기여를 해 왔다. 최근 코퍼스 언어학의 급속한 저변확대에도 불구하고 문학텍스트 코퍼스를 기반으로 한 고전 및 문학작품의 재해석에 대한 시도는 국내언어학계에서 매우 미미한 실정에 머물러 있다. 이에 본 연구는 코퍼스 언어학의 분석도구인 컴퓨터 콘코던스 프로그램인 워드스미스를 이용하여 방대한 전자텍스트로 이루어져 있는 문학작픔의 문체적 특성과 주요테마를 조사하고자 하였다. 특히 본 연구는 텍스트의 주요한 특성을 나타내는 키워드(keyword)에 초점을 두고 세익스피어의 비극작품인 로미오와 줄리엣을 코퍼스 언어학적 분석기법으로 접근하여 작품세계를 재조명하여 학문적 의의가 크다고 생각되며 앞으로 관련된 후속연구가 이어질 것으로 기대된다.

Recently corpus linguistic analyses enable researchers to examine meanings and structural features of data, that is not detected intuitively. While the potential of corpus linguistic techniques has been established and demonstrated for non-literary data, corpus stylistic analyses have been rarely performed in terms of the analysis of literature. Specifically this paper explores keywords and their role in text analysis, which is primary part of corpus linguistic analyses. This paper focuses on the application of techniques from corpus linguistics and the interpretation of results. This paper addresses the question of what is to be gained from keyword analysis by scrutinizing keywords in Shakespeare's Romeo and Juliet.



  1. T. Tabata, Investigating Stylistic Variation in Dickens through Correspondence Analysis of Word-Class Distribution. In T. Saito, J. Nakamura, and S. Yamazaki (eds.). English corpus linguistics in Japan. Rodopi, Amsterdam and New York, pp.165-182, 2002.
  2. M. Stubbs, "Conrad in the computer: Examples of quantitative stylistic methods," Language and Literature, Vol.1, No.5, pp.5-24, 2005.
  3. M. H. Short, "Discourse Analysis and the Analysis of Drama," Applied Linguistics, Vol.2, No.2, pp.180-202, 1981.
  4. M. A. K. Halliday, Linguistic function and literary style: An inquiry into the language of William Golding's The Inheritors, In Literary style: A symposium. London & New York, Oxford University Press, pp.330-365. 1971.
  5. J. Sinclair, Corpus, concordance, collocation. Oxford University Press, 1991.
  6. R. Jakobson, Closing Statement: Linguistics and Poetics. In T.A. SEBEOK (ed.), Style in Language. Cambridge, MA: MIT 31971, pp.350-377, 1958.
  7. J. F. Burrows, Computation into criticism: A study of Jane Austen's novels and an experiment in method. Oxford: Clarendon Press, 1987.
  8. T. Tabata, "Dickens Narrative Style: A Statistical Approach to Chronological Variation," RISSH, Vol.30, pp.165-182, 1994.
  9. M. Mahlberg, "Clusters, key clusters and local textual functions in Dickens," Corpora, Vol.2, No.1, pp.1-31, 2007.
  10. J. Culpeper, "Keyness: Words, parts-of-speech and semantic categories in the character-talk of Shakespeare's Romeo and Juliet," International Journal of Corpus Linguistics, Vol.14, No.1, pp.29-59, 2009
  11. N. E. Enkvist, M., Gregory, and J. Spencer, Linguistics and Style: on Defining Style: An Essay in Applied Liguistics. Oxford University Press, 1964.
  12. M. Scott, WordSmith Tools version 5, Liverpool: Lexical Analysis Software, 2008.
  13. M. Phillips, "Lexical structure of text. No.12", English language research, 1989.
  14. B. Fischer-Starcke, Corpus linguistics in literary analysis: Jane Austen and her contemporaries. Continuum, 2010.
  15. M. Scott, "PC analysis of key words - and key key words," System, Vol.25, No.2, pp.233-245, 1997.
  16. P. Baker, "Querying Keywords Questions of Difference, Frequency, and Sense in Keywords Analysis," Journal of English Linguistics, Vol.32, No.4, pp.346-359, 2004.
  17. M. A. Oliver and R. Platt, "Embracing a new creed: lexical patterning and the encoding of ideology," College Literature, Vol.33, No.2, pp.154-170, 2006.
  18. M. Scott and C. Tribble, Textual patterns: Key words and corpus analysis in language education. John Benjamins Publishing, 2006.
  19. J. Culpeper, Computers, language and characterisation: an analysis of six characters in Romeo and Juliet, pp.11-30, 2002.
  20. T. B. Sardinha, and L. Barbara, Corpus linguistics. The Handbook of Business Discourse. p.105, 2009.
  21. P. Rayson, Matrix: A statistical method and software tool for linguistic analysis through corpus comparison. Diss. 2003.
  22. 한혜원, 박경은, "전자책 콘텐츠의 체험성과 독서경험", 한국콘텐츠학회논문지, 제11권, 제12호, pp.171-181, 2011.
  23. D. Hanauer, "Attention and Literary Education," Language Awareness, Vol.8, pp.15-26, 1999.
  24. P. Rayson, "From key words to key semantic domains," International Journal of Corpus Linguistics, Vol.13, No.4, pp.519-549, 2008.