DOI QR코드

DOI QR Code

Web Catchphrase Improve System Employing Onomatopoeia and Large-Scale N-gram Corpus

  • Yamane, Hiroaki (Department of Information and Computer Science, Keio University) ;
  • Hagiwara, Masafumi (Department of Information and Computer Science, Keio University)
  • 투고 : 2012.02.28
  • 심사 : 2012.03.19
  • 발행 : 2012.03.25

초록

In this paper, we propose a system which improves text catchphrases on the web using onomatopoeia and the Japanese Google N-grams. Onomatopoeia is regarded as a fundamental tool in daily communication for people. The proposed system inserts an onomatopoetic word into plain text catchphrases. Being based on a large catchphrase encyclopedia, the proposed system evaluates each catchphrase's candidates considering the words, structure and usage of onomatopoeia. That is, candidates are selected whether they contain onomatopoeia and they use specific catchphrase grammatical structures. Subjective experiments show that inserted onomatopoeia is effective for making attractive catchphrases.

키워드

참고문헌

  1. C. Kohli, L. Leuthesser, and R. Suri, "Got slogan? guidelines for creating effective slogans," Business Horizons, vol. 50, no. 5, pp. 415-422, 2007. https://doi.org/10.1016/j.bushor.2007.05.002
  2. H. Kitamura, R. Yamaji, and H. Tabuki, Adverting Catchphrase. Yuhikaku, 1981.
  3. Sloganizer.net, "Instant slogans with our slogan generator." http://www.sloganizer.net/en/.
  4. THE-PCMAN-WEBSITE, "Free slogan generator." http://www.thepcmanwebsite.com/media/free slogan generator/index.php.
  5. M. Banko, V. O. Mittal, and M. J. Witbrock, "Headline generation based on statistical translation," in Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, ACL '00, pp. 318-325, 2000.
  6. "Online advertising: A 59 billioneuro market in 2012, up from 31 billion in 2008." http://www.internationaltelevision. org/archive/online-advertising-worldusa- europe 2005-2012.pdf.
  7. "Worldwide internet advertising spending to surpass $106 billion in 2011." http://www.marketingcharts.com/television/worldwideinternet- advertising-spending-to-surpass-106-billionin- 2011-5068/.
  8. C. Kit, "How does lexical acquisition begin? A cognitive perspective," Cognitive Science, vol. 1, no. 1, pp. 1 - 50, 2003.
  9. T. Hashimoto, N. Usui, M. Taira, I. Nose, T. Haji, and S. Kojima, "The neural mechanism associated with the processing of onomatopoeic sounds," NeuroImage, vol. 31, no. 4, pp. 1762 - 1770, 2006. https://doi.org/10.1016/j.neuroimage.2006.02.019
  10. T. Komatsu and H. Akiyama, "Expression system of onomatopoeias for assisting users' intuitive expressions," The Transactions of the Institute of Electronics, Information and Communication Engineers. A, vol. 92, no. 11, pp. 752-763, 2009.
  11. Y. Tomoto, T. Nakamura, M. Kanoh, and T. Komatsu, "Visualization of similarity relationships by onomatopoeia thesaurus map," in Fuzzy Systems (FUZZ), 2010 IEEE International Conference on, pp. 1-6, 2010.
  12. K. Komiya and Y. Kotani, "Classification of Japanese onomatopoeias using hierarchical clustering depending on contexts," in Computer Science and Software Engineering (JCSSE), 2011 Eighth International Joint Conference on, pp. 108-113, 2011.
  13. T. Kudo and H. Kazawa, "Web Japanese N-gram version 1." Gengo Shigen Kyokai, 2007.
  14. S. Yata, "Search system for giga-scale ngram corpus." http://code.google.com/p/ssgnc/.
  15. M. Ono, Japanese Onomatopoeia Dictionary: echoic and imitative words 4500. Shogakukan, 2007.
  16. Y. Kuno, Catalog and Flier Catchphrase Encyclopedia. PIE BOOKS, 2008.
  17. T. Kudo, K. Yamamoto, and Y. Matsumoto, "Applying conditional random fields to Japanese morphological analysis," in In Proc. of EMNLP, pp. 230-237, 2004.
  18. T. Kudo, "Mecab: Yet another partof- speech and morphological analyzer." http://mecab.sourceforge.net/.
  19. "Wikipedia Japanese archive." http://dumps.wikimedia.org/jawiki/.
  20. Kurohashi Laboratory: Graduate School of Informatics in Kyoto University, "KNB Corpus (Kyoto-University and NTT Blog Corpus)." http://nlp.kuee.kyoto-u.ac.jp/kuntt/, 2009.
  21. Uemura Laboratory: Faculty of Environmental Engineering in the University of Kitakyushu, "Hypermedia Corpus of Spoken Japanese." http://www.env.kitakyuu. ac.jp/corpus/texts/index.html, 1996.
  22. T. Sakaki, Y. Mtsuo, K. Uchiyama, and M. Ishizuka, "Construction of related terms thesauri from the web," Journal of natural language processing, vol. 14, no. 2, pp. 3-31, 2007. https://doi.org/10.5715/jnlp.14.2_3