• Title/Summary/Keyword: Sentence analysis

Search Result 493, Processing Time 0.019 seconds

Impact of Semantic Characteristics on Perceived Helpfulness of Online Reviews (온라인 상품평의 내용적 특성이 소비자의 인지된 유용성에 미치는 영향)

  • Park, Yoon-Joo;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.29-44
    • /
    • 2017
  • In Internet commerce, consumers are heavily influenced by product reviews written by other users who have already purchased the product. However, as the product reviews accumulate, it takes a lot of time and effort for consumers to individually check the massive number of product reviews. Moreover, product reviews that are written carelessly actually inconvenience consumers. Thus many online vendors provide mechanisms to identify reviews that customers perceive as most helpful (Cao et al. 2011; Mudambi and Schuff 2010). For example, some online retailers, such as Amazon.com and TripAdvisor, allow users to rate the helpfulness of each review, and use this feedback information to rank and re-order them. However, many reviews have only a few feedbacks or no feedback at all, thus making it hard to identify their helpfulness. Also, it takes time to accumulate feedbacks, thus the newly authored reviews do not have enough ones. For example, only 20% of the reviews in Amazon Review Dataset (Mcauley and Leskovec, 2013) have more than 5 reviews (Yan et al, 2014). The purpose of this study is to analyze the factors affecting the usefulness of online product reviews and to derive a forecasting model that selectively provides product reviews that can be helpful to consumers. In order to do this, we extracted the various linguistic, psychological, and perceptual elements included in product reviews by using text-mining techniques and identifying the determinants among these elements that affect the usability of product reviews. In particular, considering that the characteristics of the product reviews and determinants of usability for apparel products (which are experiential products) and electronic products (which are search goods) can differ, the characteristics of the product reviews were compared within each product group and the determinants were established for each. This study used 7,498 apparel product reviews and 106,962 electronic product reviews from Amazon.com. In order to understand a review text, we first extract linguistic and psychological characteristics from review texts such as a word count, the level of emotional tone and analytical thinking embedded in review text using widely adopted text analysis software LIWC (Linguistic Inquiry and Word Count). After then, we explore the descriptive statistics of review text for each category and statistically compare their differences using t-test. Lastly, we regression analysis using the data mining software RapidMiner to find out determinant factors. As a result of comparing and analyzing product review characteristics of electronic products and apparel products, it was found that reviewers used more words as well as longer sentences when writing product reviews for electronic products. As for the content characteristics of the product reviews, it was found that these reviews included many analytic words, carried more clout, and related to the cognitive processes (CogProc) more so than the apparel product reviews, in addition to including many words expressing negative emotions (NegEmo). On the other hand, the apparel product reviews included more personal, authentic, positive emotions (PosEmo) and perceptual processes (Percept) compared to the electronic product reviews. Next, we analyzed the determinants toward the usefulness of the product reviews between the two product groups. As a result, it was found that product reviews with high product ratings from reviewers in both product groups that were perceived as being useful contained a larger number of total words, many expressions involving perceptual processes, and fewer negative emotions. In addition, apparel product reviews with a large number of comparative expressions, a low expertise index, and concise content with fewer words in each sentence were perceived to be useful. In the case of electronic product reviews, those that were analytical with a high expertise index, along with containing many authentic expressions, cognitive processes, and positive emotions (PosEmo) were perceived to be useful. These findings are expected to help consumers effectively identify useful product reviews in the future.

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

  • Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.141-166
    • /
    • 2019
  • Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.

Dedicatory Inscriptions on the Amitabha Buddha and Maitreya Bodhisattva Sculptures of Gamsansa Temple (감산사(甘山寺) 아미타불상(阿彌陁佛像)과 미륵보살상(彌勒菩薩像) 조상기(造像記)의 연구)

  • Nam, Dongsin
    • MISULJARYO - National Museum of Korea Art Journal
    • /
    • v.98
    • /
    • pp.22-53
    • /
    • 2020
  • This paper analyzes the contents, characteristics, and historical significance of the dedicatory inscriptions (josanggi) on the Amitabha Buddha and the Maitreya Bodhisattva statues of Gamsansa Temple, two masterpieces of Buddhist sculpture from the Unified Silla period. In the first section, I summarize research results from the past century (divided into four periods), before presenting a new perspective and methodology that questions the pre-existing notion that the Maitreya Bodhisattva has a higher rank than the Amitabha Buddha. In the second section, through my own analysis of the dedicatory inscriptions, arrangement, and overall appearance of the two images, I assert that the Amitabha Buddha sculpture actually held a higher rank and greater significance than the Maitreya Bodhisattva sculpture. In the third section, for the first time, I provide a new interpretation of two previously undeciphered characters from the inscriptions. In addition, by comparing the sentence structures from the respective inscriptions and revising the current understanding of the author (chanja) and calligrapher (seoja), I elucidate the possible meaning of some ambiguous phrases. Finally, in the fourth section, I reexamine the content of both inscriptions, differentiating between the parts relating to the patron (josangju), the dedication (josang), and the prayers of the patrons or donors (balwon). In particular, I argue that the phrase "for my deceased parents" is not merely a general axiom, but a specific reference. To summarize, the dedicatory inscriptions can be interpreted as follows: when Kim Jiseong's parents died, they were cremated and he scattered most of their remains by the East Sea. But years later, he regretted having no physical memorial of them to which to pay his respects. Thus, in his later years, he donated his estate on Gamsan as alms and led the construction of Gamsansa Temple. He then commissioned the production of the two stone sculptures of Amitabha Buddha and Maitreya Bodhisattva for the temple, asking that they be sculpted realistically to reflect the actual appearance of his parents. Finally, he enshrined the remains of his parents in the sculptures through the hole in the back of the head (jeonghyeol). The Maitreya Bodhisattva is a standing image with a nirmanakaya, or "transformation Buddha," on the crown. As various art historians have pointed out, this iconography is virtually unprecedented among Maitreya images in East Asian Buddhist sculpture, leading some to speculate that the standing image is actually the Avalokitesvara. However, anyone who reads the dedicatory inscription can have no doubt that this image is in fact the Maitreya. To ensure that the sculpture properly embodied his mother (who wished to be reborn in Tushita Heaven with Maitreya Bodhisattva), Kim Jiseong combined the iconography of the Maitreya and Avalokitesvara (the reincarnation of compassion). Hence, Kim Jiseong's deep love for his mother motivated him to modify the conventional iconography of the Maitreya and Avalokitesvara. A similar sentiment can be found in the sculpture of Amitabha Buddha. To this day, any visitor to the temple who first looks at the sculptures from the front before reading the text on the back will be deeply touched by the filial love of Kim Jiseong, who truly cherished the memory of his parents.