• 제목/요약/키워드: lexicon-based analysis

검색결과 56건 처리시간 0.028초

전문용어의 정의문 분석 (An analysis of terminological definitions)

  • 이해윤
    • 한국독어학회지:독어학
    • /
    • 제7집
    • /
    • pp.145-163
    • /
    • 2003
  • In this paper, we examined various definitions of terminological definition for the extraction of terminological information from corpora. After we reviewed researches at the lexicography and at the terminology, we introduced the qualia structure of Generative Lexicon (Pustejovsky 1995) for the purpose of analyzing terminological definitions. By means of the qualia structure, we analyzed the definitions which are presented at the terminological dictionaries. As a result, we confirmed that the terminological definitions can be discomposed into 4 subtypes of qualia structure. Based on this examination, we analyzed terminological definitions of articles at a newspaper and showed the usefulness of the qualia structure at the extraction of terminological definitions from the corpora.

  • PDF

Extracting and Clustering of Story Events from a Story Corpus

  • Yu, Hye-Yeon;Cheong, Yun-Gyung;Bae, Byung-Chull
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권10호
    • /
    • pp.3498-3512
    • /
    • 2021
  • This article describes how events that make up text stories can be represented and extracted. We also address the results from our simple experiment on extracting and clustering events in terms of emotions, under the assumption that different emotional events can be associated with the classified clusters. Each emotion cluster is based on Plutchik's eight basic emotion model, and the attributes of the NLTK-VADER are used for the classification criterion. While comparisons of the results with human raters show less accuracy for certain emotion types, emotion types such as joy and sadness show relatively high accuracy. The evaluation results with NRC Word Emotion Association Lexicon (aka EmoLex) show high accuracy values (more than 90% accuracy in anger, disgust, fear, and surprise), though precision and recall values are relatively low.

감정과 날씨 정보에 따른 의상 추천 시스템 (Clothing-Recommendation system based on emotion and weather information)

  • 일홈존;박두순
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2021년도 추계학술발표대회
    • /
    • pp.528-531
    • /
    • 2021
  • Nowadays recommendation systems are so ubiquitous, where our many decisions are being done by the means of them. We can see recommendation systems in all areas of our daily life. Therefore the research of this sphere is still so active. So far many research papers were published for clothing recommendations as well. In this paper, we propose the clothing-recommendation system according to user emotion and weather information. We used social media to analyze users' 6 basic emotions according to Paul Eckman theory and match the colour of clothing. Moreover, getting weather information using visualcrossing.com API to predict the kind of clothing. For sentiment analysis, we used Emotion Lexicon that was created by using Mechanical Turk. And matching the emotion and colour was done by applying Hayashi's Quantification Method III.

AMR-CNN: Abstract Meaning Representation with Convolution Neural Network for Toxic Content Detection

  • Ermal Elbasani;Jeong-Dong Kim
    • Journal of Web Engineering
    • /
    • 제21권3호
    • /
    • pp.677-692
    • /
    • 2022
  • Recognizing the offensive, abusive, and profanity of multimedia content on the web has been a challenge to keep the web environment for user's freedom of speech. As profanity filtering function has been developed and applied in text, audio, and video context in platforms such as social media, entertainment, and education, the number of methods to trick the web-based application also has been increased and became a new issue to be solved. Compared to commonly developed toxic content detection systems that use lexicon and keyword-based detection, this work tries to embrace a different approach by the meaning of the sentence. Meaning representation is a way to grasp the meaning of linguistic input. This work proposed a data-driven approach utilizing Abstract meaning Representation to extract the meaning of the online text content into a convolutional neural network to detect level profanity. This work implements the proposed model in two kinds of datasets from the Offensive Language Identification Dataset and other datasets from the Offensive Hate dataset merged with the Twitter Sentiment Analysis dataset. The results indicate that the proposed model performs effectively, and can achieve a satisfactory accuracy in recognizing the level of online text content toxicity.

A Study on Efficient Market Hypothesis to Predict Exchange Rate Trends Using Sentiment Analysis of Twitter Data

  • Komariah, Kokoy Siti;Machbub, Carmadi;Prihatmanto, Ary S.;Sin, Bong-Kee
    • 한국멀티미디어학회논문지
    • /
    • 제19권7호
    • /
    • pp.1107-1115
    • /
    • 2016
  • Efficient Market Hypothesis (EMH), states that at any point in time in a liquid market security prices fully reflect all available information. This paper presents a study of proving the hypothesis through daily Twitter sentiments using the hybrid approach of the lexicon-based approach and the naïve Bayes classifier. In this research we analyze the currency exchange rate movement of Indonesia Rupiah vs US dollar as a way of testing the Efficient Market Hypothesis. In order to find a correlation between the prediction sentiments from Twitter data and the actual currency exchange rate trends we collect Twitter data every day and compute the overall sentiment to label them as positive or negative. Experimental results have shown 69% correct prediction of sentiment analysis and 65.7% correlation with positive sentiments. This implies that EMH is semi-strong Efficient Market Hypothesis, and that public information provide by Twitter sentiment correlate with changes in the exchange market trends.

Understanding the Sentiment on Gig Economy: Good or Bad?

  • NORAZMI, Fatin Aimi Naemah;MAZLAN, Nur Syazwani;SAID, Rusmawati;OK RAHMAT, Rahmita Wirza
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제9권10호
    • /
    • pp.189-200
    • /
    • 2022
  • The gig economy offers many advantages, such as flexibility, variety, independence, and lower cost. However, there are also safety concerns, lack of regulations, uncertainty, and unsatisfactory services, causing people to voice their opinion on social media. This paper aims to explore the sentiments of consumers concerning gig economy services (Grab, Foodpanda and Airbnb) through the analysis of social media. First, Vader Lexicon was used to classify the comments into positive, negative, and neutral sentiments. Then, the comments were further classified into three machine learning algorithms: Support Vector Machine, Light Gradient Boosted Machine, and Logistic Regression. Results suggested that gig economy services in Malaysia received more positive sentiments (52%) than negative sentiments (19%) and neutral sentiments (29%). Based on the three algorithms used in this research, LGBM has been the best model with the highest accuracy of 85%, while SVM has 84% and LR 82%. The results of this study proved the power of text mining and sentiment analysis in extracting business value and providing insight to businesses. Additionally, it aids gig managers and service providers in understanding clients' sentiments about their goods and services and making necessary adjustments to optimize satisfaction.

정보 중립성 확보를 위한 인터넷 뉴스 댓글의 정치성향 분석 (Political Information Filtering on Online News Comment)

  • 최혜봉;김재홍;이지현;이민구
    • 문화기술의 융합
    • /
    • 제6권4호
    • /
    • pp.575-582
    • /
    • 2020
  • 본 연구는 인터넷 뉴스 댓글 빅데이터 분석을 통해 뉴스 댓글 사용자의 정치적 성향을 추정하는 방법을 제안한다. 인터넷 뉴스 댓글과 작성자의 정치 성향을 함께 제공하여 디지털 매체를 통한 정보 전달의 객관성과 중립성을 확보하고자 한다. 250만 건 이상의 인터넷 뉴스 댓글의 특성을 분석하고 사용자의 정치적 성향을 효과적으로 추정하기 위한 특징을 추출한다. 어휘사전 기반 알고리즘과 유사도 기반 알고리즘을 제안하고 실험을 통해 두 알고리즘을 비교하고 효과를 검증한다.

신뢰성있는 온라인 고객 리뷰 텍스트 마이닝 기반 식당 개별 음식 아이템 평가 (Rating Individual Food Items of Restaurant Menu based on Online Customer Reviews using Text Mining Technique)

  • 무자밀 후세인 사이드;정선태
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2020년도 춘계학술발표대회
    • /
    • pp.389-392
    • /
    • 2020
  • The growth in social media, blogs and restaurant listing directories have led to increasing customer reviews about restaurants, their quality of food items and services available on the internet. These user reviews offer a massive amount of valuable information that can be used for various decision-making purposes. Currently, most food recommendation sites provide recommendation scores about restaurants rather than food items of the restaurant and the provided recommendation scores may be biased since they are calculated only from user reviews listed only in their sites. Usually, people wants a reliable recommendation about foods, not restaurant. In this paper, we present a reliable Korean food items rating method; we first extract food items by applying NER technique to restaurant reviews collected from many Korean restaurant recommendation web sites, blogs and web data. Then, we apply lexicon-based sentiment analysis on collected user reviews and predict people's opinions as sentiment polarity scores (+1 for positive; -1 for negative; 0 for neutral). Finally, by taking average of all calculated polarity scores about a food item, we obtain a rating to individual menu items of the restaurant. The proposed food item rating is more reliable since it does not depend on reviews of only one site.

Anatomy of Sentiment Analysis of Tweets Using Machine Learning Approach

  • Misbah Iram;Saif Ur Rehman;Shafaq Shahid;Sayeda Ambreen Mehmood
    • International Journal of Computer Science & Network Security
    • /
    • 제23권10호
    • /
    • pp.97-106
    • /
    • 2023
  • Sentiment analysis using social network platforms such as Twitter has achieved tremendous results. Twitter is an online social networking site that contains a rich amount of data. The platform is known as an information channel corresponding to different sites and categories. Tweets are most often publicly accessible with very few limitations and security options available. Twitter also has powerful tools to enhance the utility of Twitter and a powerful search system to make publicly accessible the recently posted tweets by keyword. As popular social media, Twitter has the potential for interconnectivity of information, reviews, updates, and all of which is important to engage the targeted population. In this work, numerous methods that perform a classification of tweet sentiment in Twitter is discussed. There has been a lot of work in the field of sentiment analysis of Twitter data. This study provides a comprehensive analysis of the most standard and widely applicable techniques for opinion mining that are based on machine learning and lexicon-based along with their metrics. The proposed work is helpful to analyze the information in the tweets where opinions are highly unstructured, heterogeneous, and polarized positive, negative or neutral. In order to validate the performance of the proposed framework, an extensive series of experiments has been performed on the real world twitter dataset that alter to show the effectiveness of the proposed framework. This research effort also highlighted the recent challenges in the field of sentiment analysis along with the future scope of the proposed work.

온톨로지 기반의 소프트웨어 설계에러검출방법 (A Method based on Ontology for detecting errors in the Software Design)

  • 서진원;김영태;공헌택;임재현;김치수
    • 한국산학기술학회논문지
    • /
    • 제10권10호
    • /
    • pp.2676-2683
    • /
    • 2009
  • 본 논문은 소프트웨어 설계 시 향상된 오류 검출방법을 통해서 소프트웨어 설계의 질을 향상시켜 그에 따른 소프트웨어 제품의 질을 향상시키는데 목적을 두고 있다. 본 논문에서 오류검출의 범위는 일관성결여 오류로 제한하여 일관성 오류에 관한 명세에 초점을 맞춘다. UML 표현의 문제점인 의미 일관성 표현의 한계를 극복하기 위해 ODES 모델을 제안하였으며 검증방법으로 일관성 검사 방법을 제안한다. UML 설계에서 확인된 의미적으로 중요한 특징이 ODES 모델로 구현되며, UML 모델을 ODES 모델로 변환과정에서의 일관성검사방법을 제시한다. ODES 모델로의 변환과정은 ODES 모델의 인스턴스를 생성하기 위한 알고리즘에서 복수의 사상테이블을 이용하는 소프트웨어 설계의 어휘분석과 의미분석을 포함한다.