• Title/Summary/Keyword: topic modelling

Search Result 55, Processing Time 0.031 seconds

Text Network Analysis of Korean Trade Stakeholder's Interactions - A Focus on the Trade Ministry and the Legislature (통상 이해관계자 간 상호작용 관련 텍스트 네트워크 분석(TNA) - 한국 통상부처와 입법부 관계를 중심으로)

  • Bomin Ko
    • Korea Trade Review
    • /
    • v.45 no.6
    • /
    • pp.23-43
    • /
    • 2020
  • This study aims at analyzing the interactions between two of the most significant trade stakeholders in Korea, the Trade Ministry and the Legislature, using text network analysis. Tackling seven Action and Plan Reports for Requests from Parliamentary Inspection released by the National Assembly, this paper conducts a topic modelling analysis, particularly focusing on the reports for the three trade-related institutes: the MOTIE headquarter, Korea Trade Insurance Corporation, Korea Trade and Investment Promotion Agency. According to the analysis, such traditional topics of the MOTIE as enterprise, industry, business, management, development were frequently appeared in the reports. Trade-related topics including export, trade, commerce, investment, overseas, domestic, dispute, cooperation, efficiency, negotiation, service, promotion were repeatedly shown. Lastly, a case study on 2019 Parliamentary Inspection Report showed specific trade-related topics and relevant contents that raised issues in that year. This analysis implies that the text data driven from the Parliamentary Inspection Reports between the MOTIE and the National Assembly, can be established as so called 'trade policy information system' which are valuable not only for the two but also the rest of the trade stakeholders in Korea.

An NLP-based Mixed-method Approach to Explore the Impact of Gratifications and Emotions on the Acceptance of Amazon Go

  • Arghya Ray;Subhadeep Jana;Nripendra P. Rana
    • Asia pacific journal of information systems
    • /
    • v.33 no.3
    • /
    • pp.541-572
    • /
    • 2023
  • Amazon Go is a cashierless convenience store concept, which is seen as a disruption in the grocery retail segment. Although Amazon Go has the ability to disrupt the retail segment, there are speculations on how Amazon Go will be perceived by users. Existing studies have not utilized user-generated content to understand the factors that affect customer behaviour in case of Amazon Go. Additionally, in case of phygital retail, studies have not attempted at understanding the effect of emotions and gratifications on user behaviour. To address the gap of exploring user perspectives based on their experience, we have examined the impact of gratifications and emotions on the acceptance of phygital retail using user-generated-content. A mixed-method approach has been utilized using only user-generated content. Utilizing topic-modelling based content analysis and emotion analysis on 30 articles related to Amazon Go, we found themes like, convenience, technology, experience, personalization, enjoyment and emotions like, bad, good, annoyance, success. In the empirical analysis, we have utilized 522 reviews about Amazon Go from the cognition and emotion theory stance, and found that hedonic gratifications have a positive impact on challenge emotions. We also found a significant impact of emotions on customer's favourite behaviour.

Current Status and Agenda for Regional Central Library Social Minority Service (국내 지역대표도서관 소수자서비스의 현황과 과제)

  • Chul Jung
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.4
    • /
    • pp.233-266
    • /
    • 2022
  • The purpose of this study is to derive and propose agenda to improve the quality of minority services provided by regional cental libraries at the present time when information gap is deepening. First, text mining and topic modeling were conducted on 144 studies in the field of library and information science that dealt with minorities, and the discussions surrounding minorities in the domestic library world were examined in detail. Next, the current status of services for minorities in Regional central libraries were examined in detail, and tasks requiring discussion were sought in planning and operation of services for minorities in Regional central libraries. To this end, interviews were conducted with practitioners, in charge of services for minorities at Regional central libraries. Specifically, 1) awareness of minorities by practitioners, 2) current status of minority services, and 3) responsibility and role of Regional central libraries for planning and operating minority services and necessary support were analyzed. Based on the analysis results, the following tasks were derived. 1) Recategorization of minority groups, 2) Establishment of reference resource, 3) Reinforcement of education, and 4) Cooperation support between regional representative libraries and local public libraries were derived and suggested.

Analysis of articles on water quality accidents in the water distribution networks using big data topic modelling and sentiment analysis (빅데이터 토픽모델링과 감성분석을 활용한 물공급과정에서의 수질사고 기사 분석)

  • Hong, Sung-Jin;Yoo, Do-Guen
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.spc1
    • /
    • pp.1235-1249
    • /
    • 2022
  • This study applied the web crawling technique for extracting big data news on water quality accidents in the water supply system and presented the algorithm in a procedural way to obtain accurate water quality accident news. In addition, in the case of a large-scale water quality accident, development patterns such as accident recognition, accident spread, accident response, and accident resolution appear according to the occurrence of an accident. That is, the analysis of the development of water quality accidents through key keywords and sentiment analysis for each stage was carried out in detail based on case studies, and the meanings were analyzed and derived. The proposed methodology was applied to the larval accident period of Incheon Metropolitan City in 2020 and analyzed. As a result, in a situation where the disclosure of information that directly affects consumers, such as water quality accidents, is restricted, the tone of news articles and media reports about water quality accidents with long-term damage in the event of an accident and the degree of consumer pride clearly change over time. could check This suggests the need to prepare consumer-centered policies to increase consumer positivity, although rapid restoration of facilities is very important for the development of water quality accidents from the supplier's point of view.

Prediction of Customer Satisfaction Using RFE-SHAP Feature Selection Method (RFE-SHAP을 활용한 온라인 리뷰를 통한 고객 만족도 예측)

  • Olga Chernyaeva;Taeho Hong
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.325-345
    • /
    • 2023
  • In the rapidly evolving domain of e-commerce, our study presents a cohesive approach to enhance customer satisfaction prediction from online reviews, aligning methodological innovation with practical insights. We integrate the RFE-SHAP feature selection with LDA topic modeling to streamline predictive analytics in e-commerce. This integration facilitates the identification of key features-specifically, narrowing down from an initial set of 28 to an optimal subset of 14 features for the Random Forest algorithm. Our approach strategically mitigates the common issue of overfitting in models with an excess of features, leading to an improved accuracy rate of 84% in our Random Forest model. Central to our analysis is the understanding that certain aspects in review content, such as quality, fit, and durability, play a pivotal role in influencing customer satisfaction, especially in the clothing sector. We delve into explaining how each of these selected features impacts customer satisfaction, providing a comprehensive view of the elements most appreciated by customers. Our research makes significant contributions in two key areas. First, it enhances predictive modeling within the realm of e-commerce analytics by introducing a streamlined, feature-centric approach. This refinement in methodology not only bolsters the accuracy of customer satisfaction predictions but also sets a new standard for handling feature selection in predictive models. Second, the study provides actionable insights for e-commerce platforms, especially those in the clothing sector. By highlighting which aspects of customer reviews-like quality, fit, and durability-most influence satisfaction, we offer a strategic direction for businesses to tailor their products and services.

'Hot Search Keyword' Rank-Change Prediction (인기 검색어의 순위 변화 예측)

  • Kim, Dohyeong;Kang, Byeong Ho;Lee, Sungyoung
    • Journal of KIISE
    • /
    • v.44 no.8
    • /
    • pp.782-790
    • /
    • 2017
  • The service, 'Hot Search Keywords', provides a list of the most hot search terms of different web services such as Naver or Daum. The service, bases the changes in rank of a specific search keyword on changes in its users' interest. This paper introduces a temporal modelling framework for predicting the rank change of hot search keywords using past rank data and machine learning. Past rank data shows that more than 70% of hot search keywords tend to disappear and reappear later. The authors processed missing rank value, using deletion, dummy variables, mean substitution, and expectation maximization. It is however crucial to calculate the optimal window size of the past rank data. We proposed an optimal window size selection approach based on the minimum amount of time a topic within the same or a differing context disappeared. The experiments were conducted with four different machine-learning techniques using the Naver, Daum, and Nate 'Hot Search Keywords' datasets, which were collected for 2 years.

Active Noise Control in Finite Duct by the FIR Filter Modelling Considering the Stuructural Characteristics (구조적특성을 고려한 유한 덕트계의 FIR필터모델링에 의한 능동소음제어)

  • Lee, Tae-Yeon;Song, Won-Shik;Oh, Jae-Eung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.11 no.2
    • /
    • pp.59-67
    • /
    • 1992
  • Recently, the problem which actively control the unwanted noise propagated from the technical structure by the generated secondary sound has become considerable topic from the environmental preservation point of view. In most of these studies, active noise control deals with a plane wave propagation at low frequency using adaptive filtering techniques. On the other hand, in real acoustic systems are mostly short due to the limitation of geometric configuration. In this case, the acoustic properties such as reflections and resonances inside the acoustic system should be considered. In this paper, the acoustic modeling method for short length duct was introduced using the transfer matrix method, and the active noise control problem was investigated with \implementation of FIR filter for the transfer function of control system derived from this modeling method. The identification methods for the acoustic model of actual control system was proposed by numerical computation technique based on the estimation of optimal FIR filter coefficients. The acceptable attenuation on the real acoustic system and stability of the controller are predicted in this computational simulation.

  • PDF

Efficient Data Management for Hull Condition Assessment

  • Jaramillo, David;Cabos, Christian;Renard, Philippe
    • International Journal of CAD/CAM
    • /
    • v.6 no.1
    • /
    • pp.9-17
    • /
    • 2006
  • Performing inspections for Hull Condition Monitoring and Assessment as stipulated in IACS unified requirements and IMO's Condition Assessment Scheme (CAS) IMO Resolution MEPC.94(46), 2001, Condition Assessment Scheme, IMO Resolution MEPC.111(50), 2003, Amendments to regulation 13G, addition of new regulation 13H involves a huge amount of measurement data to be collected, processed, analysed and maintained. Information to be recorded consists of thickness measurements and visual assessment of coating and cracks. The amount of data and increasing requirements with respect to condition assessment demand efficient computer support. Currently, due to the lack of standardization for this kind of data, the thickness measurements are recorded manually on ship drawings or tables. In this form, handling of the measurements is tedious and error-prone and assessment is difficult. Data reporting and analysis takes a long time, leading to some repairs being performed only at the next docking of the ship or making an additional docking necessary. The recently started ED funded project CAS addresses this topic and develops-as a first step-a data model for Hull Condition Monitoring and Assessment (HCMA) based on XML-technology. The model includes simple geometry representation to facilitate a graphically supported data collection as well as an easy visualisation of the measurement results. In order to ensure compatibility with the current way of working, the content of the data model is strictly confined to the requirements of the measurement process. Appropriate data interfaces to classification software will enable rapid assessment by the classification societies, thus improving the process in terms of time and cost savings. In particular, decision-making can be done while the ship is still in the dock for maintenance.

Study of Analysis for Autonomous Vehicle Collision Using Text Embedding (텍스트 임베딩을 이용한 자율주행자동차 교통사고 분석에 관한 연구)

  • Park, Sangmin;Lee, Hwanpil;So, Jaehyun(Jason);Yun, Ilsoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.1
    • /
    • pp.160-173
    • /
    • 2021
  • Recently, research on the development of autonomous vehicles has increased worldwide. Moreover, a means to identify and analyze the characteristics of traffic accidents of autonomous vehicles is needed. Accordingly, traffic accident data of autonomous vehicles are being collected in California, USA. This research examined the characteristics of traffic accidents of autonomous vehicles. Primarily, traffic accident data for autonomous vehicles were analyzed, and the text data used text-embedding techniques to derive major keywords and four topics. The methodology of this study is expected to be used in the analysis of traffic accidents in autonomous vehicles.

Detection of Depression Trends in Literary Cyber Writers Using Sentiment Analysis and Machine Learning

  • Faiza Nasir;Haseeb Ahmad;CM Nadeem Faisal;Qaisar Abbas;Mubarak Albathan;Ayyaz Hussain
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.3
    • /
    • pp.67-80
    • /
    • 2023
  • Rice is an important food crop for most of the population in Nowadays, psychologists consider social media an important tool to examine mental disorders. Among these disorders, depression is one of the most common yet least cured disease Since abundant of writers having extensive followers express their feelings on social media and depression is significantly increasing, thus, exploring the literary text shared on social media may provide multidimensional features of depressive behaviors: (1) Background: Several studies observed that depressive data contains certain language styles and self-expressing pronouns, but current study provides the evidence that posts appearing with self-expressing pronouns and depressive language styles contain high emotional temperatures. Therefore, the main objective of this study is to examine the literary cyber writers' posts for discovering the symptomatic signs of depression. For this purpose, our research emphases on extracting the data from writers' public social media pages, blogs, and communities; (3) Results: To examine the emotional temperatures and sentences usage between depressive and not depressive groups, we employed the SentiStrength algorithm as a psycholinguistic method, TF-IDF and N-Gram for ranked phrases extraction, and Latent Dirichlet Allocation for topic modelling of the extracted phrases. The results unearth the strong connection between depression and negative emotional temperatures in writer's posts. Moreover, we used Naïve Bayes, Support Vector Machines, Random Forest, and Decision Tree algorithms to validate the classification of depressive and not depressive in terms of sentences, phrases and topics. The results reveal that comparing with others, Support Vectors Machines algorithm validates the classification while attaining highest 79% f-score; (4) Conclusions: Experimental results show that the proposed system outperformed for detection of depression trends in literary cyber writers using sentiment analysis.