• Title/Summary/Keyword: topic modelling

Search Result 55, Processing Time 0.021 seconds

Stock Forecasting Using Prophet vs. LSTM Model Applying Time-Series Prediction

  • Alshara, Mohammed Ali
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.2
    • /
    • pp.185-192
    • /
    • 2022
  • Forecasting and time series modelling plays a vital role in the data analysis process. Time Series is widely used in analytics & data science. Forecasting stock prices is a popular and important topic in financial and academic studies. A stock market is an unregulated place for forecasting due to the absence of essential rules for estimating or predicting a stock price in the stock market. Therefore, predicting stock prices is a time-series problem and challenging. Machine learning has many methods and applications instrumental in implementing stock price forecasting, such as technical analysis, fundamental analysis, time series analysis, statistical analysis. This paper will discuss implementing the stock price, forecasting, and research using prophet and LSTM models. This process and task are very complex and involve uncertainty. Although the stock price never is predicted due to its ambiguous field, this paper aims to apply the concept of forecasting and data analysis to predict stocks.

Predicting the Saudi Student Perception of Benefits of Online Classes during the Covid-19 Pandemic using Artificial Neural Network Modelling

  • Beyari, Hasan
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.2
    • /
    • pp.145-152
    • /
    • 2022
  • One of the impacts of Covid-19 on education systems has been the shift to online education. This shift has changed the way education is consumed and perceived by students. However, the exact nature of student perception about online education is not known. The aim of this study was to understand the perceptions of Saudi higher education students (e.g., post-school students) about online education during the Covid-19 pandemic. Various aspects of online education including benefits, features and cybersecurity were explored. The data collected were analysed using statistical techniques, especially artificial neural networks, to address the research aims. The key findings were that benefits of online education was perceived by students with positive experience or when ensured of safe use of online platforms without the fear cyber security breaches for which recruitment of a cyber security officer was an important predictor. The issue of whether perception of online education as a necessity only for Covid situation or a lasting option beyond the pandemic is a topic for future research.

토픽모델링을 활용한 4차 산업혁명의 주요 이슈 분석

  • Jeon, Jeong-Hwan;Seo, Yong-Yun
    • Proceedings of the Korea Technology Innovation Society Conference
    • /
    • 2017.11a
    • /
    • pp.1321-1321
    • /
    • 2017
  • Recently the attention to the 4th industrial revolution has been increasing. In the 4th industrial revolution era, the boundaries of physical space, digital space, and biological space are becoming blurred since the active convergence between various fields There are a lot of issues on the 4th industrial revolution such as artificial intelligence, internet of thing, big data, and cyber physical system. Accordingly, this study aims to analyse the main issues of the 4th industrial revolution. Data mining such as topic modelling method is used for the analysis. This study is expected to be helpful for the researcher and policy maker of the 4th industrial revolution.

  • PDF

A numerical comparative study on induced drainage modelling in 2D hydro-mechanical coupled analysis (이차원 수리-역학적 연계해석 시 유도배수 모델링 방법에 따른 수치해석적 비교연구)

  • You, Kwang-Ho
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.10 no.1
    • /
    • pp.91-104
    • /
    • 2008
  • In tunnels, safety factor concept has been suggested to estimate their stability quantitatively. It is merely limited in the framework of mechanical analysis. However safety factor concept has not been applied in hydro-mechanical coupled analyses due to their modelling complexity. Recently studies on this topic are being actively made. In this study, induced drainage modelling methods for hydro-mechanical coupled analyses are compared and analyzed to estimate safety factor of a subsea tunnel exactly. To this end, methods both controlling hydraulic characteristic of shotcrete and using a drainage well are considered. Sensitivity analysis were carried out on rock class, thickness of shotcrete, and hydraulic properties of rock mass. As the results of this study, it turned out that the induced drainage modelling using a drainage well would give more reliable results than that of controlling hydraulic characteristic of shotcrete in estimating tunnel stability in hydro-mechanical coupled analyses.

  • PDF

An Exploratory Analysis of Online Discussion of Library and Information Science Professionals in India using Text Mining

  • Garg, Mohit;Kanjilal, Uma
    • Journal of Information Science Theory and Practice
    • /
    • v.10 no.3
    • /
    • pp.40-56
    • /
    • 2022
  • This paper aims to implement a topic modeling technique for extracting the topics of online discussions among library professionals in India. Topic modeling is the established text mining technique popularly used for modeling text data from Twitter, Facebook, Yelp, and other social media platforms. The present study modeled the online discussions of Library and Information Science (LIS) professionals posted on Lis Links. The text data of these posts was extracted using a program written in R using the package "rvest." The data was pre-processed to remove blank posts, posts having text in non-English fonts, punctuation, URLs, emails, etc. Topic modeling with the Latent Dirichlet Allocation algorithm was applied to the pre-processed corpus to identify each topic associated with the posts. The frequency analysis of the occurrence of words in the text corpus was calculated. The results found that the most frequent words included: library, information, university, librarian, book, professional, science, research, paper, question, answer, and management. This shows that the LIS professionals actively discussed exams, research, and library operations on the forum of Lis Links. The study categorized the online discussions on Lis Links into ten topics, i.e. "LIS Recruitment," "LIS Issues," "Other Discussion," "LIS Education," "LIS Research," "LIS Exams," "General Information related to Library," "LIS Admission," "Library and Professional Activities," and "Information Communication Technology (ICT)." It was found that the majority of the posts belonged to "LIS Exam," followed by "Other Discussions" and "General Information related to the Library."

Research Trend Analysis of Digital Divide in South Korea (디지털 정보격차 관련 국내 연구 동향 분석)

  • Ko, Jeonghyeun;Kang, Woojin;Lee, Jongwook
    • Journal of Korean Library and Information Science Society
    • /
    • v.52 no.4
    • /
    • pp.179-203
    • /
    • 2021
  • This study aims to grasp the key issues and the direction for digital divide research in South Korea. Based on the 488 KCI journal articles published between 2003 and 2020, the authors analyzed the changes in the number of articles per year and the subject areas of journals. Furthermore, the topic modelling and keyword network anlaysis were applied to identify the subjects of research. The main findings can be summarized as follows: first, there was a stable trend for a while after the number of articles had increased by the year of 2007, and then there has been a sharp increase since 2019. Second, digital divide research has been conducted from diverse fields including social science, multidiscipline, and engineering. Third, the six subject areas were identified which are 'digital divide among regions', 'digital divide among people with disabilities', 'technical environment of digital divide', 'divide from information use and its consequence', 'legal and institutional environments of digital divide', and 'digital divide of the elderly'. Finally, it was shown that the areas of 'divide from information use and its consequence' and 'technical environment of digital divide' have attracted attention recently.

Spatial Distribution Patterns of Twitter Data with Topic Modeling (토픽 모델링을 이용한 트위터 데이터의 공간 분포 패턴 분석)

  • Woo, Hyun Jee;Kim, Young Hoon
    • Journal of the Korean association of regional geographers
    • /
    • v.23 no.2
    • /
    • pp.376-387
    • /
    • 2017
  • This paper attempts to analyze the geographical characters of Twitter data and presents analysis potentials for social network analysis in geography. First, this paper suggests a methodology for a topic modeling-based approach in order to identify the geographical characteristics of tweets, including an analysis flow of Twitter data sets, tweet data collection and conversion, textural pre-processing and structural analysis, topic discovery, and interpretation of tweets' topics. GPS coordinates referencing tweets(geotweets) were extracted among sampled Twitter data sets because it contains the tweet place where it was created. This paper identifies a correlated relationship between some specific topics and local places in Jeju. This correlation is closely associated with some place names and local sites in Jeju Island. We assume it is the intention of tweeters to record their tweet places and to share and retweet with other tweeters in some cases. A surface density map shows the hotspots of tweets, detecting around some specific places and sites such as Jeju airport, sightseeing sites, and local places in Jeju Island. The hotspots show similar patterns of the floating population of Jeju, especially the thirty-year age group. In addition, a topic modeling algorithm is applied for the geographical topic discovery and comparison of the spatial patterns of tweets. Finally, this empirical analysis presents that Twitter data, as social network data, provide geographical significance, with topic modeling approach being useful in analyzing the textural features reflecting the geographical characteristics in large data sets of tweets.

  • PDF

A Study on the Structural Modelling of National Maritime Power System (국가해양력시스템의 구조모델화에 관한 연구)

  • 임봉택;이철영
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 1999.10a
    • /
    • pp.153-161
    • /
    • 1999
  • For composing the structure model of national maritime power system by system structural modelling, in this study, the 50 basic factors are selected by survey of the extensive and thorough literatures on maritime, sea, maritime power and sea power. And the basic factors are classified into 36 component factors by cluster method. The 9 attributes are extracted by the application of the principle component analysis method, one of the factor analysis method in system engineering, to component factors. We defined the attributes composing the national maritime power system by integration the result of this study and existed our studies relate to this topic. Which are showed in table 8. and we showed the structure model of national maritime power system in figure 3. In table 8, the 9 attributes are as follows: the fundamental power of maritime, shipping and port power, naval power, fishing power, shipbuilding power, the power of ocean research and development, dependency on seaborne trade, the protection power of ocean environment and the will and inclination of government.

Grid Based Nonpoint Source Pollution Load Modelling

  • Niaraki, Abolghasem Sadeghi;Park, Jae-Min;Kim, Kye-Hyun;Lee, Chul-Yong
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 2007.06a
    • /
    • pp.246-251
    • /
    • 2007
  • The purpose of this study is to develop a grid based model for calculating the critical nonpoint source (NPS) pollution load (BOD, TN, TP) in Nak-dong area in South Korea. In the last two decades, NPS pollution has become a topic for research that resulted in the development of numerous modeling techniques. Watershed researchers need to be able to emphasis on the characterization of water quality, including NPS pollution loads estimates. Geographic Information System (GIS) has been designed for the assessment of NPS pollution in a watershed. It uses different data such as DEM, precipitation, stream network, discharge, and land use data sets and utilizes a grid representation of a watershed for the approximation of average annual pollution loads and concentrations. The difficulty in traditional NPS modeling is the problem of identifying sources and quantifying the loads. This research is intended to investigate the correlation of NPS pollution concentrations with land uses in a watershed by calculating Expected Mean Concentrations (EMC). This work was accomplished using a grid based modelling technique that encompasses three stages. The first step includes estimating runoff grid by means of the precipitation grid and runoff coefficient. The second step is deriving the gird based model for calculating NPS pollution loads. The last step is validating the gird based model with traditional pollution loads calculation by applying statistical t-test method. The results on real data, illustrate the merits of the grid based modelling approach. Therefore, this model investigates a method of estimating and simulating point loads along with the spatially distributed NPS pollution loads. The pollutant concentration from local runoff is supposed to be directly related to land use in the region and is not considered to vary from event to event or within areas of similar land uses. By consideration of this point, it is anticipated that a single mean estimated pollutant concentration is assigned to all land uses rather than taking into account unique concentrations for different soil types, crops, and so on.

  • PDF

Exploration of User Experience Research Method with Big Data Analysis : Focusing on the Online Review Analysis of Echo (빅데이터 분석을 활용한 사용자 경험 평가 방법론 탐색 : 아마존 에코에 대한 온라인 리뷰 분석을 중심으로)

  • Hwang, Hae Jeong;Shim, Hye Rin;Choi, Junho
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.8
    • /
    • pp.517-528
    • /
    • 2016
  • This study attempted to explore and examine a new user experience (UX) research method for IoT products which are becoming widely used but lack practical user research. While user experience research has been traditionally opted for survey or observation methods, this paper utilized big data analysis method for user online reviews on an intelligent agent IoT product, Amazon's Echo. The results of topic modelling analysis extracted user experience elements such as features, conversational interaction, and updates. In addition, regression analysis showed that the topic of updates was the most influential determinant of user satisfaction. The main implication of this study is the new introduction of big data analysis method into the user experience research for the intelligent agent IoT products.