• 제목/요약/키워드: Data Models

검색결과 13,797건 처리시간 0.034초

FINANCIAL MODELS INDUCED FROM AUXILIARY INDICES AND TWITTER DATA

  • Oh, Jae-Pill
    • Korean Journal of Mathematics
    • /
    • 제22권3호
    • /
    • pp.529-552
    • /
    • 2014
  • As we know, some indices and data are strong influence to the price movement of some assets now, but not to another assets and in future. Thus we define some asset models for several time intervals; intraday, weekly, monthly, and yearly asset models. We define these asset models by using Brownian motion with volatility and Poisson process, and several deterministic functions(index function, twitter data function and big-jump simple function etc). In our asset models, these deterministic functions are the positive or negative levels of auxiliary indices, of analyzed data, and for imminent and extreme state(for example, financial shock or the highest popularity in the market). These functions determined by indices, twitter data and shocking news are a kind of one of speciality of our asset models. For reasonableness of our asset models, we introduce several real data, figurers and tables, and simulations. Perhaps from our asset models, for short-term or long-term investment, we can classify and reference many kinds of usual auxiliary indices, information and data.

Analysis of Quasi-Likelihood Models using SAS/IML

  • Ha, Il-Do
    • Journal of the Korean Data and Information Science Society
    • /
    • 제8권2호
    • /
    • pp.247-260
    • /
    • 1997
  • The quasi-likelihood models which greatly widened the scope of generalized linear models are widely used in data analysis where a likelihood is not available. Since a quasi-likelihood may not appear to be an ordinary likelihood for any known distribution in the natural exponential family, to fit the quasi-likelihood models the standard statistical packages such as GLIM, GENSTAT, S-PLUS and so on may not directly applied. SAS/IML is very useful for fitting of such models. In this paper, we present simple SAS/IML(version 6.11) program which helps to fit and analyze the quasi-likelihood models applied to the leaf-blotch data introduced by Wedderburn(1974), and the problem with deviance useful generally to model checking is pointed out, and then its solution method is mention through the data analysis based on this quasi-likelihood models checking.

  • PDF

Scalable Prediction Models for Airbnb Listing in Spark Big Data Cluster using GPU-accelerated RAPIDS

  • Muralidharan, Samyuktha;Yadav, Savita;Huh, Jungwoo;Lee, Sanghoon;Woo, Jongwook
    • Journal of information and communication convergence engineering
    • /
    • 제20권2호
    • /
    • pp.96-102
    • /
    • 2022
  • We aim to build predictive models for Airbnb's prices using a GPU-accelerated RAPIDS in a big data cluster. The Airbnb Listings datasets are used for the predictive analysis. Several machine-learning algorithms have been adopted to build models that predict the price of Airbnb listings. We compare the results of traditional and big data approaches to machine learning for price prediction and discuss the performance of the models. We built big data models using Databricks Spark Cluster, a distributed parallel computing system. Furthermore, we implemented models using multiple GPUs using RAPIDS in the spark cluster. The model was developed using the XGBoost algorithm, whereas other models were developed using traditional central processing unit (CPU)-based algorithms. This study compared all models in terms of accuracy metrics and computing time. We observed that the XGBoost model with RAPIDS using GPUs had the highest accuracy and computing time.

Comparative Analysis of Building Models to Develop a Generic Indoor Feature Model

  • Kim, Misun;Choi, Hyun-Sang;Lee, Jiyeong
    • 한국측량학회지
    • /
    • 제39권5호
    • /
    • pp.297-311
    • /
    • 2021
  • Around the world, there is an increasing interest in Digital Twin cities. Although geospatial data is critical for building a digital twin city, currently-established spatial data cannot be used directly for its implementation. Integration of geospatial data is vital in order to construct and simulate the virtual space. Existing studies for data integration have focused on data transformation. The conversion method is fundamental and convenient, but the information loss during this process remains a limitation. With this, standardization of the data model is an approach to solve the integration problem while hurdling conversion limitations. However, the standardization within indoor space data models is still insufficient compared to 3D building and city models. Therefore, in this study, we present a comparative analysis of data models commonly used in indoor space modeling as a basis for establishing a generic indoor space feature model. By comparing five models of IFC (Industry Foundation Classes), CityGML (City Geographic Markup Language), AIIM (ArcGIS Indoors Information Model), IMDF (Indoor Mapping Data Format), and OmniClass, we identify essential elements for modeling indoor space and the feature classes commonly included in the models. The proposed generic model can serve as a basis for developing further indoor feature models through specifying minimum required structure and feature classes.

비선형 성장곡선 모형의 분석 절차에 대한 연구 (A Study on the Analysis Procedures of Nonlinear Growth Curve Models)

  • 황정연
    • 품질경영학회지
    • /
    • 제25권1호
    • /
    • pp.44-55
    • /
    • 1997
  • In order to determine procedures for a, pp.opriate model selection of technological growth curves, numerous time series that were representative of growth behavior were collected according to data characteristics. Three different growth curve models were fitted onto data sets in an attempt to determine which growth curve models achieved the best forecasts for types of growth data. The analysis of the results gives rise to an a, pp.oach for selecting a, pp.opriate growth curve models for a given set of data, prior to fitting the models, based on the characteristics of the goodness of fit test.

  • PDF

Comparison of Three Binomial-related Models in the Estimation of Correlations

  • Moon, Myung-Sang
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.585-594
    • /
    • 2003
  • It has been generally recognized that conventional binomial or Poisson model provides poor fits to the actual correlated binary data due to the extra-binomial variation. A number of generalized statistical models have been proposed to account for this additional variation. Among them, beta-binomial, correlated-binomial, and modified-binomial models are binomial-related models which are frequently used in modeling the sum of n correlated binary data. In many situations, it is reasonable to assume that n correlated binary data are exchangeable, which is a special case of correlated binary data. The sum of n exchangeable correlated binary data is modeled relatively well when the above three binomial-related models are applied. But the estimation results of correlation coefficient turn to be quite different. Hence, it is important to identify which model provides better estimates of model parameters(success probability, correlation coefficient). For this purpose, a small-scale simulation study is performed to compare the behavior of above three models.

공공부문 다기관 통합전산센터 모형에 관한 연구 (A Study on Models of Data Consolidation Center for Multi-Organization in Public Sector)

  • 임성묵;이영재
    • 산업공학
    • /
    • 제18권4호
    • /
    • pp.418-430
    • /
    • 2005
  • We establish an efficient strategy for construction and operation of data consolidation center for multi-organization in public sector. First, we introduce important concepts on data consolidation center in public sector, and draw some success factors by analyzing several foreign and domestic cases. Second, we construct all the possible logical operational models of the center and investigate the properties and feasibility of the models. Third, we suggest a virtual operational environment for the two representative models selected by feasibility criteria among the possible logical models, and compare the two models in terms of operational cost. We also utilize AHP methodology to evaluate qualitative opinions on the two models from several experts in public information systems. As a result, we find the best alternative is the case in which all infrastructure and facilities for the center are provided by government, and common essential IT operations are integrated, associated data are consolidated and the whole operational work are outsourced to specialized IT operations service providers.

Quantitative Comparison of Probabilistic Multi-source Spatial Data Integration Models for Landslide Hazard Assessment

  • Park No-Wook;Chi Kwang-Hoon;Chung Chang-Jo F.;Kwon Byung-Doo
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2004년도 Proceedings of ISRS 2004
    • /
    • pp.622-625
    • /
    • 2004
  • This paper presents multi-source spatial data integration models based on probability theory for landslide hazard assessment. Four probabilistic models such as empirical likelihood ratio estimation, logistic regression, generalized additive and predictive discriminant models are proposed and applied. The models proposed here are theoretically based on statistical relationships between landslide occurrences and input spatial data sets. Those models especially have the advantage of direct use of continuous data without any information loss. A case study from the Gangneung area, Korea was carried out to quantitatively assess those four models and to discuss operational issues.

  • PDF

Linked Data 기반의 메타데이타 모델을 활용한 소프트웨어 모델 통합 (Software Model Integration Using Metadata Model Based on Linked Data)

  • 김대환;정찬기
    • 한국IT서비스학회지
    • /
    • 제12권3호
    • /
    • pp.311-321
    • /
    • 2013
  • In the community of software engineering, diverse modeling languages are used for representing all relevant information in the form of models. Also many different models such as business model, business process model, product models, interface models etc. are generated through software life cycles. In this situation, models need to be integrated for enterprise integration and enhancement of software productivity. Researchers propose rebuilding models by a specific modeling language, using a intemediate modeling language and using common reference for model integration. However, in the current approach it requires a lot of cost and time to integrate models. Also it is difficult to identify common objects from several models and to update objects in the repository of common model objects. This paper proposes software model integration using metadata model based on Linked data. We verify the effectiveness of the proposed approach through a case study.

자료기반 물환경 모델의 현황 및 발전 방향 (Data-Driven Modeling of Freshwater Aquatic Systems: Status and Prospects)

  • 차윤경;신지훈;김영우
    • 한국물환경학회지
    • /
    • 제36권6호
    • /
    • pp.611-620
    • /
    • 2020
  • Although process-based models have been a preferred approach for modeling freshwater aquatic systems over extended time intervals, the increasing utility of data-driven models in a big data environment has made the data-driven models increasingly popular in recent decades. In this study, international peer-reviewed journals for the relevant fields were searched in the Web of Science Core Collection, and an extensive literature review, which included total 2,984 articles published during the last two decades (2000-2020), was performed. The review results indicated that the rate of increase in the number of published studies using data-driven models exceeded those using process-based models since 2010. The increase in the use of data-driven models was partly attributable to the increasing availability of data from new data sources, e.g., remotely sensed hyperspectral or multispectral data. Consistently throughout the past two decades, South Korea has been one of the top ten countries in which the greatest number of studies using the data-driven models were published. Among the major data-driven approaches, i.e., artificial neural network, decision tree, and Bayesian model, were illustrated with case studies. Based on the review, this study aimed to inform the current state of knowledge regarding the biogeochemical water quality and ecological models using data-driven approaches, and provide the remaining challenges and future prospects.