• Title/Summary/Keyword: 데이터마트

Search Result 21, Processing Time 0.025 seconds

Implementation of the OLAP-based Subway Passenger Transit Pattern Analysis System (OLAP을 활용한 지하철 인구이동 맵 생성에 관한 연구)

  • Cho, Jae-Hee;Seo, Il-Jung
    • Information Systems Review
    • /
    • v.7 no.1
    • /
    • pp.65-80
    • /
    • 2005
  • The Seoul Metropolitan Subway Corporation (SMS) and the Seoul Metropolitan Rapid Transit Corporation (SMRT), which manage the city's eight subway lines, are intending to overcome their operational inefficiencies. The two investigators of the paper realize with emphasis that it is essential for the two subway authorities to analyze subway transit data prior to put policies and plans into practice. In this paper, the investigators propose a new, and an intuitive, way of analyzing subway passenger transit patterns. To achieve this goal, they have implemented a data mart by blending the "Pass Card" log data into the multidimensional model. The subway passenger's transit patterns and the practical implications of this system are also investigated.

데이터마트 전달구조

  • Korea Database Promotion Center
    • Digital Contents
    • /
    • no.7 s.62
    • /
    • pp.72-75
    • /
    • 1998
  • 오늘날 EIS는 한정된 자원을 가지고 최종 사용자에게 강건하며 동시에 유연한 상업적 데이터 마트 배급 구조를 짧은 시간안에 제공해야 한다는 어려움에 봉착해 있다. 마트 배급 구조에서 요구되는 것은 SQL에 관한 상당한 지식을 가지고 있는 전문가들로부터 항상 동일한 형식의 보고서들이나 OLAP 쿼리를 행하고자 하는 순수한 의미의 최종 사용자들에 이르는 다양한 부류의 사용자들을 지원해야 한다. 덧붙여서 이러한 구조는 ROLAP, DOLAP, 엑셀, 스프레드시트 등 모든 파일 형식상에서의 데이터 배급을 지원할 수 있을 만큼 유연해야 한다.

  • PDF

A Study on the Adjustment of Posterior Probability for Oversampling when the Target is Rare (목표 범주가 희귀한 자료의 과대표본추출에 대한 연구)

  • Kim, U.N.;Lee, S.K.;Choi, J.H.
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.3
    • /
    • pp.477-484
    • /
    • 2011
  • When an event of target variable is rare, a widespread strategy is to build a model on the sample that disproportionally over-represents the events, that is over-sampled. Using the data over-sampled from the original data set, the predicted values would be biased; however, it can be easily corrected to represent the population. In this study, we investigate into the relationship between the proportion of rare event on a data-mart and the model performance using real world data of a Korean credit card company. Also, we use the methods for adjusting of posterior probability for over-sampled data of the offset method and the weighted method. Finally, we compare the performance of the methods using real data sets.

Analysis of Passenger Movement Patterns Using Subway OD Data (도시철도 출·도착데이터를 이용한 승객이동 패턴 분석)

  • Baik, Euiyoung;Cho, Jae Hee;Kim, Dong-Geon
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.12
    • /
    • pp.315-325
    • /
    • 2019
  • The purpose of this study is to design and construct a data mart that anyone can easily analyze subway OD movement patterns. Subway OD data of the year 2017 was downloaded from the Seoul Open Data Plaza and used as the source data. A multidimensional model was designed, and Gaussian mixed cluster analysis and visualization analysis using Tableau were performed. Interestingly, movement between suburban and Seoul accounts for 23% of the total traffic. The passengers of Suwon Station move to the suburbs much more than Seoul, while Pangyo Station mostly moves to Seoul. As a result of Gaussian mixed cluster, eight clusters of OD segments were found, and the characteristics of each cluster were characterized by segment distance and passenger size.

System Implementation Plan for Applying Spatial Information to Road Occupation Permit Administrative Works (도로점용허가 행정업무에 공간정보 활용을 위한 시스템 구축방안)

  • Youn, Junhee;Kim, Changyoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.6
    • /
    • pp.4208-4215
    • /
    • 2015
  • Most of the administrative works are executed based on the address. Therefore, spatial information is an essential factor for administrative works. So far, many systems, which apply spatial information to administrative works, have been implemented. However, there is little approach to apply spatial information to road occupation permit works. In this paper, we introduce system implementation plan for applying spatial information to road occupation permit administrative works. System implementation plan includes work analysis, applying scenarios extraction, system function extraction, and datamart construction plans. First, work processes and activities are defined based on the analysis of work handbook. Also, activities are displayed in diagram. Second, scenarios applying spatial information to road occupation permit works are extracted. Third, we derive service functions for system, which realize work processes and applying scenarios. Finally, spatial information datamart construction plan is established. Proposed system implementation plan in this research includes application architecture and part of data architecture. For implementing the system, plan for hardware and software architecture should be studied.

A study on the XML Implementation for Knowledge Management System (지식관리 시스템을 위한 XML구축 방안에 대한 연구)

  • 최우영;최성
    • Proceedings of the KAIS Fall Conference
    • /
    • 2002.05a
    • /
    • pp.248-251
    • /
    • 2002
  • 현재 지식에 대한 중요성의 인식으로 각 기업별로 정보 시스템 구축을 통한 지식의 축적을 시도하고 있고 데이터 웨어하우징(DW), 데이터마트, 전사적자원관리(ERP) 등 지식관리를 위한 핵심 시스템을 통해 구성원들간에 정보를 교환하고 있다. 이처럼 기업의 지식들이 생성, 공유, 관리, 보관되고 이를 보다 체계적이고 정교하게 구축하여 이용자가 보다 쉽게 사용할 수 있는 정보기술이 지식관리시스템(KMS)이다. XML은 여러 어플리케이션들의 표준화된 정보 전달의 매개가 되는 스펙으로 지식관리의 구조적 체계를 공유할 수 있는 기반을 만들어 줄 것이다. 지식 시스템은 분산된 정보와 시스템들의 상호 연동성을 보장할 수 있는 유연성과 확장성으로 표준규격의 XML을 연구하였다.

Buying Pattern Discovery Using Spatio-Temporal Data Mart and Visual Analysis (고객군의 지리적 패턴 발견을 위한 데이터마트 구현과 시각적 분석에 관한 연구)

  • Cho, Jae-Hee;Ha, Byung-Kook
    • Journal of Information Technology Services
    • /
    • v.9 no.1
    • /
    • pp.127-139
    • /
    • 2010
  • Due to the development of information technology and business related to geographical location of customer, the need for the storage and analysis of geographical location data is increasing rapidly. Geographical location data have a spatio-temporal nature which is different from typical business data. Therefore, different methods of data storage and analysis are required. This paper proposes a multi-dimensional data model and data visualization to analyze geographical location data efficiently and effectively. Purchase order data of an online farm products brokerage business was used to build prototype datamart. RFM scores are calculated to classify customers and geocoding technology is applied to display information on maps, thereby to enhance data visualization.

Comparison of Micro Mobility Patterns of Public Bicycles Before and After the Pandemic: A Case Study in Seoul (팬데믹 전후 공공자전거의 마이크로 모빌리티 패턴 비교: 서울시 사례 연구)

  • Jae-Hee Cho;Ga-Eun Baek;Il-Jung Seo
    • The Journal of Bigdata
    • /
    • v.7 no.2
    • /
    • pp.235-244
    • /
    • 2022
  • The rental history data of public bicycles in Seoul were analyzed to examine how pandemic phenomena such as COVID-19 caused changes in people's micro mobility. Data for 2019 and 2021 were compared and analyzed by dividing them before and after COVID-19. Data were collected from public data portal sites, and data marts were created for in-depth analysis. In order to compare the changes in the two periods, the riding direction type dimension and the rental station type dimension were added, and the derived variables (rotation rate per unit, riding speed) were newly created. There is no significant difference in the average rental time before and after COVID-19, but the average rental distance and average usage speed decreased. Even in the mobility of Ttareungi, you can see the slow rhythm of daily life. On weekdays, the usage rate was the highest during commuting hours even before COVID-19, but it increased rapidly after COVID-19. It can be interpreted that people who are concerned about infection prefer Ttareungi to village buses as a means of micro-mobility. The results of data mart-based visualization and analysis proposed in this study will be able to provide insight into public bicycle operation and policy development. In future studies, it is necessary to combine SNS data such as Twitter and Instagram with public bicycle rental history data. It is expected that the value of related research can be improved by examining the behavior of bike users in various places.

Design of Database Integration System and Query System based on Global View Generation Tool (전역 스키마 생성 도구를 이용한 데이터베이스 통합 및 질의 시스템)

  • Park, U-Chang
    • Journal of Internet Computing and Services
    • /
    • v.8 no.3
    • /
    • pp.65-74
    • /
    • 2007
  • Database integration is a common and growing challenge with the proliferation of database systems, data warehouses, data marts, and other OLAP systems in organizations. Although there are many methods of sharing data between databases, true interoperability of database integration system that improves in the database federation architecture by allowing domain administrators to simply and efficiently capture database semantics. The semantic information is combined using a tool for producing a global view. Building the global view is the bottleneck in integration because there are few tools that support its construction, and these tools often require sophisticated knowledge and experience to operate properly. The technique and tool presented is simple and powerful enough to be used by all database administrators, yet expressive enough to support the majority of integration queries.

  • PDF

Analysis for Diagnosis of Patients with Cerebral Infarction by Sequence Modeling (순차규칙 모델링을 활용한 뇌경색증 환자 진단 분석)

  • Shin, A.M.;Park, H.J.;Lee, I.H.;Kim, Y.N.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.2 no.1
    • /
    • pp.51-56
    • /
    • 2009
  • This study was tried to analyze the diagnosis of patients with cerebral infarction by sequence modeling that was one of data mining analysis method and find out previous disease or complication of patients with cerebral infarction. Mass data that the diagnosis code of cerebral infarction was 163 in 2000 to 2007 were extracted from A hospital's database and then the data mart was constructed for analysis. Total 2,267 patients illnesses were diagnosed as cerebral infarction and 32,692 cases related diagnosis were extracted. Sequence modeling in Clementine 12.0 program was used to analyze diagnosis of patients with cerebral infarction and 8 meaningful rules were found in this paper. This result could be used as a basic data to make secondary cerebral infarction prevention program and to prevent complication of cerebral infarction.

  • PDF