• Title/Summary/Keyword: sequential data

Search Result 1,105, Processing Time 0.029 seconds

Improving Flash Translation Layer for Hybrid Flash-Disk Storage through Sequential Pattern Mining based 2-Level Prefetching Technique (하이브리드 플래시-디스크 저장장치용 Flash Translation Layer의 성능 개선을 위한 순차패턴 마이닝 기반 2단계 프리패칭 기법)

  • Chang, Jae-Young;Yoon, Un-Keum;Kim, Han-Joon
    • The Journal of Society for e-Business Studies
    • /
    • v.15 no.4
    • /
    • pp.101-121
    • /
    • 2010
  • This paper presents an intelligent prefetching technique that significantly improves performance of hybrid fash-disk storage, a combination of flash memory and hard disk. Since flash memory embedded in a hybrid device is much faster than hard disk in terms of I/O operations, it can be utilized as a 'cache' space to improve system performance. The basic strategy for prefetching is to utilize sequential pattern mining, with which we can extract the access patterns of objects from historical access sequences. We use two techniques for enhancing the performance of hybrid storage with prefetching. One of them is to modify a FAST algorithm for mapping the flash memory. The other is to extend the unit of prefetching to a block level as well as a file level for effectively utilizing flash memory space. For evaluating the proposed technique, we perform the experiments using the synthetic data and real UCC data, and prove the usability of our technique.

A Structural Framework on Psychological Adaptation and Sequential Changes during the COVID-19 Pandemic (코로나19 팬데믹에 대한 심리적 적응과 연쇄적 변화의 구조 모형)

  • Ko, Dong-Woo;Seo, Hyun-Sook
    • Korean Journal of Culture and Social Issue
    • /
    • v.27 no.4
    • /
    • pp.351-389
    • /
    • 2021
  • This qualitative study aimed to develop a structural framework that explains the process of psychological adaptation and sequential changes being perceived by Korean people under the COVID-19 Pandemic past year. Setting a tentative analysis frame induced from antecedent literatures about psychological phenomena during the COVID-19 pandemic, the qualitative data were collected from 6 Korean adults by semi structured individual interviews. For the data, content analysis applied from the grounded theory were performed. As a result, the initial framework was extended and revised to describe the psychological phenomena under the pandemic. This paradigm structure includes the process of 'causal factors ⇒ psychological main phenomena ⇒ sequential results' being intervened by personal contextual situations and psychological characteristics, as moderators. The category of causal factors were the COVID-19 pandemic, relevant critical incidents, and social distancing policy. The main phenomena reflected either positive, negative, or complicated experiences. The sequential psychological results included transformation of cognitive system or behavior patterns. Various variables such as psychological sense of community and social responsibility, psychological capability for leisure, and positive psychological capital were found out as moderating factors. In discussion and conclusion, theoretical/practical implications of the results and direction to study in the future were suggested.

Multiple imputation and synthetic data (다중대체와 재현자료 작성)

  • Kim, Joungyoun;Park, Min-Jeong
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.1
    • /
    • pp.83-97
    • /
    • 2019
  • As society develops, the dissemination of microdata has increased to respond to diverse analytical needs of users. Analysis of microdata for policy making, academic purposes, etc. is highly desirable in terms of value creation. However, the provision of microdata, whose usefulness is guaranteed, has a risk of exposure of personal information. Several methods have been considered to ensure the protection of personal information while ensuring the usefulness of the data. One of these methods has been studied to generate and utilize synthetic data. This paper aims to understand the synthetic data by exploring methodologies and precautions related to synthetic data. To this end, we first explain muptiple imputation, Bayesian predictive model, and Bayesian bootstrap, which are basic foundations for synthetic data. And then, we link these concepts to the construction of fully/partially synthetic data. To understand the creation of synthetic data, we review a real longitudinal synthetic data example which is based on sequential regression multivariate imputation.

Feature Selection Algorithm for Intrusions Detection System using Sequential Forward Search and Random Forest Classifier

  • Lee, Jinlee;Park, Dooho;Lee, Changhoon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.10
    • /
    • pp.5132-5148
    • /
    • 2017
  • Cyber attacks are evolving commensurate with recent developments in information security technology. Intrusion detection systems collect various types of data from computers and networks to detect security threats and analyze the attack information. The large amount of data examined make the large number of computations and low detection rates problematic. Feature selection is expected to improve the classification performance and provide faster and more cost-effective results. Despite the various feature selection studies conducted for intrusion detection systems, it is difficult to automate feature selection because it is based on the knowledge of security experts. This paper proposes a feature selection technique to overcome the performance problems of intrusion detection systems. Focusing on feature selection, the first phase of the proposed system aims at constructing a feature subset using a sequential forward floating search (SFFS) to downsize the dimension of the variables. The second phase constructs a classification model with the selected feature subset using a random forest classifier (RFC) and evaluates the classification accuracy. Experiments were conducted with the NSL-KDD dataset using SFFS-RF, and the results indicated that feature selection techniques are a necessary preprocessing step to improve the overall system performance in systems that handle large datasets. They also verified that SFFS-RF could be used for data classification. In conclusion, SFFS-RF could be the key to improving the classification model performance in machine learning.

Elastic modulus in large concrete structures by a sequential hypothesis testing procedure applied to impulse method data

  • Antonaci, Paola;Bocca, Pietro G.;Sellone, Fabrizio
    • Structural Engineering and Mechanics
    • /
    • v.26 no.5
    • /
    • pp.499-516
    • /
    • 2007
  • An experimental method denoted as Impulse Method is proposed as a cost-effective non-destructive technique for the on-site evaluation of concrete elastic modulus in existing structures: on the basis of Hertz's quasi-static theory of elastic impact and with the aid of a simple portable testing equipment, it makes it possible to collect series of local measurements of the elastic modulus in an easy way and in a very short time. A Hypothesis Testing procedure is developed in order to provide a statistical tool for processing the data collected by means of the Impulse Method and assessing the possible occurrence of significant variations in the elastic modulus without exceeding some prescribed error probabilities. It is based on a particular formulation of the renowned sequential probability ratio test and reveals to be optimal with respect to the error probabilities and the required number of observations, thus further improving the time-effectiveness of the Impulse Method. The results of an experimental investigation on different types of plain concrete prove the validity of the Impulse Method in estimating the unknown value of the elastic modulus and attest the effectiveness of the proposed Hypothesis Testing procedure in identifying significant variations in the elastic modulus.

Fast Scene Understanding in Urban Environments for an Autonomous Vehicle equipped with 2D Laser Scanners (무인 자동차의 2차원 레이저 거리 센서를 이용한 도시 환경에서의 빠른 주변 환경 인식 방법)

  • Ahn, Seung-Uk;Choe, Yun-Geun;Chung, Myung-Jin
    • The Journal of Korea Robotics Society
    • /
    • v.7 no.2
    • /
    • pp.92-100
    • /
    • 2012
  • A map of complex environment can be generated using a robot carrying sensors. However, representation of environments directly using the integration of sensor data tells only spatial existence. In order to execute high-level applications, robots need semantic knowledge of the environments. This research investigates the design of a system for recognizing objects in 3D point clouds of urban environments. The proposed system is decomposed into five steps: sequential LIDAR scan, point classification, ground detection and elimination, segmentation, and object classification. This method could classify the various objects in urban environment, such as cars, trees, buildings, posts, etc. The simple methods minimizing time-consuming process are developed to guarantee real-time performance and to perform data classification on-the-fly as data is being acquired. To evaluate performance of the proposed methods, computation time and recognition rate are analyzed. Experimental results demonstrate that the proposed algorithm has efficiency in fast understanding the semantic knowledge of a dynamic urban environment.

Estimation of Sea Surface Current Vector based on Satellite Ocean Color Image around the Korean Marginal Sea

  • Kim, Eung;Ro, Young-Jae;Ahn, Yu-Hwan
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.816-819
    • /
    • 2006
  • One of the most difficult parameters to measure in the sea is current speed and direction. Recently, efforts are being made to estimate the ocean current vectors by utilizing sequential satellite imageries. In this study, we attempted to estimated sea surface current vector (sscv) by using satellite ocean color imageries of SeaWifs around the Korean Peninsula. This ocean color image data has 1-day sampling interval and spatial resolution of 1x1 km. Maximum cross-correlation method is employed which is aimed to detect similar patterns between sequential images. The estimated current vectors are compared to the surface geostrophic current vectors obtained from altimeter of sea level height data. In utilizing the color imagery data, some limitations and drawbacks exist so that in warm water region where phytoplankton concentration is relatively lower than in cold water region, estimation of sscv is poor and unreliable. On the other hand, two current vector fields agree reasonably well in the Korean South Sea region where high concentration of chlorophyll-a and weak tide is observed. In the future, with ocean color images of shorter sampling interval by COMS satellite, the algorithm and methodology developed in the study would be useful in providing the information for the ocean current around Korean Peninsula.

  • PDF

Type I projection sum of squares by weighted least squares (가중최소제곱법에 의한 제1종 사영제곱합)

  • Choi, Jaesung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.423-429
    • /
    • 2014
  • This paper discusses a method for getting Type I sums of squares by projections under a two-way fixed-effects model when variances of errors are not equal. The method of weighted least squares is used to estimate the parameters of the assumed model. The model is fitted to the data in a sequential manner by using the model comparison technique. The vector space generated by the model matrix can be composed of orthogonal vector subspaces spanned by submatrices consisting of column vectors related to the parameters. It is discussed how to get the Type I sums of squares by using the projections into the orthogonal vector subspaces.

Markov Chain Monte Carlo simulation based Bayesian updating of model parameters and their uncertainties

  • Sengupta, Partha;Chakraborty, Subrata
    • Structural Engineering and Mechanics
    • /
    • v.81 no.1
    • /
    • pp.103-115
    • /
    • 2022
  • The prediction error variances for frequencies are usually considered as unknown in the Bayesian system identification process. However, the error variances for mode shapes are taken as known to reduce the dimension of an identification problem. The present study attempts to explore the effectiveness of Bayesian approach of model parameters updating using Markov Chain Monte Carlo (MCMC) technique considering the prediction error variances for both the frequencies and mode shapes. To remove the ergodicity of Markov Chain, the posterior distribution is obtained by Gaussian Random walk over the proposal distribution. The prior distributions of prediction error variances of modal evidences are implemented through inverse gamma distribution to assess the effectiveness of estimation of posterior values of model parameters. The issue of incomplete data that makes the problem ill-conditioned and the associated singularity problem is prudently dealt in by adopting a regularization technique. The proposed approach is demonstrated numerically by considering an eight-storey frame model with both complete and incomplete modal data sets. Further, to study the effectiveness of the proposed approach, a comparative study with regard to accuracy and computational efficacy of the proposed approach is made with the Sequential Monte Carlo approach of model parameter updating.

Observational Arc-Length Effect on Orbit Determination for Korea Pathfinder Lunar Orbiter in the Earth-Moon Transfer Phase Using a Sequential Estimation

  • Kim, Young-Rok;Song, Young-Joo
    • Journal of Astronomy and Space Sciences
    • /
    • v.36 no.4
    • /
    • pp.293-306
    • /
    • 2019
  • In this study, the observational arc-length effect on orbit determination (OD) for the Korea Pathfinder Lunar Orbiter (KPLO) in the Earth-Moon Transfer phase was investigated. For the OD, we employed a sequential estimation using the extended Kalman filter and a fixed-point smoother. The mission periods, comprised between the perigee maneuvers (PM) and the lunar orbit insertion (LOI) maneuver in a 3.5 phasing loop of the KPLO, was the primary target. The total period was divided into three phases: launch-PM1, PM1-PM3, and PM3-LOI. The Doppler and range data obtained from three tracking stations [included in the deep space network (DSN) and Korea Deep Space Antenna (KDSA)] were utilized for the OD. Six arc-length cases (24 hrs, 48 hrs, 60 hrs, 3 days, 4 days, and 5 days) were considered for the arc-length effect investigation. In order to evaluate the OD accuracy, we analyzed the position uncertainties, the precision of orbit overlaps, and the position differences between true and estimated trajectories. The maximum performance of 3-day OD approach was observed in the case of stable flight dynamics operations and robust navigation capability. This study provides a guideline for the flight dynamics operations of the KPLO in the trans-lunar phase.