• Title/Summary/Keyword: Synthetic environment data

Search Result 146, Processing Time 0.025 seconds

An SVD-Based Approach for Generating High-Dimensional Data and Query Sets (SVD를 기반으로 한 고차원 데이터 및 질의 집합의 생성)

  • 김상욱
    • The Journal of Information Technology and Database
    • /
    • v.8 no.2
    • /
    • pp.91-101
    • /
    • 2001
  • Previous research efforts on performance evaluation of multidimensional indexes typically have used synthetic data sets distributed uniformly or normally over multidimensional space. However, recent research research result has shown that these hinds of data sets hardly reflect the characteristics of multimedia database applications. In this paper, we discuss issues on generating high dimensional data and query sets for resolving the problem. We first identify the features of the data and query sets that are appropriate for fairly evaluating performances of multidimensional indexes, and then propose HDDQ_Gen(High-Dimensional Data and Query Generator) that satisfies such features. HDDQ_Gen supports the following features : (1) clustered distributions, (2) various object distributions in each cluster, (3) various cluster distributions, (4) various correlations among different dimensions, (5) query distributions depending on data distributions. Using these features, users are able to control tile distribution characteristics of data and query sets. Our contribution is fairly important in that HDDQ_Gen provides the benchmark environment evaluating multidimensional indexes correctly.

  • PDF

A Compression Study on a Synthetic Talc (합성 활석에 대한 압축 연구)

  • Kim, Young-Ho;Kim, Soon-Oh
    • Journal of the Mineralogical Society of Korea
    • /
    • v.27 no.4
    • /
    • pp.283-291
    • /
    • 2014
  • Talc ($Mg_3Si_4O_{10}(OH)_2$), one of sheet silicates, is soft and has been widely used in industry. Powdered talc specimen was synthesized at the pressure of 200 MPa and temperature of $600^{\circ}C$ using external heated hydrothermal high pressure apparatus. High pressure angular dispersive X-ray diffraction (ADXRD) mode experiments were performed at the Pohang Light Source (PLS) using the symmetrical diamond anvil cell (SDAC). Compression pressure was loaded up to 11.06 GPa at room temperature. This synthetic talc shows no phase transition(s) within the present pressure limit. Based on ADXRD data, bulk modulus of talc was calculated to be 72.4 GPa using Birch-Muranghan equation of state (EOS). This value is lower than that of natural talc determined previously.

Improved Method of License Plate Detection and Recognition using Synthetic Number Plate (인조 번호판을 이용한 자동차 번호인식 성능 향상 기법)

  • Chang, Il-Sik;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.26 no.4
    • /
    • pp.453-462
    • /
    • 2021
  • A lot of license plate data is required for car number recognition. License plate data needs to be balanced from past license plates to the latest license plates. However, it is difficult to obtain data from the actual past license plate to the latest ones. In order to solve this problem, a license plate recognition study through deep learning is being conducted by creating a synthetic license plates. Since the synthetic data have differences from real data, and various data augmentation techniques are used to solve these problems. Existing data augmentation simply used methods such as brightness, rotation, affine transformation, blur, and noise. In this paper, we apply a style transformation method that transforms synthetic data into real-world data styles with data augmentation methods. In addition, real license plate data are noisy when it is captured from a distance and under the dark environment. If we simply recognize characters with input data, chances of misrecognition are high. To improve character recognition, in this paper, we applied the DeblurGANv2 method as a quality improvement method for character recognition, increasing the accuracy of license plate recognition. The method of deep learning for license plate detection and license plate number recognition used YOLO-V5. To determine the performance of the synthetic license plate data, we construct a test set by collecting our own secured license plates. License plate detection without style conversion recorded 0.614 mAP. As a result of applying the style transformation, we confirm that the license plate detection performance was improved by recording 0.679mAP. In addition, the successul detection rate without image enhancement was 0.872, and the detection rate was 0.915 after image enhancement, confirming that the performance improved.

An Iterative Normalization Algorithm for cDNA Microarray Medical Data Analysis

  • Kim, Yoonhee;Park, Woong-Yang;Kim, Ho
    • Genomics & Informatics
    • /
    • v.2 no.2
    • /
    • pp.92-98
    • /
    • 2004
  • A cDNA microarray experiment is one of the most useful high-throughput experiments in medical informatics for monitoring gene expression levels. Statistical analysis with a cDNA microarray medical data requires a normalization procedure to reduce the systematic errors that are impossible to control by the experimental conditions. Despite the variety of normalization methods, this. paper suggests a more general and synthetic normalization algorithm with a control gene set based on previous studies of normalization. Iterative normalization method was used to select and include a new control gene set among the whole genes iteratively at every step of the normalization calculation initiated with the housekeeping genes. The objective of this iterative normalization was to maintain the pattern of the original data and to keep the gene expression levels stable. Spatial plots, M&A (ratio and average values of the intensity) plots and box plots showed a convergence to zero of the mean across all genes graphically after applying our iterative normalization. The practicability of the algorithm was demonstrated by applying our method to the data for the human photo aging study.

A Study on Insider Threat Dataset Sharing Using Blockchain (블록체인을 활용한 내부자 유출위협 데이터 공유 연구)

  • Wonseok Yoon;Hangbae Chang
    • Journal of Platform Technology
    • /
    • v.11 no.2
    • /
    • pp.15-25
    • /
    • 2023
  • This study analyzes the limitations of the insider threat datasets used for insider threat detection research and compares and analyzes the solution-based insider threat data with public insider threat data using a security solution to overcome this. Through this, we design a data format suitable for insider threat detection and implement a system that can safely share insider threat information between different institutions and companies using blockchain technology. Currently, there is no dataset collected based on actual events in the insider threat dataset that is revealed to researchers. Public datasets are virtual synthetic data randomly created for research, and when used as a learning model, there are many limitations in the real environment. In this study, to improve these limitations, a private blockchain was designed to secure information sharing between institutions of different affiliations, and a method was derived to increase reliability and maintain information integrity and consistency through agreement and verification among participants. The proposed method is expected to collect data through an outflow threat collector and collect quality data sets that posed a threat, not synthetic data, through a blockchain-based sharing system, to solve the current outflow threat dataset problem and contribute to the insider threat detection model in the future.

  • PDF

A Framework to Construct the Aviation Engagement Simulation Model based on the Synthetic Battlefield in the HLA/RTI System (HLA/RTI 시스템에서 합성전장환경 기반의 항공 교전 시뮬레이션 모델 구축 프레임워크)

  • Ham, Won K.;Yang, Karam;Choi, Jong-Yeob;Park, Sang C.
    • Journal of the Korea Society for Simulation
    • /
    • v.23 no.2
    • /
    • pp.57-64
    • /
    • 2014
  • This paper proposes a framework to construct the synthetic battlefield based aviation engagement simulation model for the distributed system. The proposed framework has the synthetic battlefield in the HLA (High Level Architecture)/RTI (Run-Time Infrastructure) based distributed system to reflect environmental effects into the aviation engagement simulation model. In an aviation engagement, the environment affects weapon systems such as detection and movement. Therefore, environmental effects are required to be reflected in the simulation. However, former researches are inadequate for complex operations of weapon systems that are requirements of the engagement simulation. Thus, the construction of the engagement simulation system of which reflects environmental effects based on environmental data is still difficult. The main objective of this paper is to propose a framework to solve the difficulty and constructs an example system based on the proposed framework.

Wind Data Simulation Using Digital Generation of Non-Gaussian Turbulence Multiple Time Series with Specified Sample Cross Correlations (임의의 표본상호상관함수와 비정규확률분포를 갖는 다중 난류시계열의 디지털 합성방법을 이용한 풍속데이터 시뮬레이션)

  • Seong, Seung-Hak;Kim, Wook;Kim, Kyung-Chun;Boo, Jung-Sook
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.19 no.5
    • /
    • pp.569-581
    • /
    • 2003
  • A method of synthetic time series generation was developed and applied to the simulation of homogeneous turbulence in a periodic 3 - D box and the hourly wind data simulation. The method can simulate almost exact sample auto and cross correlations of multiple time series and control non-Gaussian distribution. Using the turbulence simulation, influence of correlations, non-Gaussian distribution, and one-direction anisotropy on homogeneous structure were studied by investigating the spatial distribution of turbulence kinetic energy and enstrophy. An hourly wind data of Typhoon Robin was used to illustrate a capability of the method to simulate sample cross correlations of multiple time series. The simulated typhoon data shows a similar shape of fluctuations and almost exactly the same sample auto and cross correlations of the Robin.

Application of Rainwater Harvesting System Reliability Model Based on Non-parametric Stochastic Daily Rainfall Generator to Haundae District of Busan (비모수적 추계학적 일 강우 발생기 기반의 빗물이용시설 신뢰도 평가모형의 부산광역시 해운대 신시가지 적용)

  • Choi, ChiHyun;Park, MooJong;Baek, ChunWoo;Kim, SangDan
    • Journal of Korean Society on Water Environment
    • /
    • v.27 no.5
    • /
    • pp.634-645
    • /
    • 2011
  • A newly developed rainwater harvesting (RWH) system reliability model is evaluated for roof area of buildings in Haeundae District of Busan. RWH system is used to supply water for toilet flushing, back garden irrigation, and air cooling. This model is portable because it is based on a non-parametric precipitation generation algorithm using a markov chain. Precipitation occurrence is simulated using transition probabilities derived for each day of the year based on the historical probability of wet and dry day state changes. Precipitation amounts are selected from a matrix of historical values within a moving 30 day window that is centered on the target day. Then, the reliability of RWH system is determined for catchment area and tank volume ranges using synthetic precipitation data. As a result, the synthetic rainfall data well reproduced the characteristics of precipitation in Busan. Also the reliabilities of RWH system for each of demands were computed to high values. Furthermore, for study area using the RWH system, reduction efficiencies for rooftop runoff inputs to the sewer system and potable water demand are evaluated for 23%, 53%, respectively.

Validation of Sea Surface Wind Estimated from KOMPSAT-5 Backscattering Coefficient Data (KOMPSAT-5 후방산란계수 자료로 산출된 해상풍 검증)

  • Jang, Jae-Cheol;Park, Kyung-Ae;Yang, Dochul
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.6_3
    • /
    • pp.1383-1398
    • /
    • 2018
  • Sea surface wind is one of the most fundamental variables for understanding diverse marine phenomena. Although scatterometers have produced global wind field data since the early 1990's, the data has been used limitedly in oceanic applications due to it slow spatial resolution, especially at coastal regions. Synthetic Aperture Radar (SAR) is capable to produce high resolution wind field data. KOMPSAT-5 is the first Korean satellite equipped with X-band SAR instrument and is able to retrieve the sea surface wind. This study presents the validation results of sea surface wind derived from the KOMPSAT-5 backscattering coefficient data for the first time. We collected 18 KOMPSAT-5 ES mode data to produce a matchup database collocated with buoy stations. In order to calculate the accurate wind speed, we preprocessed the SAR data, including land masking, speckle noise reduction, and ship detection, and converted the in-situ wind to 10-m neutral wind as reference wind data using Liu-Katsaros-Businger (LKB) model. The sea surface winds based on XMOD2 show root-mean-square errors of about $2.41-2.74m\;s^{-1}$ depending on backscattering coefficient conversion equations. In-depth analyses on the wind speed errors derived from KOMPSAT-5 backscattering coefficient data reveal the existence of diverse potential error factors such as image quality related to range ambiguity, discrete and discontinuous distribution of incidence angle, change in marine atmospheric environment, impacts on atmospheric gravity waves, ocean wave spectrum, and internal wave.

Assessment of Stand-alone Utilization of Sentinel-1 SAR for High Resolution Soil Moisture Retrieval Using Machine Learning (기계학습 기반 고해상도 토양수분 복원을 위한 Sentinel-1 SAR의 자립형 활용성 평가)

  • Jeong, Jaehwan;Cho, Seongkeun;Jeon, Hyunho;Lee, Seulchan;Choi, Minha
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_1
    • /
    • pp.571-585
    • /
    • 2022
  • As the threat of natural disasters such as droughts, floods, forest fires, and landslides increases due to climate change, social demand for high-resolution soil moisture retrieval, such as Synthetic Aperture Radar (SAR), is also increasing. However, the domestic environment has a high proportion of mountainous topography, making it challenging to retrieve soil moisture from SAR data. This study evaluated the usability of Sentinel-1 SAR, which is applied with the Artificial Neural Network (ANN) technique, to retrieve soil moisture. It was confirmed that the backscattering coefficient obtained from Sentinel-1 significantly correlated with soil moisture behavior, and the possibility of stand-alone use to correct vegetation effects without using auxiliary data observed from other satellites or observatories. However, there was a large difference in the characteristics of each site and topographic group. In particular, when the model learned on the mountain and at flat land cross-applied, the soil moisture could not be properly simulated. In addition, when the number of learning points was increased to solve this problem, the soil moisture retrieval model was smoothed. As a result, the overall correlation coefficient of all sites improved, but errors at individual sites gradually increased. Therefore, systematic research must be conducted in order to widely apply high-resolution SAR soil moisture data. It is expected that it can be effectively used in various fields if the scope of learning sites and application targets are specifically limited.