• Title/Summary/Keyword: ensemble mean

Search Result 199, Processing Time 0.026 seconds

Measurement of Turbulence Properties at the Time of Flow Reversal Under High Wave Conditions in Hujeong Beach (후정해변 고파랑 조건하에서 파랑유속 방향전환점에서 발생하는 난류성분의 측정)

  • Chang, Yeon S.;Do, Jong Dae;Kim, Sun-Sin;Ahn, Kyungmo;Jin, Jae-Youll
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.29 no.4
    • /
    • pp.206-216
    • /
    • 2017
  • The temporal distribution of the turbulence kinetic energy (TKE) and the vertical component of Reynolds stresses ($-{\bar{u^{\prime}w^{\prime}}}$) was measured during one wave period under high wave energy conditions. The wave data were obtained at Hujeong Beach in the east coast of Korea at January 14~18 of 2017 when an extratropical cyclone was developed in the East Sea. Among the whole thousands of waves measured during the period, hundreds of regular waves that had with similar pattern were selected for the analysis in order to give three representing mean wave patterns using the ensemble average technique. The turbulence properties were then estimated based on the selected wave data. It is interesting to find out that $-{\bar{u^{\prime}w^{\prime}}}$ has one clear peak near the time of flow reversal while TKE has two peaks at the corresponding times of maximum cross-shore velocity magnitudes. The distinguished pattern of Reynolds stress indicates that vertical fluxes of such properties as suspended sediments may be enhanced at the time when the horizontal flow direction is reversed to disturb the flows, supporting the turbulence convection process proposed by Nielsen (1992). The characteristic patterns of turbulence properties are examined using the CADMAS-SURF Reynolds-Averaged Navier-Stokes (RANS) model. Although the model can reasonably simulate the distribution of TKE pattern, it fails to produce the $-{\bar{u^{\prime}w^{\prime}}}$ peak at the time of flow reversal, which indicates that the application of RANS model is limited in the prediction of some turbulence properties such as Reynolds stresses.

A Correction of East Asian Summer Precipitation Simulated by PNU/CME CGCM Using Multiple Linear Regression (다중 선형 회귀를 이용한 PNU/CME CGCM의 동아시아 여름철 강수예측 보정 연구)

  • Hwang, Yoon-Jeong;Ahn, Joong-Bae
    • Journal of the Korean earth science society
    • /
    • v.28 no.2
    • /
    • pp.214-226
    • /
    • 2007
  • Because precipitation is influenced by various atmospheric variables, it is highly nonlinear. Although precipitation predicted by a dynamic model can be corrected by using a nonlinear Artificial Neural Network, this approach has limits such as choices of the initial weight, local minima and the number of neurons, etc. In the present paper, we correct simulated precipitation by using a multiple linear regression (MLR) method, which is simple and widely used. First of all, Ensemble hindcast is conducted by the PNU/CME Coupled General Circulation Model (CGCM) (Park and Ahn, 2004) for the period from April to August in 1979-2005. MLR is applied to precipitation simulated by PNU/CME CGCM for the months of June (lead 2), July (lead 3), August (lead 4) and seasonal mean JJA (from June to August) of the Northeast Asian region including the Korean Peninsula $(110^{\circ}-145^{\circ}E,\;25-55^{\circ}N)$. We build the MLR model using a linear relationship between observed precipitation and the hindcasted results from the PNU/CME CGCM. The predictor variables selected from CGCM are precipitation, 500 hPa vertical velocity, 200 hPa divergence, surface air temperature and others. After performing a leave-oneout cross validation, the results are compared with the PNU/CME CGCM's. The results including Heidke skill scores demonstrate that the MLR corrected results have better forecasts than the direct CGCM result for rainfall.

Development of the Aircraft CO2 Measurement Data Assimilation System to Improve the Estimation of Surface CO2 Fluxes Using an Inverse Modeling System (인버스 모델링을 이용한 지표면 이산화탄소 플럭스 추정 향상을 위한 항공기 관측 이산화탄소 자료동화 체계 개발)

  • Kim, Hyunjung;Kim, Hyun Mee;Cho, Minkwang;Park, Jun;Kim, Dae-Hui
    • Atmosphere
    • /
    • v.28 no.2
    • /
    • pp.113-121
    • /
    • 2018
  • In order to monitor greenhouse gases including $CO_2$, various types of surface-, aircraft-, and satellite-based measurement projects have been conducted. These data help understand the variations of greenhouse gases and are used in atmospheric inverse modeling systems to simulate surface fluxes for greenhouse gases. CarbonTracker is a system for estimating surface $CO_2$ flux, using an atmospheric inverse modeling method, based on only surface observation data. Because of the insufficient surface observation data available for accurate estimation of the surface $CO_2$ flux, additional observations would be required. In this study, a system that assimilates aircraft $CO_2$ measurement data in CarbonTracker (CT2013B) is developed, and the estimated results from this data assimilation system are evaluated. The aircraft $CO_2$ measurement data used are obtained from the Comprehensive Observation Network for Trace gases by the Airliner (CONTRAIL) project. The developed system includes the preprocessor of the raw observation data, the observation operator, and the ensemble Kalman filter (EnKF) data assimilation process. After preprocessing the raw data, the modeled value corresponding spatially and temporally to each observation is calculated using the observation operator. These modeled values and observations are then averaged in space and time, and used in the EnKF data assimilation process. The modeled values are much closer to the observations and show smaller biases and root-mean-square errors, after the assimilation of the aircraft $CO_2$ measurement data. This system could also be used to assimilate other aircraft $CO_2$ measurement data in CarbonTracker.

Development of Stochastic Downscaling Method for Rainfall Data Using GCM (GCM Ensemble을 활용한 추계학적 강우자료 상세화 기법 개발)

  • Kim, Tae-Jeong;Kwon, Hyun-Han;Lee, Dong-Ryul;Yoon, Sun-Kwon
    • Journal of Korea Water Resources Association
    • /
    • v.47 no.9
    • /
    • pp.825-838
    • /
    • 2014
  • The stationary Markov chain model has been widely used as a daily rainfall simulation model. A main assumption of the stationary Markov model is that statistical characteristics do not change over time and do not have any trends. In other words, the stationary Markov chain model for daily rainfall simulation essentially can not incorporate any changes in mean or variance into the model. Here we develop a Non-stationary hidden Markov chain model (NHMM) based stochastic downscaling scheme for simulating the daily rainfall sequences, using general circulation models (GCMs) as inputs. It has been acknowledged that GCMs perform well with respect to annual and seasonal variation at large spatial scale and they stand as one of the primary sources for obtaining forecasts. The proposed model is applied to daily rainfall series at three stations in Nakdong watershed. The model showed a better performance in reproducing most of the statistics associated with daily and seasonal rainfall. In particular, the proposed model provided a significant improvement in reproducing the extremes. It was confirmed that the proposed model could be used as a downscaling model for the purpose of generating plausible daily rainfall scenarios if elaborate GCM forecasts can used as a predictor. Also, the proposed NHMM model can be applied to climate change studies if GCM based climate change scenarios are used as inputs.

Sound event detection model using self-training based on noisy student model (잡음 학생 모델 기반의 자가 학습을 활용한 음향 사건 검지)

  • Kim, Nam Kyun;Park, Chang-Soo;Kim, Hong Kook;Hur, Jin Ook;Lim, Jeong Eun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.5
    • /
    • pp.479-487
    • /
    • 2021
  • In this paper, we propose an Sound Event Detection (SED) model using self-training based on a noisy student model. The proposed SED model consists of two stages. In the first stage, a mean-teacher model based on an Residual Convolutional Recurrent Neural Network (RCRNN) is constructed to provide target labels regarding weakly labeled or unlabeled data. In the second stage, a self-training-based noisy student model is constructed by applying different noise types. That is, feature noises, such as time-frequency shift, mixup, SpecAugment, and dropout-based model noise are used here. In addition, a semi-supervised loss function is applied to train the noisy student model, which acts as label noise injection. The performance of the proposed SED model is evaluated on the validation set of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2020 Challenge Task 4. The experiments show that the single model and ensemble model of the proposed SED based on the noisy student model improve F1-score by 4.6 % and 3.4 % compared to the top-ranked model in DCASE 2020 challenge Task 4, respectively.

A Study on the Predictability of the Number of Days of Heat and Cold Damages by Growth Stages of Rice Using PNU CGCM-WRF Chain in South Korea (PNU CGCM-WRF Chain을 이용한 남한지역 벼의 생육단계별 고온해 및 저온해 발생일수에 대한 예측성 연구)

  • Kim, Young-Hyun;Choi, Myeong-Ju;Shim, Kyo-Moon;Hur, Jina;Jo, Sera;Ahn, Joong-Bae
    • Atmosphere
    • /
    • v.31 no.5
    • /
    • pp.577-592
    • /
    • 2021
  • This study evaluates the predictability of the number of days of heat and cold damages by growth stages of rice in South Korea using the hindcast data (1986~2020) produced by Pusan National University Coupled General Circulation Model-Weather Research and Forecasting (PNU CGCM-WRF) model chain. The predictability is accessed in terms of Root Mean Square Error (RMSE), Normalized Standardized Deviations (NSD), Hit Rate (HR) and Heidke Skill Score (HSS). For the purpose, the model predictability to produce the daily maximum and minimum temperatures, which are the variables used to define heat and cold damages for rice, are evaluated first. The result shows that most of the predictions starting the initial conditions from January to May (01RUN to 05RUN) have reasonable predictability, although it varies to some extent depending on the month at which integration starts. In particular, the ensemble average of 01RUN to 05RUN with equal weighting (ENS) has more reasonable predictability (RMSE is in the range of 1.2~2.6℃ and NSD is about 1.0) than individual RUNs. Accordingly, the regional patterns and characteristics of the predicted damages for rice due to excessive high- and low-temperatures are well captured by the model chain when compared with observation, particularly in regions where the damages occur frequently, in spite that hindcasted data somewhat overestimate the damages in terms of number of occurrence days. In ENS, the HR and HSS for heat (cold) damages in rice is in the ranges of 0.44~0.84 and 0.05~0.13 (0.58~0.81 and -0.01~0.10) by growth stage. Overall, it is concluded that the PNU CGCM-WRF chain of 01RUN~05RUN and ENS has reasonable capability to predict the heat and cold damages for rice in South Korea.

A Study on the Prediction of Disc Cutter Wear Using TBM Data and Machine Learning Algorithm (TBM 데이터와 머신러닝 기법을 이용한 디스크 커터마모 예측에 관한 연구)

  • Tae-Ho, Kang;Soon-Wook, Choi;Chulho, Lee;Soo-Ho, Chang
    • Tunnel and Underground Space
    • /
    • v.32 no.6
    • /
    • pp.502-517
    • /
    • 2022
  • As the use of TBM increases, research has recently increased to to analyze TBM data with machine learning techniques to predict the exchange cycle of disc cutters, and predict the advance rate of TBM. In this study, a regression prediction of disc cutte wear of slurry shield TBM site was made by combining machine learning based on the machine data and the geotechnical data obtained during the excavation. The data were divided into 7:3 for training and testing the prediction of disc cutter wear, and the hyper-parameters are optimized by cross-validated grid-search over a parameter grid. As a result, gradient boosting based on the ensemble model showed good performance with a determination coefficient of 0.852 and a root-mean-square-error of 3.111 and especially excellent results in fit times along with learning performance. Based on the results, it is judged that the suitability of the prediction model using data including mechanical data and geotechnical information is high. In addition, research is needed to increase the diversity of ground conditions and the amount of disc cutter data.

Predicting stock movements based on financial news with systematic group identification (시스템적인 군집 확인과 뉴스를 이용한 주가 예측)

  • Seong, NohYoon;Nam, Kihwan
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.1-17
    • /
    • 2019
  • Because stock price forecasting is an important issue both academically and practically, research in stock price prediction has been actively conducted. The stock price forecasting research is classified into using structured data and using unstructured data. With structured data such as historical stock price and financial statements, past studies usually used technical analysis approach and fundamental analysis. In the big data era, the amount of information has rapidly increased, and the artificial intelligence methodology that can find meaning by quantifying string information, which is an unstructured data that takes up a large amount of information, has developed rapidly. With these developments, many attempts with unstructured data are being made to predict stock prices through online news by applying text mining to stock price forecasts. The stock price prediction methodology adopted in many papers is to forecast stock prices with the news of the target companies to be forecasted. However, according to previous research, not only news of a target company affects its stock price, but news of companies that are related to the company can also affect the stock price. However, finding a highly relevant company is not easy because of the market-wide impact and random signs. Thus, existing studies have found highly relevant companies based primarily on pre-determined international industry classification standards. However, according to recent research, global industry classification standard has different homogeneity within the sectors, and it leads to a limitation that forecasting stock prices by taking them all together without considering only relevant companies can adversely affect predictive performance. To overcome the limitation, we first used random matrix theory with text mining for stock prediction. Wherever the dimension of data is large, the classical limit theorems are no longer suitable, because the statistical efficiency will be reduced. Therefore, a simple correlation analysis in the financial market does not mean the true correlation. To solve the issue, we adopt random matrix theory, which is mainly used in econophysics, to remove market-wide effects and random signals and find a true correlation between companies. With the true correlation, we perform cluster analysis to find relevant companies. Also, based on the clustering analysis, we used multiple kernel learning algorithm, which is an ensemble of support vector machine to incorporate the effects of the target firm and its relevant firms simultaneously. Each kernel was assigned to predict stock prices with features of financial news of the target firm and its relevant firms. The results of this study are as follows. The results of this paper are as follows. (1) Following the existing research flow, we confirmed that it is an effective way to forecast stock prices using news from relevant companies. (2) When looking for a relevant company, looking for it in the wrong way can lower AI prediction performance. (3) The proposed approach with random matrix theory shows better performance than previous studies if cluster analysis is performed based on the true correlation by removing market-wide effects and random signals. The contribution of this study is as follows. First, this study shows that random matrix theory, which is used mainly in economic physics, can be combined with artificial intelligence to produce good methodologies. This suggests that it is important not only to develop AI algorithms but also to adopt physics theory. This extends the existing research that presented the methodology by integrating artificial intelligence with complex system theory through transfer entropy. Second, this study stressed that finding the right companies in the stock market is an important issue. This suggests that it is not only important to study artificial intelligence algorithms, but how to theoretically adjust the input values. Third, we confirmed that firms classified as Global Industrial Classification Standard (GICS) might have low relevance and suggested it is necessary to theoretically define the relevance rather than simply finding it in the GICS.

The KMA Global Seasonal forecasting system (GloSea6) - Part 2: Climatological Mean Bias Characteristics (기상청 기후예측시스템(GloSea6) - Part 2: 기후모의 평균 오차 특성 분석)

  • Hyun, Yu-Kyung;Lee, Johan;Shin, Beomcheol;Choi, Yuna;Kim, Ji-Yeong;Lee, Sang-Min;Ji, Hee-Sook;Boo, Kyung-On;Lim, Somin;Kim, Hyeri;Ryu, Young;Park, Yeon-Hee;Park, Hyeong-Sik;Choo, Sung-Ho;Hyun, Seung-Hwon;Hwang, Seung-On
    • Atmosphere
    • /
    • v.32 no.2
    • /
    • pp.87-101
    • /
    • 2022
  • In this paper, the performance improvement for the new KMA's Climate Prediction System (GloSea6), which has been built and tested in 2021, is presented by assessing the bias distribution of basic variables from 24 years of GloSea6 hindcasts. Along with the upgrade from GloSea5 to GloSea6, the performance of GloSea6 can be regarded as notable in many respects: improvements in (i) negative bias of geopotential height over the tropical and mid-latitude troposphere and over polar stratosphere in boreal summer; (ii) cold bias of tropospheric temperature; (iii) underestimation of mid-latitude jets; (iv) dry bias in the lower troposphere; (v) cold tongue bias in the equatorial SST and the warm bias of Southern Ocean, suggesting the potential of improvements to the major climate variability in GloSea6. The warm surface temperature in the northern hemisphere continent in summer is eliminated by using CDF-matched soil-moisture initials. However, the cold bias in high latitude snow-covered area in winter still needs to be improved in the future. The intensification of the westerly winds of the summer Asian monsoon and the weakening of the northwest Pacific high, which are considered to be major errors in the GloSea system, had not been significantly improved. However, both the use of increased number of ensembles and the initial conditions at the closest initial dates reveals possibility to improve these biases. It is also noted that the effect of ensemble expansion mainly contributes to the improvement of annual variability over high latitudes and polar regions.