• Title/Summary/Keyword: Under-Sampling

Search Result 1,094, Processing Time 0.031 seconds

A sampling design for e-learning industry status survey on the business demand sector (이러닝수요부문 사업체실태조사를 위한 표본설계)

  • Kim, Hea-Jung;Kwak, Hwa-Ryun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.701-712
    • /
    • 2013
  • The e-learning industry status survey statistic provides information about the actual conditions of supply and demand of the e-learning industries. NIPA (National IT Industry Promotion Agency) has published the annual report of the survey results since 2004. Due to the 9th version of the KSIC (Korean standard industrial classification) revised in 2008, a refinement of the sampling design for the survey becomes necessary, especially that for the business demand sector. This article, based on the 9th revision of the KSIC, constructs a stratification of the target population used for the e-learning industry status survey on the business demand sector. Classification of strata in the business population is based on the industrial type and employment scale of business. Under the stratified population, we design a sampling scheme by using the power allocation method that enables us to satisfy a target coefficient of variation of each industrial stratum. In order to secure an accurate survey results based on the proposed sampling design, we consider the problem of calculating the design weights, derivation of parameter estimators, and formulas of their standard errors.

Phase Tracking for Orthogonal Frequency Division Multiplexing Systems (직교 주파수 분할 다중화 시스템을 위한 위상 오차 추적)

  • Jeon, Tae-Hyun
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.43 no.12 s.354
    • /
    • pp.61-67
    • /
    • 2006
  • This paper proposes the algorithm for tracking of the residual phase errors incurred by carrier frequency offset and sampling frequency offset in the orthogonal frequency division multiplexing (OFDM) systems which are suitable for high data rate wireless communications. In the OFDM systems the subcarriers which are orthogonal to each other are modulated by digital data and transmitted simultaneously. The carrier frequency offset causes degradation of signal to noise ratio(SNR) performance and interference between the adjacent subcarriers. The errors in the sampling timing caused by the sampling frequency difference between the transmitter and the receiver sides also cause a major performance degradation in the OFDM systems. The residual error tracking and compensation mechanism is essential in the OFDM system since the carrier and the sampling frequency offset cause the loss of orthogonality resulting in the system performance loss. This paper proposes the scheme where the channel gain and the payload data information are reflected in the residual error tracking process which results in the reduction of the estimation error and the tracking performance improvements under the frequency selective fading wireless channels.

Ultra-WideBand Channel Measurement with Compressive Sampling for Indoor Localization (실내 위치추정을 위한 Compressive Sampling적용 Ultra-WideBand 채널 측정기법)

  • Kim, Sujin;Myung, Jungho;Kang, Joonhyuk;Sung, Tae-Kyung;Lee, Kwang-Eog
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.2
    • /
    • pp.285-297
    • /
    • 2015
  • In this paper, Ulta-WideBand (UWB) channel measurement and modeling based on compressive sampling (CS) are proposed. The sparsity of the channel impulse response (CIR) of the UWB signal in frequency domain enables the proposed channel measurement to have a low-complexity and to provide a comparable performance compared with the existing approaches especially used for the indoor geo-localization purpose. Furthermore, to improve the performance under noisy situation, the soft thresholding method is also investigated in solving the optimization problem for signal recovery of CS. Via numerical results, the proposed channel measurement and modeling are evaluated with the real measured data in terms of location estimation error, bandwidth, and compression ratio for indoor geo-localization using UWB system.

Performance Evaluation of an Auto Sampling and Filtering Unit of Substrate Solution using a Diaphragm Pump (소형 판막 펌프를 이용한 기질용액 채취 및 여과 자동화 장치의 성능검증)

  • Song, D.B.;Jung, H.S.;Lee, S.K.;Jung, D.H.;Park, S.W.
    • Journal of Biosystems Engineering
    • /
    • v.32 no.4
    • /
    • pp.263-268
    • /
    • 2007
  • An auto sampling and filtering unit was developed for monitoring automation of a fermentation process and its performance was evaluated. The automatic sampling and filtering unit was constructed with a glass filter, a diaphragm suction pump, and a flow direction change valve. To evaluate operating stability, delivery volumes of the suction pump were measured according to the experimental conditions of cellulose powder, pore size of the glass filter and suction head of the pump. The developed unit could deliver the sample solution under any experimental conditions except the filter pore size of $16{\mu}m$ and the suction head of 20cm. In case of the suction head of 30cm, the pump could not deliver the sample solution at all. Concentrations of the sample solutions were converged on those of the standard glucose solution after 8 minutes from the initial sampling time. The relative error of concentration between the sample and the standard solution showed 3.8, 4.8, 7.0% for the 1, 3, 5% contents of cellulose powder, respectively.

A Simulation-based Optimization for Scheduling in a Fab: Comparative Study on Different Sampling Methods (시뮬레이션 기반 반도체 포토공정 스케줄링을 위한 샘플링 대안 비교)

  • Hyunjung Yoon;Gwanguk Han;Bonggwon Kang;Soondo Hong
    • Journal of the Korea Society for Simulation
    • /
    • v.32 no.3
    • /
    • pp.67-74
    • /
    • 2023
  • A semiconductor fabrication facility(FAB) is one of the most capital-intensive and large-scale manufacturing systems which operate under complex and uncertain constraints through hundreds of fabrication steps. To improve fab performance with intuitive scheduling, practitioners have used weighted-sum scheduling. Since the determination of weights in the scheduling significantly affects fab performance, they often rely on simulation-based decision making for obtaining optimal weights. However, a large-scale and high-fidelity simulation generally is time-intensive to evaluate with an exhaustive search. In this study, we investigated three sampling methods (i.e., Optimal latin hypercube sampling(OLHS), Genetic algorithm(GA), and Decision tree based sequential search(DSS)) for the optimization. Our simulation experiments demonstrate that: (1) three methods outperform greedy heuristics in performance metrics; (2) GA and DSS can be promising tools to accelerate the decision-making process.

Sampling Based Approach to Hierarchical Bayesian Estimation of Reliability Function

  • Younshik Chung
    • Communications for Statistical Applications and Methods
    • /
    • v.2 no.2
    • /
    • pp.43-51
    • /
    • 1995
  • For the stress-strengh function, hierarchical Bayes estimations considered under squared error loss and entropy loss. In particular, the desired marginal postrior densities ate obtained via Gibbs sampler, an iterative Monte Carlo method, and Normal approximation (by Delta method). A simulation is presented.

  • PDF

Improvement of Verification Method for Remedial Works through the Suggestion of Indicative Parameters and Sampling Method (정화 보조지표와 시료 채취 방법 제안을 통한 토양정화검증 제도 개선 연구)

  • Kwon, Ji Cheol;Lee, Goontaek;Kim, Tae Seung;Yoon, Jeong-Ki;Kim, Ji-in;Kim, Yonghoon;Kim, Joonyoung;Choi, Jeongmin
    • Journal of Soil and Groundwater Environment
    • /
    • v.21 no.6
    • /
    • pp.179-191
    • /
    • 2016
  • In addition to the measurement of the concentration of soil contaminants, the new idea of indicative parameters was proposed to validate the remedial works through the monitoring for the changes of soil characteristics after applying the clean up technologies. The parameters like CFU (colony forming unit), pH and soil texture were recommended as indicative parameters for land farming. In case of soil washing, water content and the particle size distribution of the sludge were recommended as indicative parameters. The sludge is produced through the particle separation process in soil washing and it is usually treated as a waste. The parameters like water content, organic matter content, CEC (cation exchange capacity) and CFU were recommended as indicative parameters for the low temperature thermal desorption method. Besides the indicative parameter, sampling methods in stock pile and the optimal minimum amount of composite soil sample were proposed. The rates of sampling error in regular grid, zigzag, four bearing, random grid methods were 17.3%, 17.6%, 17.2% and 16.5% respectively. The random grid method showed the minimum sampling error among the 4 kinds of sampling methods although the differences in sampling errors were very little. Therefore the random grid method was recommended as an appropriate sampling method in stock pile. It was not possible to propose a value of optimal minimum amount of composite soil sample based on the real analytical data due to the dynamic variation of $CV_{fund{\cdot}error}$. Instead of this, 355 g of soil was recommended for the optimal minimum amount of composite soil sample under the assumption of ISO 10381-8.

Application of Random Over Sampling Examples(ROSE) for an Effective Bankruptcy Prediction Model (효과적인 기업부도 예측모형을 위한 ROSE 표본추출기법의 적용)

  • Ahn, Cheolhwi;Ahn, Hyunchul
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.8
    • /
    • pp.525-535
    • /
    • 2018
  • If the frequency of a particular class is excessively higher than the frequency of other classes in the classification problem, data imbalance problems occur, which make machine learning distorted. Corporate bankruptcy prediction often suffers from data imbalance problems since the ratio of insolvent companies is generally very low, whereas the ratio of solvent companies is very high. To mitigate these problems, it is required to apply a proper sampling technique. Until now, oversampling techniques which adjust the class distribution of a data set by sampling minor class with replacement have popularly been used. However, they are a risk of overfitting. Under this background, this study proposes ROSE(Random Over Sampling Examples) technique which is proposed by Menardi and Torelli in 2014 for the effective corporate bankruptcy prediction. The ROSE technique creates new learning samples by synthesizing the samples for learning, so it leads to better prediction accuracy of the classifiers while avoiding the risk of overfitting. Specifically, our study proposes to combine the ROSE method with SVM(support vector machine), which is known as the best binary classifier. We applied the proposed method to a real-world bankruptcy prediction case of a Korean major bank, and compared its performance with other sampling techniques. Experimental results showed that ROSE contributed to the improvement of the prediction accuracy of SVM in bankruptcy prediction compared to other techniques, with statistical significance. These results shed a light on the fact that ROSE can be a good alternative for resolving data imbalance problems of the prediction problems in social science area other than bankruptcy prediction.

Criteria for calculation of CSO volume and frequency using rainfall-runoff model (우수유출 모형을 이용한 합류식하수관로시스템의 월류량, 월류빈도 산정 기준 결정 연구)

  • Lee, Gunyoung;Na, Yongun;Ryu, Jaena;Oh, Jeill
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.27 no.3
    • /
    • pp.313-324
    • /
    • 2013
  • It is widely known that untreated Combined Sewer Overflows (CSOs) that directly discharged from receiving water have a negative impact. Recent concerns on the CSO problem have produced several large scale constructions of treatment facilities, but the facilities are normally designed under empirical design criteria. In this study, several criteria for defining CSOs (e.g. determination of effective rainfall, sampling time, minimum duration of data used for rainfall-runoff simulation and so on) were investigated. Then this study suggested a standard methodology for the CSO calculation and support formalized standard on the design criteria for CSO facilities. Criteria decided for an effective rainfall was over 0.5 mm of total rainfall depth and at least 4 hours should be exist between two different events. An Antecedent dry weather period prior to storm event to satisfy the effective rainfall criteria was over 3 days. Sampling time for the rainfall-runoff model simulation was suggested as 1 hour. A duration of long-term simulation CSO overflow and frequency calculation should be at least recent 10 year data. A Management plan for the CSOs should be established under a phase-in of the plan. That should reflect site-specific conditions of different catchments, and formalized criteria for defining CSOs should be used to examine the management plans.

Caffeine and Carbamazepine: Detection in Nakdong River Basin and Behavior under Drinking Water Treatment Processes (Caffeine과 Carbamazepine: 낙동강 수계에서의 검출 및 정수처리 공정에서의 거동)

  • Son, Hee-Jong;Yeom, Hoon-Sik;Jung, Jong-Moon;Jang, Seong-Ho;Kim, Han-Soo
    • Journal of Environmental Science International
    • /
    • v.21 no.7
    • /
    • pp.837-843
    • /
    • 2012
  • The aims of this study were to investigated the occurrence of caffeine and carbamazepine in Nakdong river basin (8 mainstreams and 2 tributaries) and the behavior of caffeine and carbamazepine under drinking water treatment processes (conventional and advanced processes). The examination results showed that caffeine was detected at all sampling sites (5.4~558.5 ng/L), but carbamazepine was detected at five sampling sites (5.1~79.4 ng/L). The highest concentration level of caffeine and carbamazepine in the mainstream and tributaries in Nakdong river were Goryeong and Jinchun-cheon, respectively. These pharmaceutical products were completely removed when they were subject to conventional plus advanced processes of drinking water treatment processes. Conventional processes of coagulation, sedimentation and sand-filtration were not effective for their removal, while advanced processes of ozonation and biological activated carbon (BAC) filtration were effective. Among these pharmaceuticals, carbamazeoine was more subject to ozonation than caffeine.