• 제목/요약/키워드: a error model

Search Result 7,334, Processing Time 0.046 seconds

Prediction of Correct Answer Rate and Identification of Significant Factors for CSAT English Test Based on Data Mining Techniques (데이터마이닝 기법을 활용한 대학수학능력시험 영어영역 정답률 예측 및 주요 요인 분석)

  • Park, Hee Jin;Jang, Kyoung Ye;Lee, Youn Ho;Kim, Woo Je;Kang, Pil Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.11
    • /
    • pp.509-520
    • /
    • 2015
  • College Scholastic Ability Test(CSAT) is a primary test to evaluate the study achievement of high-school students and used by most universities for admission decision in South Korea. Because its level of difficulty is a significant issue to both students and universities, the government makes a huge effort to have a consistent difficulty level every year. However, the actual levels of difficulty have significantly fluctuated, which causes many problems with university admission. In this paper, we build two types of data-driven prediction models to predict correct answer rate and to identify significant factors for CSAT English test through accumulated test data of CSAT, unlike traditional methods depending on experts' judgments. Initially, we derive candidate question-specific factors that can influence the correct answer rate, such as the position, EBS-relation, readability, from the annual CSAT practices and CSAT for 10 years. In addition, we drive context-specific factors by employing topic modeling which identify the underlying topics over the text. Then, the correct answer rate is predicted by multiple linear regression and level of difficulty is predicted by classification tree. The experimental results show that 90% of accuracy can be achieved by the level of difficulty (difficult/easy) classification model, whereas the error rate for correct answer rate is below 16%. Points and problem category are found to be critical to predict the correct answer rate. In addition, the correct answer rate is also influenced by some of the topics discovered by topic modeling. Based on our study, it will be possible to predict the range of expected correct answer rate for both question-level and entire test-level, which will help CSAT examiners to control the level of difficulties.

A Proposal to Control System and the Problems of the Problems of the Report about Supply and Demand for Medical Technicians and Management Policy ("의료기사인력수급에 관한 보고서"의 문제점과 관리제도의 개선방안)

  • Kim, Sang-Hyun;Lim, Yongmoo
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.13 no.4
    • /
    • pp.25-30
    • /
    • 2008
  • Purpose: In this paper, we have analyzed the problems of the Oh's report which is used to the basic data for supply and demand of medical technicians and studied a proposal for improvement to control system and supply and demand of korean optometrists. Methods: We have analyzed errors of Oh's report including supply and demand for medical technicians and management policy, expecting number for future optician, inaccurate estimation by limited data (employment rate, retirement rate, mortality rate) and an incorrect method of measurement for future supply and demand. Results: Oh's report showed the 18% error for estimation of supply which exclude the irregular entrance students. The estimation of supply was calculated by graduation rate 62.6% (college and University of Technology are 78.9% and 85.98% respectively), employment rate 65.8% (the average employment between 2002 and 2007 is 73.96%) and retirement rate is 2.3% (the retirement of pharmacists is 1.3%) but it showed the significant differences to objective data. For estimate the suitable ratio of optometrists to the population, the ratio use of medical facilities by an age group was used, and suggested spectacle wearers 1,280 persons (populations 2,928 persons) per optometrist but the different from reference of Germany (4,706 persons), America (1,789 persons) and Korea (1,825 persons/an optometrist) are applied to estimation on supply. This report applied the low employment rate and argued that maintain the present situation, but claimed that utilize unemployment persons. The above result has induced double weighting effect on estimation of supply. Conclusions: To solve the related problems of supply and demand, we have to make a search for exact data and optimum application model, have to take an example of nation similar job category as Germany and the research result of the job satisfaction into consideration. After we get the integrated research result, we must carried out the policy with fairness and balance for the estimation of supply and demand. Therefore exact research is required prior to beginning policy establishment, government and related group have to make a clear long-term plan and permanent organization for medical technician to establish supply and demand of medical technician.

  • PDF

Long-term forecasting reference evapotranspiration using statistically predicted temperature information (통계적 기온예측정보를 활용한 기준증발산량 장기예측)

  • Kim, Chul-Gyum;Lee, Jeongwoo;Lee, Jeong Eun;Kim, Hyeonjun
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.12
    • /
    • pp.1243-1254
    • /
    • 2021
  • For water resources operation or agricultural water management, it is important to accurately predict evapotranspiration for a long-term future over a seasonal or monthly basis. In this study, reference evapotranspiration forecast (up to 12 months in advance) was performed using statistically predicted monthly temperatures and temperature-based Hamon method for the Han River basin. First, the daily maximum and minimum temperature data for 15 meterological stations in the basin were derived by spatial-temporal downscaling the monthly temperature forecasts. The results of goodness-of-fit test for the downscaled temperature data at each site showed that the percent bias (PBIAS) ranged from 1.3 to 6.9%, the ratio of the root mean square error to the standard deviation of the observations (RSR) ranged from 0.22 to 0.27, the Nash-Sutcliffe efficiency (NSE) ranged from 0.93 to 0.95, and the Pearson correlation coefficient (r) ranged from 0.97 to 0.98 for the monthly average daily maximum temperature. And for the monthly average daily minimum temperature, PBIAS was 7.8 to 44.7%, RSR was 0.21 to 0.25, NSE was 0.94 to 0.96, and r was 0.98 to 0.99. The difference by site was not large, and the downscaled results were similar to the observations. In the results of comparing the forecasted reference evapotranspiration calculated using the downscaled data with the observed values for the entire region, PBIAS was 2.2 to 5.4%, RSR was 0.21 to 0.28, NSE was 0.92 to 0.96, and r was 0.96 to 0.98, indicating a very high fit. Due to the characteristics of the statistical models and uncertainty in the downscaling process, the predicted reference evapotranspiration may slightly deviate from the observed value in some periods when temperatures completely different from the past are observed. However, considering that it is a forecast result for the future period, it will be sufficiently useful as information for the evaluation or operation of water resources in the future.

Evaluation of Moisture and Feed Values for Winter Annual Forage Crops Using Near Infrared Reflectance Spectroscopy (근적외선분광법을 이용한 동계사료작물 풀 사료의 수분함량 및 사료가치 평가)

  • Kim, Ji Hea;Lee, Ki Won;Oh, Mirae;Choi, Ki Choon;Yang, Seung Hak;Kim, Won Ho;Park, Hyung Soo
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.39 no.2
    • /
    • pp.114-120
    • /
    • 2019
  • This study was carried out to explore the accuracy of near infrared spectroscopy(NIRS) for the prediction of moisture content and chemical parameters on winter annual forage crops. A population of 2454 winter annual forages representing a wide range in chemical parameters was used in this study. Samples of forage were scanned at 1nm intervals over the wavelength range 680-2500nm and the optical data was recorded as log 1/Reflectance(log 1/R), which scanned in intact fresh condition. The spectral data were regressed against a range of chemical parameters using partial least squares(PLS) multivariate analysis in conjunction with spectral math treatments to reduced the effect of extraneous noise. The optimum calibrations were selected based on the highest coefficients of determination in cross validation($R^2$) and the lowest standard error of cross-validation(SECV). The results of this study showed that NIRS calibration model to predict the moisture contents and chemical parameters had very high degree of accuracy except for barely. The $R^2$ and SECV for integrated winter annual forages calibration were 0.99(SECV 1.59%) for moisture, 0.89(SECV 1.15%) for acid detergent fiber, 0.86(SECV 1.43%) for neutral detergent fiber, 0.93(SECV 0.61%) for crude protein, 0.90(SECV 0.45%) for crude ash, and 0.82(SECV 3.76%) for relative feed value on a dry matter(%), respectively. Results of this experiment showed the possibility of NIRS method to predict the moisture and chemical composition of winter annual forage for routine analysis method to evaluate the feed value.

Estimation of High Resolution Sea Surface Salinity Using Multi Satellite Data and Machine Learning (다종 위성자료와 기계학습을 이용한 고해상도 표층 염분 추정)

  • Sung, Taejun;Sim, Seongmun;Jang, Eunna;Im, Jungho
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_2
    • /
    • pp.747-763
    • /
    • 2022
  • Ocean salinity affects ocean circulation on a global scale and low salinity water around coastal areas often has an impact on aquaculture and fisheries. Microwave satellite sensors (e.g., Soil Moisture Active Passive [SMAP]) have provided sea surface salinity (SSS) based on the dielectric characteristics of water associated with SSS and sea surface temperature (SST). In this study, a Light Gradient Boosting Machine (LGBM)-based model for generating high resolution SSS from Geostationary Ocean Color Imager (GOCI) data was proposed, having machine learning-based improved SMAP SSS by Jang et al. (2022) as reference data (SMAP SSS (Jang)). Three schemes with different input variables were tested, and scheme 3 with all variables including Multi-scale Ultra-high Resolution SST yielded the best performance (coefficient of determination = 0.60, root mean square error = 0.91 psu). The proposed LGBM-based GOCI SSS had a similar spatiotemporal pattern with SMAP SSS (Jang), with much higher spatial resolution even in coastal areas, where SMAP SSS (Jang) was not available. In addition, when tested for the great flood occurred in Southern China in August 2020, GOCI SSS well simulated the spatial and temporal change of Changjiang Diluted Water. This research provided a potential that optical satellite data can be used to generate high resolution SSS associated with the improved microwave-based SSS especially in coastal areas.

An Adjustment of Cloud Factors for Continuity and Consistency of Insolation Estimations between GOES-9 and MTSAT-1R (GOES-9과 MTSAT-1R 위성 간의 일사량 산출의 연속성과 일관성 확보를 위한 구름 감쇠 계수의 조정)

  • Kim, In-Hwan;Han, Kyung-Soo;Yeom, Jong-Min
    • Korean Journal of Remote Sensing
    • /
    • v.28 no.1
    • /
    • pp.69-77
    • /
    • 2012
  • Surface insolation is one of the major indicators for climate research over the Earth system. For the climate research, long-term data and wide range of spatial coverage from the data observed by two or more of satellites of the same orbit are needed. It is important to improve the continuity and consistency of the derived products, such as surface insolation, from different satellites. In this study, surface insolations based on Geostationary Operational Environmental Satellite (GOES-9) and Multi-functional Transport Satellites (MTSAT-1R) were compared during overlap period using physical model of insolation to find ways to improve the consistency and continuity between two satellites through comparison of each channel data and ground observation data. The thermal infrared brightness temperature of two satellites show a relatively good agreement between two satellites : rootmean square error (RMSE)=5.595 Kelvin; Bias=2.065 Kelvin. Whereas, visible channels shown a quite different values, but it distributed similar tendency. And the surface insolations from two satellites are different from the ground observation data. To improve the quality of retrieved insolations, we have reproduced surface insolation of each satellite through adjustment of the Cloud Factor, and the Cloud Factor for GOES-9 satellite is modified based on the analysis result of difference channel data. As a result, the insolations estimated from GOES-9 for cloudy conditions show good agreement with MTSAT-1R and ground observation : RMSE=$83.439W\;m^{-2}$ Bias=$27.296W\;m^{-2}$. The result improved accuracy confirms that the modification of Cloud Factor for GOES-9 can improve the continuity and consistency of the insolations derived from two or more satellites.

Estimation of Surface Solar Radiation using Ground-based Remote Sensing Data on the Seoul Metropolitan Area (수도권지역의 지상기반 원격탐사자료를 이용한 지표면 태양에너지 산출)

  • Jee, Joon-Bum;Min, Jae-Sik;Lee, Hankyung;Chae, Jung-Hoon;Kim, Sangil
    • Journal of the Korean earth science society
    • /
    • v.39 no.3
    • /
    • pp.228-240
    • /
    • 2018
  • Solar energy is calculated using meteorological (14 station), ceilometer (2 station) and microwave radiometer (MWR, 7 station)) data observed from the Weather Information Service Engine (WISE) on the Seoul metropolitan area. The cloud optical thickness and the cloud fraction are calculated using the back-scattering coefficient (BSC) of the ceilometer and liquid water path of the MWR. The solar energy on the surface is calculated using solar radiation model with cloud fraction from the ceilometer and the MWR. The estimated solar energy is underestimated compared to observations both at Jungnang and Gwanghwamun stations. In linear regression analysis, the slope is less than 0.8 and the bias is negative which is less than $-20W/m^2$. The estimated solar energy using MWR is more improved (i.e., deterministic coefficient (average $R^2=0.8$) and Root Mean Square Error (average $RMSE=110W/m^2$)) than when using ceilometer. The monthly cloud fraction and solar energy calculated by ceilometer is greater than 0.09 and lower than $50W/m^2$ compared to MWR. While there is a difference depending on the locations, RMSE of estimated solar radiation is large over $50W/m^2$ in July and September compared to other months. As a result, the estimation of a daily accumulated solar radiation shows the highest correlation at Gwanghwamun ($R^2=0.80$, RMSE=2.87 MJ/day) station and the lowest correlation at Gooro ($R^2=0.63$, RMSE=4.77 MJ/day) station.

Thin Layer Drying and Quality Characteristics of Ainsliaea acerifolia Sch. Bip. Using Far Infrared Radiation (원적외선을 이용한 단풍취의 박층 건조 및 품질 특성)

  • Ning, Xiao Feng;Li, He;Kang, Tae Hwan;Lee, Jun Soo;Lee, Jeong Hyun;Ha, Chung Su
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.43 no.6
    • /
    • pp.884-892
    • /
    • 2014
  • The purpose of this study was to investigate the drying characteristics and drying models of Ainsliaea acerifolia Sch. Bip. using far-infrared thin layer drying. Far-infrared thin layer drying test on Ainsliaea acerifolia Sch. Bip. was conducted at two air velocities of 0.6 and 0.8 m/sec, as well as three drying temperatures of 40, 45, and $50^{\circ}C$ respectively. The drying models were estimated using coefficient of determination and root mean square error. Drying characteristics were analyzed based on factors such as drying rate, leaf color changes, antioxidant activity, and contents of polyphenolics and flavonoids. The results revealed that increases in drying temperature and air velocity caused a reduction in drying time. The Thompson model was considered suitable for thin layer drying using far-infrared radiation for Ainsliaea accerifolia Sch. Bip. Greenness and yellowness values decreased and lightness values increased after far-infrared thin layer drying, and the color difference (${\Delta}E$) values at $40^{\circ}C$ were higher than those at $45^{\circ}C$ and $50^{\circ}C$. The antioxidant properties of Ainsliaea acerifolia Sch. Bip. decreased under all far-infrared thin layer drying conditions, and the highest polyphenolic content (37.9 mg/g), flavonoid content (22.7 mg/g), DPPH radical scavenging activity (32.5), and ABTS radical scavenging activity (31.1) were observed at a drying temperature of $40^{\circ}C$ with an air velocity of 0.8 m/sec.

A Study on GPU-based Iterative ML-EM Reconstruction Algorithm for Emission Computed Tomographic Imaging Systems (방출단층촬영 시스템을 위한 GPU 기반 반복적 기댓값 최대화 재구성 알고리즘 연구)

  • Ha, Woo-Seok;Kim, Soo-Mee;Park, Min-Jae;Lee, Dong-Soo;Lee, Jae-Sung
    • Nuclear Medicine and Molecular Imaging
    • /
    • v.43 no.5
    • /
    • pp.459-467
    • /
    • 2009
  • Purpose: The maximum likelihood-expectation maximization (ML-EM) is the statistical reconstruction algorithm derived from probabilistic model of the emission and detection processes. Although the ML-EM has many advantages in accuracy and utility, the use of the ML-EM is limited due to the computational burden of iterating processing on a CPU (central processing unit). In this study, we developed a parallel computing technique on GPU (graphic processing unit) for ML-EM algorithm. Materials and Methods: Using Geforce 9800 GTX+ graphic card and CUDA (compute unified device architecture) the projection and backprojection in ML-EM algorithm were parallelized by NVIDIA's technology. The time delay on computations for projection, errors between measured and estimated data and backprojection in an iteration were measured. Total time included the latency in data transmission between RAM and GPU memory. Results: The total computation time of the CPU- and GPU-based ML-EM with 32 iterations were 3.83 and 0.26 see, respectively. In this case, the computing speed was improved about 15 times on GPU. When the number of iterations increased into 1024, the CPU- and GPU-based computing took totally 18 min and 8 see, respectively. The improvement was about 135 times and was caused by delay on CPU-based computing after certain iterations. On the other hand, the GPU-based computation provided very small variation on time delay per iteration due to use of shared memory. Conclusion: The GPU-based parallel computation for ML-EM improved significantly the computing speed and stability. The developed GPU-based ML-EM algorithm could be easily modified for some other imaging geometries.

A Study on the Impact of Human Factors for the Students Pilot's in ATO -With Respect to Korea Aviation Act and ICAO Human Factors Training Manual- (항공법규에 의거 지정된 조종사 양성 전문교육기관의 학생조종사에 대한 휴먼팩터 영향 연구)

  • Lee, Kang-Seok
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.26 no.2
    • /
    • pp.149-179
    • /
    • 2011
  • Statistics of aviation accident in Korea show that safety level of training flights is high. However, more than 80% of aviation accidents happen owing to human factors. And because most reasons of them are concerned with pilot error, it is very important for student pilots who will transport a lot of passengers to develop the knowledge of safety and abilities of risk management for preventing accidents. In this study, in order to investigate the Human Factors which affect safety in training student pilots for flight, verified the correlationbetween experiences of accident, the differences according to the experience level of training flight and the differences between college student pilots and ordinary student pilots on the basis of human factors that composes the SHELL models. For the study, Using SPSS 17.0, conducted Correlation Analysis, Analysis of Variance(ANOVA) and t-test. To sum up the result of this study, student pilot's ability and equipment in the cockpit are the important factors for safety when pilots are training flight. Also the analysis of the differences between human factors according to the characters of student pilots' groups shows that college student pilots are affected by immanent factors and organizational cultures. So far, there haven't been any accidents which is related with human casualties when training at the ATO(Approved Training Organization). But accidents can occur at any time and anywhere. Especially the human factors which comprises most of aviation accident have a wide reach and are impossible to be eliminated, therefore, it is best to minimize them. Because ATO is the starting point to lead the aviation industry of Korea, we will have to be aware of problems and improve education/training of human factors.

  • PDF