• Title/Summary/Keyword: coefficient of determination (R-square)

Search Result 172, Processing Time 0.031 seconds

Calibration of Portable Particulate Mattere-Monitoring Device using Web Query and Machine Learning

  • Loh, Byoung Gook;Choi, Gi Heung
    • Safety and Health at Work
    • /
    • v.10 no.4
    • /
    • pp.452-460
    • /
    • 2019
  • Background: Monitoring and control of PM2.5 are being recognized as key to address health issues attributed to PM2.5. Availability of low-cost PM2.5 sensors made it possible to introduce a number of portable PM2.5 monitors based on light scattering to the consumer market at an affordable price. Accuracy of light scatteringe-based PM2.5 monitors significantly depends on the method of calibration. Static calibration curve is used as the most popular calibration method for low-cost PM2.5 sensors particularly because of ease of application. Drawback in this approach is, however, the lack of accuracy. Methods: This study discussed the calibration of a low-cost PM2.5-monitoring device (PMD) to improve the accuracy and reliability for practical use. The proposed method is based on construction of the PM2.5 sensor network using Message Queuing Telemetry Transport (MQTT) protocol and web query of reference measurement data available at government-authorized PM monitoring station (GAMS) in the republic of Korea. Four machine learning (ML) algorithms such as support vector machine, k-nearest neighbors, random forest, and extreme gradient boosting were used as regression models to calibrate the PMD measurements of PM2.5. Performance of each ML algorithm was evaluated using stratified K-fold cross-validation, and a linear regression model was used as a reference. Results: Based on the performance of ML algorithms used, regression of the output of the PMD to PM2.5 concentrations data available from the GAMS through web query was effective. The extreme gradient boosting algorithm showed the best performance with a mean coefficient of determination (R2) of 0.78 and standard error of 5.0 ㎍/㎥, corresponding to 8% increase in R2 and 12% decrease in root mean square error in comparison with the linear regression model. Minimum 100 hours of calibration period was found required to calibrate the PMD to its full capacity. Calibration method proposed poses a limitation on the location of the PMD being in the vicinity of the GAMS. As the number of the PMD participating in the sensor network increases, however, calibrated PMDs can be used as reference devices to nearby PMDs that require calibration, forming a calibration chain through MQTT protocol. Conclusions: Calibration of a low-cost PMD, which is based on construction of PM2.5 sensor network using MQTT protocol and web query of reference measurement data available at a GAMS, significantly improves the accuracy and reliability of a PMD, thereby making practical use of the low-cost PMD possible.

Predictive Modeling for the Growth of Listeria monocytogenes as a Function of Temperature, NaCl, and pH

  • PARK SHIN YOUNG;CHOI JIN-WON;YEON JIHYE;LEE MIN JEONG;CHUNG DUCK HWA;KIM MIN-GON;LEE KYU-HO;KIM KEUN-SUNG;LEE DONG-HA;BAHK GYUNG-JIN;BAE DONG-HO;KIM KWANG-YUP;KIM CHEOL-HO
    • Journal of Microbiology and Biotechnology
    • /
    • v.15 no.6
    • /
    • pp.1323-1329
    • /
    • 2005
  • A mathematical model was developed for predicting the growth kinetics of Listeria monocytogenes in tryptic soy broth (TSB) as a function of combined effects of temperature, pH, and NaCl. The TSB containing four different concentrations of NaCl (2, 4, 5, and $10\%$) was initially adjusted to six different pH levels (pH 5, 6, 7, 8, 9, and 10) and incubated at 4, 10, 25, or 37$^{circ}C$. In all experimental variables, the primary growth curves were well fitted ($r^{2}$=0.982 to 0.998) to a Gompertz equation to obtain the lag time (LT) and specific growth rate (SGR). Surface response models were identified as appropriate secondary models for LT and SGR on the basis of coefficient determination ($r^{2}$=0.907 for LT, 0.964 for SGR), mean square error (MSE=3.389 for LT, 0.018 for SGR), bias factor ($B_{1}$B,=0.706 for LT, 0.836 for SGR), and accuracy factor ($A_{f}$=1.567 for LT, 1.213 for SGR). Therefore, the developed secondary model proved reliable predictions of the combined effect of temperature, NaCl, and pH on both LT and SGR for L. monocytogenes in TSB.

Bioequivalence Test of Triflusal Capsules (트리플루살 캅셀의 생물학적 동등성 평가)

  • 박정숙;이미경;박경미;김진기;임수정;최성희;민경아;김종국
    • Biomolecules & Therapeutics
    • /
    • v.9 no.4
    • /
    • pp.291-297
    • /
    • 2001
  • The bioequivalence of two triflusal products was evaluated with 20 healthy volunteers following single oral dose according to the guidelines of Korea Food and Drug Administration (KFDA). Trisa $l^{R}$ capsule (Whanin Pharm. Corp., Korea) and Disgre $n^{R}$ capsule (Myung-In Pharm. Corp., Korea) were used as test product and reference product, respectively. Both products contain 300 mg of trifusal. One capsule of test product or reference product was orally administered to the volunteers, respectively, by randomized two period crossover study (2$\times$2 Latin square method). Blood samples were taken at predetermined time intervals for 4 hours and the determination of trifusal was accomplished using semi-microbore HPLC equipped with automated column switching system. The analytical method with HPLC was validated according to the Bioanalytic Method Validation guideline by F7A prior to determining the plasma samples. The pharmacokinetic parameters (AU $C_{0-4h}$ $C_{max}$ and $T_{max}$) were calculated and ANOVA test was utilized for statistical analysis of parameters. As a result of the assay validation, the limit of quantification of trifusal in human plasma by current assay procedure was 50 ng/ml using 500 $\mu$l of plasma. The accuracy of the assay was from 97.76% to 116.51% while the intra-day and inter-day coefficient of variation of the same concentration range was less than 15%. Average drug concentration at the designated time intervals and pharmacokinetic parameters calculated were not significantly different between two products (p>0.05). The difference of mean AU $C_{olongrightarrow4hr}$, $C_{max}$, and $T_{max}$ between the two products (2.92, 4.39, and -2.44%, respectively) were less than 20%. The power (1-$\beta$) and treatment difference ($\Delta$) for AU $C_{olongrightarrow4hr}$ and $C_{max}$ were more than 0.8 and less than 0.2, respectively. Although the power for $T_{max}$ was under 0.8, $T_{max}$ of the two products was not significantly different from each other (p>0.05). These results satisfied the criteria of KFDA guideline for bioequivalence, indicating the two products of triflusal were bioequivalent.quivalent.ent.ent.

  • PDF

A Mathematical Model for Estimating Proper Taxi Fleet Size : Focusing on Pyeong-Taek City Case Study (택시총량산정을 위한 수리모형의 개발 : 평택시를 중심으로)

  • Kim, Suk Hee;Choi, Keechoo;Choi, Doo Sun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.31 no.5D
    • /
    • pp.633-639
    • /
    • 2011
  • To estimate a proper fleet size of taxi, a daily archived tachograph was analyzed for both corporate taxi and owner-driver taxi. Mathematical model to estimate a desirable number of taxi was developed using city's characteristics of Pyeong-taek city case. This model could be used as coefficient of determination of city's characteristics model(revised R square) was 0.970. a total amount of taxi number in the future for the city of Pyeong-taek. As a result, the model produced a proper fleet size of Pyeong-taek city in the future as 1,794 taxis by 2014, which was higher in number by 214 taxis, compared to 2009. Also, the model of the service rate, considering operation condition, was used to analyze a total number of taxies. As a result, the model showed a total number of taxis as 1,224 taxis by 2014, which is lower in number by 356 taxies, compared to 2009. It is desirable to use both city's characteristics model and the service rate model to estimate a total number of taxis in conclusion. As a result of adopting average value from two model, the model produced a total supply plan of Pyeong-taek city as 1,509 taxis by 2014, which is smaller than in number by 71 taxis, compared to 2009.

A Development for Sea Surface Salinity Algorithm Using GOCI in the East China Sea (GOCI를 이용한 동중국해 표층 염분 산출 알고리즘 개발)

  • Kim, Dae-Won;Kim, So-Hyun;Jo, Young-Heon
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.5_2
    • /
    • pp.1307-1315
    • /
    • 2021
  • The Changjiang Diluted Water (CDW) spreads over the East China Sea every summer and significantly affects the sea surface salinity changes in the seas around Jeju Island and the southern coast of Korea peninsula. Sometimes its effect extends to the eastern coast of Korea peninsula through the Korea Strait. Specifically, the CDW has a significant impact on marine physics and ecology and causes damage to fisheries and aquaculture. However, due to the limited field surveys, continuous observation of the CDW in the East China Sea is practically difficult. Many studies have been conducted using satellite measurements to monitor CDW distribution in near-real time. In this study, an algorithm for estimating Sea Surface Salinity (SSS) in the East China Sea was developed using the Geostationary Ocean Color Imager (GOCI). The Multilayer Perceptron Neural Network (MPNN) method was employed for developing an algorithm, and Soil Moisture Active Passive (SMAP) SSS data was selected for the output. In the previous study, an algorithm for estimating SSS using GOCI was trained by 2016 observation data. By comparison, the train data period was extended from 2015 to 2020 to improve the algorithm performance. The validation results with the National Institute of Fisheries Science (NIFS) serial oceanographic observation data from 2011 to 2019 show 0.61 of coefficient of determination (R2) and 1.08 psu of Root Mean Square Errors (RMSE). This study was carried out to develop an algorithm for monitoring the surface salinity of the East China Sea using GOCI and is expected to contribute to the development of the algorithm for estimating SSS by using GOCI-II.

Development of a Model for Calculating the Construction Duration of Urban Residential Housing Based on Multiple Regression Analysis (다중 회귀분석 기반 도시형 생활주택의 공사기간 산정 모델 개발)

  • Kim, Jun-Sang;Kim, Young Suk
    • Land and Housing Review
    • /
    • v.12 no.4
    • /
    • pp.93-101
    • /
    • 2021
  • As the number of small households (1 to 2 persons per household) in Korea gradually increases, so does the importance of housing supply policies for small households. In response to the increase in small households, the government has been continuously supplying urban housing for these households. Since housing for small households is a sales and rental business similar to apartments and general business facilities, it is important for the building owner to calculate the project's estimated construction duration during the planning stage. Review of literature found a model for estimating the duration of construction of large-scale buildings but not for small-scale buildings such as urban housing for small households. Therefore this study aimed to develop and verify a model for estimating construction duration for urban housing at the planning stage based on multiple regression analysis. Independent variables inputted into the estimation model were building site area, building gross floor area, number of below ground floors, number of above ground floors, number of buildings, and location. The modified coefficient of determination (Ra2) of the model was 0.547. The developed model resulted in a Root Mean Square Error (RMSE) of 171.26 days and a Mean Absolute Percentage Error (MAPE) of 26.53%. The developed estimation model is expected to provide reliable construction duration calculations for small-scale urban residential buildings during the planning stage of a project.

A Study on the Field Application of Nays2D Model for Evaluation of Riverfront Facility Flood Risk (친수시설 홍수위험도 평가를 위한 Nays2D 모형의 현장 적용에 관한 연구)

  • Ku, Young Hun;Song, Chang Geun;Park, Yong-Sung;Kim, Young Do
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.3
    • /
    • pp.579-588
    • /
    • 2015
  • Recent climage changes have resulted in increases in rainfall intensity and flood frequency as well as the risk of flood damage due to typhoons during the summer season. Water-friendly facilities such as ecological parks and sports facilities have been established on floodplains of rivers since the river improvement project was implemented and increases in the flood levels of rivers due to typhoons can lead to direct flood damage to such facilities. To analyze the hydraulic influence of these water-friendly facilities on floodplains or to evaluate their stability, numerical analysis should be performed in advance. In addition, it is crucial to address the drying and wetting processes generated by water level fluctuations. This study uses a Nays2D model, which analyzes drying and wetting, to examine its applicability to simple terrain in which such fluctuations occur and to natural rivers in which drying occurs. The results of applying this model to sites of actual typhoon events are compared with values measured at water level observatories. Through this comparison, it is determined that values of coefficient of determination ($R^2$), mean absolute error (MAE), and root-mean-square error (RMSE) are 0.988, 0.208, and 0.239, respectively, thus showing a statistically high correlation. In addition, the results are used to calculate flood risk indices for evaluation of such risk for water-friendly facilities constructed on floodplains.

Bioequivalence of Two Clarithromycin Tablets (클래리스로마애신 정제의 생물학적 동등성 평가)

  • 김종국;이사원;최하곤;고종호;이미경;김인숙
    • Biomolecules & Therapeutics
    • /
    • v.6 no.2
    • /
    • pp.219-224
    • /
    • 1998
  • The bioequivalence of two clarithromvcin products was evaluated with 16 normal male volunteers (age 23-28 yr, body weight 57.5-75.517g) following single oral dose. Test product was ReYon Clarithromycin tablets (ReYon Pharm. Corp., Korea) and reference product was Klarici $d_{R}$ tablets (Abbott Korea). Both products contain 250 mg of clarithromucin. One tablet of the test or the reference product was administered to the volunteers, respectively, by randomized two period cross-over study (2$\times$2 Latin square method). The determination of clarithromycin was accomplished using a modified agar well diffusion bioassay. As a result of the assay validation, the quantification of clarithromycin in human serum by this technique was possible down to 0.03$\mu$g/ml using 100$\mu$l of serum. The coefficient of variation (C.V.) was less than 10%. Average drug concentrations at each sampling time and pharmacokinetic parameters calculated were not significantly different between two products P>0.05); the area under the curve to last sampling time (24 hr) (AU $Co_{24hr}$ (8.10$\pm$ 1.26 vs 8.22$\pm$ 1.627g . hr/ml), AUC from time zero to infinite (AU $Co_{\infty}$) (8.61 $\pm$ 1.28 vs 8.84$\pm$ 1.71 $\mu$g . hr/ml), maximum plasma concentration ( $C_{msx}$) (0.87$\pm$0.22 vs 0.88$\pm$0.19 $\mu$g/ml) and time to maximum plasma concentration ( $T_{max}$) (2.69 $\pm$0.48 vs 2.56$\pm$ 0.51 hr). The differences of mean AU $Co_{24h}$, $C_{msx}$ and $T_{msx}$ between the two products (1.44, 1.39, and 4.65%, respectively) were less than 20%. The power (1-$\beta$) and treatment difference ($\Delta$) for AU $Co_{24hr}$, and $C_{max}$ were more than 0.8 and less than 0.2, respectivly. Although the power for $T_{max}$ was under 0.8, $T_{max}$. of the two products was not significantly different each other (p>0.05). These results suggest that the bioavailability of ReYon Clarithromycin tablets is not significantly different from that of Klarici $d_{R}$ tablets. Therefore, two products are bioequivalent based on the current results. results.sults.sults.s.s.s.s.s.s.s.

  • PDF

Development of Naïve-Bayes classification and multiple linear regression model to predict agricultural reservoir storage rate based on weather forecast data (기상예보자료 기반의 농업용저수지 저수율 전망을 위한 나이브 베이즈 분류 및 다중선형 회귀모형 개발)

  • Kim, Jin Uk;Jung, Chung Gil;Lee, Ji Wan;Kim, Seong Joon
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.10
    • /
    • pp.839-852
    • /
    • 2018
  • The purpose of this study is to predict monthly agricultural reservoir storage by developing weather data-based Multiple Linear Regression Model (MLRM) with precipitation, maximum temperature, minimum temperature, average temperature, and average wind speed. Using Naïve-Bayes classification, total 1,559 nationwide reservoirs were classified into 30 clusters based on geomorphological specification (effective storage volume, irrigation area, watershed area, latitude, longitude and frequency of drought). For each cluster, the monthly MLRM was derived using 13 years (2002~2014) meteorological data by KMA (Korea Meteorological Administration) and reservoir storage rate data by KRC (Korea Rural Community). The MLRM for reservoir storage rate showed the determination coefficient ($R^2$) of 0.76, Nash-Sutcliffe efficiency (NSE) of 0.73, and root mean square error (RMSE) of 8.33% respectively. The MLRM was evaluated for 2 years (2015~2016) using 3 months weather forecast data of GloSea5 (GS5) by KMA. The Reservoir Drought Index (RDI) that was represented by present and normal year reservoir storage rate showed that the ROC (Receiver Operating Characteristics) average hit rate was 0.80 using observed data and 0.73 using GS5 data in the MLRM. Using the results of this study, future reservoir storage rates can be predicted and used as decision-making data on stable future agricultural water supply.

Measurement of Surface Color and Fermentation Degree in Tea Products Using NIRS (근적외선 분광광도계를 이용한 차제품의 표면 색상 및 발효정도 측정)

  • Chun, Jong-Un
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.54 no.1
    • /
    • pp.55-60
    • /
    • 2009
  • This study was conducted to measure tea surface colors using the visible bands ($400{\sim}700$ nm) with near-infrared spectroscopy (NIRS). The surface colors of 117 tea products were measured with a colorimeter. The $a^*/b^*$ (CIE color scale) or a/b (Hunter color scale) ratios in different tea products accounted for about 99.7% of the variation in fermentation degree (FD), indicating that the $a^*/b^*$ (a/b) ratio is a very useful trait for assessing fermentation degree. Also tea powders were scanned in the visible bands used with NIRS. Calibration equations for surface colors and fermentation degree were developed using the regression method of modified partial least-squares (MPLS) with internal cross validation. The equations had low SECV (standard errors of cross-validation), and high $R^2$ (coefficient of determination in calibration) values with $0.779{\sim}0.999$, indicating that the whole bands ($400{\sim}2500\;nm$) with NIRS could be used to rapidly measure traits related to surface color, fermentation degree and other chemical components in tea products with high precision and ease at a time.