• Title/Summary/Keyword: Data Optimization

Search Result 3,487, Processing Time 0.038 seconds

A Recidivism Prediction Model Based on XGBoost Considering Asymmetric Error Costs (비대칭 오류 비용을 고려한 XGBoost 기반 재범 예측 모델)

  • Won, Ha-Ram;Shim, Jae-Seung;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.127-137
    • /
    • 2019
  • Recidivism prediction has been a subject of constant research by experts since the early 1970s. But it has become more important as committed crimes by recidivist steadily increase. Especially, in the 1990s, after the US and Canada adopted the 'Recidivism Risk Assessment Report' as a decisive criterion during trial and parole screening, research on recidivism prediction became more active. And in the same period, empirical studies on 'Recidivism Factors' were started even at Korea. Even though most recidivism prediction studies have so far focused on factors of recidivism or the accuracy of recidivism prediction, it is important to minimize the prediction misclassification cost, because recidivism prediction has an asymmetric error cost structure. In general, the cost of misrecognizing people who do not cause recidivism to cause recidivism is lower than the cost of incorrectly classifying people who would cause recidivism. Because the former increases only the additional monitoring costs, while the latter increases the amount of social, and economic costs. Therefore, in this paper, we propose an XGBoost(eXtream Gradient Boosting; XGB) based recidivism prediction model considering asymmetric error cost. In the first step of the model, XGB, being recognized as high performance ensemble method in the field of data mining, was applied. And the results of XGB were compared with various prediction models such as LOGIT(logistic regression analysis), DT(decision trees), ANN(artificial neural networks), and SVM(support vector machines). In the next step, the threshold is optimized to minimize the total misclassification cost, which is the weighted average of FNE(False Negative Error) and FPE(False Positive Error). To verify the usefulness of the model, the model was applied to a real recidivism prediction dataset. As a result, it was confirmed that the XGB model not only showed better prediction accuracy than other prediction models but also reduced the cost of misclassification most effectively.

Kriging of Daily PM10 Concentration from the Air Korea Stations Nationwide and the Accuracy Assessment (베리오그램 최적화 기반의 정규크리깅을 이용한 전국 에어코리아 PM10 자료의 일평균 격자지도화 및 내삽정확도 검증)

  • Jeong, Yemin;Cho, Subin;Youn, Youjeong;Kim, Seoyeon;Kim, Geunah;Kang, Jonggu;Lee, Dalgeun;Chung, Euk;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.3
    • /
    • pp.379-394
    • /
    • 2021
  • Air pollution data in South Korea is provided on a real-time basis by Air Korea stations since 2005. Previous studies have shown the feasibility of gridding air pollution data, but they were confined to a few cities. This paper examines the creation of nationwide gridded maps for PM10 concentration using 333 Air Korea stations with variogram optimization and ordinary kriging. The accuracy of the spatial interpolation was evaluated by various sampling schemes to avoid a too dense or too sparse distribution of the validation points. Using the 114,745 matchups, a four-round blind test was conducted by extracting random validation points for every 365 days in 2019. The overall accuracy was stably high with the MAE of 5.697 ㎍/m3 and the CC of 0.947. Approximately 1,500 cases for high PM10 concentration also showed a result with the MAE of about 12 ㎍/m3 and the CC over 0.87, which means that the proposed method was effective and applicable to various situations. The gridded maps for daily PM10 concentration at the resolution of 0.05° also showed a reasonable spatial distribution, which can be used as an input variable for a gridded prediction of tomorrow's PM10 concentration.

Applications of Fuzzy Theory on The Location Decision of Logistics Facilities (퍼지이론을 이용한 물류단지 입지 및 규모결정에 관한 연구)

  • 이승재;정창무;이헌주
    • Journal of Korean Society of Transportation
    • /
    • v.18 no.1
    • /
    • pp.75-85
    • /
    • 2000
  • In existing models in optimization, the crisp data improve has been used in the objective or constraints to derive the optimal solution, Besides, the subjective environments are eliminated because the complex and uncertain circumstances were regarded as Probable ambiguity, In other words those optimal solutions in the existing models could be the complete satisfactory solutions to the objective functions in the Process of application for industrial engineering methods to minimize risks of decision-making. As a result of those, decision-makers in location Problems couldn't face appropriately with the variation of demand as well as other variables and couldn't Provide the chance of wide selection because of the insufficient information. So under the circumstance. it has been to develop the model for the location and size decision problems of logistics facility in the use of the fuzzy theory in the intention of making the most reasonable decision in the Point of subjective view under ambiguous circumstances, in the foundation of the existing decision-making problems which must satisfy the constraints to optimize the objective function in strictly given conditions in this study. Introducing the Process used in this study after the establishment of a general mixed integer Programming(MIP) model based upon the result of existing studies to decide the location and size simultaneously, a fuzzy mixed integer Programming(FMIP) model has been developed in the use of fuzzy theory. And the general linear Programming software, LINDO 6.01 has been used to simulate, to evaluate the developed model with the examples and to judge of the appropriateness and adaptability of the model(FMIP) in the real world.

  • PDF

Direct Reconstruction of Displaced Subdivision Mesh from Unorganized 3D Points (연결정보가 없는 3차원 점으로부터 차이분할메쉬 직접 복원)

  • Jung, Won-Ki;Kim, Chang-Heon
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.29 no.6
    • /
    • pp.307-317
    • /
    • 2002
  • In this paper we propose a new mesh reconstruction scheme that produces a displaced subdivision surface directly from unorganized points. The displaced subdivision surface is a new mesh representation that defines a detailed mesh with a displacement map over a smooth domain surface, but original displaced subdivision surface algorithm needs an explicit polygonal mesh since it is not a mesh reconstruction algorithm but a mesh conversion (remeshing) algorithm. The main idea of our approach is that we sample surface detail from unorganized points without any topological information. For this, we predict a virtual triangular face from unorganized points for each sampling ray from a parameteric domain surface. Direct displaced subdivision surface reconstruction from unorganized points has much importance since the output of this algorithm has several important properties: It has compact mesh representation since most vertices can be represented by only a scalar value. Underlying structure of it is piecewise regular so it ran be easily transformed into a multiresolution mesh. Smoothness after mesh deformation is automatically preserved. We avoid time-consuming global energy optimization by employing the input data dependant mesh smoothing, so we can get a good quality displaced subdivision surface quickly.

A comparative analysis of gas and liquid phase standard spiked solid sorbent tubes for the determination of volatile organic compounds in indoor air by TD-GC/MS (열탈착/저온농축-GC/MS에 의한 실내공기 중 휘발성 유기화합물 정량용 기체상 및 액체상 표준물질 첨가한 고체 흡착관의 비교 분석)

  • Lim, Hyun-Woo;Jung, Sung-Won;Kang, Chul-Ho;Park, Jin-Sook;Park, Byeong Moo;Choi, Yong-Wook
    • Analytical Science and Technology
    • /
    • v.26 no.4
    • /
    • pp.287-297
    • /
    • 2013
  • The optimization of analytical method for the thermal desorption of seven VOCs (volatile organic compounds) by TD-GC/MS (thermal desorption-gas chromatograph-mass spectrometer) with solid phase sorbent tube, and comparative analysis for the determination of VOCs plotted by standard sorbent tubes prepared using both gas phase and liquid phase materials were investigated. The result of paired t-test showed that a liquid phase standard sorbent tube method was in agreement with a gas phase standard sorbent tube method for six species of VOCs including benzene, toluene, ethylbenzene, o-, m-, and p-xylene except for styrene at the significance level (${\alpha}=0.01$), while the 15.6% of difference in response factor between both of gas phase and liquid phase standard plotting for the determination of styrene showed that both methods were significantly different at the significance level. Therefore, the liquid phase standard plotting method was employed to reduce erroneous data for the determination of styrene including BTEX. Under the optimized analytical method by liquid phase standard sorbent tube, recovery was between $100{\pm}5%$ for 7 species of VOCs, reproducibility ranged from 0.3 to 7.7%, and method detection limit (MDL) ranged from $0.01{\mu}g/m^3$ for o-xylene to $0.27{\mu}g/m^3$ for toluene. The optimized standard method was applied to determine VOCs VOCs from indoor air of of dormitory, one bedroom apartment, and a new car.

A Study on Prospective Plan Comparison using DVH-index in Tomotherapy Planning (토모 테라피 치료 시 선량 체적 히스토그램 표지자를 이용한 치료계획 비교에 관한 연구)

  • Kim, Joo-Ho;Cho, Jeong-Hee;Lee, Sang-Kyoo;Jeon, Byeong-Chul;Yoon, Jong-Won;Kim, Dong-Wook
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.19 no.2
    • /
    • pp.113-122
    • /
    • 2007
  • Purpose: We proposed the method using dose-volume Histogram index to compare prospective plan trials in tomotherapy planning optimization. Materials and Methods: For 3 patients in cranial region, thorax and abdominal region, we acquired computed tomography images with PQ 5000 in each case. Then we delineated target structure and normal organ contour with pinnacle Ver 7.6c, after transferred each data to tomotherapy planning system (hi-art system Ver 2.0), we optimized 3 plan trials in each case that used differ from beam width, pitch, importance. We analyzed 3 plan trials in each region with isodose distribution, dose-volume histogram and dose statistics. Also we verified 3 plan trials with specialized DVH-indexes that is dose homogeneity index in target organ, conformity index around target structure and dose gradient index in non-target structures. Results: We compared with the similarity of results that the one is decide the best plan trial using isodose distribution, dose volume histogram and dose statistics, and the another is using DVH-indexes. They all decided the same plan trial to better result in each case. Conclusion: In some of case, it was appeared a little difference of results that used to DVH-index for comparison of plan trial in tomotherapy by special goal in it. But because DVH-index represented both dose distribution in target structure and high dose risk about normal tissue, it will be reasonable method for comparison of many plan trials before the tomotherapy treatments.

  • PDF

Optimization of Pre-treatment of Tropical Crop Oil by Sulfuric Acid and Bio-diesel Production (황산을 이용한 열대작물 오일의 전처리 반응 최적화 및 바이오디젤 생산)

  • Kim, Deog-Keun;Choi, Jong-Doo;Park, Ji-Yeon;Lee, Jin-Suk;Park, Seung-Bin;Park, Soon-Chul
    • Korean Chemical Engineering Research
    • /
    • v.47 no.6
    • /
    • pp.762-767
    • /
    • 2009
  • In this study, the feasibility of using vegetable oil extracted from tropical crop seed as a biodiesel feedstock was investigated by producing biodiesel and analysing the quality parameters as a transport fuel. In order to produce biodiesel efficiently, two step reaction process(pre-treatment and transesterificaion) was required because the tropical crop oil have a high content of free fatty acids. To determine the suitable acid catalyst for the pre-esterification, three kinds of acid catalysts were tested and sulfuric acid was identified as the best catalyst. After constructing the experimental matrix based on RSM and analysing the statistical data, the optimal pre-treatment conditions were determined to be 26.7% of methanol and 0.982% of sulfuric acid. Trans-esterification experiments of the pre-esterified oil based on RSM were carried out, then discovered 1.24% of KOH catalyst and 22.76% of methanol as the optimal trans-esterification conditions. However, the quantity of KOH was higher than the previously established KOH concentration of our team. So, we carried out supplemental experiment to determine the quantity of catalyst and methanol. As a result, the optimal transesterification conditions were determined to be 0.8% of KOH and 16.13% of methanol. After trans-esterification of tropical crop oil, the produced biodiesel could meet the major quality standard specifications; 100.8% of FAME, 0.45 mgKOH/g of acid value, 0.00% of water, 0.04% of total glycerol, $4.041mm^2/s$ of kinematic viscosity(at $40^{\circ}C$).

Optimization for Solid Culture of Phellinus sp. by Response Surface Methodology (반응표면방법에 의한 Phellinus sp. 고체배양의 최적화)

  • Kang, Tae-Su;Kang, An-Seok;Sohn, Hyung-Rac;Kang, Mi-Sun;Lim, Yaung-Iee;Lee, Shin-Young;Jung, Sung-Mo
    • The Korean Journal of Mycology
    • /
    • v.26 no.2 s.85
    • /
    • pp.265-274
    • /
    • 1998
  • This study was carried out to obtain the basic data for an artificial cultivation of Phellinus sp.. The optimum conditions for the mycelial growth on the different sawdusts (Quercus aliena, Morns alba and Alnus japonica) substrate of an isolated Phellinus sp. were optimized by response surface methodology. The ratio of rice bran addition to sawdust and the suitable moisture content for the mycelial growth in the all sawdust media were about 30% (w/w) and $65{\sim}70%$ (w/v), respectively. The initial pHs for the mycelial growth of Quercus aliena and Morns alba were in the range of $pH\;5{\sim}6$, whereas Alnus japonica was obtained at pH 6. The optimum temperature for the mycelial growth was about $25{\sim}30^{\circ}C$, depending on the different kinds of wood substrates. From the response surface analysis, the values of independent variables of Quercus aliena at stationary points were determined to be 31.01 % (w/w) of rice bran, pH of 5.31 and 69.03% (w/v) of moisture content, and the expected value of mycelial growth was about 8.32 cm. Both the ratio of rice bran addition to sawdust $(X_1)$ and moisture content $(X_3)$ were effective to the mycelial growth. In the case of Morns alba, the ratio of rice bran addition to sawdust, initial pH and moisture content at the stationary points were 28.77% (w/w), 5.28 and 69.8 (w/v),respectively, and the expected mycelial growth of 7.60 cm was obtained. Stationary points for the mycelial growth in the sawdust media of Alnus japonica were 28.74% (w/w) of rice bran, pH of 6. 04 and 66.96% (w/v) of moisture content, and the expected values of mycelial growth was about 5.38 cm. Based on the above results, there was correlations between the mycelial growth and independent variables, and the effect of rice bran $(X_1)$ and initial pH $(X_2)$ for the mycelial growth were higher than the moisture content $(X_3)$. The optimum species of sawdust media for the my celial growth of Phellinus sp. was in the order of Quercus aliena > Morns alba > Alnus japonica.

  • PDF

A Study on the Production Structure and Biomass Productivity of Quercus variabilis Natural Forest (굴참나무천연림(天然林)의 생산구조(生産構造) 및 물질생산력(物質生産力)에 관(關)한 연구(硏究))

  • Kim, Si Kyung;Jeong, Jwa Yong
    • Journal of Korean Society of Forest Science
    • /
    • v.70 no.1
    • /
    • pp.91-102
    • /
    • 1985
  • Growth and biomass production of natural stands of Quercus variabilis in relation to tree density were studied to obtain basic guide lines for future tending operation. Two natural stands of Quercus variabilis located at 900m (A stand: 6,600trees/ha, $15.84m^2/ha$, $\frac{19}{17-20}$) and 800m (B stand: 4,300trees/ha, $16.65m^2/ha$, $\frac{20}{17-21}$) elevation in Sancheong, Kyongnam Province were selected for the comparative study and following results were obtained through a sample plot method. After diameter of individual trees in the sample plots was measured, twelve average trees from each diameter class were cut felled to measure dry weight of $W_S$, $W_B$, $W_L$, $W_{Ba}$, and standing biomass and biomass production rates by a allometrior regressions related to $D^2H$. Vertical distribution of leaves along the stems indicated that photosynthesis was carried out 2.2m above the ground in Stand A and 1.2m in Stand B. Maximum photosynthesis was located 4.2m and 6.2m above the ground in Stand A and B, respectively. Leaf area index was 4.25ha/ha for Stand A, and 3.89ha/ha for Stand B. Above-ground standing biomass was 49.51 ton/ha for Stand A and 59.20 ton/ha and net annual production was 6.75 ton/ha/yr. for Stand A and 8.99 ton/ha/yr. for Stand B. The ratio of net annual production to standing biomass was 17.5% for Stand A and 16.7% for Stand B. Net assimilation rate was 2.75kg/kg/yr. for Stand A and 3.58kg/kg/yr. for Stand B. Stem wood production rate was 1.46kg/kg/yr. for Stand A and 2.09kg/kg/yr. for Stand B. Bark production rate was 0.60 kg/kg/yr. for Stand A and 0.34kg/kg/yr. for Stand B. Above data indicated that Stand B utilized growing spaces and sites more efficiently than Stand A. It is concluded chat productivity of natural stands of Quercus variabilis can be enhanced through optimization of basal areas and number of tree per hectare and that sound management of natural oak stands should be based on systematic sampling of the area for periodic productivity estimation.

  • PDF

Optimization of Mycelial Growth of Entomogenous fungi of the Genus Cordyceps (동충하초속균의 균사생장최적화)

  • Hong, In-Pyo;Nam, Sung-Hee;Jung, I-Yeon;Sung, Gyoo-Byung;Nam, Hack-Woo;Kang, Seok-Woo;Hur, Hyeon;Lee, Min-Woong;Guo, Shun-Xing
    • Journal of Mushroom
    • /
    • v.2 no.3
    • /
    • pp.149-156
    • /
    • 2004
  • This study was carried out to obtain basic data on physiological characteristics for an artificial cultivation of fruiting body of Cordyceps. Specimens such as Cordyceps longissima, C. militaris and C. pruinosa were collected at Mt. Halla of Cheju island in July, 2003. Among four different culture media which have been used for culture of mushrooms, MCM medium was selected for the favorable culture medium of the Cordyceps tested. The initial pH of solid medium for mycelial growth of Cordyceps was good in the range of pH 5.0~7.0 lower than 8.0. The mycelial growth of C. longissima was most favorable on culture media supplemented with glucose, one of monosaccharides. In C. militaris, nine carbon sources were favorable to the mycelial growth as compared with control among 11 carbon sources. Six nitrogen sources were favorable to the mycelial growth of C. longissima as compared with control among 9 carbon sources; namely, the mycelial growth of C. longissima was most favorable on culture media contained potassium nitrate, and followed in order by ammonium citrate and sodium nitrate in 4 weeks incubation.

  • PDF