• Title/Summary/Keyword: data-based model

Search Result 21,096, Processing Time 0.054 seconds

Cox proportional hazard model with L1 penalty

  • Hwang, Chang-Ha;Shim, Joo-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.3
    • /
    • pp.613-618
    • /
    • 2011
  • The proposed method is based on a penalized log partial likelihood of Cox proportional hazard model with L1-penalty. We use the iteratively reweighted least squares procedure to solve L1 penalized log partial likelihood function of Cox proportional hazard model. It provide the ecient computation including variable selection and leads to the generalized cross validation function for the model selection. Experimental results are then presented to indicate the performance of the proposed procedure.

Organizing Lidar Data Based on Octree Structure

  • Wang, Miao;Tseng, Yi-Hsing
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.150-152
    • /
    • 2003
  • Laser scanned lidar data record 3D surface information in detail. Exploring valuable spatial information from lidar data is a prerequisite task for its applications, such as DEM generation and 3D building model reconstruction. However, the inherent spatial information is implicit in the abundant, densely and randomly distributed point cloud. This paper proposes a novel method to organize point cloud data, so that further analysis or feature extraction can proceed based on a well organized data model. The principle of the proposed algorithm is to segment point cloud into 3D planes. A split and merge segmentation based on the octree structure is developed for the implementation. Some practical airborne and ground lidar data are tested for demonstration and discussion. We expect this data organization could provide a stepping stone for extracting spatial information from lidar data.

  • PDF

Development of Optimal Control System for Air Separation Unit

  • Ji, Dae-Hyun;Lee, Sang-Moon;Kim, Sang-Un;Kim, Sun-Jang;Won, Sang-Chul
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.524-529
    • /
    • 2004
  • In this paper, We described the method which developed the optimal control system for air separation unit to change production rates frequently and rapidly. Control models of the process were developed from actual plant data using subspace identification method which is developed by many researchers in resent years. The model consist of a series connection of linear dynamic block and static nonlinear block (Wiener model). The model is controlled by model based predictive controller. In MPC the input is calculated by on-line optimization of a performance index based on predictions by the model, subject to possible constraints. To calculate the optimal the performance index, conditions are expressed by LMI(Linear Matrix Inequalities).In order to access at the Bailey DCS system, we applied the OPC server and developed the Client program. The OPC sever is a device which can access Bailey DCS system.The Client program is developed based on the Matlab language for easy calculation,data simulation and data logging. Using this program, we can apply the optimal input to the DCS system at real time.

  • PDF

Predictive of Osteoporosis by Tree-based Machine Learning Model in Post-menopause Woman (폐경 여성에서 트리기반 머신러닝 모델로부터 골다공증 예측)

  • Lee, In-Ja;Lee, Junho
    • Journal of radiological science and technology
    • /
    • v.43 no.6
    • /
    • pp.495-502
    • /
    • 2020
  • In this study, the prevalence of osteoporosis was predicted based on 10 independent variables such as age, weight, and alcohol consumption and 4 tree-based machine-learning models, and the performance of each model was compared. Also the model with the highest performance was used to check the performance by clearing the independent variable, and Area Under Curve(ACU) was utilized to evaluate the performance of the model. The ACU for each model was Decision tree 0.663, Random forest 0.704, GBM 0.702, and XGBoost 0.710 and the importance of the variable was shown in the order of age, weight, and family history. As a result of using XGBoost, the highest performance model and clearing independent variables, the ACU shows the best performance of 0.750 with 7 independent variables. This data suggests that this method be applied to predict osteoporosis, but also other various diseases. In addition, it is expected to be used as basic data for big data research in the health care field.

Density Adaptive Grid-based k-Nearest Neighbor Regression Model for Large Dataset (대용량 자료에 대한 밀도 적응 격자 기반의 k-NN 회귀 모형)

  • Liu, Yiqi;Uk, Jung
    • Journal of Korean Society for Quality Management
    • /
    • v.49 no.2
    • /
    • pp.201-211
    • /
    • 2021
  • Purpose: This paper proposes a density adaptive grid algorithm for the k-NN regression model to reduce the computation time for large datasets without significant prediction accuracy loss. Methods: The proposed method utilizes the concept of the grid with centroid to reduce the number of reference data points so that the required computation time is much reduced. Since the grid generation process in this paper is based on quantiles of original variables, the proposed method can fully reflect the density information of the original reference data set. Results: Using five real-life datasets, the proposed k-NN regression model is compared with the original k-NN regression model. The results show that the proposed density adaptive grid-based k-NN regression model is superior to the original k-NN regression in terms of data reduction ratio and time efficiency ratio, and provides a similar prediction error if the appropriate number of grids is selected. Conclusion: The proposed density adaptive grid algorithm for the k-NN regression model is a simple and effective model which can help avoid a large loss of prediction accuracy with faster execution speed and fewer memory requirements during the testing phase.

The Application of the GWLF model for Rural Small Watershed (농촌 소유역에 대한 GWLF 모형의 적용성 검토)

  • Hwang, Sye-Woon;Jang, Tae-Il;Park, Seung-Woo;Kang, Moon-Seong
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.48 no.3
    • /
    • pp.23-34
    • /
    • 2006
  • This study reviews the applicability of the GWLF (Generalized Watershed Loading Function) model, which is based on a loading function that requires only a relatively small amount of data, in a small agricultural watershed. The hydrological data was collected from 1996 to 2004 for a study area based on the HP#6 upper stream reservoir small watershed area. This data was then used to calibrate and verify the model. A simulation based on the model yielded $R^2$ values of $0.47\sim0.89$. This is considered to have high applicability when compared to the simulation and the observed results, which yielded relatively high values of $R^2$ for SS (Suspended Solid), TN (Total Nitrogen), and TP (Total Phosphorus) of 0.58, 0.47 and 0.62, respectively. This study provides a useful approach fur researchers selecting appropriate models to use the insufficient measuring data for rural watersheds.

Store-Release based Distributed Hydrologic Model with GIS (GIS를 이용한 기저-유출 바탕의 수문모델)

  • Kang, Kwang-Min;Yoon, Se-Eui
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2012.05a
    • /
    • pp.35-35
    • /
    • 2012
  • Most grid-based distributed hydrologic models are complex in terms of data requirements, parameter estimation and computational demand. To address these issues, a simple grid-based hydrologic model is developed in a geographic information system (GIS) environment using storage-release concept. The model is named GIS Storage Release Model (GIS-StoRM). The storage-release concept uses the travel time within each cell to compute howmuch water is stored or released to the watershed outlet at each time step. The travel time within each cell is computed by combining the kinematic wave equation with Manning's equation. The input to GIS-StoRM includes geospatial datasets such as radar rainfall data (NEXRAD), land use and digital elevation model (DEM). The structural framework for GIS-StoRM is developed by exploiting geographic features in GIS as hydrologic modeling objects, which store and process geospatial and temporal information for hydrologic modeling. Hydrologic modeling objects developed in this study handle time series, raster and vector data within GIS to: (i) exchange input-output between modeling objects, (ii) extract parameters from GIS data; and (iii) simulate hydrologic processes. Conceptual and structural framework of GIS StoRM including its application to Pleasant Creek watershed in Indiana will be presented.

  • PDF

Probability-Based Estimates of Basic Design Wind Speeds In Korea (확률에 기초한 한국의 기본 설계풍속 주정)

  • 조효남;백현식;차철준
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 1988.10a
    • /
    • pp.7-12
    • /
    • 1988
  • This study presents rational methods for probability-based estimates of basic design wind speeds in Korea and develops a risk-bases nation-wide map of design wind speeds. The paper examines the fitting of the Type-I extreme model to maximum yearly non-typhoon wind data from long-term records based on the conventional method and to maximum monthly nod-typhoon wind data from short-term records following Grigorin's approach. The paper also reviews the applicability of the method using short records of about 5 years. The basic design wind speeds for typhoon and non-typhoon wind at a station are made to be obtained from a mixed model which is given as a product of typhoon and non-typhoon extreme wind distributions. A practical method which is based on the fitting of the Type I model to records or typhoon and non-typhoon mixed wind data at a station is also preposed in this study.

  • PDF

A Design of Intelligent Patient Monitoring System using Model Base (모델 베이스를 이용한 지능적 환자 감시 시스템의 설계)

  • Kim, Jung-Ook;Lee, Seok-Pil;Chi, Sung-Do;Park, Sang-Hui
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1995 no.05
    • /
    • pp.155-159
    • /
    • 1995
  • A design method that can easily construct intelligent patient monitoring systems is proposed. To achieve the design method, the SES/MB concept and a discrete event-based logic control formalism based on a set theory is introduced. In this control paradigm the controller expects to receive confirming sensor responses to its control commands within definite time windows determined by DEVS model of the system under control. Because data to be used for rule-based symbolic reasoning are to be abstracted, several AI methods are applied the processes. These methods are applied to intelligent patient monitoring systems so that they facilitate transformation from low level raw data to high level linguistic data. Model-based system representations have advantages of reusability, extensibility, flexsibility, independent testability and encapsulation.

  • PDF

Development of a Personalized Music Recommendation System Using MBTI Personality Types and KNN Algorithm

  • Chun-Ok Jang
    • International Journal of Advanced Culture Technology
    • /
    • v.12 no.3
    • /
    • pp.427-433
    • /
    • 2024
  • This study aims to develop a personalized music digital therapeutic based on MBTI personality types and apply it to depression treatment. In the data collection stage, participants' MBTI personality types and music preferences were surveyed to build a database, which was then preprocessed as input data for the KNN model. The KNN model calculates the distance between personality types using Euclidean distance and recommends music suitable for the user's MBTI type based on the nearest K neighbors' data. The developed system was tested with new participants, and the system and algorithm were improved based on user feedback. In the final validation stage, the system's effectiveness in alleviating depression was evaluated. The results showed that the MBTI personality type-based music recommendation system provides a personalized music therapy experience, positively impacting emotional stability and stress reduction. This study suggests the potential of nonpharmacological treatments and demonstrates that a personalized treatment experience can offer more effective and safer methods for treating depression.