• Title/Summary/Keyword: data-based model

Search Result 21,105, Processing Time 0.05 seconds

Predicting Reports of Theft in Businesses via Machine Learning

  • JungIn, Seo;JeongHyeon, Chang
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.499-510
    • /
    • 2022
  • This study examines the reporting factors of crime against business in Korea and proposes a corresponding predictive model using machine learning. While many previous studies focused on the individual factors of theft victims, there is a lack of evidence on the reporting factors of crime against a business that serves the public good as opposed to those that protect private property. Therefore, we proposed a crime prevention model for the willingness factor of theft reporting in businesses. This study used data collected through the 2015 Commercial Crime Damage Survey conducted by the Korea Institute for Criminal Policy. It analyzed data from 834 businesses that had experienced theft during a 2016 crime investigation. The data showed a problem with unbalanced classes. To solve this problem, we jointly applied the Synthetic Minority Over Sampling Technique and the Tomek link techniques to the training data. Two prediction models were implemented. One was a statistical model using logistic regression and elastic net. The other involved a support vector machine model, tree-based machine learning models (e.g., random forest, extreme gradient boosting), and a stacking model. As a result, the features of theft price, invasion, and remedy, which are known to have significant effects on reporting theft offences, can be predicted as determinants of such offences in companies. Finally, we verified and compared the proposed predictive models using several popular metrics. Based on our evaluation of the importance of the features used in each model, we suggest a more accurate criterion for predicting var.

Design and Implementation of Transfer Buffer Sharing Technique for Efficient Massive Data Transfer

  • Cho, Dae-Soo
    • Journal of information and communication convergence engineering
    • /
    • v.6 no.3
    • /
    • pp.327-330
    • /
    • 2008
  • It is required that a server which communicates with various client simultaneously should have an efficient data transfer model. In Windows$^{(R)}$ environment, the server was generally developed based on IOCP model. Developing the IOCP model, the server generally has one data transfer buffer per client. If the server divides a larger data than the transfer buffer into several fragments, there used to be a problem in sending it to a client, because there is a conflict in a data transfer buffer. That is, CPU requests one data-fragment transfer, then it will request the next data-fragment transfer successively before completing the previous request, owing to the property of overlapped IO model. In this paper, we proposed the transfer buffer sharing technique to solve the conflicting problem. The experimental result shows that the performance of data transfer was enhanced by 39% maximally.

Machine Learning Based Architecture and Urban Data Analysis - Construction of Floating Population Model Using Deep Learning - (머신러닝을 통한 건축 도시 데이터 분석의 기초적 연구 - 딥러닝을 이용한 유동인구 모델 구축 -)

  • Shin, Dong-Youn
    • Journal of KIBIM
    • /
    • v.9 no.1
    • /
    • pp.22-31
    • /
    • 2019
  • In this paper, we construct a prototype model for city data prediction by using time series data of floating population, and use machine learning to analyze urban data of complex structure. A correlation prediction model was constructed using three of the 10 data (total flow population, male flow population, and Monday flow population), and the result was compared with the actual data. The results of the accuracy were evaluated. The results of this study show that the predicted model of the floating population predicts the correlation between the predicted floating population and the current state of commerce. It is expected that it will help efficient and objective design in the planning stages of architecture, landscape, and urban areas such as tree environment design and layout of trails. Also, it is expected that the dynamic population prediction using multivariate time series data and collected location data will be able to perform integrated simulation with time series data of various fields.

A Study on the Data-based WBS Model for Train Control System to Improve a Maintenance work (열차제어시스템 유지관리 업무 개선을 위한 데이터 기반 WBS 모델 연구)

  • Jeon, Jo Won;Kim, Young Min;Park, Bum
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.18 no.1
    • /
    • pp.99-104
    • /
    • 2022
  • In this paper, to increase the maintenance efficiency of the urban railway train control system and to build a standard data system, we collect as much as possible structured, unstructured, and semi-structured data, and collect data by sensing and monitoring the system status and system status and monitoring. pre-process function data(Identification, purification, integration, transformation) through effective data classification and maintenance activities business classification system was studied. The purpose of this is to define the data matrix model by considering the relationship with the data generated and managed in the O&M stage of the train control system operated by the urban railway together with the WBS model, and to reflect and utilize it in practice.

Demonstration of the Effectiveness of Monte Carlo-Based Data Sets with the Simplified Approach for Shielding Design of a Laboratory with the Therapeutic Level Proton Beam

  • Lai, Bo-Lun;Chang, Szu-Li;Sheu, Rong-Jiun
    • Journal of Radiation Protection and Research
    • /
    • v.47 no.1
    • /
    • pp.50-57
    • /
    • 2022
  • Background: There are several proton therapy facilities in operation or planned in Taiwan, and these facilities are anticipated to not only treat cancer but also provide beam services to the industry or academia. The simplified approach based on the Monte Carlo-based data sets (source terms and attenuation lengths) with the point-source line-of-sight approximation is friendly in the design stage of the proton therapy facilities because it is intuitive and easy to use. The purpose of this study is to expand the Monte Carlo-based data sets to allow the simplified approach to cover the application of proton beams more widely. Materials and Methods: In this work, the MCNP6 Monte Carlo code was used in three simulations to achieve the purpose, including the neutron yield calculation, Monte Carlo-based data sets generation, and dose assessment in simple cases to demonstrate the effectiveness of the generated data sets. Results and Discussion: The consistent comparison of the simplified approach and Monte Carlo simulation results show the effectiveness and advantage of applying the data set to a quick shielding design and conservative dose assessment for proton therapy facilities. Conclusion: This study has expanded the existing Monte Carlo-based data set to allow the simplified approach method to be used for dose assessment or shielding design for beam services in proton therapy facilities. It should be noted that the default model of the MCNP6 is no longer the Bertini model but the CEM (cascade-exciton model), therefore, the results of the simplified approach will be more conservative when it was used to do the double confirmation of the final shielding design.

Effect of Measuring Period on Predicting the Annual Heating Energy Consumption for Building (연간 건물난방 에너지사용량의 예측에 미치는 측정기간의 영향)

  • 조성환;태춘섭;김진호;방기영
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • v.15 no.4
    • /
    • pp.287-293
    • /
    • 2003
  • This study examined the temperature-dependent regression model of energy consumption based on various measuring period. The methodology employed was to construct temperature-dependent linear regression model of daily energy consumption from one day to three months data-sets and to compare the annual heating energy consumption predicted by these models with actual annual heating energy consumption. Heating energy consumption from a building in Daejon was examined experimentally. From the results, predicted value based on one day experimental data can have error over 100%. But predicted value based on one week experimental data showed error over 30%. And predicted value based on over three months experimental data provides accurate prediction within 6% but it will be required very expensive.

Web-based GIS for Real Time Hydrologic Topographical Data Extraction for the Geum River Watershed in Korea (Web기반 GIS를 이용한 금강유역의 실시간 수문지형인자 추출)

  • Nam, Won-Ho;Choi, Jin-Yong;Jang, Min-Won;Engel, B.A.
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.49 no.5
    • /
    • pp.81-90
    • /
    • 2007
  • Watershed topographical information is required in hydrologic analysis, supporting efficient hydrologic model operation and managing water resources. Watershed topographical data extraction systems based on desktop GIS are abundant these days placing burdens for spatial data processing on users. This paper describes development of a Web-based Geographic Information Systems that can delineate the Geum River sub-basins and extract watershed topographical data in real time. Through this system, users can obtain a watershed boundary by selecting outlet location and then extracting topographical data including watershed area, boundary length, average altitude, slope distribution about the elevation range with Web browsers. Moreover, the system provides watershed hydrological data including land use, soil types, soil drainage conditions, and NRCS(Natural Resources Conservation Service) curve number for hydrologic model operation through grid overlay technique. The system operability was evaluated with the hydrological data of WAMIS(Water Management Information System) with the government operation Web site as reference data.

A Service Model Development Plan for Countering Denial of Service Attacks based on Artificial Intelligence Technology (인공지능 기술기반의 서비스거부공격 대응 위한 서비스 모델 개발 방안)

  • Kim, Dong-Maeong;Jo, In-June
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.2
    • /
    • pp.587-593
    • /
    • 2021
  • In this thesis, we will break away from the classic DDoS response system for large-scale denial-of-service attacks that develop day by day, and effectively endure intelligent denial-of-service attacks by utilizing artificial intelligence-based technology, one of the core technologies of the 4th revolution. A possible service model development plan was proposed. That is, a method to detect denial of service attacks and minimize damage through machine learning artificial intelligence learning targeting a large amount of data collected from multiple security devices and web servers was proposed. In particular, the development of a model for using artificial intelligence technology is to detect a Western service attack by focusing on the fact that when a service denial attack occurs while repeating a certain traffic change and transmitting data in a stable flow, a different pattern of data flow is shown. Artificial intelligence technology was used. When a denial of service attack occurs, a deviation between the probability-based actual traffic and the predicted value occurs, so it is possible to respond by judging as aggressiveness data. In this paper, a service denial attack detection model was explained by analyzing data based on logs generated from security equipment or servers.

Machine Learning-based hydrogen charging station energy demand prediction model (머신러닝 기반 수소 충전소 에너지 수요 예측 모델)

  • MinWoo Hwang;Yerim Ha;Sanguk Park
    • Journal of Internet Computing and Services
    • /
    • v.24 no.2
    • /
    • pp.47-56
    • /
    • 2023
  • Hydrogen energy is an eco-friendly energy that produces heat and electricity with high energy efficiency and does not emit harmful substances such as greenhouse gases and fine dust. In particular, smart hydrogen energy is an economical, sustainable, and safe future smart hydrogen energy service, which means a service that stably operates based on 'data' by digitally integrating hydrogen energy infrastructure. In this paper, in order to implement a data-based hydrogen charging station demand forecasting model, three hydrogen charging stations (Chuncheon, Sokcho, Pyeongchang) installed in Gangwon-do were selected, supply and demand data of hydrogen charging stations were secured, and 7 machine learning and deep learning algorithms were used. was selected to learn a model with a total of 27 types of input data (weather data + demand for hydrogen charging stations), and the model was evaluated with root mean square error (RMSE). Through this, this paper proposes a machine learning-based hydrogen charging station energy demand prediction model for optimal hydrogen energy supply and demand.

Efficiency Analysis of Specialists by Medical Specialty using Activity-Based Costing Data: Using the DEA-CCR model and SBM model (활동기준 원가 자료를 활용한 과별 전문의의 효율성 분석 : DEA-CCR 모형과 SBM 모형을 이용)

  • Do Won Kim;Tae Hyun Kim
    • Korea Journal of Hospital Management
    • /
    • v.28 no.2
    • /
    • pp.44-65
    • /
    • 2023
  • Purposes: As super-aging population and low fertility rates are threatening the sustainability of the National Health Insurance funds, enhancing the efficiency of hospital management is paramount. In the past, studies analyzing the efficiencies of hospitals primarily made inter-hospital comparisons, but it is important to assess hospitals' internal efficiency and develop improvement measures in order to attain practical improvements in hospital efficiencies. The purpose of this study is to analyze the efficiencies of specialists by medical specialty in a hospital in order to provide foundational data for efficient hospital management. Methodology/Approach: We used the activity-based costing (ABC) data and hospital statistical data from one tertiary hospital in Seoul to analyze the efficiency of specialists by medical specialty. Efficiency was analyzed and compared among specialists using the data envelopment analysis developed by Charnes, Cooper, and Rhodes (DEA-CCR) model and the slacks-based measure (SBM) models. The input variables were labor cost, material cost, and operational expenses, and the output variables were the number of outpatients, number of inpatients, outpatient revenue, and inpatient revenue. Findings: First, there was a marked deviation in efficiency across specialists. Second, there was a marked deviation in efficiency across medical specialties. Third, there was little difference in efficiency according to the specialist's sex, age, and job position. Fourth, the SBM model produced more conservative results and better explained efficiency parameters than the CCR model. Practical Implications: The efficiency of a specialist was more influenced by their medical specialty than their personal characteristics, namely sex, age, and job position. Therefore, Further research is needed to analyze the efficiencies of each subspecialty and identify factors that contribute to the variations in efficiencies across medical specialties, such as clinical practices and fee structures.

  • PDF