• Title/Summary/Keyword: location information

Search Result 7,783, Processing Time 0.037 seconds

Algorithm Development for Extract O/D of Air Passenger via Mobile Telecommunication Bigdata (모바일 통신 빅데이터 기반 항공교통이용자 O/D 추출 알고리즘 연구)

  • Bumchul Cho;Kihun Kwon
    • The Journal of Bigdata
    • /
    • v.8 no.2
    • /
    • pp.1-13
    • /
    • 2023
  • Current analysis of air passengers mainly relies on statistical methods, but there are limitations in analyzing detailed aspects such as travel routes, number of regional passengers and airport access times. However, with the advancement of big data technology and revised three data acts, big data-based transportation analysis has become more active. Mobile communication data, which can precisely track the location of mobile phone terminals, can serve as valuable analytical data for transportation analysis. In this paper, we propose a air passenger Origin/Destination (O/D) extraction algorithm based on mobile communication data that overcomes the limitations of existing air transportation user analysis methods. The algorithm involves setting airport signal detection zones at each airport and extracting air passenger based on their base station connection history within these zones. By analyzing the base station connection data along the passenger's origin-destination paths, we estimate the entire travel route. For this paper, we extracted O/D information for both domestic and international air passengers at all domestic airports from January 2019 to December 2020. To compensate for errors caused by mobile communication service provider market shares, we applied a adjustment to correct the travel volume at a nationwide citizen level. Furthermore correlation analysis was performed on O/D data and aviation statistics data for air traffic users based on mobile communication data to verify the extracted data. Through this, there is a difference in the total amount (4.1 for domestic and 4.6 for international), but the correlation is high at 0.99, which is judged to be useful. The proposed algorithm in this paper enables a comprehensive and detailed analysis of air transportation users' travel behavior, regional/age group ratios, and can be utilized in various fields such as formulating airport-related policies and conducting regional market analysis.

Evaluation of the Contribution of Inflow by Water Sources into Hydropower Dam in the Han River basin through Water Balance Analysis (물수지 분석을 통한 한강유역 발전용댐 유입량의 수원별 기여도 평가)

  • Choi, Sijung;Kang, Seongkyu;Noh, Huiseong;Ahn, Jeonghwan
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.44 no.5
    • /
    • pp.649-661
    • /
    • 2024
  • Recently, the need for water supply from hydropower dams has been increasing due to drought. In order to supply water through hydropower dam, the inflow into hydropower dam should be evaluated first. Some of the hydropower dams in the Han River basin are located downstream of multipurpose dam, so it is important to analyze its own inflow of the hydropower dam. The purpose of this study is to evaluate the contribution of inflow by water source to the hydropower dam located in the Han River basin. Water use-related data provided by various domestic institutions were investigated and collected, and a location-based water supply and demand network was constructed. Unlike the existing domestic water balance analysis method, the simulation was conducted in consideration of the amount of transmission loss. The applicability of the analysis method was confirmed through the results of the fitness evaluation (NSE 0.95~0.99 and correlation coefficient 0.98~0.99) comparing the simulated flow with the observed flow at the representative point. Based on the water intake method of the facility and the release method of the remaining multipurpose dam water, a water balance analysis was performed assuming four cases, and the contribution of inflow by water sources into each hydropower dam was evaluated and presented. The research results are expected to provide various information for evaluating the water supply capacity of hydropower dams in the future.

Comparison of Association Rule Learning and Subgroup Discovery for Mining Traffic Accident Data (교통사고 데이터의 마이닝을 위한 연관규칙 학습기법과 서브그룹 발견기법의 비교)

  • Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.1-16
    • /
    • 2015
  • Traffic accident is one of the major cause of death worldwide for the last several decades. According to the statistics of world health organization, approximately 1.24 million deaths occurred on the world's roads in 2010. In order to reduce future traffic accident, multipronged approaches have been adopted including traffic regulations, injury-reducing technologies, driving training program and so on. Records on traffic accidents are generated and maintained for this purpose. To make these records meaningful and effective, it is necessary to analyze relationship between traffic accident and related factors including vehicle design, road design, weather, driver behavior etc. Insight derived from these analysis can be used for accident prevention approaches. Traffic accident data mining is an activity to find useful knowledges about such relationship that is not well-known and user may interested in it. Many studies about mining accident data have been reported over the past two decades. Most of studies mainly focused on predict risk of accident using accident related factors. Supervised learning methods like decision tree, logistic regression, k-nearest neighbor, neural network are used for these prediction. However, derived prediction model from these algorithms are too complex to understand for human itself because the main purpose of these algorithms are prediction, not explanation of the data. Some of studies use unsupervised clustering algorithm to dividing the data into several groups, but derived group itself is still not easy to understand for human, so it is necessary to do some additional analytic works. Rule based learning methods are adequate when we want to derive comprehensive form of knowledge about the target domain. It derives a set of if-then rules that represent relationship between the target feature with other features. Rules are fairly easy for human to understand its meaning therefore it can help provide insight and comprehensible results for human. Association rule learning methods and subgroup discovery methods are representing rule based learning methods for descriptive task. These two algorithms have been used in a wide range of area from transaction analysis, accident data analysis, detection of statistically significant patient risk groups, discovering key person in social communities and so on. We use both the association rule learning method and the subgroup discovery method to discover useful patterns from a traffic accident dataset consisting of many features including profile of driver, location of accident, types of accident, information of vehicle, violation of regulation and so on. The association rule learning method, which is one of the unsupervised learning methods, searches for frequent item sets from the data and translates them into rules. In contrast, the subgroup discovery method is a kind of supervised learning method that discovers rules of user specified concepts satisfying certain degree of generality and unusualness. Depending on what aspect of the data we are focusing our attention to, we may combine different multiple relevant features of interest to make a synthetic target feature, and give it to the rule learning algorithms. After a set of rules is derived, some postprocessing steps are taken to make the ruleset more compact and easier to understand by removing some uninteresting or redundant rules. We conducted a set of experiments of mining our traffic accident data in both unsupervised mode and supervised mode for comparison of these rule based learning algorithms. Experiments with the traffic accident data reveals that the association rule learning, in its pure unsupervised mode, can discover some hidden relationship among the features. Under supervised learning setting with combinatorial target feature, however, the subgroup discovery method finds good rules much more easily than the association rule learning method that requires a lot of efforts to tune the parameters.

Variation of Hospital Costs and Product Heterogeneity

  • Shin, Young-Soo
    • Journal of Preventive Medicine and Public Health
    • /
    • v.11 no.1
    • /
    • pp.123-127
    • /
    • 1978
  • The major objective of this research is to identify those hospital characteristics that best explain cost variation among hospitals and to formulate linear models that can predict hospital costs. Specific emphasis is placed on hospital output, that is, the identification of diagnosis related patient groups (DRGs) which are medically meaningful and demonstrate similar patterns of hospital resource consumption. A casemix index is developed based on the DRGs identified. Considering the common problems encountered in previous hospital cost research, the following study requirements are estab-lished for fulfilling the objectives of this research: 1. Selection of hospitals that exercise similar medical and fiscal practices. 2. Identification of an appropriate data collection mechanism in which demographic and medical characteristics of individual patients as well as accurate and comparable cost information can be derived. 3. Development of a patient classification system in which all the patients treated in hospitals are able to be split into mutually exclusive categories with consistent and stable patterns of resource consumption. 4. Development of a cost finding mechanism through which patient groups' costs can be made comparable across hospitals. A data set of Medicare patients prepared by the Social Security Administration was selected for the study analysis. The data set contained 27,229 record abstracts of Medicare patients discharged from all but one short-term general hospital in Connecticut during the period from January 1, 1971, to December 31, 1972. Each record abstract contained demographic and diagnostic information, as well as charges for specific medical services received. The 'AUT-OGRP System' was used to generate 198 DRGs in which the entire range of Medicare patients were split into mutually exclusive categories, each of which shows a consistent and stable pattern of resource consumption. The 'Departmental Method' was used to generate cost information for the groups of Medicare patients that would be comparable across hospitals. To fulfill the study objectives, an extensive analysis was conducted in the following areas: 1. Analysis of DRGs: in which the level of resource use of each DRG was determined, the length of stay or death rate of each DRG in relation to resource use was characterized, and underlying patterns of the relationships among DRG costs were explained. 2. Exploration of resource use profiles of hospitals; in which the magnitude of differences in the resource uses or death rates incurred in the treatment of Medicare patients among the study hospitals was explored. 3. Casemix analysis; in which four types of casemix-related indices were generated, and the significance of these indices in the explanation of hospital costs was examined. 4. Formulation of linear models to predict hospital costs of Medicare patients; in which nine independent variables (i. e., casemix index, hospital size, complexity of service, teaching activity, location, casemix-adjusted death. rate index, occupancy rate, and casemix-adjusted length of stay index) were used for determining factors in hospital costs. Results from the study analysis indicated that: 1. The system of 198 DRGs for Medicare patient classification was demonstrated not only as a strong tool for determining the pattern of hospital resource utilization of Medicare patients, but also for categorizing patients by their severity of illness. 2. The wei틴fed mean total case cost (TOTC) of the study hospitals for Medicare patients during the study years was $11,27.02 with a standard deviation of $117.20. The hospital with the highest average TOTC ($1538.15) was 2.08 times more expensive than the hospital with the lowest average TOTC ($743.45). The weighted mean per diem total cost (DTOC) of the study hospitals for Medicare patients during the sutdy years was $107.98 with a standard deviation of $15.18. The hospital with the highest average DTOC ($147.23) was 1.87 times more expensive than the hospital with the lowest average DTOC ($78.49). 3. The linear models for each of the six types of hospital costs were formulated using the casemix index and the eight other hospital variables as the determinants. These models explained variance to the extent of 68.7 percent of total case cost (TOTC), 63.5 percent of room and board cost (RMC), 66.2 percent of total ancillary service cost (TANC), 66.3 percent of per diem total cost (DTOC), 56.9 percent of per diem room and board cost (DRMC), and 65.5 percent of per diem ancillary service cost (DTANC). The casemix index alone explained approximately one half of interhospital cost variation: 59.1 percent for TOTC and 44.3 percent for DTOC. Thsee results demonstrate that the casemix index is the most importand determinant of interhospital cost variation Future research and policy implications in regard to the results of this study is envisioned in the following three areas: 1. Utilization of casemix related indices in the Medicare data systems. 2. Refinement of data for hospital cost evaluation. 3. Development of a system for reimbursement and cost control in hospitals.

  • PDF

Development of GIS based Water Quality Simulation System for Han River and Kyeonggi Bay Area (한강과 경기만 지역 GIS 기반 통합수질모의 시스템 개발)

  • Lee, Chol-Young;Kim, Kye-Hyun
    • Journal of Korea Spatial Information System Society
    • /
    • v.10 no.4
    • /
    • pp.77-88
    • /
    • 2008
  • There has been growing demands to manage the water quality of west coastal region due to the large scale urbanization along the coastal zone, the possibility of application of TMDL(Total Maximum Daily Loadings) to Han river, and the natural disaster such as oil spill incident in Taean, Chungnam. However, no system has been developed for such purposes. In this background, the demand of GIS based effective water quality management has been increased to monitor water quality environment and propose best management alternatives for Han river and Kyeonggi bay. This study mainly focused on the development of integrated water quality management system for Han river bas in and its estuary are a connected to Kyeonggi bay to support integrated water quality management and its plan. Integration was made based on GIS by spatial linking between water quality attributes and location information. A GIS DB was built to estimate the amount of generated and discharged water pollutants according to TMDL technical guide and it included input data to use two different water quality models--W ASP7 for Han river and EFDC for coastal area--to forecast water quality and to suggest BMP(Best management Practices). The results of BOD, TN, and TP from WASP7 were used as the input to run EFDC. Based on the study results, some critical areas which have relatively higher pollutant loadings were identified, and it was also identified that the locations discharging water pollutant loadings to river and seasonal factor affected water quality. And the relationship of water quality between river and its estuary area was quantitatively verified. The results showed that GIS based integrated system could be used as a tool for estimating status-quo of water quality and proposing economically effective BMPs to mitigate water pollution. Further studies need to be made for improving system's capabilities such as adding decision making function as well as cost-benefit analysis, etc. Also, the concrete methodology for water quality management using the system need to be developed.

  • PDF

Spatio-temporal Analysis of Population Distribution in Seoul via Integrating Transportation and Land Use Information, Based on Four-Dimensional Visualization Methods (교통과 토지이용 정보를 결합한 서울 인구분포의 시공간적 분석: 4차원 시각화 방법을 토대로)

  • Lee, Keumsook;Kim, Ho Sung
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.21 no.1
    • /
    • pp.20-33
    • /
    • 2018
  • Population distribution in urban space varies with transportation flow changing along time of day. Transportation flow is directly affected by the activities of urbanites and the distribution of related facilities, since the flow is the result of moving to the point where the facilities associated with their activities are located. It is thus necessary to analyze the spatio-temporal characteristics of the urban population distribution by integrating the distribution of activity spaces related to the daily life of urbanites and the flow of transportation. The purpose of this study is to analyze the population distribution in urban space with daily and weekly time bases using the building database and T-card database in the city of Seoul, which is rich in information on land use and transportation flow. For a time-based analysis that is difficult to grasp by general statistical techniques, a four-dimensional visualization method combining time and space using a Java program is devised. Dynamic visualization in the four-dimensional space and time allows intuitive analysis and makes it possible to understand more effectively the spatio-temporal characteristics of population distribution. For this purpose, buildings are classified into three activity groups: residential, working, and commercial according to their purpose, and the number of passengers traveling to and from each stop site of bus and subway networks in the T-card database for one week is calculated in one-minute increments, Visualizing these and integrating transportation and land use, we analyze spatio-temporal characteristics of the population distribution in Seoul. As a result, it is found that the population distribution of Seoul displays distinct spatio-temporal characteristics according to land use. In particular, there is a clear difference in the population distribution pattern along the time axis according to the mixed aspects of working, commercial, and residential activities. The results of this study can be very useful for transportation and location planning of city facilities.

Multi-Component Relaxation Study of Human Brain Using Relaxographic Analysis (Relaxographic 분석법을 이용한 뇌의 다중 자기이완특성에 관한 연구)

  • Yongmin Chang;Bong Soo Han;Bong Seok Kang;Kyungnyeo Jeon;Kyungsoo Bae;Yong-Sun Kim;Duk-Sik Kang
    • Investigative Magnetic Resonance Imaging
    • /
    • v.6 no.2
    • /
    • pp.120-128
    • /
    • 2002
  • Purpose : To demonstrate that the relaxographic method provides additional information such as the distribution of relaxation times and water content which are poentially applicable to clinical medicine. Materials and Methods : First, the computer simulation was performed with the generated relaxation data to verify the accuracy and reliabilility of the relaxographic method (CONTINI). Secondly, in or der to see how well the CONTIN quantifies and resolves the two different ${T_1}$ environments, we calculated the oil to water peak area ratios and identified peak positions of ${T_1}-distribution$ curve of the phantom solutions, which consist of four centrifugal tubes (10 ml) filled with the compounds of 0, 10, 20, 30% of corn oil and distilled water, using CONTIN. Finally, inversion recovery MR images for a volunteer are acquired for each TI ranged from 40 to 1160 msec with TR/TE=2200/20 msec. From the 3 different ROIs (GM, WM, CSF), CONTIN analysis was performed to obtain the ${T_1}$-distribution curves, which gave peak positions and peak area of each ROI location. Results : The simulation result shows that the errors of peak positions were less in the higher peak (centered ${T_1}=600$ msec) than in the lower peak (centered ${T_1}=150$ msec) for all SNR but the errors of peak areas were larger in the higher peak than in the lower peak. The CONTIN analysis of the measured relaxation data of phantoms revealed two peaks between 20 and 60 msec and between 500 and 700 msec. The analysis gives the peak area ratio as oil 10%: oil 20%: oil 30% = 1:1.3:1.9, which is different from the exact ratio, 1:2:3. For human brain, in ROI 3 (CSF), only one component of -distributions was observed whereas in ROI 1(GM) and in ROI 2 (WM) we observed two components of ${T_1}-distribution$. For the WM and CSF there was great agreement between the observed ${T_1}-relaxation$ times and the reported values. Conclusion : we demonstrated that the relaxographic method provided additional information such as the distribution of relaxation times and water content, which were not available in the routine relaxometry and ${T_1}/{T_2}$ mapping techniques. In addition, these additional information provided by relaxographic analysis may have clinical importance.

  • PDF

A Study of the Application of 'Digital Heritage ODA' - Focusing on the Myanmar cultural heritage management system - (디지털 문화유산 ODA 적용에 관한 시론적 연구 -미얀마 문화유산 관리시스템을 중심으로-)

  • Jeong, Seongmi
    • Korean Journal of Heritage: History & Science
    • /
    • v.53 no.4
    • /
    • pp.198-215
    • /
    • 2020
  • Official development assistance refers to assistance provided by governments and other public institutions in donor countries, aimed at promoting economic development and social welfare in developing countries. The purpose of this research is to examine the construction process of the "Myanmar Cultural Heritage Management System" that is underway as part of the ODA project to strengthen cultural and artistic capabilities and analyze the achievements and challenges of the Digital Cultural Heritage ODA. The digital cultural heritage management system is intended to achieve the permanent preservation and sustainable utilization of tangible and intangible cultural heritage materials. Cultural heritage can be stored in digital archives, newly approached using computer analysis technology, and information can be used in multiple dimensions. First, the Digital Cultural Heritage ODA was able to permanently preserve cultural heritage content that urgently needed digitalization by overcoming and documenting the "risk" associated with cultural heritage under threat of being extinguished, damaged, degraded, or distorted in Myanmar. Second, information on Myanmar's cultural heritage can be systematically managed and used in many ways through linkages between materials. Third, cultural maps can be implemented that are based on accurate geographical location information as to where cultural heritage is located or inherited. Various items of cultural heritage were collectively and intensively visualized to maximize utility and convenience for academic, policy, and practical purposes. Fourth, we were able to overcome the one-sided limitations of cultural ODA in relations between donor and recipient countries. Fifth, the capacity building program run by officials in charge of the beneficiary country, which could be the most important form of sustainable development in the cultural ODA, was operated together. Sixth, there is an implication that it is an ODA that can be relatively smooth and non-face-to-face in nature, without requiring the movement of manpower between countries during the current global pandemic. However, the following tasks remain to be solved through active discussion and deliberation in the future. First, the content of the data uploaded to the system should be verified. Second, to preserve digital cultural heritage, it must be protected from various threats. For example, it is necessary to train local experts to prepare for errors caused by computer viruses, stored data, or operating systems. Third, due to the nature of the rapidly changing environment of computer technology, measures should also be discussed to address the problems that tend to follow when new versions and programs are developed after the end of the ODA project, or when developers have not continued to manage their programs. Fourth, since the classification system criteria and decisions regarding whether the data will be disclosed or not are set according to Myanmar's political judgment, it is necessary to let the beneficiary country understand the ultimate purpose of the cultural ODA project.

A Study on Method of Citizen Science and Improvement of Performance as a Ecosystem Conservation and Management Tool of Wetland Protected Areas (Inland Wetland) - Focused on the Target of Conservation·Management·Utilization in Wetland Protected Area Conservation Plan - (내륙 습지보호지역의 생태계 보전·관리 도구로서 시민과학연구 방법론 및 성과 제고 방안 - 습지보호지역 보전계획의 보전·관리·이용 목표를 중심으로 -)

  • Inae Yeo;Changsu Lee;Ji Hyun Kang
    • Journal of Environmental Impact Assessment
    • /
    • v.32 no.6
    • /
    • pp.450-462
    • /
    • 2023
  • This study suggested methodology of Citizen Science as a tool of ecosystem conservation and management to achieve Wetland Protected Area (WPA) Conservation Plan and examined whose applicability in 3 WPAs (Jangrok of Gwangju metropolitan city, Madongho of Goseong in South Gyeongsang Province, and Incheongang estuary of Gochang in North Jeolla Province). It consists of a) figuring out main interests and stakeholder or beneficiaries of WPA and their information demand based on conservation, utilization, and management target in the WPA Conservation Plan, b) conducting research activities to gain outcome to address stakeholder's demand, and c) returning the research outcome to citizen scientists and making diffusion to the society. Based on the suggested method and process, citizen scientists conducted ecosystem monitoring (plants including Invasive Alien Plants, terrestrial insects, traces of mammals, discovering unknown wetland). As a result, citizen scientists contributed to collecting species information of 16 plans, 43 species of terrestrial insects, 5 mammals including Lutra lutra (Endangered Species I) and Prionailurus bengalensis (Endangered Species II). The authors constructed and provided distribution map of Invasive Alien Plants, which included information of location and density which citizen scientists registered, for Environment Agencies and local governments who manage 3 WPAs to aid data-based ecosystem policy, In further studies, not only accumulating research data and outcomes acquired from citizen science to suffice the policy demands but also deliberate reviewing policy applicability and social·economic ripple effect should be processed for the suggested Citizen Science in WPA to be settled down as a tool of ecosystem conservation and management.

Animal Infectious Diseases Prevention through Big Data and Deep Learning (빅데이터와 딥러닝을 활용한 동물 감염병 확산 차단)

  • Kim, Sung Hyun;Choi, Joon Ki;Kim, Jae Seok;Jang, Ah Reum;Lee, Jae Ho;Cha, Kyung Jin;Lee, Sang Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.137-154
    • /
    • 2018
  • Animal infectious diseases, such as avian influenza and foot and mouth disease, occur almost every year and cause huge economic and social damage to the country. In order to prevent this, the anti-quarantine authorities have tried various human and material endeavors, but the infectious diseases have continued to occur. Avian influenza is known to be developed in 1878 and it rose as a national issue due to its high lethality. Food and mouth disease is considered as most critical animal infectious disease internationally. In a nation where this disease has not been spread, food and mouth disease is recognized as economic disease or political disease because it restricts international trade by making it complex to import processed and non-processed live stock, and also quarantine is costly. In a society where whole nation is connected by zone of life, there is no way to prevent the spread of infectious disease fully. Hence, there is a need to be aware of occurrence of the disease and to take action before it is distributed. Epidemiological investigation on definite diagnosis target is implemented and measures are taken to prevent the spread of disease according to the investigation results, simultaneously with the confirmation of both human infectious disease and animal infectious disease. The foundation of epidemiological investigation is figuring out to where one has been, and whom he or she has met. In a data perspective, this can be defined as an action taken to predict the cause of disease outbreak, outbreak location, and future infection, by collecting and analyzing geographic data and relation data. Recently, an attempt has been made to develop a prediction model of infectious disease by using Big Data and deep learning technology, but there is no active research on model building studies and case reports. KT and the Ministry of Science and ICT have been carrying out big data projects since 2014 as part of national R &D projects to analyze and predict the route of livestock related vehicles. To prevent animal infectious diseases, the researchers first developed a prediction model based on a regression analysis using vehicle movement data. After that, more accurate prediction model was constructed using machine learning algorithms such as Logistic Regression, Lasso, Support Vector Machine and Random Forest. In particular, the prediction model for 2017 added the risk of diffusion to the facilities, and the performance of the model was improved by considering the hyper-parameters of the modeling in various ways. Confusion Matrix and ROC Curve show that the model constructed in 2017 is superior to the machine learning model. The difference between the2016 model and the 2017 model is that visiting information on facilities such as feed factory and slaughter house, and information on bird livestock, which was limited to chicken and duck but now expanded to goose and quail, has been used for analysis in the later model. In addition, an explanation of the results was added to help the authorities in making decisions and to establish a basis for persuading stakeholders in 2017. This study reports an animal infectious disease prevention system which is constructed on the basis of hazardous vehicle movement, farm and environment Big Data. The significance of this study is that it describes the evolution process of the prediction model using Big Data which is used in the field and the model is expected to be more complete if the form of viruses is put into consideration. This will contribute to data utilization and analysis model development in related field. In addition, we expect that the system constructed in this study will provide more preventive and effective prevention.