• Title/Summary/Keyword: Public Big data

Search Result 709, Processing Time 0.031 seconds

A Study about Library-Related Open Data through Public Data Portals (공공데이터 포털을 통해 개방된 도서관 관련 데이터 분석)

  • Cho, Jane
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.29 no.2
    • /
    • pp.35-56
    • /
    • 2018
  • This study examines the current state of library related data opened through public data portals, and analyzes how much data is being utilized according to the type of releasing organization, and open level. In addition, we analyzed the subject cluster of data and the centrality of data by performing PathFinder Network analysis using keywords assigned to data by dividing the releasing subject into local government and national/public institutions. Based on this, the subject area of library - related data disclosed by local governments and national/public organizations is understood. And identify the main open body that should be opened first by linking with data utilization analysis result and then suggest implications for future improvement in connection with library big data business.

Design of Client-Server Model For Effective Processing and Utilization of Bigdata (빅데이터의 효과적인 처리 및 활용을 위한 클라이언트-서버 모델 설계)

  • Park, Dae Seo;Kim, Hwa Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.109-122
    • /
    • 2016
  • Recently, big data analysis has developed into a field of interest to individuals and non-experts as well as companies and professionals. Accordingly, it is utilized for marketing and social problem solving by analyzing the data currently opened or collected directly. In Korea, various companies and individuals are challenging big data analysis, but it is difficult from the initial stage of analysis due to limitation of big data disclosure and collection difficulties. Nowadays, the system improvement for big data activation and big data disclosure services are variously carried out in Korea and abroad, and services for opening public data such as domestic government 3.0 (data.go.kr) are mainly implemented. In addition to the efforts made by the government, services that share data held by corporations or individuals are running, but it is difficult to find useful data because of the lack of shared data. In addition, big data traffic problems can occur because it is necessary to download and examine the entire data in order to grasp the attributes and simple information about the shared data. Therefore, We need for a new system for big data processing and utilization. First, big data pre-analysis technology is needed as a way to solve big data sharing problem. Pre-analysis is a concept proposed in this paper in order to solve the problem of sharing big data, and it means to provide users with the results generated by pre-analyzing the data in advance. Through preliminary analysis, it is possible to improve the usability of big data by providing information that can grasp the properties and characteristics of big data when the data user searches for big data. In addition, by sharing the summary data or sample data generated through the pre-analysis, it is possible to solve the security problem that may occur when the original data is disclosed, thereby enabling the big data sharing between the data provider and the data user. Second, it is necessary to quickly generate appropriate preprocessing results according to the level of disclosure or network status of raw data and to provide the results to users through big data distribution processing using spark. Third, in order to solve the problem of big traffic, the system monitors the traffic of the network in real time. When preprocessing the data requested by the user, preprocessing to a size available in the current network and transmitting it to the user is required so that no big traffic occurs. In this paper, we present various data sizes according to the level of disclosure through pre - analysis. This method is expected to show a low traffic volume when compared with the conventional method of sharing only raw data in a large number of systems. In this paper, we describe how to solve problems that occur when big data is released and used, and to help facilitate sharing and analysis. The client-server model uses SPARK for fast analysis and processing of user requests. Server Agent and a Client Agent, each of which is deployed on the Server and Client side. The Server Agent is a necessary agent for the data provider and performs preliminary analysis of big data to generate Data Descriptor with information of Sample Data, Summary Data, and Raw Data. In addition, it performs fast and efficient big data preprocessing through big data distribution processing and continuously monitors network traffic. The Client Agent is an agent placed on the data user side. It can search the big data through the Data Descriptor which is the result of the pre-analysis and can quickly search the data. The desired data can be requested from the server to download the big data according to the level of disclosure. It separates the Server Agent and the client agent when the data provider publishes the data for data to be used by the user. In particular, we focus on the Big Data Sharing, Distributed Big Data Processing, Big Traffic problem, and construct the detailed module of the client - server model and present the design method of each module. The system designed on the basis of the proposed model, the user who acquires the data analyzes the data in the desired direction or preprocesses the new data. By analyzing the newly processed data through the server agent, the data user changes its role as the data provider. The data provider can also obtain useful statistical information from the Data Descriptor of the data it discloses and become a data user to perform new analysis using the sample data. In this way, raw data is processed and processed big data is utilized by the user, thereby forming a natural shared environment. The role of data provider and data user is not distinguished, and provides an ideal shared service that enables everyone to be a provider and a user. The client-server model solves the problem of sharing big data and provides a free sharing environment to securely big data disclosure and provides an ideal shared service to easily find big data.

Interactions of Behavioral Changes in Smoking, High-risk Drinking, and Weight Gain in a Population of 7.2 Million in Korea

  • Kim, Yeon-Yong;Kang, Hee-Jin;Ha, Seongjun;Park, Jong Heon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.52 no.4
    • /
    • pp.234-241
    • /
    • 2019
  • Objectives: To identify simultaneous behavioral changes in alcohol consumption, smoking, and weight using a fixed-effect model and to characterize their associations with disease status. Methods: This study included 7 000 529 individuals who participated in the national biennial health-screening program every 2 years from 2009 to 2016 and were aged 40 or more. We reconstructed the data into an individual-level panel dataset with 4 waves. We used a fixed-effect model for smoking, heavy alcohol drinking, and overweight. The independent variables were sex, age, lifestyle factors, insurance contribution, employment status, and disease status. Results: Becoming a high-risk drinker and losing weight were associated with initiation or resumption of smoking. Initiation or resumption of smoking and weight gain were associated with non-high-risk drinkers becoming high-risk drinkers. Smoking cessation and becoming a high-risk drinker were associated with normal-weight participants becoming overweight. Participants with newly acquired diabetes mellitus, ischemic heart disease, stroke, and cancer tended to stop smoking, discontinue high-risk drinking, and return to a normal weight. Conclusions: These results obtained using a large-scale population-based database documented interactions among lifestyle factors over time.

Changes and Strategies of the Government Service Paradigm through Using Big Data -Focused on Disaster Safety Management in Seoul City- (빅데이터활용을 통한 정부서비스 패러다임의 변화와 전략 -서울시 재난안전관리를 중심으로-)

  • Kim, Young-mi
    • Journal of Digital Convergence
    • /
    • v.15 no.2
    • /
    • pp.59-65
    • /
    • 2017
  • The basic goal of urban safety is to support citizens' quality of life and city competitiveness, and its importance is increasing. Since the risk of disasters is growing, there is a growing demand from society for minimizing the damage by preventing and responding to them in advance. In case of urban governments, securing safety emerges as one of the most important policy tasks due to natural disasters such as heavy rain and heavy snow and human disasters such as various accidents. Recently, it is emphasized the necessity to increase the prevention effect through disaster analysis using Big Data. This study examined paradigm change of disaster safety management using big data centering on Seoul city. In particular, the study tried case analysis from the viewpoint of maximizing effective government services for disaster safety management, and sought the strategic meaning in connection with the ordinance.

A Study on the Online Perception of Chabak Using Big Data Analysis (빅데이터 분석을 통한 차박의 온라인 인식에 대한 연구)

  • Kim, Sae-Hoon;Lee, Hwan-Soo
    • The Journal of Society for e-Business Studies
    • /
    • v.26 no.2
    • /
    • pp.61-81
    • /
    • 2021
  • In the era of untact, the "Chabak" using cars as accommodation spaces is attracting attention as a new form of travel. Due to the advantages, including low costs, convenience, and safety, as well as the characteristics of the vehicle enabling independent travel, the demand for Chabak is continuously increasing. Despite the rapid growth of the market and related industries, little academic has investigated this trend. To establish itself as a new type of travel culture and to sustain the growth of related industries, it is essential to understand the public perception of Chabak. Therefore, based on the marketing mix theory and big data analysis, this study analyzes the public perception of Chabak. The results showed that Chabak has established itself as a consumer-led travel culture, contributing to the aftermarket growth of the automobile industry. Additionally, consumers were found to be increasingly inclined to enjoy travel economically and wisely, and actively share information through social media. This initial study on the new travel trend of Chabak is significant in that it employs big data analysis on a theoretical basis.

A Study on Enhancement Method of Public Perception about Geoscience using Big Data Analysis: Focusing on Media Article (지질자원기술 빅데이터 분석을 통한 국민 인식 제고 방안 연구 : 언론 기사 중심으로)

  • Kim, Chan Souk
    • Economic and Environmental Geology
    • /
    • v.55 no.3
    • /
    • pp.273-280
    • /
    • 2022
  • The purpose of this study is to explore the social perception on geoscience using a big data analysis and to propose a way to enhance people's perception on geoscience. For this, 5,044 media articles including geoscience produced by 54 media companies from January 1, 2010 to April 14, 2022. were analyzed. Big data analyses were applied. The results of analyses are as follows: Media articles consist of key words of research institute, some countries of America, China and Japan, City of Pohang, CEO of KIGAM. And geology, industry, development of mineral resources, environment, energy, nuclear power, and groundwater are highlighted as key words. Also, it is confirmed that topics related to geoscience such as expert, environment and research institute are not individually isolated, but interconnected and linked to topics in the center of future, industry, and global. Based on this result, ways to enhance people's perception on geoscience were discussed.

A Study on establishing countermeasures to security threats due to the introduction of information protection system. (정보보호시스템도입에 따른 보안위협요소 대응방안수립에 관한 연구)

  • Kyung, ji-hun;Jung, Sung-Jae;Bae, Yu-Mi;Sung, Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.05a
    • /
    • pp.693-696
    • /
    • 2013
  • Information protection system (Information protection system)-based IT environment built popularity in public agencies and businesses take advantage of the resources for the integration of the information system one essential environment began to recognize, cloud systems (Cloud System), cloud security (Cloud Security), big data (Big Data), big data security (Big Data Security), industrial security (Security Industry), as well as the issue. Due to the influence of these information protection system (Information protection system) in response to my external security threats based on the analysis plan. In this paper, data protection systems (Information protection system), resulting in the introduction, there are a number of security threats and particularly industrial security aspects and internal and external security threats in response by lighting about aspects of the plan is based on knowledge.

  • PDF

Evaluation of Transit Transfer Pattern for the Mobility Handicapped Using Traffic Card Big Data: Focus on Transfer between Bus and Metro (교통카드데이터를 활용한 교통약자 대중교통 환승통행패턴 분석: 버스 지하철 간 환승을 중심으로)

  • Kwon, Min young;Kim, Young chan;Ku, Ji sun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.2
    • /
    • pp.58-71
    • /
    • 2021
  • The number of elderly people worldwide is rapidly increasing and the mobility handicapped suffering from inconvenient public transportation service is also increasing. In Korea and abroad, various policies are being implemented to provide high-quality transportation services for the mobility handicapped, and budget support and investment related to mobility facilities are being expanded. The mobility handicapped spends more time for transit transfer than normal users and their satisfaction with transit service is also lower. There exist transfer inconvenience points of the mobility handicapped due to various factors such as long transfer distances, absence of transportation facilities like elevators, escalators, etc. The purpose of this study is to find transfer inconvenience points for convenient transit transfer of the mobility handicapped using Smart card Big data. This study process traffic card transaction data and construct transfer travel data by user groups using smart card big data and analysis of the transfer characteristics for each user group ; normal, children, elderly, etc. Finally, find transfer inconveniences points by comparing transfer patterns between normal users and the mobility handicapped. This study is significant in that it can find transfer inconvenience points for convenient transit transfer of the mobility handicapped using Smart card Big data. In addition, it can be applicated of Smart card Big data for developing public transportation polices in the future. It is expected that the result of this study be used to improve the accessibility of transit transportation for mobility handicapped.

Estimation of ship operational efficiency from AIS data using big data technology

  • Kim, Seong-Hoon;Roh, Myung-Il;Oh, Min-Jae;Park, Sung-Woo;Kim, In-Il
    • International Journal of Naval Architecture and Ocean Engineering
    • /
    • v.12 no.1
    • /
    • pp.440-454
    • /
    • 2020
  • To prevent pollution from ships, the Energy Efficiency Design Index (EEDI) is a mandatory guideline for all new ships. The Ship Energy Efficiency Management Plan (SEEMP) has also been applied by MARPOL to all existing ships. SEEMP provides the Energy Efficiency Operational Indicator (EEOI) for monitoring the operational efficiency of a ship. By monitoring the EEOI, the shipowner or operator can establish strategic plans, such as routing, hull cleaning, decommissioning, new building, etc. The key parameter in calculating EEOI is Fuel Oil Consumption (FOC). It can be measured on board while a ship is operating. This means that only the shipowner or operator can calculate the EEOI of their own ships. If the EEOI can be calculated without the actual FOC, however, then the other stakeholders, such as the shipbuilding company and Class, or others who don't have the measured FOC, can check how efficiently their ships are operating compared to other ships. In this study, we propose a method to estimate the EEOI without requiring the actual FOC. The Automatic Identification System (AIS) data, ship static data, and environment data that can be publicly obtained are used to calculate the EEOI. Since the public data are of large capacity, big data technologies, specifically Hadoop and Spark, are used. We verify the proposed method using actual data, and the result shows that the proposed method can estimate EEOI from public data without actual FOC.

Level of Agreement and Factors Associated With Discrepancies Between Nationwide Medical History Questionnaires and Hospital Claims Data

  • Kim, Yeon-Yong;Park, Jong Heon;Kang, Hee-Jin;Lee, Eun Joo;Ha, Seongjun;Shin, Soon-Ae
    • Journal of Preventive Medicine and Public Health
    • /
    • v.50 no.5
    • /
    • pp.294-302
    • /
    • 2017
  • Objectives: The objectives of this study were to investigate the agreement between medical history questionnaire data and claims data and to identify the factors that were associated with discrepancies between these data types. Methods: Data from self-reported questionnaires that assessed an individual's history of hypertension, diabetes mellitus, dyslipidemia, stroke, heart disease, and pulmonary tuberculosis were collected from a general health screening database for 2014. Data for these diseases were collected from a healthcare utilization claims database between 2009 and 2014. Overall agreement, sensitivity, specificity, and kappa values were calculated. Multiple logistic regression analysis was performed to identify factors associated with discrepancies and was adjusted for age, gender, insurance type, insurance contribution, residential area, and comorbidities. Results: Agreement was highest between questionnaire data and claims data based on primary codes up to 1 year before the completion of self-reported questionnaires and was lowest for claims data based on primary and secondary codes up to 5 years before the completion of self-reported questionnaires. When comparing data based on primary codes up to 1 year before the completion of selfreported questionnaires, the overall agreement, sensitivity, specificity, and kappa values ranged from 93.2 to 98.8%, 26.2 to 84.3%, 95.7 to 99.6%, and 0.09 to 0.78, respectively. Agreement was excellent for hypertension and diabetes, fair to good for stroke and heart disease, and poor for pulmonary tuberculosis and dyslipidemia. Women, younger individuals, and employed individuals were most likely to under-report disease. Conclusions: Detailed patient characteristics that had an impact on information bias were identified through the differing levels of agreement.