• Title/Summary/Keyword: Public dataset

Search Result 254, Processing Time 0.029 seconds

The Role and Necessity of Public Health Services in a Remote Area

  • Lee-Seung KWON
    • Journal of Wellbeing Management and Applied Psychology
    • /
    • v.6 no.4
    • /
    • pp.63-68
    • /
    • 2023
  • Purpose: This study aims to investigate the national obligation of public health support for residents in medically vulnerable areas in Korea, and to propose a suitable model for public health institutions in this region. Research design, data, and methodology: A survey targeting residents was conducted from August 10 to August 17, 2021, with a sample size of 177 general citizens. The survey utilized a structured questionnaire administered online through Google, employing convenience random sampling. After an editing process to ensure data accuracy, the final dataset of 174 valid samples underwent encoding, coding, and cleaning using the IBM SPSS Statistics 22.0 program for analysis. Results: Health status revealed a moderate level, and 63.8% reported having chronic diseases, particularly prevalent among the elderly. External healthcare institutions were commonly utilized, with proximity and competence of doctors being primary reasons. Respondents expressed a need for improving the public health and medical system, emphasizing the establishment of a County Health Centre and expanding medical departments. Conclusions: In this region, the region's unique challenges, including education, employment, population decline, aging, and transportation, require multidimensional efforts and urgent intervention by public entities. Long-term strategies involve considering the establishment of a health and medical institute, adjusting health centre resources to local realities, and fostering a cooperative system for collaboration among residents and institutions.

A Design and Implementation of a DCAT-based Metadata Transformation Tool for Interoperability in Open Data Platforms (오픈데이터 플랫폼의 상호운용성을 위한 DCAT 기반 메타데이터 변환도구 설계 및 구현)

  • Park, Kyounghyun;Wonk, Hee Sun;Ryu, Keun Ho
    • Journal of Digital Contents Society
    • /
    • v.19 no.1
    • /
    • pp.59-65
    • /
    • 2018
  • As open data(public data) began to be recognized as a source of national economic development, many countries began to build public data portals and provide open data to the private sector. In accordance with this trend, open source communities have begun to develop open data platform such as CKAN and enable to share dataset among open data platforms by applying metadata standard technology. However, many governments and local governments are still making it difficult to share data between data portals because they build their own platforms. In this paper, we propose a DCAT-based metadata transformation tool to solve these problems, and show how to transform a dataset into DCAT.

Derivation of Driving Stability Indicators for Autonomous Vehicles Based on Analyzing Waymo Open Dataset (Waymo Open Dataset 기반 자율차의 주행행태분석을 통한 주행안정성 평가지표 도출)

  • Hoyoon Lee;Jeonghoon Jee;Cheol Oh;Hoseon Kim
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.23 no.4
    • /
    • pp.94-109
    • /
    • 2024
  • As autonomous vehicles are allowed to drive on public roads, there is an increasing amount of on-road data available for research. It has therefore become possible to analyze impacts of autonomous vehicles on traffic safety using real-world data. It is necessary to use indicators that are well-representative of the driving behavior of autonomous vehicles to understand the implications of them on traffic safety. This study aims to derive indicators that effectively reflect the driving stability of autonomous vehicles by analyzing the driving behavior using the Waymo Open Dataset. Principal component analysis was adopted to derive indicators with high explanatory capability for the dataset. Driving stability indicators were separated into longitudinal and lateral ones. The road segments on the dataset were divided into four based on the characteristics of each, which were signalized and unsignalized intersections, tangent road section, and curved road section. The longitudinal driving stability was 35.48% higher in the curved road sections compared to the unsignalized intersections. With regard to the lateral driving stability, the driving stability was 76.08% higher in the signalized intersections than in the unsignalized intersections. The comparison between curved and tangent road segments showed that tangent roads are 146.87% higher regarding lateral driving stability. The results of this study are valuable for the further research to analyze the impact of autonomous vehicles on traffic safety using real-world data.

Image Clustering Using Machine Learning : Study of InceptionV3 with K-means Methods. (머신 러닝을 사용한 이미지 클러스터링: K-means 방법을 사용한 InceptionV3 연구)

  • Nindam, Somsauwt;Lee, Hyo Jong
    • Annual Conference of KIPS
    • /
    • 2021.11a
    • /
    • pp.681-684
    • /
    • 2021
  • In this paper, we study image clustering without labeling using machine learning techniques. We proposed an unsupervised machine learning technique to design an image clustering model that automatically categorizes images into groups. Our experiment focused on inception convolutional neural networks (inception V3) with k-mean methods to cluster images. For this, we collect the public datasets containing Food-K5, Flowers, Handwritten Digit, Cats-dogs, and our dataset Rice Germination, and the owner dataset Palm print. Our experiment can expand into three-part; First, format all the images to un-label and move to whole datasets. Second, load dataset into the inception V3 extraction image features and transferred to the k-mean cluster group hold on six classes. Lastly, evaluate modeling accuracy using the confusion matrix base on precision, recall, F1 to analyze. In this our methods, we can get the results as 1) Handwritten Digit (precision = 1.000, recall = 1.000, F1 = 1.00), 2) Food-K5 (precision = 0.975, recall = 0.945, F1 = 0.96), 3) Palm print (precision = 1.000, recall = 0.999, F1 = 1.00), 4) Cats-dogs (precision = 0.997, recall = 0.475, F1 = 0.64), 5) Flowers (precision = 0.610, recall = 0.982, F1 = 0.75), and our dataset 6) Rice Germination (precision = 0.997, recall = 0.943, F1 = 0.97). Our experiment showed that modeling could get an accuracy rate of 0.8908; the outcomes state that the proposed model is strongest enough to differentiate the different images and classify them into clusters.

Comparative analysis of Machine-Learning Based Models for Metal Surface Defect Detection (머신러닝 기반 금속외관 결함 검출 비교 분석)

  • Lee, Se-Hun;Kang, Seong-Hwan;Shin, Yo-Seob;Choi, Oh-Kyu;Kim, Sijong;Kang, Jae-Mo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.6
    • /
    • pp.834-841
    • /
    • 2022
  • Recently, applying artificial intelligence technologies in various fields of production has drawn an upsurge of research interest due to the increase for smart factory and artificial intelligence technologies. A great deal of effort is being made to introduce artificial intelligence algorithms into the defect detection task. Particularly, detection of defects on the surface of metal has a higher level of research interest compared to other materials (wood, plastics, fibers, etc.). In this paper, we compare and analyze the speed and performance of defect classification by combining machine learning techniques (Support Vector Machine, Softmax Regression, Decision Tree) with dimensionality reduction algorithms (Principal Component Analysis, AutoEncoders) and two convolutional neural networks (proposed method, ResNet). To validate and compare the performance and speed of the algorithms, we have adopted two datasets ((i) public dataset, (ii) actual dataset), and on the basis of the results, the most efficient algorithm is determined.

Training a semantic segmentation model for cracks in the concrete lining of tunnel (터널 콘크리트 라이닝 균열 분석을 위한 의미론적 분할 모델 학습)

  • Ham, Sangwoo;Bae, Soohyeon;Kim, Hwiyoung;Lee, Impyeong;Lee, Gyu-Phil;Kim, Donggyou
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.23 no.6
    • /
    • pp.549-558
    • /
    • 2021
  • In order to keep infrastructures such as tunnels and underground facilities safe, cracks of concrete lining in tunnel should be detected by regular inspections. Since regular inspections are accomplished through manual efforts using maintenance lift vehicles, it brings about traffic jam, exposes works to dangerous circumstances, and deteriorates consistency of crack inspection data. This study aims to provide methodology to automatically extract cracks from tunnel concrete lining images generated by the existing tunnel image acquisition system. Specifically, we train a deep learning based semantic segmentation model with open dataset, and evaluate its performance with the dataset from the existing tunnel image acquisition system. In particular, we compare the model performance in case of using all of a public dataset, subset of the public dataset which are related to tunnel surfaces, and the tunnel-related subset with negative examples. As a result, the model trained using the tunnel-related subset with negative examples reached the best performance. In the future, we expect that this research can be used for planning efficient model training strategy for crack detection.

Improvement of Face Recognition Algorithm for Residential Area Surveillance System Based on Graph Convolution Network (그래프 컨벌루션 네트워크 기반 주거지역 감시시스템의 얼굴인식 알고리즘 개선)

  • Tan Heyi;Byung-Won Min
    • Journal of Internet of Things and Convergence
    • /
    • v.10 no.2
    • /
    • pp.1-15
    • /
    • 2024
  • The construction of smart communities is a new method and important measure to ensure the security of residential areas. In order to solve the problem of low accuracy in face recognition caused by distorting facial features due to monitoring camera angles and other external factors, this paper proposes the following optimization strategies in designing a face recognition network: firstly, a global graph convolution module is designed to encode facial features as graph nodes, and a multi-scale feature enhancement residual module is designed to extract facial keypoint features in conjunction with the global graph convolution module. Secondly, after obtaining facial keypoints, they are constructed as a directed graph structure, and graph attention mechanisms are used to enhance the representation power of graph features. Finally, tensor computations are performed on the graph features of two faces, and the aggregated features are extracted and discriminated by a fully connected layer to determine whether the individuals' identities are the same. Through various experimental tests, the network designed in this paper achieves an AUC index of 85.65% for facial keypoint localization on the 300W public dataset and 88.92% on a self-built dataset. In terms of face recognition accuracy, the proposed network achieves an accuracy of 83.41% on the IBUG public dataset and 96.74% on a self-built dataset. Experimental results demonstrate that the network designed in this paper exhibits high detection and recognition accuracy for faces in surveillance videos.

Study on Image Use for Plant Disease Classification (작물의 병충해 분류를 위한 이미지 활용 방법 연구)

  • Jeong, Seong-Ho;Han, Jeong-Eun;Jeong, Seong-Kyun;Bong, Jae-Hwan
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.2
    • /
    • pp.343-350
    • /
    • 2022
  • It is worth verifying the effectiveness of data integration between data with different features. This study investigated whether the data integration affects the accuracy of deep neural network (DNN), and which integration method shows the best improvement. This study used two different public datasets. One public dataset was taken in an actual farm in India. And another was taken in a laboratory environment in Korea. Leaf images were selected from two different public datasets to have five classes which includes normal and four different types of plant diseases. DNN used pre-trained VGG16 as a feature extractor and multi-layer perceptron as a classifier. Data were integrated into three different ways to be used for the training process. DNN was trained in a supervised manner via the integrated data. The trained DNN was evaluated by using a test dataset taken in an actual farm. DNN shows the best accuracy for the test dataset when DNN was first trained by images taken in the laboratory environment and then trained by images taken in the actual farm. The results show that data integration between plant images taken in a different environment helps improve the performance of deep neural networks. And the results also confirmed that independent use of plant images taken in different environments during the training process is more effective in improving the performance of DNN.

Possibility of the Use of Public Microarray Database for Identifying Significant Genes Associated with Oral Squamous Cell Carcinoma

  • Kim, Ki-Yeol;Cha, In-Ho
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.23-32
    • /
    • 2012
  • There are lots of studies attempting to identify the expression changes in oral squamous cell carcinoma. Most studies include insufficient samples to apply statistical methods for detecting significant gene sets. This study combined two small microarray datasets from a public database and identified significant genes associated with the progress of oral squamous cell carcinoma. There were different expression scales between the two datasets, even though these datasets were generated under the same platforms - Affymetrix U133A gene chips. We discretized gene expressions of the two datasets by adjusting the differences between the datasets for detecting the more reliable information. From the combination of the two datasets, we detected 51 significant genes that were upregulated in oral squamous cell carcinoma. Most of them were published in previous studies as cancer-related genes. From these selected genes, significant genetic pathways associated with expression changes were identified. By combining several datasets from the public database, sufficient samples can be obtained for detecting reliable information. Most of the selected genes were known as cancer-related genes, including oral squamous cell carcinoma. Several unknown genes can be biologically evaluated in further studies.

Exploring the Impact of Environmental Factors on Fermentation Trends: A Google Trends Analysis from 2020 to 2024

  • Won JOO;Eun-Ah CHEON
    • Journal of Wellbeing Management and Applied Psychology
    • /
    • v.7 no.4
    • /
    • pp.51-64
    • /
    • 2024
  • Purpose: This study analyzes factors influencing public interest in fermentation using Google search trends. Specifically, it examines how key elements such as oxygen, temperature, time, and pH influence fermentation-0related searches from December 2020 to September 2024. Research design, data and methodology: Data from Google Trends was collected under the Beauty & Fitness category for the terms "Fermentation," "Oxygen," "Temperature," "Time," and "pH." Time series analysis was used to track trends over four years, and a correlation analysis was conducted to assess the relationships between these terms. A linear regression model was built to determine the influence of each factor on fermentation-related searches. The dataset was split into 80% training data and 20% testing data for model validation. Results: The correlation analysis indicated moderate positive relationships between fermentation-related searches and both time and pH, while oxygen had little to no correlation. The regression model showed that time and pH were the strongest influencers of fermentation interest, explaining 25% of the variance (R-squared = 0.25). Oxygen and temperature had minimal impact in predicting fermentation-related search interest. Conclusions: Time and pH are significant factors influencing public interest in fermentation-related topics, as shown by search trends. In contrast, oxygen and temperature, while important in the fermentation process itself, did not strongly affect public search behavior. These findings provide valuable insights for businesses and researchers looking to better understand consumer interest in fermentation products.