• Title/Summary/Keyword: Data linkage

Search Result 722, Processing Time 0.036 seconds

A study on the probabilistic record linkage and its application (확률적 자료연계의 이론과 적용에 관한 연구)

  • Choi, Yeonok;Lee, Sangin
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.849-861
    • /
    • 2021
  • This paper aims to introduce the basic concept of probabilistic record linkage and its statistical framework, and describe the specific process and principle of performing it using a real example from Statistics Korea. First, we briefly describe the deterministic record linkage and compare it with probabilistic record linkage. We introduce the Fellegi-Sunter model framework for record linkage and the related paprameters: m-probability, u-probability, matched weight and decision rule. Finally, we show the detailed process of record linkage under Fellegi-Sunter model framework and evaluate the record linkage results, using sample data from the registered-based census and Population and Housing Census survey in Statistics Korea.

Construction of Genetic Linkage Map for Korean Soybean Genotypes using Molecular Markers

  • Jong Il Chung;Ye Jin Cho;Dae Jin Park;Sung Jin Han;Ju Ho Oh
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.48 no.4
    • /
    • pp.297-302
    • /
    • 2003
  • Genetic linkage maps serve the plant geneticist in a number of ways, from marker assisted selection in plant improvement to map-based cloning in molecular genetic research. Genetic map based upon DNA polymorphism is a powerful tool for the study of qualitative and quantitative traits in crops. The objective of this study was to develop genetic linkage map of soybean using the population derived from the cross of Korean soybean cultivar 'Kwangkyo, and wild accession 'IT182305'. Total 1,000 Operon random primers for RAPD marker, 49 combinations of primer for AFLP marker, and 100 Satt primers for SSR marker were used to screen parental polymorphism. Total 341 markers (242 RAPD, 83 AFLP, and 16 SSR markers) was segregated in 85 $\textrm{F}_2$ population. Forty two markers that shown significantly distorted segregation ratio (1:2:1 for codominant or 3:1 for domimant marker) were not used in mapping procedure. A linkage map was constructed by applying the computer program MAPMAKER/EXP 3.0 to the 299 marker data with LOD 4.0 and maximum distance 50 cM. 176 markers were found to be genetically linked and formed 25 linkage groups. Linkage map spanned 2,292.7 cM across all 25 linkage groups. The average linkage distance between pair of markers among all linkage groups was 13.0 cM. The number of markers per linkage group ranged from 2 to 55. The longest linkage group 3 spanned 967.4 cM with 55 makers. This map requires further saturation with more markers and agronomically important traits will be joined over it.

A Study on the GIS Analysis Techniques for Finding an Catchment Area by Public Transport at Railway Stations Using Transport Cards Big Data (교통카드 빅 데이터를 활용한 철도역의 대중교통 연계영향권 설정을 위한 GIS 분석 기법 연구)

  • Jin, Sang Kyu;Kim, Hawng Bae
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.36 no.6
    • /
    • pp.1093-1099
    • /
    • 2016
  • Currently, there are 499 metropolitan subway stations in Korea, but there are not many studies on the influence zone of linkage between railway station and public transport. Existing studies have been studied almost in terms of accessibility.. In addition, the existing research on the influence zone of linkage using survey data and statistics, there is a limit to the theoretical basis and analysis techniques. In this paper, we propose a new method to select on the influence zone of linkage, It is a GIS analysis technique using the spatial data of the railway station user as the large data of the traffic card. We applied the GIS analysis technique for select the influence zone of linkage based on the travel time of the network for each public transportation system. As a result, it was confirmed that the influence of the link of 15 minutes on the local bus, 20 minutes on the city bus and 25 minutes on the intercity bus were clearly distinguished according to the difference in network access time.

BIM data mapping based on M-BDL for BIM-BEMS connection (BIM-BEMS 연계를 위한 M-BDL 기반 BIM 데이터 맵핑)

  • Kang, Tae-Wook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.9
    • /
    • pp.348-354
    • /
    • 2018
  • This study proposes MF (Model Filter)-based M-BDL (MF-based BIM Data Linkage), which is a model filter-based data mapping method for BIM (Building Information Modeling)-BEMS linkage. Recently, BEMS (Building Energy Management System) is actively utilizing 3D spatial information. This allows the user to intuitively manage the facility energy linked to spatial information. To use BIM data in energy management systems, it is essential to link BEMS with BIM data only in terms of the user requirements. On the other hand, if the BIM is a rich dataset and is linked as it is, the user will need to manage the unnecessary information. By mapping only the data required for BEMS in heavy BIM data through M-BDL, the BIM data can be lightened and the amount of data required for maintenance can be reduced. This technology proposes a mapping method that can link the BIM data with the filtered BIM data.

The Analysis of Factors on the Service-Linkage of Long-term Care Workers for the Elderly (일부 노인 장기요양보호기관 종사자간의 서비스연계 조사)

  • You, Jae-Eung;Kim, Kyoung;Cha, Yong-Jun
    • The Journal of Korean Physical Therapy
    • /
    • v.24 no.1
    • /
    • pp.35-40
    • /
    • 2012
  • Purpose: This study was to analyze the factors that affect the service relationship of long term care workers for the elderly and to provide basic resource for the successful connection of long term care services. Methods: 259 subjects who were engaged in long term care units completed a self-administered questionnaire that measured the extent of service linkage among one another. The Cronbach's ${\alpha}$ score determined the internal consistency of the acquired data and the discriminated validity was estimated by Pearson's correlation coefficient. Multiple regression analysis was conducted to investigate the influence of the known factors on the service linkage. Results: Acceptance and participation negatively influenced on the service linkage. Reliance, comprehension, recognition on service, and frequent contact with others positively activated the service linkage of long term care workers. Conclusion: The establishments of systemic training courses providing education that emphasizes reliability and recognizes other services, including work environment to contact easily are needed to improve the service-linkage of long-term care workers for the elderly.

Predicting the Accuracy of Breeding Values Using High Density Genome Scans

  • Lee, Deuk-Hwan;Vasco, Daniel A.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.24 no.2
    • /
    • pp.162-172
    • /
    • 2011
  • In this paper, simulation was used to determine accuracies of genomic breeding values for polygenic traits associated with many thousands of markers obtained from high density genome scans. The statistical approach was based upon stochastically simulating a pedigree with a specified base population and a specified set of population parameters including the effective and noneffective marker distances and generation time. For this population, marker and quantitative trait locus (QTL) genotypes were generated using either a single linkage group or multiple linkage group model. Single nucleotide polymorphism (SNP) was simulated for an entire bovine genome (except for the sex chromosome, n = 29) including linkage and recombination. Individuals drawn from the simulated population with specified marker and QTL genotypes were randomly mated to establish appropriate levels of linkage disequilibrium for ten generations. Phenotype and genomic SNP data sets were obtained from individuals starting after two generations. Genetic prediction was accomplished by statistically modeling the genomic relationship matrix and standard BLUP methods. The effect of the number of linkage groups was also investigated to determine its influence on the accuracy of breeding values for genomic selection. When using high density scan data (0.08 cM marker distance), accuracies of breeding values on juveniles were obtained of 0.60 and 0.82, for a low heritable trait (0.10) and high heritable trait (0.50), respectively, in the single linkage group model. Estimates of 0.38 and 0.60 were obtained for the same cases in the multiple linkage group models. Unexpectedly, use of BLUP regression methods across many chromosomes was found to give rise to reduced accuracy in breeding value determination. The reasons for this remain a target for further research, but the role of Mendelian sampling may play a fundamental role in producing this effect.

A Study on the Analysis of Identification System and the Linkage Method of Academic-information (학술정보의 식별체계 현황 분석 및 연계 방안 연구)

  • Gang, Ju-Yeon;Seol, Jae-Wook;Hwang, Hyekyong
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.1
    • /
    • pp.115-143
    • /
    • 2020
  • With the era of the 4th Industrial Revolution, the number of data-centric integrated researches increases. The integrated researches make information identification and linkage more important, so it is necessary to seek a method to efficiently manage and share academic-information for supporting the researches. Therefore, this study aims to analyze identification system and linkable information types of 12 major academic search engines and bibliographic databases(ASEBDs) in Korea and abroad and to propose a method to identify and link academic-information. The analysis was conducted 2 times, and academic-information types, searchable fields, linkable information types, used identification system were investigated. As a result, the ASEBDs link directly or/and indirectly 3~4 information types based on their own identifiers with persistent identifiers. In addition, they identify academic-information semi-automatically based on machine learning methodology and collect and manage the related data. Finally, the method for academic-information linkage was proposed in terms of practice and society: linkage based on persistent identifiers and linkage based on collaborative network of institutions.

a Study on the Real-time Data Linkage of Field Control System for Distributed Control (분산제어를 위한 필드제어시스템의 실시간 데이터 연계)

  • Kim, S.G.;Song, S.I.;Oh, E.S.;Lee, S.W.;Gwak, K.Y.;Lee, E.W.;Park, T.R.
    • Proceedings of the KIEE Conference
    • /
    • 2003.07b
    • /
    • pp.777-779
    • /
    • 2003
  • This paper describes the real-time data linkage of the field control system for distributed control in nuclear power plant environment. The most important keys of digital control system in nuclear power plant are the reliability and stability of system, and real-time control ability. This Paper brought up the hardware construction using a new method about the design of each station located upon control transmission network to improve real-time ability of field control system, and measured the station binding time between devices connected to field control module. And it was confirmed performance improvement of overall system for real-time data linkage between control devices.

  • PDF

A Major DNA marker Mining of ILST035 microsatellite loci in Hanwoo Chromosome 6

  • Lee, Jea-Young;Yeo, Jung-Sou;Kim, Jae-Woo;Lee, Yong-Won
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.2
    • /
    • pp.97-104
    • /
    • 2002
  • K-Means modelling has been tried for finding major DNA marker of ILST035 microsatellite loci in Hanwoo Chromosome 6 linkage map. Major DNA markers are obtained from the ILST035 microsatellite through quantitative trait loci(QTL) and data mining modelling.

  • PDF

A Performance Improvement Study On Hierarchical Clustering (Centroid Linkage) Using A Priority Queue (Priority Queue 를 이용한 Hierarchical Clustering (Centroid Linkage) 성능 개선)

  • Jeon, Yongkweon;Yoon, Sungroh
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.1837-1838
    • /
    • 2010
  • 기존 hierarchical clustering 은 Time complexity 와 space complexity 가 Large data set 을 clustering 하기에는 적당하지 못하며 이것을 일반 PC 의 메모리 내에서 해결하는데 어려움이 있다. 따라서 본 연구에서는 이러한 어려움을 극복하기 위해 기존 Hierarchical clustering 중 Centroid Linkage 에 새로운 Algorithm 을 제안하여 보다 적은 메모리를 사용하고 빠르게 처리하는 방법을 제안하고자 한다.