• 제목/요약/키워드: complete linkage algorithm

검색결과 7건 처리시간 0.023초

컴플리트 링키지 알고리즘을 이용한 교육시설물 BTL사업 유지관리번들 구성방안에 관한 연구 (A Study on Maintenance Bundle Alternatives of BTL Project for Educational Facilities Using Complete Linkage Algorithm)

  • 조창연;손재호
    • 교육시설
    • /
    • 제15권3호
    • /
    • pp.4-16
    • /
    • 2008
  • BTL(Build-Transfer-Lease) Project for Education Facilities is contracted as a package which consists of several education facilities and its maintenance period is 20 years. Thus, total cost variation largely depends on the accuracy of the maintenance cost forecasting in the early stage in the life cycle of the BTL Projects. This research develops a method using complete linkage algorithm and branch & bound algorithm to help in finding optimal bundling combination. The result of this research suggests more reasonable and effective forecasting method for the maintenance bundle in BTL projects.

최적화기법을 활용한 교육시설물 BTL 사업 운영관리비용 비용예측 시스템 개발 기초연구 (A Study on Development of Maintenance Cost Estimation System for BTL Project of Education Facilities Using Optimization Methodology)

  • 조창연;손재호;김재온
    • 한국건설관리학회논문집
    • /
    • 제10권1호
    • /
    • pp.45-57
    • /
    • 2009
  • 교육시설물 BTL 사업은 여러 개의 시설물을 1개의 번들로 계약하며, 계약기간을 20년으로 산정하여 그 비용을 지급한다. 따라서 BTL 프로젝트 초기단계에 운영관리비를 예측하는 정확도와 발주번들에 포함되는 시설물을 어떻게 편성하느냐에 따라 비용편차가 크게 발생하게 된다. 본 연구에서는 교육시설물 BTL 사업 발주 시 운영관리 최적 번들을 구성하기 위한 방법으로 컴플리트 링키지 알고리즘을 이용하는 비용 예측 시스템을 개발하고, 이를 통해 발주번들에 포함되는 시설물 변화에 따라 제한조건들이 변화하는 경우에 대해 사용자의 의사결정 시 판단근거자료로 활용 가능하게 하고자 한다. 본 연구에서 제안하는 시스템을 활용하면 보다 효율적이고 신속한 BTL 프로젝트의 운영비 예측이 가능할 것으로 사료된다.

지식 분류의 자동화를 위한 클러스터링 모형 연구 (Development of a Clustering Model for Automatic Knowledge Classification)

  • 정영미;이재윤
    • 정보관리학회지
    • /
    • 제18권2호
    • /
    • pp.203-230
    • /
    • 2001
  • 본 연구에서는 문헌을 기반으로 한 지식의 자동분류를 위해 최적의 클러스터링 모형을 제시하고자 하였다. 클러스터링 실험을 위해서 신문기사 실험집단과 학술논문 초록 실험집단을 구축하였고, 분류 성능 평가 척도인 WACS를 개발하였다. 분류자질로 사용한 용어의 집합은 다양한 자질 축소 기준을 적용하여 생성하였으며, 다양한 용어 가중치를 사용하였다. 유사계수 공식으로는 코사인 계수와 자카드 계수를 적용하였으며, 클러스터링 알고리즘으로는 비계층적 기법인 완전연결 기법과 계층적 기법인 K-means기법을 각각 사용하였다. 실험 결과 신문기사 원문 집단에서의 성능이 좋았으며, 완전연결 기법의 성능이 K-means 기법보다 높게 나타났다. 역문헌빈도의 적용은 완전연결 클러스터링에서는 긍정적인 효과가 나타났으나, K-means 클러스터링에서는 그렇지 못했다. 분류자질은 전체의 7.66%만 사용하였을 경우에도 성능 저하가 크지 않았으며, K-means 클러스터링에서는 오히려 성능 향상 효과가 있었다.

  • PDF

Algorithm Design to Judge Fake News based on Bigdata and Artificial Intelligence

  • Kang, Jangmook;Lee, Sangwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제11권2호
    • /
    • pp.50-58
    • /
    • 2019
  • The clear and specific objective of this study is to design a false news discriminator algorithm for news articles transmitted on a text-based basis and an architecture that builds it into a system (H/W configuration with Hadoop-based in-memory technology, Deep Learning S/W design for bigdata and SNS linkage). Based on learning data on actual news, the government will submit advanced "fake news" test data as a result and complete theoretical research based on it. The need for research proposed by this study is social cost paid by rumors (including malicious comments) and rumors (written false news) due to the flood of fake news, false reports, rumors and stabbings, among other social challenges. In addition, fake news can distort normal communication channels, undermine human mutual trust, and reduce social capital at the same time. The final purpose of the study is to upgrade the study to a topic that is difficult to distinguish between false and exaggerated, fake and hypocrisy, sincere and false, fraud and error, truth and false.

군집분석 비교 및 한우 관능평가데이터 군집화 (A Comparison of Cluster Analyses and Clustering of Sensory Data on Hanwoo Bulls)

  • 김재희;고윤실
    • 응용통계연구
    • /
    • 제22권4호
    • /
    • pp.745-758
    • /
    • 2009
  • 자발적인 군집을 유도하는 다변량 통계기법으로 널리 사용되는 군집분석은 데이터에 기반한 탐색적 방법으로 쓰이며 군집원칙에 따라 여러 가지 방법이 제안되어 왔다. 또한 군집화된 결과에 대하여 유효성을 측정하는 측도도 다양한방법이 개발되었다. 본 연구에서는 계층적 군집분석 방법으로 최장연결법과 Ward의 방법, 비계층적 군집분석 방법으로 K-평균법 그리고 확률분포정보를 활용한 모형기반 군집분석방법을 이용하여 모의실험으로 군집분석을 실시하고 군집유효성 측도로는 연결성, Dunn 지수, 실루엣을 구하여 각 군집방법에 대해 유효성을 비교한다. 또한, 한우 관능평가 데이터에 군집분석을 적용하여 최적의 군집 상황을 구하고자 한다.

의무기록의 다각적 활용을 통한 충실도 높은 병원 암등록 체계의 구축: 서울아산병원의 경험 (Construction and Validation of Hospital-Based Cancer Registry Using Various Health Records to Detect Patients with Newly Diagnosed Cancer: Experience at Asan Medical Center)

  • 김화정;조진희;유용만;이선혜;황경하;이무송
    • Journal of Preventive Medicine and Public Health
    • /
    • 제43권3호
    • /
    • pp.257-264
    • /
    • 2010
  • Objectives: An accurate estimation of cancer patients is the basis of epidemiological studies and health services. However in Korea, cancer patients visiting out-patient clinics are usually ruled out of such studies and so these studies are suspected of underestimating the cancer patient population. The purpose of this study is to construct a more complete, hospital-based cancer patient registry using multiple sources of medical information. Methods: We constructed a cancer patient detection algorithm using records from various sources that were obtained from both the in-patients and out-patients seen at Asan Medical Center (AMC) for any reason. The medical data from the potentially incident cancer patients was reviewed four months after first being detected by the algorithm to determine whether these patients actually did or did not have cancer. Results: Besides the traditional practice of reviewing the charts of in-patients upon their discharge, five more sources of information were added for this algorithm, i.e., pathology reports, the national severe disease registry, the reason for treatment, prescriptions of chemotherapeutic agents and radiation therapy reports. The constructed algorithm was observed to have a PPV of 87.04%. Compared to the results of traditional practice, 36.8% of registry failures were avoided using the AMC algorithm. Conclusions: To minimize loss in the cancer registry, various data sources should be utilized, and the AMC algorithm can be a successful model for this. Further research will be required in order to apply novel and innovative technology to the electronic medical records system in order to generate new signals from data that has not been previously used.

Imputation Accuracy from Low to Moderate Density Single Nucleotide Polymorphism Chips in a Thai Multibreed Dairy Cattle Population

  • Jattawa, Danai;Elzo, Mauricio A.;Koonawootrittriron, Skorn;Suwanasopee, Thanathip
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제29권4호
    • /
    • pp.464-470
    • /
    • 2016
  • The objective of this study was to investigate the accuracy of imputation from low density (LDC) to moderate density SNP chips (MDC) in a Thai Holstein-Other multibreed dairy cattle population. Dairy cattle with complete pedigree information (n = 1,244) from 145 dairy farms were genotyped with GeneSeek GGP20K (n = 570), GGP26K (n = 540) and GGP80K (n = 134) chips. After checking for single nucleotide polymorphism (SNP) quality, 17,779 SNP markers in common between the GGP20K, GGP26K, and GGP80K were used to represent MDC. Animals were divided into two groups, a reference group (n = 912) and a test group (n = 332). The SNP markers chosen for the test group were those located in positions corresponding to GeneSeek GGP9K (n = 7,652). The LDC to MDC genotype imputation was carried out using three different software packages, namely Beagle 3.3 (population-based algorithm), FImpute 2.2 (combined family- and population-based algorithms) and Findhap 4 (combined family- and population-based algorithms). Imputation accuracies within and across chromosomes were calculated as ratios of correctly imputed SNP markers to overall imputed SNP markers. Imputation accuracy for the three software packages ranged from 76.79% to 93.94%. FImpute had higher imputation accuracy (93.94%) than Findhap (84.64%) and Beagle (76.79%). Imputation accuracies were similar and consistent across chromosomes for FImpute, but not for Findhap and Beagle. Most chromosomes that showed either high (73%) or low (80%) imputation accuracies were the same chromosomes that had above and below average linkage disequilibrium (LD; defined here as the correlation between pairs of adjacent SNP within chromosomes less than or equal to 1 Mb apart). Results indicated that FImpute was more suitable than Findhap and Beagle for genotype imputation in this Thai multibreed population. Perhaps additional increments in imputation accuracy could be achieved by increasing the completeness of pedigree information.