• Title/Summary/Keyword: nearest neighbors

검색결과 222건 처리시간 0.024초

Classification of nuclear activity types for neighboring countries of South Korea using machine learning techniques with xenon isotopic activity ratios

  • Sang-Kyung Lee;Ser Gi Hong
    • Nuclear Engineering and Technology
    • /
    • 제56권4호
    • /
    • pp.1372-1384
    • /
    • 2024
  • The discrimination of the source for xenon gases' release can provide an important clue for detecting the nuclear activities in the neighboring countries. In this paper, three machine learning techniques, which are logistic regression, support vector machine (SVM), and k-nearest neighbors (KNN), were applied to develop the predictive models for discriminating the source for xenon gases' release based on the xenon isotopic activity ratio data which were generated using the depletion codes, i.e., ORIGEN in SCALE 6.2 and Serpent, for the probable sources. The considered sources for the neighboring countries of South Korea include PWRs, CANDUs, IRT-2000, Yongbyun 5 MWe reactor, and nuclear tests with plutonium and uranium. The results of the analysis showed that the overall prediction accuracies of models with SVM and KNN using six inputs, all exceeded 90%. Particularly, the models based on SVM and KNN that used six or three xenon isotope activity ratios with three classification categories, namely reactor, plutonium bomb, and uranium bomb, had accuracy levels greater than 88%. The prediction performances demonstrate the applicability of machine learning algorithms to predict nuclear threat using ratios of xenon isotopic activity.

머신러닝 기법을 이용한 약물 분류 방법 연구 (A Study on the Drug Classification Using Machine Learning Techniques)

  • Anmol Kumar Singh;Ayush Kumar;Adya Singh;Akashika Anshum;Pradeep Kumar Mallick
    • 산업과 과학
    • /
    • 제3권2호
    • /
    • pp.8-16
    • /
    • 2024
  • 본 논문에서는 인구통계학적, 생리학적 특성을 기반으로 환자에게 가장 적합한 약물을 예측하는 것을 목표로 하는 약물 분류 시스템을 제시한다. 데이터 세트에는 적절한 약물을 결정하기 위한 목적으로 연령, 성별, 혈압(BP), 콜레스테롤 수치, 나트륨 대 칼륨 비율(Na_to_K)과 같은 속성들이 포함된다. 본 연구에 사용된 모델은 KNN(K-Nearest Neighbors), 로지스틱 회귀 분석 및 Random Forest이다. 하이퍼파라미터를 최적화하기 위해 5겹 교차 검증을 갖춘 GridSearchCV를 활용하였으며, 각 모델은 데이터 세트에서 훈련 및 테스트 되었다. 초매개변수 조정 유무에 관계없이 각 모델의 성능은 정확도, 혼동 행렬, 분류 보고서와 같은 지표를 사용하여 평가되었다. GridSearchCV를 적용하지 않은 모델의 정확도는 0.7, 0.875, 0.975인 반면, GridSearchCV를 적용한 모델의 정확도는 0.75, 1.0, 0.975로 나타났다. GridSearchCV는 로지스틱 회귀 분석을 세 가지 모델 중 약물 분류에 가장 효과적인 모델로 식별했으며, K-Nearest Neighbors가 그 뒤를 이었고 Na_to_K 비율은 결과를 예측하는 데 중요한 특징인 것으로 밝혀졌다.

절대 유사 임계값 기반 사례기반추론과 유전자 알고리즘을 활용한 시스템 트레이딩 (System Trading using Case-based Reasoning based on Absolute Similarity Threshold and Genetic Algorithm)

  • 한현웅;안현철
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제26권3호
    • /
    • pp.63-90
    • /
    • 2017
  • Purpose This study proposes a novel system trading model using case-based reasoning (CBR) based on absolute similarity threshold. The proposed model is designed to optimize the absolute similarity threshold, feature selection, and instance selection of CBR by using genetic algorithm (GA). With these mechanisms, it enables us to yield higher returns from stock market trading. Design/Methodology/Approach The proposed CBR model uses the absolute similarity threshold varying from 0 to 1, which serves as a criterion for selecting appropriate neighbors in the nearest neighbor (NN) algorithm. Since it determines the nearest neighbors on an absolute basis, it fails to select the appropriate neighbors from time to time. In system trading, it is interpreted as the signal of 'hold'. That is, the system trading model proposed in this study makes trading decisions such as 'buy' or 'sell' only if the model produces a clear signal for stock market prediction. Also, in order to improve the prediction accuracy and the rate of return, the proposed model adopts optimal feature selection and instance selection, which are known to be very effective in enhancing the performance of CBR. To validate the usefulness of the proposed model, we applied it to the index trading of KOSPI200 from 2009 to 2016. Findings Experimental results showed that the proposed model with optimal feature or instance selection could yield higher returns compared to the benchmark as well as the various comparison models (including logistic regression, multiple discriminant analysis, artificial neural network, support vector machine, and traditional CBR). In particular, the proposed model with optimal instance selection showed the best rate of return among all the models. This implies that the application of CBR with the absolute similarity threshold as well as the optimal instance selection may be effective in system trading from the perspective of returns.

A new clustering algorithm based on the connected region generation

  • Feng, Liuwei;Chang, Dongxia;Zhao, Yao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권6호
    • /
    • pp.2619-2643
    • /
    • 2018
  • In this paper, a new clustering algorithm based on the connected region generation (CRG-clustering) is proposed. It is an effective and robust approach to clustering on the basis of the connectivity of the points and their neighbors. In the new algorithm, a connected region generating (CRG) algorithm is developed to obtain the connected regions and an isolated point set. Each connected region corresponds to a homogeneous cluster and this ensures the separability of an arbitrary data set theoretically. Then, a region expansion strategy and a consensus criterion are used to deal with the points in the isolated point set. Experimental results on the synthetic datasets and the real world datasets show that the proposed algorithm has high performance and is insensitive to noise.

위치 인식 기반 계층형 P2P 시스템 (Location-awareness based Hybrid P2P System)

  • 민수홍;조동섭
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2007년도 심포지엄 논문집 정보 및 제어부문
    • /
    • pp.448-450
    • /
    • 2007
  • Peer-to-Peer system has emerged as a popular model aiming at further utilizing Internet information and resources, complementing the available client-server services. However, the mechanism of peers randomly choosing logical neighbors without any knowledge about underlying physical location aware topology can cause serious performance degradation. In this paper, we consider the network distance between peers so that it helps peers select neighbors located at the nearest when they exchange queries for sharing of resources. To reduce the unnecessary signaling traffic and delay of query exchange, we propose a location aware topology based Hybrid P2P system. This system calculates the network distance which combines the direct measurement such as RTT (Round Trip Time) with geographic space of peers using IP address

  • PDF

절반금속 Fe_2TX 화합물의 전자구조 연구 (T = 3d 전이금속; X = Al, Si) (Electronic Structures of half-metallic phase of ternary Fe_2TX (T = 3d transition metal and X = Al, Si))

  • Park, Jin-Ho;Kwon, Se-Kyun;Byung ll Min
    • 한국자기학회:학술대회 개요집
    • /
    • 한국자기학회 2000년도 International Symposium on Magnetics The 2000 Fall Conference
    • /
    • pp.584-584
    • /
    • 2000
  • Electronic structures of ordered Fe$_3X (X = Al, Si), and their derivative ternary alloys of Fe_2TX (T = 3d transition metal) have been investigated by using the linearized muffin-tin orbital (LMTO) band method. The role of the coupling between substituted transition metal and its neighbors is investigated by calculating the magnetic moments and local density of states (LDOS). It is shown that it is essential to include the coupling beyond nearest neighbors in obtaining the magnetic moment of Fe alloy. The preferential sites of T impurities in Fe_3X are determined from the total energy calculations. The derivative ternary alloys of Fe_2TX have characteristic electronic structures of semi-metal for Fe_2VAI and (nearly) half-metal for Fe_2TAI (T = Cr, Mn) and Fe_2TSi (T = V, Cr, Mn)

  • PDF

최적화 사례기반추론을 이용한 통신시장 고객관계관리 (Customer Relationship Management in Telecom Market using an Optimized Case-based Reasoning)

  • 안현철;김경재
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2006년도 추계학술대회 학술발표 논문집 제16권 제2호
    • /
    • pp.285-288
    • /
    • 2006
  • Most previous studies on improving the effectiveness of CBR have focused on the similarity function aspect or optimization of case features and their weights. However, according to some of the prior research, finding the optimal k parameter for the k-nearest neighbor (k-NN) is also crucial for improving the performance of the CBR system. Nonetheless, there have been few attempts to optimize the number of neighbors, especially using artificial intelligence (AI) techniques. In this study, we introduce a genetic algorithm (GA) to optimize the number of neighbors that combine, as well as the weight of each feature. The new model is applied to the real-world case of a major telecommunication company in Korea in order to build the prediction model for the customer profitability level. Experimental results show that our GA-optimized CBR approach outperforms other AI techniques for this mulriclass classification problem.

  • PDF

Determining the optimal number of cases to combine in a case-based reasoning system for eCRM

  • Hyunchul Ahn;Kim, Kyoung-jae;Ingoo Han
    • 한국산학기술학회:학술대회논문집
    • /
    • 한국산학기술학회 2003년도 Proceeding
    • /
    • pp.178-184
    • /
    • 2003
  • Case-based reasoning (CBR) often shows significant promise for improving effectiveness of complex and unstructured decision making. Consequently, it has been applied to various problem-solving areas including manufacturing, finance and marketing. However, the design of appropriate case indexing and retrieval mechanisms to improve the performance of CBR is still challenging issue. Most of previous studies to improve the effectiveness for CBR have focused on the similarity function or optimization of case features and their weights. However, according to some of prior researches, finding the optimal k parameter for k-nearest neighbor (k-NN) is also crucial to improve the performance of CBR system. Nonetheless, there have been few attempts which have tried to optimize the number of neighbors, especially using artificial intelligence (AI) techniques. In this study, we introduce a genetic algorithm (GA) to optimize the number of neighbors to combine. This study applies the new model to the real-world case provided by an online shopping mall in Korea. Experimental results show that a GA-optimized k-NN approach outperforms other AI techniques for purchasing behavior forecasting.

  • PDF

유사 아이템 정보를 이용한 콜드 아이템 추천성능 개선 (Addressing the Item Cold-Start in Recommendation Using Similar Warm Items)

  • 한정규;천세진
    • 한국멀티미디어학회논문지
    • /
    • 제24권12호
    • /
    • pp.1673-1681
    • /
    • 2021
  • Item cold start is a well studied problem in the research field of recommender systems. Still, many existing collaborative filters cannot recommend items accurately when only a few user-item interaction data are available for newly introduced items (Cold items). We propose a interaction feature prediction method to mitigate item cold start problem. The proposed method predicts the interaction features that collaborative filters can calculate for the cold items. For prediction, in addition to content features of the cold-items used by state-of-the-art methods, our method exploits the interaction features of k-nearest content neighbors of the cold-items. An attention network is adopted to extract appropriate information from the interaction features of the neighbors by examining the contents feature similarity between the cold-item and its neighbors. Our evaluation on a real dataset CiteULike shows that the proposed method outperforms state-of-the-art methods 0.027 in Recall@20 metric and 0.023 in NDCG@20 metric.

증강현실 시각화를 위해 K-최근접 이웃을 사용한 BIM 메쉬 경량화 알고리즘 (BIM Mesh Optimization Algorithm Using K-Nearest Neighbors for Augmented Reality Visualization)

  • 빠 빠 윈 아웅;이동환;박주영;조민건;박승희
    • 대한토목학회논문집
    • /
    • 제42권2호
    • /
    • pp.249-256
    • /
    • 2022
  • 최근 BIM (Building Information Modeling)과 AR (Augmented Reality)을 결합한 실시간 시각화 기술이 건설관리 의사 결정 및 처리 효율성을 높이는 데 도움이 된다는 것을 보여주기 위한 다양한 연구가 활발히 진행되고 있다. 그러나, 대용량 BIM 데이터는 AR에 적용할 경우 데이터 전송 문제, 이미지 단절, 영상 끊김 등과 같은 다양한 문제가 발생함으로 3차원(3D) 모델의 메쉬 최적화를 통해 시각화의 효율성을 향상시켜야 한다. 대부분의 기존 메쉬 경량화 방법은 복잡하고 경계가 많은 3D 모델의 메쉬를 적절하게 처리할 수 없다. 이에 본 연구에서는 고성능 AR 시각화를 위해 BIM 데이터를 재구성하기 위한 k-최근접이웃(KNN) 분류 프레임워크 기반 메쉬 경량화 알고리즘을 제안하였다. 제안 알고리즘은 선정된 BIM 모델을 삼각형 중심 개념 기반의 Unity C# 코드로 경량화하였고 모델의 데이터 세트를 활용하여 정점 사이의 거리를 정의할 수 있는 KNN로 분류되었다. 그 결과 전체 모델과 각 구조의 경량화 메쉬 점 및 삼각형 개수가 각각 약 56 % 및 약 42 % 감소됨을 확인할 수 있었다. 결과적으로, 원본 모델과 비교했을 때 경량화한 모델은 시각적인 요소 및 정보 손실이 없었고, 따라서, AR 기기 활용 시 고성능 시각화를 향상시킬 수 있을 것으로 기대된다.