• Title/Summary/Keyword: 평균 사용자 유사도

Search Result 109, Processing Time 0.026 seconds

Clustering Normal User Behavior for Anomaly Intrusion Detection (비정상행위 탐지를 위한 사용자 정상행위 클러스터링 기법)

  • Oh, Sang-Hyun;Lee, Won-Suk
    • The KIPS Transactions:PartC
    • /
    • v.10C no.7
    • /
    • pp.857-866
    • /
    • 2003
  • For detecting an intrusion based on the anomaly of a user's activities, previous works are concentrated on statistical techniques in order to analyze an audit data set. However. since they mainly analyze the average behavior of a user's activities, some anomalies can be detected inaccurately. In this paper, a new clustering algorithm for modeling the normal pattern of a user's activities is proposed. Since clustering can identify an arbitrary number of dense ranges in an analysis domain, it can eliminate the inaccuracy caused by statistical analysis. Also, clustering can be used to model common knowledge occurring frequently in a set of transactions. Consequently, the common activities of a user can be found more accurately. The common knowledge is represented by the occurrence frequency of similar data objects by the unit of a transaction as veil as the common repetitive ratio of similar data objects in each transaction. Furthermore, the proposed method also addresses how to maintain identified common knowledge as a concise profile. As a result, the profile can be used to detect any anomalous behavior In an online transaction.

Two-step Scheduling With Reduced Feedback Overhead in Multiuser Relay Systems (다중 사용자 릴레이 시스템에서 감소된 피드백 정보를 이용한 두 단계 스케줄링 기법)

  • Jang, Yong-Up;Shin, Won-Yong;Kim, A-Jung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.5A
    • /
    • pp.511-520
    • /
    • 2011
  • In this paper, we introduce a multiuser (MU) scheduling method for multiuser amplify-and-forward relay systems, which selects both the transmission mode, i.e., either one- or two-hop transmission, and the desired user via two steps. A closed-form expression for the average achievable rate of the proposed scheduling is derived under two transmission modes with MU scheduling, and its asymptotic solution is also analyzed in the limit of large number of mobile stations. Based on the analysis, we perform our two-step scheduling algorithm: the transmission mode selection followed by the user selection that needs partial feedback for instantaneous signal-to-noise ratios (SNRs) to the base station. We also analyze the average SNR condition such that the MU diversity gain is fully exploited. In addition, it is examined how to further reduce a quantity of feedback under certain conditions. The proposed algorithm shows the comparable achievable rate to that of the optimal one using full feedback information, while its required feedback overhead is reduced below half of the optimal one.

Key Frame Detection and Multimedia Retrieval on MPEG Video (MPEG 비디오 스트림에서의 대표 프레임 추출 및 멀티미디어 검색 기법)

  • 김영호;강대성
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2000.08a
    • /
    • pp.297-300
    • /
    • 2000
  • 본 논문에서는 MPEG 비디오 스트림을 분석하여 DCT DC 계수를 추출하고 이들로 구성된 DC 이미지로부터 제안하는 robust feature를 이용하여 shot을 구하고 각 feature들의 통계적 특성을 이용하여 스트림의 특징에 따라 weight를 부가하여 구해진 characterizing value의 시간변화량을 구한다. 구해진 변화량의 local maxima와 local minima는 MPEG 비디오 스트림에서 각각 가장 특징적인 frame과 평균적인 frame을 나타낸다. 이 순간의 frame을 구함으로서 효과적이고 빠른 시간 내에 key frame을 추출한다. 추출되어진 key frame에 대하여 원영상을 복원한 후, 색인을 위하여 다수의 parameter를 구하고 사용자가 질의한 영상에 대해서 이들 파라메터를 구하여 key frame들과 가장 유사한 대표영상들을 검색한다.

  • PDF

K-means clustering using a center of gravity for grid-based sample (그리드 기반 표본의 무게중심을 이용한 케이-평균군집화)

  • Lee, Sun-Myung;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.1
    • /
    • pp.121-128
    • /
    • 2010
  • K-means clustering is an iterative algorithm in which items are moved among sets of clusters until the desired set is reached. K-means clustering has been widely used in many applications, such as market research, pattern analysis or recognition, image processing, etc. It can identify dense and sparse regions among data attributes or object attributes. But k-means algorithm requires many hours to get k clusters that we want, because it is more primitive, explorative. In this paper we propose a new method of k-means clustering using a center of gravity for grid-based sample. It is more fast than any traditional clustering method and maintains its accuracy.

The Educational Contents Recommendation System Design based on Collaborative Filtering Method (협업 여과 기반의 교육용 컨텐츠 추천 시스템 설계)

  • Lee, Yong-Jun;Lee, Se-Hoon;Wang, Chang-Jong
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.2
    • /
    • pp.147-156
    • /
    • 2003
  • Collaborative Filtering is a popular technology in electronic commerce, which adapt the opinions of entire communities to provide interesting products or personalized resources and items. It has been applied to many kinds of electronic commerce domain since Collaborative Filtering has proven an accurate and reliable tool. But educational application remain limited yet. We design collaborative filtering recommendation system using user's ratings in educational contents recommendation. Also We propose a method of similarity compensation using user's information for improvement of recommendation accuracy. The proposed method is more efficient than the traditional collaborative filtering method by experimental comparisons of mean absolute error(MAE) and reciever operating characteristics(ROC) values.

  • PDF

A Study on Research Paper Classification Using Keyword Clustering (키워드 군집화를 이용한 연구 논문 분류에 관한 연구)

  • Lee, Yun-Soo;Pheaktra, They;Lee, JongHyuk;Gil, Joon-Min
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.12
    • /
    • pp.477-484
    • /
    • 2018
  • Due to the advancement of computer and information technologies, numerous papers have been published. As new research fields continue to be created, users have a lot of trouble finding and categorizing their interesting papers. In order to alleviate users' this difficulty, this paper presents a method of grouping similar papers and clustering them. The presented method extracts primary keywords from the abstracts of each paper by using TF-IDF. Based on TF-IDF values extracted using K-means clustering algorithm, our method clusters papers to the ones that have similar contents. To demonstrate the practicality of the proposed method, we use paper data in FGCS journal as actual data. Based on these data, we derive the number of clusters using Elbow scheme and show clustering performance using Silhouette scheme.

Research of Term-Weighting Method in an Usenet Information Retrieval System (유즈넷 정보검색시스템에서 단어 가중치 적용방법에 관한연구)

  • 최재덕;최진석;박민식
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.339-341
    • /
    • 1998
  • 다양한 정보교환 수단의 하나인 유즈넷은 방대한 정보량을 가진다. 사용자는 유즈넷에서 필요한 정보를 쉽게 찾지 못하므로 뉴스그룹 전체와 본문에서 정보 검색의 필요성을 인식하고 있다. 이 논문에서는 정보검색시스템을 유즈넷으로 확장시 단어 가중치 적용방법의 개선을 통해 검색효율을 향상시키고자 한다. 정보검색에서 단어의 중요도에 영향을 미치는 tf, idf 이외의 다른 요소인 카테고리빈도(category frequency, cf)를 활용하여 tf*idf방법에 역카테고리빈도(inverted categoary frequency, icf)를 고려한 유사도 계산 방법을 제시하고 이를 검증하였다. 실험 결과에서 상위 30위 내의 평균 적합문서의 수가 tf*{{{{ SQRT {idf$^2$+icf$^2$} }}}}방법이 tf*idf 방법보다 4.6% 향상됨을 알 수 있다.

Detection of inappropriate advertising content on SNS using k-means clustering technique (k-평균 군집화 기법을 활용한 SNS의 부적절한 광고성 콘텐츠 탐지)

  • Lee, Dong-Hwan;Lim, Heui-Seok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.570-573
    • /
    • 2021
  • 오늘날 SNS를 사용하는 사람들이 증가함에 따라, 생성되는 데이터도 많아지고 종류도 매우 다양해졌다. 하지만 유익한 정보만 존재하는 것이 아니라, 부정적, 반사회적, 사행성 등의 부적절한 콘텐츠가 공존한다. 때문에 사용자에 따라 적절한 콘텐츠를 필터링 할 필요성이 증가하고 있다. 따라서 본 연구에서는 SNS Instagram을 대상으로 콘텐츠의 해시태그를 수집하여 데이터화 했다. 또한 k-평균 군집화 기법을 적용하여, 유사한 특성의 콘텐츠들을 군집화하고, 각 군집은 실루엣 계수(Silhouette Coefficient)와 키워드 다양성(Keyword Diversity)을 계산하여 콘텐츠의 적절성을 판단하였다.

Performance Evaluation of a Cellular OFDM System with Heterogeneous MIMO Users (이질적인 MIMO 사용자들을 가진 셀룰러 OFDM 시스템의 성능 분석)

  • Oh Joon;Hwang Hyeon chyeol;Lim Jong kyoung;Kim Duk kyung;Kwak Kyung sup
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.4A
    • /
    • pp.296-303
    • /
    • 2005
  • In this paper, we evaluate system performance and propose signal separation and detection when a user with one antenna shares the co-channel together with a user with two space-time coded antennas. The proposed technique can identify co-channel users by an interference cancellation method and detect the signals by maximum likelihood method. Simulation results show that the shortcoming of the Minimum Mean-Squared Error technique which can be applied two users with the same number of antenna but can not applied for heterogeneous MIMO users with the different number of antennas. Also, we apply the proposed scheme to OFDM system and evaluate the system performance. By simulations, we identify that the performance of the proposed system is the same as that of the existing single antenna users and improves the performance of the two-antenna MIMO users.

CMFston : Synthetic User Programming Benchmark Based on UNIX (CMFston:유닉스 기반의 합성 사용자 프로그래밍 벤치마크)

  • Lee, Si-Jin;Park, Sung-Uk;Kwon, Hyeog-In
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.5
    • /
    • pp.1215-1228
    • /
    • 1996
  • The purpose of benchmark program is to measure the performance of a computer system. The performance of a computer system is determined by the amount of execution time of user application programs. Thus, it is assumed that a benchmark program must have the same features with user application programs to test. In this paper, we have designed and implemented CMFstone which is consisted of Chaustone, Mchaustone and Fhsstone. After applied the CMFstone, designed and implemented in this paper, to the real situations, the results of comparison show that CMFstone is similar to geometric mean of other benchmarks results. Thus, we have concluded that CMFstone is good enough to measure the performance of a computer system.

  • PDF