Search | Korea Science

Preprocessing Methods for Improving Prediction Accuracy in Recommender Systems (추천 시스템의 예측 정확도 향상을 위한 전처리 방법)

박석인;김택헌;류영석;양성봉
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.04b
- /
- pp.247-249
- /
- 2002
협력적 여과(collaborative filtering) 방법을 사용하는 추천 시스템에서 예측 정확도를 높이는 방법들 중 하나는 군집화(clustering)방법이 있다. 군집화 방법은 선호도가 유사한 사용자들을 미리 같은 군집으로 만들고, 군집 내에 속한 사용자들을 이웃으로 선정하여 예측을 수행하기 때문에 군집화의 결과가 예측의 정확도에 직접적인 영향을 주게 된다. 본 연구에서는 군집화 결과의 향상을 위해 데이터를 전 처리하는 두 가지 방법과 군집화의 특성을 이용한 새로운 예측식을 제안하고, 기존 연구 방법과의 비교 실험을 통해 실험결과를 분석한다.
PDF

Crowd Analysis System Using Human Recognition and Clustering Techniques (사람인식 및 클러스터링 기법을 이용한 군집분석 시스템)

Tae-jeong Park;Ji-ho Park;Bo-yoon Seo;Jun-ha Shin;Kyung-hwan Choi;Hongseok Yoo
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.07a
- /
- pp.485-487
- /
- 2023
최근 코로나 19 방역지침 해제로 인한 대면적인 활동이 많아지면서 사람에 대한 서비스 제공이 중요한 이슈가 되었다. 하지만 사람들이 밀집되어있는 곳에서는 서비스가 원할하게 이루어지지 않는 경우가 대부분이다. 본 논문에서는 객체인식 알고리즘 기술인 Yolo와 OpenCv를 통해 카메라로 영상 속의 사람들을 인식하여 군집화 기술인 K-means 클러스터링을 이용해서 사람에 대한 군집화를 진행후 우선순위를 선정하고 좌표를 지정하여서 로봇이 군집의 좌표로 이동하여서 사람들에게 직접 접근하여 서비스를 제공할 수 있도록 하였다.
PDF

Rule extraction from trained neural network using NofM algorithm with improved clustering step (개선된 군집화 단계의 NofM 알고리즘을 이용한 훈련된 신경망으로부터의 규칙추출)

Lee, Han-Yul;Ra, Jong-Hei;Kim, Moon-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.10a
- /
- pp.581-584
- /
- 2001
신경망이 만들어내는 출력에 대한 정보는 수치적으로 분산되어 신경망에 저장되므로, 인간이 직접 해석하기가 힘들다. 본 논문에서는 LRE(link rule extraction)기법인 NofM 알고리즘의 6단계 중에서 초기 단계인 가중치 군집화 단계를 개선하여 추출되는 규칙들의 전제부에 들어가는 규칙 조건들의 수를 조절함으로써, 추출된 규칙이 입력 특성에 대한 정보를 과잉 일반화하거나, 과잉 구체화하는 것을 피할 수 있음을 실험을 통해 보였다. 일반적으로 NofM 알고리즘에서 가중치들을 군집화한 때는 Join 알고리즘을 사용하는데, 본 논문에서는 Join 알고리즘의 Join condition을 0.05부터 0.25까지 0.05씩 점진적으로 확대하여 클러스터링을 하여줌으로써 신경망의 출력에 중요한 역할을 하는 가중치들을 효과적으로 군집화함을 보였다.
PDF

Feature Points Clustering For Panorama Construction (파노라마 생성을 위한 특징점 클러스터링)

Kim, Tae-Woo
- Proceedings of the KAIS Fall Conference
- /
- 2007.11a
- /
- pp.209-210
- /
- 2007
불변 특징 기반의 파노라마 생성 방법은 직접 방법에 비해 비교적 처리 속도가 빠르다. 파노라마 생성 과정에서 특징점 추출과 특징 정합에 대부분의 시간이 소요된다. 본 논문에서는 파노라마 생성을 위한 특징점 클러스터링 방법을 제안한다. LoG 영상에서 특징점들을 추출한 후, 클러스터링을 통해 특징점들을 군집화한다. 군집도가 강한 특징점들은 그렇지 않은 특징점들보다 더 의미 있으므로, 파노라마 생성에서 군집도가 약한 군집을 배제함으로써 정확도가 높아지고 처리 시간이 빨라지는 장점이 있다. 실험에서 $320{\times}240$ 크기의 칼라 영상에 대해 제안한 방법의 처리 시간이 약2.0초로 클러스터링 처리를 하지 않는 방법에 비해 약 2배 빠른 결과를 보였다.
PDF

A new cluster validity index based on connectivity in self-organizing map (자기조직화지도에서 연결강도에 기반한 새로운 군집타당성지수)

Kim, Sangmin;Kim, Jaejik
- The Korean Journal of Applied Statistics
- /
- v.33 no.5
- /
- pp.591-601
- /
- 2020
The self-organizing map (SOM) is a unsupervised learning method projecting high-dimensional data into low-dimensional nodes. It can visualize data in 2 or 3 dimensional space using the nodes and it is available to explore characteristics of data through the nodes. To understand the structure of data, cluster analysis is often used for nodes obtained from SOM. In cluster analysis, the optimal number of clusters is one of important issues. To help to determine it, various cluster validity indexes have been developed and they can be applied to clustering outcomes for nodes from SOM. However, while SOM has an advantage in that it reflects the topological properties of original data in the low-dimensional space, these indexes do not consider it. Thus, we propose a new cluster validity index for SOM based on connectivity between nodes which considers topological properties of data. The performance of the proposed index is evaluated through simulations and it is compared with various existing cluster validity indexes.
https://doi.org/10.5351/KJAS.2020.33.5.591 인용 PDF KSCI

Ubiquitous Cyber Aquarium Using Schooling Animation (군집 애니메이션을 이용한 유비쿼터스 사이버 아쿠아리움)

Kim, Jong-Chan;Cho, Seung-Il;Kim, Eung-Kon
- Journal of Korea Multimedia Society
- /
- v.11 no.10
- /
- pp.1427-1435
- /
- 2008
It is worth it to properly present the behaviors of cyberspace and game characters. It is not so easy to manually supply the actions for all objects one by one, so we need to use crowd animation techniques in order to display automatically, realistically, and effectively. In this paper, we construct a ubiquitous cyber aquarium based on sensors, using the crowd behavior library, which is applied to programming crowd behavior in cyberspace and game applications. We use a sensor board to become aware of a person approaching the cyber aquarium, and according to the perceived distance, the objects in the cyber aquarium automatically react and behave accordingly.
PDF

Medical Document Clustering using the Growing Hierarchical SOM (신경망 GHSOM을 이용한 의료 문헌 정보의 군집화)

Heo, Jin-Seok;Kim, In-Cheol
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.04a
- /
- pp.519-522
- /
- 2002
일반적으로 PubMed와 같은 인터넷을 이용한 대규모 의료 문헌정보 검색시스템에서 포괄적인 주제어나 간결한 주제어를 이용한 검색을 시도할 경우, 종종 매우 다양한 세부주제의 문헌리스트들이 다량으로 검색된다. 이러한 경우 이용자는 실제로 본인이 원했던 세부주제에 부합되는 문헌들을 찾기 위해서는 검색결과로 주어진 긴 문헌리스트상의 문헌 하나하나에 대해 다시 문헌제목이나 혹은 요약 등의 내용을 직접 읽어보고 내용을 확인하여야 한다. 이러한 작업은 매우 번거럽고 시간과 노력을 많이 필요로 한다. 따라서 본 논문에서는 이러한 노력을 줄이기 위한 한 가지 방안으로, PubMed 시스템의 주제어 검색결과로 주어진 문헌들에 대해 내용의 유사성과 차별성에 따라 자동으로 몇 개의 그룹으로 나누어주는 군집화시스템 MedCluster의 설계와 구현에 대해 소개한다. MedCluster의 큰 특징은 기존의 문서 군집화 방법과는 다른 신경망 GHSOM을 이용한 군집화 방법을 사용하는 점이다. GHSOM은 미리 문서 그룹의 개수를 정해줄 필요가 없고 다양한 레벨의 문서 그룹들을 얻을 수 있는 계층적 군집화를 이루어낸다는 장점을 가지고 있다. 본 논문에서는 신경망 GHSOM의 구조와 특성에 대해 간략히 살펴보고, GHSOM을 채용한 의료문헌 군집화시스템 MedCluster의 설계와 구현에 대해 설명한다.
PDF

Sparse Document Data Clustering Using Factor Score and Self Organizing Maps (인자점수와 자기조직화지도를 이용한 희소한 문서데이터의 군집화)

Jun, Sung-Hae
- Journal of the Korean Institute of Intelligent Systems
- /
- v.22 no.2
- /
- pp.205-211
- /
- 2012
The retrieved documents have to be transformed into proper data structure for the clustering algorithms of statistics and machine learning. A popular data structure for document clustering is document-term matrix. This matrix has the occurred frequency value of a term in each document. There is a sparsity problem in this matrix because most frequencies of the matrix are 0 values. This problem affects the clustering performance. The sparseness of document-term matrix decreases the performance of clustering result. So, this research uses the factor score by factor analysis to solve the sparsity problem in document clustering. The document-term matrix is transformed to document-factor score matrix using factor scores in this paper. Also, the document-factor score matrix is used as input data for document clustering. To compare the clustering performances between document-term matrix and document-factor score matrix, this research applies two typed matrices to self organizing map (SOM) clustering.
https://doi.org/10.5391/JKIIS.2012.22.2.205 인용 PDF KSCI

Cluster analysis for highway speed according to patterns and effects (고속도로 구간별 통행속도의 패턴과 영향에 따른 군집분석)

Kim, Byungsoo;An, Soyoung;Son, Jungmin;Park, Hyemi
- The Korean Journal of Applied Statistics
- /
- v.29 no.5
- /
- pp.949-960
- /
- 2016
This paper uses all sections of highway data (VDS) for two years (Jan. 2014-Dec. 2015), with 15 minute units. The first purpose of this study is to find clusters with similar patterns that appear repeatedly with time variables of month, week and hour. The cluster analysis results indicate a variety of patterns of average traffic speeds by time variables depending on the clusters; subsequently, these can be utilized to model for the forecast of the speed at a specific time. The second purpose is to do cluster analysis for grouping sections by effect nets that are closely related to each other. For the similarity measure we use cross-correlation functions calculated after pre-whitening the speed of each section. The cluster analysis gets 19 clusters, and sections within a cluster are geographically close. These results are expected to help to forecast a real-time speed.
https://doi.org/10.5351/KJAS.2016.29.5.949 인용 PDF KSCI

Daily Behavior Pattern Extraction using Time-Series Behavioral Data of Dairy Cows and k-Means Clustering (행동 시계열 데이터와 k-평균 군집화를 통한 젖소의 일일 행동패턴 검출)

Lee, Seonghun;Park, Gicheol;Park, Jaehwa
- Journal of Software Assessment and Valuation
- /
- v.17 no.1
- /
- pp.83-92
- /
- 2021
There are continuous and tremendous attempts to apply various sensor systems and ICTs into the dairy science for data accumulation and improvement of dairy productivity. However, these only concerns the fields which directly affect to the dairy productivity such as the number of individuals and the milk production amount, while researches on the physiology aspects of dairy cows are not enough which are fundamentally involved in the dairy productivity. This paper proposes the basic approach for extraction of daily behavior pattern from hourly behavioral data of dairy cows to identify the health status and stress. Total four clusters were grouped by k-means clustering and the reasonability was proved by visualization of the data in each groups and the representatives of each groups. We hope that provided results should lead to the further researches on catching abnormalities and disease signs of dairy cows.
https://doi.org/10.29056/jsav.2021.06.11 인용

Search Result 171, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)