• Title/Summary/Keyword: 데이터 선별

Search Result 580, Processing Time 0.027 seconds

A Study on improvement of performance of collaborative filtering recommendation system using social data (소셜 데이터를 이용한 협업필터링 추천 시스템 성능 개선 연구)

  • Joo, Jong-Min;Yang, Hyung-Jeong;Kim, Nam-Hun;Park, Sung-Hyun;Lee, Gun-Woo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.11a
    • /
    • pp.660-663
    • /
    • 2017
  • 다양한 소셜 네트워크 서비스가 발달되고 많은 사람들이 소셜 미디어에 참여하면서 방대한 양의 정보가 발생하고 있다. 따라서 원하는 정보를 선별하고 가공하는 연구도 활발히 진행되고 있다. 협업필터링은 이러한 정보를 토대로 사용자에게 맞춤형 아이템을 추천해주는 알고리즘이다. 하지만 정확한 추천을 위해서는 매우 방대한 양의 정보가 필요하다. 또한 협업필터링에는 초기에는 제대로 추천이 이루어지지 않는 콜드스타터 문제가 있다. 이러한 문제를 해결하기 위해 본 논문에서는 소셜 네트워크 서비스 중의 하나인 트위터 데이터를 활용하여 협업필터링 추천 시스템의 성능을 높이고자 한다. 협업필터링의 평점에 특정 아이템 관련 트윗을 수집해서 긍정/부정을 측정하여 가중치를 부여한다. RMSE 평가 방법을 통한 실험 결과, 소셜 미디어의 긍부정 영향력을 측정하여 적용했을 때가 기존의 협업필터링 방식에 비해 약 5.5%의 성능 향상을 확인하였다.

Analysis on the Relation between Induced Longitudinal Voltage and Induced Noise Voltage caused by Electrified Railway system (고속전철시설에 의한 전력유도현상의 종전압과 잡음전압의 관계 분석)

  • Cho, Mun-Hwan;Lee, Snag-Mu;Cho, Pyung-Dong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2011.10a
    • /
    • pp.589-592
    • /
    • 2011
  • Induced longitudinal voltage and induced noise voltage are used in the analysis on the power induction phenomenon and it is well known that these are in the subordinate relationship. But sometimes. there is a confusing situation that these voltages have not exact subordinate relationship in the actual measurement fields. So. we have analyze the correlation between induced longitudinal voltage and induced noise voltage by using tile actual measured data in the fields of 30 urban areas and 30 rural areas.

  • PDF

Methodology for Implementation of the Portable Disease Diagnosis Platform based on Neural Network Using High Performance Computing (고성능 컴퓨팅을 활용한 뉴럴 네트워크 기반의 휴대용 질병 진단 플랫폼 구현 방법론)

  • Kim, Sang-man;Park, Ju-Sung
    • Journal of IKEEE
    • /
    • v.22 no.4
    • /
    • pp.1093-1098
    • /
    • 2018
  • In this paper, we proposed a methodology for portable disease diagnosis platform using high performance computing. The proposed methodology consists of gathering clinical data, diagnosis and feature selection algorithm, implementation of diagnosis platform. For the algorithm verification, a clinical data which is obtained from 401 people(314 normal subjects and 87 liver cancer patients) using a microarray consists of 1,146 aptamers were used. As the result, we could diagnosis liver cancer with 97.5% accuracy using the 32 selected aptamers. Based on these results, we designed and implemented a portable disease diagnosis platform which has 32 bio-signals as inputs.

Search for a user-centered system design and implementation (사용자 중심 검색 시스템 설계 및 구현)

  • Kim, A-Yong;Park, Man-Seub;Kim, Jong-Moon;Jeong, Dae-Jin;Jung, Hoe-kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.619-621
    • /
    • 2014
  • addition to the advances in information technology and the latest IT technology for their issue. To enable users who are using the Web to find need the information your search data they're sifting through about how many are struggling. In this paper, we propose a user-centered search system. Lucene search system to offer Hadoop's MapReduce with the Apache project Nutch, Solr, HDFS, utilizing design and implementation. This is the Web search users who wish to use depending on the intentions of the data that you want to collect and index information will be utilized in the search field.

  • PDF

An Exploratory Analysis of Korean News Topics of Chinese Students in Pandemic (팬데믹 상황의 중국인 유학생 뉴스 토픽에 대한 탐색적 분석)

  • Choi, Sook;JIN, XIANMEI
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.6
    • /
    • pp.218-227
    • /
    • 2021
  • The purpose was to examine what kind of discourse about foreigners in the media in a situation where hatred toward foreigners prevailed in a pandemic situation. News data related to Chinese international students(CIS) was collected for 2020, The 11 optimal topics were selected derived through LDA analysis. They were analyzed in an exploratory level, focusing on the relationship with major events per year. The news about CIS in 2020 was intensively linked to reports on the COVID19 situation. There was a tendency to report in response to the presupposes CIS as potential confirmed patients.

Development of Sensor Placement Optimization Algorithm for Smart Container Control (스마트 컨테이너 제어를 위한 센서 위치 최적화 알고리즘 개발)

  • Kim, Jeong-ho;Jeon, Byeong-jin;Park, Byeong-jun;Lee, Sang-jin;Im, Hyeon-seok;Kim, Hyung-hoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.1047-1049
    • /
    • 2022
  • 스마트 컨테이너 제어를 위해서는 컨테이너 내부에 센서가 필요하나, 센서의 개수가 증가하면 비용 및 시스템 부하가 증가한다. 본 연구에서는 CFD(Computational Fluid Dynamics)를 이용하여 얻은 컨테이너 내부 온도 데이터와 센서 위치 최적화 알고리즘을 이용하여 컨테이너 내부 모니터링을 위한 최적의 센서 위치 결정 방법론을 제시한다. CFD 상용 SW로 컨테이너 내·외부 상황을 가정하여 내부 온도 데이터를 추출하고, 이를 바탕으로 내부 상태를 대표하는 공간들을 구분한다. 컨테이너 내벽에 부착된 센서가 탐지할 수 있는 능력을 탐지 거리 및 각도의 수식들로 나타내어 각 수식을 조합하여 센서의 탐지 능력을 수치화하고, 이 수치에 따라 균등하게 분포된 센서 위치 후보군 중, 선별된 공간을 탐지하는 센서 위치를 최적화하여 효율적인 컨테이너 제어를 위한 여건을 마련한다.

A Study on the Model of Internet Public Library in Korea (IPL-Korea) (인터넷 공공도서관 구축 모형 연구)

  • 고영만;오삼균
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.4
    • /
    • pp.109-123
    • /
    • 1999
  • We are faced with a paradox in the age of information as finding quality information on the Internet becomes a more challenging task because of information overload. This paper describes the prototype for “IPL-Korea” (Internet Public Library in Korea) project which is an attempt to provide the public with quality information in the form of a metadata system. The system involves cataloging of resources, i.e. websites, that are filtered by library and information science majors as well as information professionals. The user focus of this system is on children, youth, women, and seniors; various classification schemes and resource descriptions relevant for each user group are incorporated into the system to allow efficient browsing of the resources. A thesaurus for “IPL-Korea”, which is based on the ERIC thesaurus, is being constructed for easy manipulation of the breath of searching. The “IPL-Korea” metadata system employs the entity-relationship model in the design of its conceptual schema. Metadata is being stored in the Oracle database system and Web interfaces to this database are provided through ASP, ColdFusion, and JAVA technology.

  • PDF

A Selective Video Data Deletion Algorithm to Free Up Storage Space in Video Proxy Server (비디오 프록시 서버에서의 저장 공간 확보를 위한 선택적 동영상 데이터 삭제 알고리즘)

  • Lee, Jun-Pyo;Park, Sung-Han
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.4
    • /
    • pp.121-126
    • /
    • 2009
  • Video poxy server which is located near clients can store the frequently requested video data in storage space in order to minimize initial latency and network traffic significantly. However, due to the limited storage space in video proxy server, an appropriate deletion algorithm is needed to remove the old video data which is not serviced for a long time. Thus, we propose an efficient video data deletion algorithm for video proxy server. The proposed deletion algorithm removes the video which has the lowest request possibility based on the user access patterns. In our algorithm, we arrange the videos which are stored in video proxy server according to the requested time sequence and then, select the video which has the oldest requested time. The selected video is partially removed in order to free up storage space in video poky server. The simulation results show that the proposed algorithm performs better than other algorithms in terms of the block hit rate and the number of block deletion.

Design 5Q MPI Hardware Unit Supporting Standard Mode (표준 모드를 지원하는 5Q MPI 하드웨어 유닛 설계)

  • Park, Jae-Won;Chung, Won-Young;Lee, Seung-Woo;Lee, Yong-Surk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.1B
    • /
    • pp.59-66
    • /
    • 2012
  • The use of MPSoC has been increasing because of a rise of use of mobile devices and complex applications. For improving the performance of MPSoC, number of processor has been increasing. Standard MPI is used for efficiently sending data in distributed memory architecture that has advantage in multi processor. Standard In this paper, we propose a scalable distributed memory system with a low cost hardware message passing interface(MPI). The proposed architecture improves transfer rate with buffered send for small size packet. Three queues, Ready Queue, Request Queue, and Reservation Queue, work as previous architecture, and two queues, Small Ready Queue and Small Request Queue, are added to send small size packet. When the critical point is set 8 bytes, the proposed architecture takes more than 2 times the performance improvement in the data that below the critical point.

Current Status Analysis of Business Units and Retention Period Estimation related to Administrative Information Systems of Public Institutions (공공기관 행정정보시스템 관련 단위과제 및 보존기간 책정 현황분석)

  • Yoon, Sung-Ho;Yu, Sin Seong;Choi, Kippeum;Oh, Hyo-Jung
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.31 no.2
    • /
    • pp.139-160
    • /
    • 2020
  • Since the Public Records Management Act was enacted in 2007, the administrative information system has already been included in the electronic records production system, and dataset has been subject to record management as a type of electronic records. With the recent revision of the enforcement decree, dataset records management has been enacted. This study analyzes business units related to administrative information systems of public institutions and examines the current status of retention periods estimation. For this purpose, we collected 36 records classification systems from 49 public institutions among the direct management agencies of the National Archives and disaster management agencies. And we discriminated 824 business units related to administrative information system and divided into large and small groups according to types. We also compared the retention period estimation of records. The problems and improvement plans of this study are expected to be used as basic data in preparing the standard of administrative dataset management in the future.