Search | Korea Science

Building Data Warehouse System for Weblog Analysis (웹로그 분석을 위한 데이터 웨어하우스 시스템 구축)

Lee, Joo-Il;Baek, Kyung-Min;Shin, Joo-Hahn;Lee, Won-Suk
- 한국IT서비스학회:학술대회논문집
- /
- 2010.05a
- /
- pp.291-295
- /
- 2010
최근 급격한 하드웨어 기술과 데이터베이스 시스템의 발전은 우리 주변에서 발생하는 다양한 분야의 데이터를 자동으로 수집하는 것을 가능하게 하였다. 흔히 데이터 스트림(data stream)이라고 언급되는 끊임없이 생산되는 대용량의 데이터를 효율적으로 처리하여 유용한 정보를 얻어내는 기술은 이미 많은 응용 분야에서 광범위하게 연구되고 있다. 인터넷은 이러한 데이터 스트림을 양산해 내는 주요 원천 중의 하나이다. 인터넷 비즈니스의 활성화와 더불어 웹로그 데이터 스트림은 마케팅, 전략 수립, 고객관리 등 여러 부분에 광범위하게 활용되기 시작했으며, 보다 정확하고 효율적인 분석에 대한 요구사항도 점점 늘어나고 있다. 데이터 웨어하우스(Data Warehouse)는 수집된 데이터를 주제 기반으로 통합하여 시계열 형태로 적재하는 저장소서 유용한 분석이나 의사결정에 많이 사용되어 왔다. 데이터웨어하우스는 데이터를 요약하고 통합 및 정제하는 기능을 제공하여 대용량의 데이터 처리에 적합하고 데이터의 품질을 향상시키기 때문에 데이터 마이닝 분야에서 전처리 과정으로도 많이 이용되어 왔다. 본 논문에서는 웹로그 데이터 스트림에 대한 데이터 웨어하우스를 구축하여 보다 고품질의 유용한 정보를 효율적으로 얻어내는 시스템을 제안한다.
PDF

The Analysis Framework for User Behavior Model using Massive Transaction Log Data (대규모 로그를 사용한 유저 행동모델 분석 방법론)

Lee, Jongseo;Kim, Songkuk
- The Journal of Bigdata
- /
- v.1 no.2
- /
- pp.1-8
- /
- 2016
User activity log includes lots of hidden information, however it is not structured and too massive to process data, so there are lots of parts uncovered yet. Especially, it includes time series data. We can reveal lots of parts using it. But we cannot use log data directly to analyze users' behaviors. In order to analyze user activity model, it needs transformation process through extra framework. Due to these things, we need to figure out user activity model analysis framework first and access to data. In this paper, we suggest a novel framework model in order to analyze user activity model effectively. This model includes MapReduce process for analyzing massive data quickly in the distributed environment and data architecture design for analyzing user activity model. Also we explained data model in detail based on real online service log design. Through this process, we describe which analysis model is fit for specific data model. It raises understanding of processing massive log and designing analysis model.
PDF

Early Prediction Model of Student Performance Based on Deep Neural Network Using Massive LMS Log Data (대용량 LMS 로그 데이터를 이용한 심층신경망 기반 대학생 학업성취 조기예측 모델)

Moon, Kibum;Kim, Jinwon;Lee, Jinsook
- The Journal of the Korea Contents Association
- /
- v.21 no.10
- /
- pp.1-10
- /
- 2021
Log data accumulated in the Learning Management System (LMS) provide high-quality information for the learning process of students. Until now, various studies have been conducted to predict students' academic achievement using LMS log data. However, previous studies were based on relatively small sample sizes of students and courses, limiting the possibility of generalization. This study developed and validated a deep neural network model for the early prediction of academic achievement of college students using massive LMS log data. To this end, we used 78,466,385 cases of LMS log data and 165,846 cases of grade data. The proposed model predicted the excellent-grade students with a high level of accuracy from the beginning of the semester. Meanwhile, the prediction accuracy for the moderate and underachieving groups was relatively low, but the accuracy improved as the time points of the prediction were delayed. This study is meaningful in that we developed an early prediction model based on a deep neural network with sufficient accuracy for practical utilization by only using LMS log data.
https://doi.org/10.5392/JKCA.2021.21.10.001 인용 PDF KSCI HTML

Integrated Monitoring System using Log Data (로그 데이터를 이용한 통합모니터링 시스템)

Jeon, Byung-Jin;Yoon, Deok-Byeong;Shin, Seung-Soo
- Journal of Convergence for Information Technology
- /
- v.7 no.1
- /
- pp.35-42
- /
- 2017
In this paper, we propose to implement an integrated monitoring system using log data to reduce the load of analysis task of information security officer and to detect information leak in advance. To do this, we developed a transmission module between different model DBMS that transmits large amount of log data generated by the individual security system (MSSQL) to the integrated monitoring system (ORACLE), and the transmitted log data is digitized by individual and individual and researches about the continuous inspection and measures against malicious users when the information leakage symptom is detected by using the numerical data.
https://doi.org/10.22156/CS4SMB.2017.7.1.035 인용 PDF KSCI

Decision Tree Based Application Recommendation System (의사결정트리 기반 애플리케이션 추천 시스템)

Kim, Doo-Hyeong;Shin, Jae-Myong;Park, Sang-Won
- Proceedings of the Korean Information Science Society Conference
- /
- 2012.06d
- /
- pp.140-142
- /
- 2012
최근 상황인지에 관한 연구가 활발히 진행되고 있으며 스마트폰의 각종 센서를 통해 사용자의 컨텍스트 파악이 가능해졌다. 이에 따라서 스마트폰의 컨텍스트 파악을 통해서 사용자에게 각종 친화적 서비스 모델이 많이 생겨 나고 있다. 사용자의 경로 추론, 실내에서의 사용자의 위치파악, 사용자 위치기반 편의시설 추천 등이 그 예이며, 그 중 애플리케이션 추천은 대표적인 서비스라 할 수 있다. 애플리케이션 추천은 사용자의 컨텍스트에 따라서 애플리케이션 사용내역을 로그 데이터로 만들고, 로그 데이터를 기반으로 컨텍스트에 따라서 사용자의 애플리케이션 추천을 해주는 시스템이다. 여기서 로그 데이터를 가공하지 않고 통계를 통해 추천이 가능하지만, 로그 데이터를 사용하여 의사 결정 트리를 만들게 되면 보다 정확하고, 빠르게 추천이 가능하며 적은 로그 데이터로 더 많은 컨텍스트에 적용하여 추천 할 수 있다는 이점이 있다. 본 논문에서는 사용자의 컨텍스트 추출하고 이 데이터를 기반으로 의사결정트리를 만들어 앱을 추천하는 시스템을 제안한다. 이러한 컨텍스트 수집 방법과 추론모델을 이용한 애플리케이션 추천 시스템은 추후 사용자 친화적 서비스 연구에 많은 도움이 될 것이다.

Implementation of a remote log-data collecting system for the analysis on smartphone usage pattern (스마트폰 사용패턴 분석을 위한 원격 로그데이터 수집 시스템 구현)

Song, Hyun-Ji;Lee, Min-Kyung;Chung, Hee-Won;Yu, Seok-Jong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2014.04a
- /
- pp.237-239
- /
- 2014
다수 사용자의 스마트폰 사용패턴을 협업적인 방법으로 분석할 경우 모바일 기기에 대한 선호도 분석, 과몰입 정도 판단 등 다양한 관련 연구에 활용될 수 있다. 본 연구는 스마트폰의 사용패턴 분석을 통한 사용자 맞춤형 서비스 개발을 위하여 로그데이터를 추출하여 서버에 저장하는 시스템을 설계하고 구현하는 것을 목표로 한다. 사용자의 스마트 폰 로그데이터를 수집하기 위하여 모바일앱을 개발하고 모바일앱을 통해서 추출된 로그데이터를 저장할 서버 DB 를 구축하고 유사성 분석을 위한 협업필터링 엔진을 개발하였다. 개발된 시스템의 성능 평가를 위하여 일부 사용자에 대한 사용패턴 데이터셋 구축 실험을 수행하였으며 후속 연구를 위한 실험 환경을 설계하였다.
https://doi.org/10.3745/PKIPS.y2014m04a.237 인용 PDF

Comparative Analysis of Security Schemes for Log System Providing Forward Security (전방 안전성이 보장되는 로그 시스템 보안기법 비교분석)

Kang, Seok-Gyu;Park, Chang-Seop
- Convergence Security Journal
- /
- v.15 no.7
- /
- pp.85-96
- /
- 2015
In IT system, logs are an indicator of the previous key events. Therefore, when a security problem occurs in the system, logs are used to find evidence and solution to the problem. So, it is important to ensure the integrity of the stored logs. Existing schemes have been proposed to detect tampering of the stored logs after the key has been exp osed. Existing schemes are designed separately in terms of log transmission and storage. We propose a new log sys tem for integrating log transmission with storage. In addition, we prove the security requirements of the proposed sc heme and computational efficiency with existing schemes.
PDF KSCI

A Lifelog Management System Based on the Relational Data Model and its Applications (관계 데이터 모델 기반 라이프로그 관리 시스템과 그 응용)

Song, In-Chul;Lee, Yu-Won;Kim, Hyeon-Gyu;Kim, Hang-Kyu;Haam, Deok-Min;Kim, Myoung-Ho
- Journal of KIISE:Computing Practices and Letters
- /
- v.15 no.9
- /
- pp.637-648
- /
- 2009
As the cost of disks decreases, PCs are soon expected to be equipped with a disk of 1TB or more. Assuming that a single person generates 1GB of data per month, 1TB is enough to store data for the entire lifetime of a person. This has lead to the growth of researches on lifelog management, which manages what people see and listen to in everyday life. Although many different lifelog management systems have been proposed, including those based on the relational data model, based on ontology, and based on file systems, they have all advantages and disadvantages: Those based on the relational data model provide good query processing performance but they do not support complex queries properly; Those based on ontology handle more complex queries but their performances are not satisfactory: Those based on file systems support only keyword queries. Moreover, these systems are lack of support for lifelog group management and do not provide a convenient user interface for modifying and adding tags (metadata) to lifelogs for effective lifelog search. To address these problems, we propose a lifelog management system based on the relational data model. The proposed system models lifelogs by using the relational data model and transforms queries on lifelogs into SQL statements, which results in good query processing performance. It also supports a simplified relationship query that finds a lifelog based on other lifelogs directly related to it, to overcome the disadvantage of not supporting complex queries properly. In addition, the proposed system supports for the management of lifelog groups by providing ways to create, edit, search, play, and share them. Finally, it is equipped with a tagging tool that helps the user to modify and add tags conveniently through the ion of various tags. This paper describes the design and implementation of the proposed system and its various applications.
PDF KSCI

Real time predictive analytic system design and implementation using Bigdata-log (빅데이터 로그를 이용한 실시간 예측분석시스템 설계 및 구현)

Lee, Sang-jun;Lee, Dong-hoon
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.25 no.6
- /
- pp.1399-1410
- /
- 2015
Gartner is requiring companies to considerably change their survival paradigms insisting that companies need to understand and provide again the upcoming era of data competition. With the revealing of successful business cases through statistic algorithm-based predictive analytics, also, the conversion into preemptive countermeasure through predictive analysis from follow-up action through data analysis in the past is becoming a necessity of leading enterprises. This trend is influencing security analysis and log analysis and in reality, the cases regarding the application of the big data analysis framework to large-scale log analysis and intelligent and long-term security analysis are being reported file by file. But all the functions and techniques required for a big data log analysis system cannot be accommodated in a Hadoop-based big data platform, so independent platform-based big data log analysis products are still being provided to the market. This paper aims to suggest a framework, which is equipped with a real-time and non-real-time predictive analysis engine for these independent big data log analysis systems and can cope with cyber attack preemptively.
https://doi.org/10.13089/JKIISC.2015.25.6.1399 인용 PDF KSCI HTML

Refining massive event logs to evaluate performance measures of the container terminal (컨테이너 터미널 성능평가를 위한 대용량 이벤트 로그 정제 방안 연구)

Park, Eun-Jung;Bae, Hyerim
- The Journal of Bigdata
- /
- v.4 no.1
- /
- pp.11-27
- /
- 2019
There is gradually being a decrease in earnings rate of the container terminals because of worsened business environment. To enhance global competitiveness of terminal, operators of the container terminal have been attempting to deal with problems of operations through analyzing overall the terminal operations. For improving operations of the container terminal, the operators try to efforts about analyzing and utilizing data from the database which collects and stores data generated during terminal operation in real time. In this paper, we have analyzed the characteristics of operating processes and defined the event log data to generate container processes and CKO processes using stored data in TOS (terminal operating system). And we have explained how imperfect event logs creating non-normal processes are refined effectively by analyzing the container and CKO processes. We also have proposed the framework to refine the event logs easily and fast. To validate the proposed framework we have implemented it using python2.7 and tested it using the data collected from real container terminal as input data. In consequence we could have verified that the non-normal processes in the terminal operations are greatly improved.
PDF

Search Result 886, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)