• 제목/요약/키워드: Distributed data collection

검색결과 237건 처리시간 0.025초

효과적인 웹 사용자의 패턴 분석을 위한 하둡 시스템의 웹 로그 분석 방안 (A Method for Analyzing Web Log of the Hadoop System for Analyzing a Effective Pattern of Web Users)

  • 이병주;권정숙;고기철;최용락
    • 한국IT서비스학회지
    • /
    • 제13권4호
    • /
    • pp.231-243
    • /
    • 2014
  • Of the various data that corporations can approach, web log data are important data that correspond to data analysis to implement customer relations management strategies. As the volume of approachable data has increased exponentially due to the Internet and popularization of smart phone, web log data have also increased a lot. As a result, it has become difficult to expand storage to process large amounts of web logs data flexibly and extremely hard to implement a system capable of categorizing, analyzing, and processing web log data accumulated over a long period of time. This study thus set out to apply Hadoop, a distributed processing system that had recently come into the spotlight for its capacity of processing large volumes of data, and propose an efficient analysis plan for large amounts of web log. The study checked the forms of web log by the effective web log collection methods and the web log levels by using Hadoop and proposed analysis techniques and Hadoop organization designs accordingly. The present study resolved the difficulty with processing large amounts of web log data and proposed the activity patterns of users through web log analysis, thus demonstrating its advantages as a new means of marketing.

Service Quality and Consumer Satisfaction: An Empirical Study in Indonesia

  • LUKMAN, Lukman;SUJIANTO, Agus Eko;WALUYO, Agus;YAHYA, Muchlis
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제8권5호
    • /
    • pp.971-977
    • /
    • 2021
  • The purpose of this research paper is: (1) to describe the service quality index; (2) describe the data quality index; and (3) describe the anti-corruption index of BPS Trenggalek, Indonesia. The approach chosen is quantitative with the type of survey research. The primary data collection technique was mainly based on a questionnaire distributed to 40 respondents, namely BPS service users in 5 (five) categories: the private sector, the banking industry, academics, offices, or agencies in Trenggalek Regency and universities. The results showed that the quality of BPS services was good and the data quality index where the respondents were satisfied with the data presented by BPS. Meanwhile, testing the anti-corruption index shows that BPS Trenggalek is very anti-corruption in providing services to consumers. The findings of this study suggested that to improve service quality, it is necessary to pay attention to several aspects, including published service requirements, easy requirements to be fulfilled, published procedure information, clear service process flow, published service times, and costs/tariffs are communicated. This study suggests updating data, data relevance, data accessibility, and data completeness to improve data quality. Furthermore, to maintain the very anti-corruption predicate, this study suggests maintaining service by upholding the prevailing ethics and norms.

Consumer Dissatisfaction Factors and Purchase Behaviors of Backpacks

  • Kim, Mi-Sook;Kim, So-Young
    • The International Journal of Costume Culture
    • /
    • 제3권2호
    • /
    • pp.147-160
    • /
    • 2000
  • The purposes of the present study were to investigate the factors of consumer dissatisfaction with backpacks and to examine if the levels of dissatisfaction with the factors differ significantly among the groups determined by demographic characteristics and purchase behaviors. The differences in the purchase behaviors of backpacks were also tested among the groups determined by demographic characteristics. Data collection was consisted of two pilot tests and the final test. The questionnaire was distributed to 40 students of universities in Seoul from July 1 to 13 in 2000 and 351 were usable. Data were analyzed by factor analysis, t-test, χ²analysis, MANOVA, ANAOVA, Duncan's multiple range tests using SPSS PC/sup +/ Program. Five factors were formulated : durability, ease-of-care/color fastness, dimensional stability, wearability and design. Subjects were most dissatisfied with the dimensional stability of backpacks. Different demographic characteristics and purchase behaviors resulted in significantly different levels of dissatisfaction with selected factors. Significant differences were also found in selected selection criteria and purchase behaviors among groups determined by some demographic characteristics.

  • PDF

Antecedents to Entrepreneurship Behavior: Moderating Role of Social Support and Entrepreneurial Self Efficacy among Business Students

  • Ava Shrestha;Sateesh Kumar Ojha
    • Journal of Information Technology Applications and Management
    • /
    • 제30권3호
    • /
    • pp.15-35
    • /
    • 2023
  • Considerable agreement exists about the importance of promoting entrepreneurship in both developed and developing countries. In less developed countries, governments see entrepreneurship as a way to stimulate economic development and tackle serious economic and social challenges. So how can countries encourage young people to become entrepreneurs? Research confirms that intentions play an important role in the decision to start a new firm and many factors influence that intentions. The purpose of the study was to investigate the antecedents to entrepreneurship behavior with particular attention to moderating role of social support and entrepreneur self-efficacy. The study covered 116 business students of undergraduate and post graduate level studying under different universities in Kathmandu, Nepal. The questionnaire for data collection was distributed in college groups via WhatsApp and viber with the support and permission from the college administration. The study design used was correlational with a sampling procedure of convenience. The study only showed the impact of attitude to entrepreneurship behavior as well as moderating effect of social support was also observed.

Study on Stress and Quality of life of people with Moderate Disabilities

  • Young Ran Kim;JungHyun Kim
    • International Journal of Advanced Culture Technology
    • /
    • 제12권2호
    • /
    • pp.215-220
    • /
    • 2024
  • This study was conducted to explore the relationship between stress and the quality of life experienced by people with disabilities and to provide essential data for improving their quality of life and promoting their health. The data collection period was from November 2023 to January 2024. Fifty questionnaires were distributed, and 48 copies were used, excluding unfaithful responses. As a result of the study, since people with moderate disabilities show hesitancy in all of their daily life processes, not only physical rehabilitation but also psychological rehabilitation should be necessary so that they can accept their bodies that have turned into disabilities. In addition, programs for the acceptance of disabilities for people with moderate disabilities should be designed to recover more widely. In the future, education and psychological programs should be activated to overcome the symptoms of moderate disability and receive related information. Finally, disability awareness education should be further subdivided to bring about changes in the perception of disabilities.

Privacy-Preserving Aggregation of IoT Data with Distributed Differential Privacy

  • Lim, Jong-Hyun;Kim, Jong-Wook
    • 한국컴퓨터정보학회논문지
    • /
    • 제25권6호
    • /
    • pp.65-72
    • /
    • 2020
  • 오늘날 사물 인터넷은 우리에게 편의를 제공하기 위해 가정, 산업 현장 및 병원을 포함한 많은 장소에서 사용된다. 다양한 장치가 네트워크에 연결됨에 따라 많은 서비스들이 실시간 데이터 수집, 저장 및 분석을 통해 새로운 가치를 창출하고 있다. 이처럼 많은 분야에서 IoT 장치 내의 센서 및 통신 기능을 활용하는 서비스 및 애플리케이션을 개발하고 있다. 예시로 산업 분야에서 Samsung과 LG는 자사의 IoT 애플리케이션을 통해 가전과 IoT 기기를 연결하여 스마트 홈을 구축하는 서비스를 제공하며, 의료 및 건강 분야에서 Samsung과 Xioami와 같은 기업들은 피트니스 워치 및 앱을 통해 심전도를 확인하거나 운동량을 기록, 관리한다. 위 같은 사례에서 스마트 홈을 구축하는 서비스의 경우에 수집한 데이터를 통해 해당 가정의 생활 패턴이나 출퇴근 여부 등의 민감정보를 유출할 수 있다. 또한 의료 데이터로 사용하기 위해 측정한 데이터를 통해 개인 정보와 질병의 존재와 같은 민감정보를 유출할 수 있다. 따라서 이를 보호하기 위해 해당 논문이 제안하는 방법에 따라 데이터를 수집, 배포한다면 데이터를 제공하는 사용자의 개인 정보 보호에 위협을 막을 수 있다. 이를 해결하기 위해 최근에는 프라이버시 보호 데이터 처리에 차분 프라이버시(DP)가 채택되어왔다. 따라서 DP를 기반으로 스마트워치 플랫폼에서 건강 데이터를 안전하게 수집할 수 있는 방법을 제안하며, 이를 통해 위와 같이 다양한 분야에서 프라이버시를 보호하는 환경에서의 데이터 수집 및 배포를 가능케 할 수 있다.

도로 주행환경 분석을 위한 빅데이터 플랫폼 구축 정보기술 인프라 개발 (Development of Information Technology Infrastructures through Construction of Big Data Platform for Road Driving Environment Analysis)

  • 정인택;정규수
    • 한국산학기술학회논문지
    • /
    • 제19권3호
    • /
    • pp.669-678
    • /
    • 2018
  • 본 연구는 차량센싱데이터, 공공데이터 등 다종의 빅데이터를 활용하여 주행환경 분석 플랫폼 구축을 위한 정보기술 인프라를 개발하였다. 정보기술 인프라는 H/W 기술과 S/W 기술로 구분할 수 있다. 먼저, H/W 기술은 빅데이터 분산 처리를 위한 병렬처리 구조의 소형 플랫폼 서버를 개발하였다. 해당 서버는 1대의 마스터 노드와 9대의 슬래이브 노드로 구성하였으며, H/W 결함에 따른 데이터 유실을 막기 위하여 클러스터 기반 H/W 구성으로 설계하였다. 다음으로 S/W 기술은 빅데이터 수집 및 저장, 가공 및 분석, 정보시각화를 위한 각각의 프로그램을 개발하였다. 수집 S/W의 경우, 실시간 데이터는 카프카와 플럼으로 비실시간 데이터는 스쿱을 이용하여 수집 인터페이스를 개발하였다. 저장 S/W는 데이터의 활용 용도에 따라 하둡 분산파일시스템과 카산드라 DB로 구분하여 저장하는 인터페이스를 개발하였다. 가공 S/W는 그리드 인덱스 기법을 적용하여 수집데이터의 공간 단위 매칭과 시간간격 보간 및 집계를 위한 프로그램을 개발하였다. 분석 S/W는 개발 알고리즘의 탐재 및 평가, 장래 주행환경 예측모형 개발을 위하여 제플린 노트북 기반의 분석 도구를 개발하였다. 마지막으로 정보시각화 S/W는 다양한 주행환경 정보제공 및 시각화를 위하여 지오서버 기반의 웹 GIS 엔진 프로그램을 개발하였다. 성능평가는 개발서버의 메모리 용량과 코어개수에 따른 연산 테스트를 수행하였으며, 타 기관의 클라우드 컴퓨팅과도 연산성능을 비교하였다. 그 결과, 개발 서버에 대한 최적의 익스큐터 개수, 메모리 용량과 코어 개수를 도출하였으며, 개발 서버는 타 시스템 보다 연산성능이 우수한 것으로 나타났다.

PC 클러스터를 이용한 실시간 분산 웹 영상 내용기반 검색 시스템에 관한 연구 (A Study on the Real-time Distributed Content-based Web Image Retrieval System using PC Cluster)

  • 이은애;하석운
    • 한국멀티미디어학회논문지
    • /
    • 제4권6호
    • /
    • pp.534-542
    • /
    • 2001
  • 최근의 내용기반 영상 검객 시스템은 한정된 수의 영상을 저장해 놓은 단일의 서버를 이용하고 있다. 이로 인해 웹 상의 다양한 영상을 원하는 웹 사용자의 요구를 만족시키지 못하고 있다. 수많은 웹 영상을 대상으로 하는 내용기반 영상 검색 시스템은 무엇보다도 실시간에 기반을 두어야 한다. 이를 구현하기 위해서는 영상 수집과 특징 추출에 걸리는 많은 소모 시간 문제가 해결되어야 한다. 최근, 고속의 데이터 처리를 목적으로 부하분산 PC클러스터가 개발되고 있다. 본 논문에서는 많은 시간을 요하는 영상 수집과 특징 추출 작업을 부하분산 PC클러스터의 종속 컴퓨터들에 분배함으로써 전체 검색 시간을 감소시켰으며, 이를 통해 실시간 웹 영상 검색의 가능성을 발견할 수 있었다.

  • PDF

인터넷 환경에 기반한 환경정보시스템 아키텍쳐 설계 : 환경요인을 Database 구축과 이를 이용한 GIS 구축 (Design of Environmental Information Systems Architecture Based on the Internet : The Building of a Database for Environmental Factors and GIS)

  • 서의호;이대호;유성호
    • Asia pacific journal of information systems
    • /
    • 제8권2호
    • /
    • pp.1-18
    • /
    • 1998
  • As the management and preservation of the environment become an important social issue, information required to support environmental task is required. So, there is an increasing demand for environmental information and appropriate systems to manage it. The vast volume of environmental data is distributed in different knowledge domains and systems. Environmental data objects have the complex structure containing environmental quality data and attribute data. Environmental information systems must be able to address these properties. This research has aimed at constructing well-defined schema design of environmental data, and making system architecture that environmental data kept by authorities should be made available to the public user. There are 3 major components in environmental information systems architecture ; User interface, Catalog libraries, Communication Provider. Web browsers provide consistent and intuitive user interfaces on Internet. The communication provider is a collection of diverse CGI functions. The main roles of the CGIs are to build interfaces between the Web, databases. Catalog libraries is libraries of various matadata including administration matadata. Administration matadata support the environmental administration and the managerial aspects of environmental data rather than explain a database itself or its properties.

  • PDF

TLF: Two-level Filter for Querying Extreme Values in Sensor Networks

  • Meng, Min;Yang, Jie;Niu, Yu;Lee, Young-Koo;Jeong, Byeong-Soo;Lee, Sung-Young
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2007년도 춘계학술발표대회
    • /
    • pp.870-872
    • /
    • 2007
  • Sensor networks have been widely applied for data collection. Due to the energy limitation of the sensor nodes and the most energy consuming data transmission, we should allocate as much work as possible to the sensors, such as data compression and aggregation, to reduce data transmission and save energy. Querying extreme values is a general query type in wireless sensor networks. In this paper, we propose a novel querying method called Two-Level Filter (TLF) for querying extreme values in wireless sensor networks. We first divide the whole sensor network into domains using the Distributed Data Aggregation Model (DDAM). The sensor nodes report their data to the cluster heads using push method. The advantages of two-level filter lie in two aspects. When querying extreme values, the number of pull operations has the lower boundary. And the query results are less affected by the topology changes of the wireless sensor network. Through this method, the sensors preprocess the data to share the burden of the base station and it combines push and pull to be more energy efficient.