Search | Korea Science

Scalable Big Data Pipeline for Video Stream Analytics Over Commodity Hardware

Ayub, Umer;Ahsan, Syed M.;Qureshi, Shavez M.
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.4
- /
- pp.1146-1165
- /
- 2022
A huge amount of data in the form of videos and images is being produced owning to advancements in sensor technology. Use of low performance commodity hardware coupled with resource heavy image processing and analyzing approaches to infer and extract actionable insights from this data poses a bottleneck for timely decision making. Current approach of GPU assisted and cloud-based architecture video analysis techniques give significant performance gain, but its usage is constrained by financial considerations and extremely complex architecture level details. In this paper we propose a data pipeline system that uses open-source tools such as Apache Spark, Kafka and OpenCV running over commodity hardware for video stream processing and image processing in a distributed environment. Experimental results show that our proposed approach eliminates the need of GPU based hardware and cloud computing infrastructure to achieve efficient video steam processing for face detection with increased throughput, scalability and better performance.
https://doi.org/10.3837/tiis.2022.04.004 인용 PDF KSCI HTML

A Study on Computer Security and Controls (Computer Security에 관한 소고 - 사고범죄예방을 중심으로 -)

이종철
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.4 no.4
- /
- pp.25-34
- /
- 1981
Recently there has been a marked increase in concern for security in computerized operations. The purposes of computer security controls are to protect against the unauthorized access to and modification of data processing resources, unauthorised access to and modification of data files and software, and the misuse of authorized activities. The controls relate to the physical security of the data processing department and of the areas within the data processing department : to the security of the data files, programs, and system software : and to the human interaction with the data files, programs, and system software. The controls that will be discussed in this paper include : I. Risk on the computer use. II. Methods of risk counter measure. III. Role of system auditing.
PDF

LOSSLESS DATA COMPRESSION ON SAR DISPLAY IMAGES (SAR 디스플레이 영상을 위한 무손실 압축)

Lee, Tae-hee;Song, Woo-jin;Do, Dae-won;Kwon, Jun-chan;Yoon, Byung-woo
- Proceedings of the IEEK Conference
- /
- 2001.09a
- /
- pp.117-120
- /
- 2001
Synthetic aperture radar (SAR) is a promising active remote sensing technique to obtain large terrain information of the earth in all-weather conditions. SAR is useful in many applications, including terrain mapping and geographic information system (GIS), which use SAR display images. Usually, these applications need the enormous data storage because they deal with wide terrain images with high resolution. So, compression technique is a useful approach to deal with SAR display images with limited storage. Because there is some indispensable data loss through the conversion of a complex SAR image to a display image, some applications, which need high-resolution images, cannot tolerate more data loss during compression. Therefore, lossless compression is appropriate to these applications. In this paper, we propose a novel lossless compression technique for a SAR display image using one-step predictor and block arithmetic coding.
PDF

Design an Indexing Structure System Based on Apache Hadoop in Wireless Sensor Network

Keo, Kongkea;Chung, Yeongjee
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.05a
- /
- pp.45-48
- /
- 2013
In this paper, we proposed an Indexing Structure System (ISS) based on Apache Hadoop in Wireless Sensor Network (WSN). Nowadays sensors data continuously keep growing that need to control. Data constantly update in order to provide the newest information to users. While data keep growing, data retrieving and storing are face some challenges. So by using the ISS, we can maximize processing quality and minimize data retrieving time. In order to design ISS, Indexing Types have to be defined depend on each sensor type. After identifying, each sensor goes through the Indexing Structure Processing (ISP) in order to be indexed. After ISP, indexed data are streaming and storing in Hadoop Distributed File System (HDFS) across a number of separate machines. Indexed data are split and run by MapReduce tasks. Data are sorted and grouped depend on sensor data object categories. Thus, while users send the requests, all the queries will be filter from sensor data object and managing the task by MapReduce processing framework.
https://doi.org/10.3745/PKIPS.y2013m05a.45 인용 PDF

An Improved Indexing Method for Query Processing of Dataspaces (데이터스페이스의 질의 처리를 위한 향상된 인덱싱 기법)

Huang, Xuguang;Lee, Dong-Wook;Shin, Soong-Sun;Baek, Sung-Ha;Bae, Hae-Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2009.04a
- /
- pp.317-320
- /
- 2009
Dataspaces are the collections of heterogeneous and partially unstructured data. It is difficult for the users to explore the data from varies data sources using a single schema. And the queries supposed should be allowed to specify varying degrees of structure, spanning keyword queries to more structure-aware queries. Utilizing give the model of heterogeneous data and the definitions of two mainly types of query on dataspaces, in this paper we propose an improved method which can suppose the flexibly query more efficiently.
https://doi.org/10.3745/PKIPS.y2009m04a.317 인용 PDF

A Study on Distributed System Construction and Numerical Calculation Using Raspberry Pi

Ko, Young-ho;Heo, Gyu-Seong;Lee, Sang-Hyun
- International journal of advanced smart convergence
- /
- v.8 no.4
- /
- pp.194-199
- /
- 2019
As the performance of the system increases, more parallelized data is being processed than single processing of data. Today's cpu structure has been developed to leverage multicore, and hence data processing methods are being developed to enable parallel processing. In recent years desktop cpu has increased multicore, data is growing exponentially, and there is also a growing need for data processing as artificial intelligence develops. This neural network of artificial intelligence consists of a matrix, making it advantageous for parallel processing. This paper aims to speed up the processing of the system by using raspberrypi to implement the cluster building and parallel processing system against the backdrop of the foregoing discussion. Raspberrypi is a credit card-sized single computer made by the raspberrypi Foundation in England, developed for education in schools and developing countries. It is cheap and easy to get the information you need because many people use it. Distributed processing systems should be supported by programs that connected multiple computers in parallel and operate on a built-in system. RaspberryPi is connected to switchhub, each connected raspberrypi communicates using the internal network, and internally implements parallel processing using the Message Passing Interface (MPI). Parallel processing programs can be programmed in python and can also use C or Fortran. The system was tested for parallel processing as a result of multiplying the two-dimensional arrangement of 10000 size by 0.1. Tests have shown a reduction in computational time and that parallelism can be reduced to the maximum number of cores in the system. The systems in this paper are manufactured on a Linux-based single computer and are thought to require testing on systems in different environments.
https://doi.org/10.7236/IJASC.2019.8.4.194 인용 PDF KSCI

Spatial Clearinghouse Components for OpenGIS Data Providers

Oh, Byoung-Woo;Kim, Min-Soo;Lee, Jong-Hun
- Proceedings of the KSRS Conference
- /
- 1999.11a
- /
- pp.84-88
- /
- 1999
Recently, the necessity of accessing spatial data from remote computer via network has been increased as distributed spatial data have been increased due to their size and cost. Many methods have been used in recent years for transferring spatial data, such as socket, CORBA, HTTP, RPC, FTP, etc. In this paper, we propose spatial clearinghouse components to access distributed spatial data sources via CORBA and Internet. The spatial clearinghouse components are defined as OLE/COM components that enable users to access spatial data that meet their requests from remote computer. For reusability, we design the spatial clearinghouse with UML and implement it as a set of components. In order to enhance interoperability among different platforms in distributed computing environment, we adopt international standards and open architecture such as CORBA, HTTB, and OpenGIS Simple Features Specifications. There are two kinds of spatial clearinghouse: CORBA-based spatial clearinghouse and Internet-based spatial clearinghouse. The CORBA-based spatial clearinghouse supports COM-CORBA bridge to access spatial data from remote data providers that satisfy the OpenGIS Simple Features Specification for OLE/COM using COM and CORBA interfaces. The Internet-based spatial clearinghouse provides Web-service components to access spatial data from remote data providers using Web-browser.
PDF

Locality Aware Multi-Sensor Data Fusion Model for Smart Environments (장소인식멀티센서스마트 환경을위한 데이터 퓨전 모델)

Nawaz, Waqas;Fahim, Muhammad;Lee, Sung-Young;Lee, Young-Koo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.04a
- /
- pp.78-80
- /
- 2011
In the area of data fusion, dealing with heterogeneous data sources, numerous models have been proposed in last three decades to facilitate different application domains i.e. Department of Defense (DoD), monitoring of complex machinery, medical diagnosis and smart buildings. All of these models shared the theme of multiple levels processing to get more reliable and accurate information. In this paper, we consider five most widely acceptable fusion models (Intelligence Cycle, Joint Directors of Laboratories, Boyd control, Waterfall, Omnibus) applied to different areas for data fusion. When they are exposed to a real scenario, where large dataset from heterogeneous sources is utilize for object monitoring, then it may leads us to non-efficient and unreliable information for decision making. The proposed variation works better in terms of time and accuracy due to prior data diminution.
https://doi.org/10.3745/PKIPS.y2011m04a.78 인용 PDF

Big IoT Healthcare Data Analytics Framework Based on Fog and Cloud Computing

Alshammari, Hamoud;El-Ghany, Sameh Abd;Shehab, Abdulaziz
- Journal of Information Processing Systems
- /
- v.16 no.6
- /
- pp.1238-1249
- /
- 2020
Throughout the world, aging populations and doctor shortages have helped drive the increasing demand for smart healthcare systems. Recently, these systems have benefited from the evolution of the Internet of Things (IoT), big data, and machine learning. However, these advances result in the generation of large amounts of data, making healthcare data analysis a major issue. These data have a number of complex properties such as high-dimensionality, irregularity, and sparsity, which makes efficient processing difficult to implement. These challenges are met by big data analytics. In this paper, we propose an innovative analytic framework for big healthcare data that are collected either from IoT wearable devices or from archived patient medical images. The proposed method would efficiently address the data heterogeneity problem using middleware between heterogeneous data sources and MapReduce Hadoop clusters. Furthermore, the proposed framework enables the use of both fog computing and cloud platforms to handle the problems faced through online and offline data processing, data storage, and data classification. Additionally, it guarantees robust and secure knowledge of patient medical data.
https://doi.org/10.3745/JIPS.04.0193 인용 PDF KSCI

A Review of Window Query Processing for Data Streams

Kim, Hyeon Gyu;Kim, Myoung Ho
- Journal of Computing Science and Engineering
- /
- v.7 no.4
- /
- pp.220-230
- /
- 2013
In recent years, progress in hardware technology has resulted in the possibility of monitoring many events in real time. The volume of incoming data may be so large, that monitoring all individual data might be intractable. Revisiting any particular record can also be impossible in this environment. Therefore, many database schemes, such as aggregation, join, frequent pattern mining, and indexing, become more challenging in this context. This paper surveys the previous efforts to resolve these issues in processing data streams. The emphasis is on specifying and processing sliding window queries, which are supported in many stream processing engines. We also review the related work on stream query processing, including synopsis structures, plan sharing, operator scheduling, load shedding, and disorder control.
https://doi.org/10.5626/JCSE.2013.7.4.220 인용 PDF KSCI KPUBS

Search Result 4,265, Processing Time 0.053 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)