• Title/Summary/Keyword: Embedded CPU

Search Result 221, Processing Time 0.03 seconds

User-level scheduling for Handling Dynamic Workloads in Xen Hypervisor (Xen 하이퍼바이저 기반의 동적 워크로드 핸들링을 위한 사용자 수준의 스케쥴링)

  • Lee, Hyun-Ju;Ko, Young-Woong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06b
    • /
    • pp.381-384
    • /
    • 2011
  • 오늘날의 높은 효율성을 가진 현대적인 가상화 솔루션의 가용성으로 인해, 서버와 클라우딩 컴퓨팅 분야와 같은 다양한 분야에서 주목받고 있다. Xen은 다양한 가상화 시스템 중 폭 넓게 사용되는 기술이기는 하나 스케줄링에 있어 약간의 약점이 노출되고 있다. 이는 Xen 하이퍼바이져 스케줄러가 유저 레벨의 쓰레드와 물리적 CPU 사이의 스케줄러들 가운데 가장 아래에 존재하기 때문에, 유저 레벌의 워크로드에 대한 정보를 얻는데 제약사항이 있기 때문이다. 이러한 특성은 시스템의 전체적인 처리량을 떨어뜨리고 리소스의 활용률을 저하시킬 수 있다. 본 연구는 게스트 운영체제에서 동작하는 동적인 워크로드에 대한 정보를 활용하는 유저 레벨의 스케줄링을 제안하고자 한다. 이를 위해 새로운 하이퍼콜과 모니터링 데몬을 가지는 유저레벨 스케줄링을 구현한다. 유저레벨 스케줄링이 동작하는 Xen 하이퍼바이져 기반에 Linux를 게스트 운영체제로 환경을 구축하여 다양한 유저레벨 워크로드를 동작시켜 시스템의 처리량 증가와 CPU 리소스의 활용률을 높일 수 있음을 보인다.

Optimization of Ship Management System (선박관리 시스템의 최적화)

  • Syan, Lim Chia;Park, Soo-Hong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.6
    • /
    • pp.839-846
    • /
    • 2013
  • In this paper, an effort has been made to design and develop an optimized programming model for Real-time Ship Management System. Replacing the conventional interrupt-driven programming model, an embedded real-time operating system (RTOS) has been implemented on the system, allowing processes to run virtually simultaneous and multitasking. Data management algorithms are designed and developed in the RTOS to facilitate data distribution amongst tasks and optimize the CPU processing time through intelligent resource utilization. Finally, data lost in the system has been minimized via the improvement of data processing rate under the optimized programming model.

Desktop PC CPU Cooling System Design and Analysis

  • Choi, Jee-Hoon;Yoo, Jung-Hyun;Seo, Min-Whan;Kang, Shin-Jae;Kim, Chul-Ju
    • Proceedings of the SAREK Conference
    • /
    • 2008.11a
    • /
    • pp.601-604
    • /
    • 2008
  • Desktop PC CPUs have been significantly required to be the necessity of thermal management while they have satisfied the extensive data and graphic processing requirements. So the cooling systems assembled with heat pipes embedded in a metal cooling plate, and fins are widely used in the desktop PC markets. Due to a number of demands such as the confined space of desktop PCs, higher heat density of CPUs, and acoustic noise, however, there is the main drive to improve continuously cooling systems. This paper presents the flow and thermal behavior of the cooling system by using the computational fluid dynamics(CFD) code.

  • PDF

Cascade CNN with CPU-FPGA Architecture for Real-time Face Detection (실시간 얼굴 검출을 위한 Cascade CNN의 CPU-FPGA 구조 연구)

  • Nam, Kwang-Min;Jeong, Yong-Jin
    • Journal of IKEEE
    • /
    • v.21 no.4
    • /
    • pp.388-396
    • /
    • 2017
  • Since there are many variables such as various poses, illuminations and occlusions in a face detection problem, a high performance detection system is required. Although CNN is excellent in image classification, CNN operatioin requires high-performance hardware resources. But low cost low power environments are essential for small and mobile systems. So in this paper, the CPU-FPGA integrated system is designed based on 3-stage cascade CNN architecture using small size FPGA. Adaptive Region of Interest (ROI) is applied to reduce the number of CNN operations using face information of the previous frame. We use a Field Programmable Gate Array(FPGA) to accelerate the CNN computations. The accelerator reads multiple featuremap at once on the FPGA and performs a Multiply-Accumulate (MAC) operation in parallel for convolution operation. The system is implemented on Altera Cyclone V FPGA in which ARM Cortex A-9 and on-chip SRAM are embedded. The system runs at 30FPS with HD resolution input images. The CPU-FPGA integrated system showed 8.5 times of the power efficiency compared to systems using CPU only.

Design and Implementation of the Gateway for Remote Monitoring a Combine (콤바인 원격 모니터링을 위한 게이트웨이 설계 및 개발)

  • Moon, Y.K.;Song, Y.H.;Shin, K.Y.;Lee, S.S.;Choi, C.H.;Mun, J.H.
    • Journal of Biosystems Engineering
    • /
    • v.32 no.3
    • /
    • pp.197-205
    • /
    • 2007
  • The objective of this study was to design and implement a gateway for remote monitoring a combine. Many researchers have designed and implemented trouble-shooting system of agricultural machine. but the system didn't have network system or used wired network system. But monitoring machine have been operated in the out of door. In such an environment, each machine have to be operated under on a guarantee of mobility and stability. Thus, we have developed a gateway with an embedded system including the XScale PXA255 processor and wireless network device. We have also built an embedded Linux kernel and several devices. We developed an embedded application for monitoring a combine and this application is also capable of receiving signals from other clients and sending them to a server via Wireless LAN. Finally, results of performance evaluation which measured CPU share and memory sizes have shown that it is possible to provide monitoring service stably.

A New File System for Multimedia Data Stream (멀티미디어 데이터 스트림을 위한 파일 시스템의 설계 및 구현)

  • Lee, Minsuk;Song, Jin-Seok
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.1 no.2
    • /
    • pp.90-103
    • /
    • 2006
  • There are many file systems in various operating systems. Those are usually designed for server environments, where the common cases are usually 'multiple active users', 'great many small files' And they assume a big main memory to be used as buffer cache. So the existing file systems are not suitable for resource hungry embedded systems that process multimedia data streams. In this study, we designed and implemented a new file system which efficiently stores and retrieves multimedia data steams. The proposed file system has a very simple disk layout, which guarantees a quick disk initialization and file system recovery. And we introduced a new indexing-scheme, called the time-based indexing scheme, with the file system. With the indexing scheme, the file system maintains the relation between time and the location for all the multimedia streams. The scheme is useful in searching and playing the compressed multimedia streams by locating exact frame position with given time, resulting in reduction of CPU processing and power consumption. The proposed file system and its APIs utilizing the time-based indexing schemes were implemented firstly on a Linux environment, though it is operating system independent. In the performance evaluation on a real DVR system, which measured the execution time of multi-threaded reading and writing, we found the proposed file system is maximum 38.7% faster than EXT2 file system.

  • PDF

Design and Implementation of The Priority based Round Robin Scheduling Operating System for Compact Size Embedded System (소규모 임베디드 시스템을 위한 우선 순위 기반 라운드 로빈 스케줄링 운영체제의 설계 및 구현)

  • 남상엽;이상원;박인정
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.40 no.4
    • /
    • pp.222-231
    • /
    • 2003
  • In This paper, the operating system using priority based round robin scheduling system is designed and implemented. Using this scheduler, Real-Time operation is possible because High priority Task is running first and the other Task is running in parallel. Also Intertask Communication, Device Driver and operating system suitable for using the compact sized embedded system were implemented. Therefore this Operating system provides efficient and rapid implementation for the compact sized embedded system application.

Query Optimization with Metadata Routing tables on Nano-Q+ Sensor Network with Heterogeneous Sensors (다중 이기종 센서를 보유한 Nano-Q+ 기반 센서네트워크에서 메타데이터 라우팅 테이블을 이용한 질의 최적화)

  • Hwang, Sang-Won;Nam, Young-Kwang;Lee, Kwang-Yong;Mah, Pyung-Soo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10d
    • /
    • pp.85-89
    • /
    • 2006
  • 센서네트워크에는 한 가지 종류의 센서만을 사용하는 것이 아니라 여러 가지 종류의 센서를 하나의 네트워크에 연결하여 한 번에 여러 종류의 센서의 질의를 수행할 수 있어야 한다. 이 경우에 다양한 센서가 부착된 네트워크에서 한 가지 종류의 센서에 대한 질의를 수행할 경우 해당되는 센서를 포함하지 않는 센서에 대해서도 질의를 수행해야 한다. 본 논문에서는 여러 종류의 센서가 부착된 센서네트워크에서 라우팅 테이블을 이용하여 질의를 최적화하는 방법을 제안한다. 센서노드의 라우팅 테이블에는 그 센서가 질의명령어를 보내고 받아야 하는 센서노드에 대한정보를 포함한다. 자식노드에 명령어를 보낼 경우 하위노드에서 질의하는 센서의 종류가 없을 경우에도 명령어를 보내야 한다. 이 경우 하위 자식노드에 원하는 센서의 종류에 관한 정보를 포함하면 센서의 종류에 따라서 불필요한 명령어의 수행을 줄일 수 있도록 하였다. 이 방법은 ATmega128 CPU를 장착한 Nano-Q+ 시스템에서 구현되었다.

  • PDF

GPU-Based ECC Decode Unit for Efficient Massive Data Reception Acceleration

  • Kwon, Jisu;Seok, Moon Gi;Park, Daejin
    • Journal of Information Processing Systems
    • /
    • v.16 no.6
    • /
    • pp.1359-1371
    • /
    • 2020
  • In transmitting and receiving such a large amount of data, reliable data communication is crucial for normal operation of a device and to prevent abnormal operations caused by errors. Therefore, in this paper, it is assumed that an error correction code (ECC) that can detect and correct errors by itself is used in an environment where massive data is sequentially received. Because an embedded system has limited resources, such as a low-performance processor or a small memory, it requires efficient operation of applications. In this paper, we propose using an accelerated ECC-decoding technique with a graphics processing unit (GPU) built into the embedded system when receiving a large amount of data. In the matrix-vector multiplication that forms the Hamming code used as a function of the ECC operation, the matrix is expressed in compressed sparse row (CSR) format, and a sparse matrix-vector product is used. The multiplication operation is performed in the kernel of the GPU, and we also accelerate the Hamming code computation so that the ECC operation can be performed in parallel. The proposed technique is implemented with CUDA on a GPU-embedded target board, NVIDIA Jetson TX2, and compared with execution time of the CPU.

Light-weight Gender Classification and Age Estimation based on Ensemble Multi-tasking Deep Learning (앙상블 멀티태스킹 딥러닝 기반 경량 성별 분류 및 나이별 추정)

  • Huy Tran, Quoc Bao;Park, JongHyeon;Chung, SunTae
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.1
    • /
    • pp.39-51
    • /
    • 2022
  • Image-based gender classification and age estimation of human are classic problems in computer vision. Most of researches in this field focus just only one task of either gender classification or age estimation and most of the reported methods for each task focus on accuracy performance and are not computationally light. Thus, running both tasks together simultaneously on low cost mobile or embedded systems with limited cpu processing speed and memory capacity are practically prohibited. In this paper, we propose a novel light-weight gender classification and age estimation method based on ensemble multitasking deep learning with light-weight processing neural network architecture, which processes both gender classification and age estimation simultaneously and in real-time even for embedded systems. Through experiments over various well-known datasets, it is shown that the proposed method performs comparably to the state-of-the-art gender classification and/or age estimation methods with respect to accuracy and runs fast enough (average 14fps) on a Jestson Nano embedded board.