• Title/Summary/Keyword: and Parallel Processing

Search Result 2,013, Processing Time 0.032 seconds

Fast Multi-Phase Packet Classification Architecture using Internal Buffer and Single Entry Caching (내부 버퍼와 단일 엔트리 캐슁을 이용한 다단계 패킷 분류 가속화 구조)

  • Kang, Dae-In;Park, Hyun-Tae;Kim, Hyun-Sik;Kang, Sung-Ho
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.44 no.9
    • /
    • pp.38-45
    • /
    • 2007
  • With the emergence of new applications, packet classification is essential for supporting advanced internet applications, such as network security and QoS provisioning. As the packet classification on multiple-fields is a difficult and time consuming problem, internet routers need to classify incoming packet quickly into flows. In this paper, we present multi-phase packet classification architecture using an internal buffer for fast packet processing. Using internal buffer between address pair searching phase and remained fields searching phases, we can hide latency from the characteristic that search times of source and destination header fields are different. Moreover we guarantee the improvement by using single entry caching. The proposed architecture is easy to apply to different needs owing to its simplicity and generality.

Design of Intra Prediction Circuit for HEVC and H.264 Multi-decoder Supporting UHD Images (UHD 영상을 지원하는 HEVC 및 H.264 멀티 디코더 용 인트라 예측 회로 설계)

  • Yu, Sanghyun;Cho, Kyeongsoon
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.12
    • /
    • pp.50-56
    • /
    • 2016
  • This paper proposes the architecture and design of intra prediction circuit for a multi-decoder supporting UHD images. The proposed circuit supports not only the latest video compression standard HEVC but also H.264. In addition to the basic function of performing intra prediction, this circuit has the capability of performing the reference sample filter operation defined in the H.264 standard, and the smoothing and strong sample filter operations defined in the HEVC standard. We reduced the circuit size by sharing the circuit blocks for common operations and internal storage, and improved the circuit performance by parallel processing. The proposed circuit was described at RTL using Verilog HDL and its functionality was verified by using NC-Verilog of Cadence. The RTL circuit was synthesized by using Design Compiler of Synopsys and 130nm standard cell library. The synthesized gate-level circuit consists of 69,694 gates and processes 100 ~ 280 frames per second for 4K-UHD HEVC images at the maximum operation frequency of 157MHz.

The Cognitive Ecological Characteristics in Folded Space and Their Effects (폴드공간의 인지생태론적 특성과 그 효과)

  • Kim Joomi
    • Korean Institute of Interior Design Journal
    • /
    • v.14 no.3 s.50
    • /
    • pp.181-190
    • /
    • 2005
  • In accordance with the rapid growth of digital media in 1990s, the state of indetermination that is found in digital process has been emphasized in the field of natural science and philosophy. Digitalized space design has been dramatically developed and it raised heated debate and comment on 'folded space' The purpose of this thesis is to explore how the cognitive-ecological factors constitute fold structures in space design of the late 20th century. Syntax of space structure and geometric composition were analyzed to define what types of cognitive-ecological factors are contrived in the process of visual information. In particular, I put higher theoretical emphasis on what characteristics are ensued in the process of structuring spaces than any other subjects. Through theses analyses and discussions, I raised questions on what principles are operating to create new space design that counts on non-linear structure and its formational process. And I also observed what influences these structural principles of design could fundamentally bring to human beings. First, I proposed that we could overcome reductionist space design through cognitive-ecological approach. Some key concepts such as affordance, parallel processing, and redundancy were adopted as defining elements of non-linear structures. As a result of analyses, I found that the cognitive-ecological approach could substitute the reductionist space design of the past. What is also found is that the three variables are the ultimate ecological elements. In addition, as a methodological concept of fold structures, the form of 'topology' was highlighted because it could be a supporting idea to the cognitive-ecological factors. Second, I claimed that non-linear design is more experiential than rational linear design, and it is more efficiently correspondent to human being than any other forms. What is intended and implied in non-linear structure is also indicated.

GPGPU based Depth Image Enhancement Algorithm (GPGPU 기반의 깊이 영상 화질 개선 기법)

  • Han, Jae-Young;Ko, Jin-Woong;Yoo, Jisang
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.12
    • /
    • pp.2927-2936
    • /
    • 2013
  • In this paper, we propose a noise reduction and hole removal algorithm in order to improve the quality of depth images when they are used for creating 3D contents. In the proposed algorithm, the depth image and the corresponding color image are both used. First, an intensity image is generated by converting the RGB color space into the HSI color space. By estimating the difference of distance and depth between reference and neighbor pixels from the depth image and difference of intensity values from the color image, they are used to remove noise in the proposed algorithm. Then, the proposed hole filling method fills the detected holes with the difference of euclidean distance and intensity values between reference and neighbor pixels from the color image. Finally, we apply a parallel structure of GPGPU to the proposed algorithm to speed-up its processing time for real-time applications. The experimental results show that the proposed algorithm performs better than other conventional algorithms. Especially, the proposed algorithm is more effective in reducing edge blurring effect and removing noise and holes.

Comparison of SPECT Images with $^{99m}Tc$ Collimators ($^{99m}Tc$용 콜리메타의 성능과 SPECT 화상의 영향)

  • Lee, Man-Koo;Lee, Jeong-Ok;Park, Soung-Ock
    • Journal of radiological science and technology
    • /
    • v.24 no.2
    • /
    • pp.35-40
    • /
    • 2001
  • Performance of SPECT imaging systems which use a rotating gamma camera, are affected by characteristics of the detector-collimator assembly, the data acquisition method, and the filter used in imaging reconstruction. The purpose of this study Is to examine image qualifies of SPECT with different types of low energy collimators. The SPECT imaging system in this study is a digital gamma camera system GCA-901A(Toshiba) and a data processing unit Scintipac-700(Shimadzu). The four types of collimators compared are UHR(ultra high resolution), LEHR(low energy high resolution), LEGP(low energy general purpose), and I-123 PAR(Parallel), with 0.27, 0.66, 1.00, and 2.06 relative sensitivity, respectively. In this case of the same collimators, the spatial resolutions measured in the slice plane showed a slight difference in the FWHM values(mean values of UHR, LEHR, LEGP, and I-123 PAR were 11.3 mm, 13.6 mm, 15.8 mm, and 20.4 mm, respectively) between the center and the circumference of the field of view, in the radial direction, but a large difference in the tangential direction, with lower FWHM values(values of UHR, LEHR, LEGP, and I-123 PAR were 8.4 mm, 8.7 mm, 9.3 mm, and 10.8 mm at 12 cm from the center, respectively). In comparison of SPECT images with the four types of collimators, except for the I-123 PAR collimator, image qualities of UHR, LEHR, and LEGP collimators showed only a slight difference. From the point of for, it is expected that the LEGP collimator would be suitable for SPECT imaging with $^{99m}Tc$.

  • PDF

Effect of Nutrition Permeability from Barley sprouts, Curcuma longa L., Dendropanax morbifera LEV., Phellinus linteus Using Cryogenic Grinding Technology (동결분쇄를 이용한 보리싹, 울금, 황칠, 상황버섯의 영양성분 증진 및 투과 효과)

  • Lee, Il-nam;Han, Ye-eun;Jeong, Ho-jun;Park, Haeun;Jung, Juyeong;Rhee, Jin-Kyu
    • Food Engineering Progress
    • /
    • v.21 no.4
    • /
    • pp.391-402
    • /
    • 2017
  • The purpose of this study was to improve the nutrition and the permeability of functional plants by using cryogenic grinding technology. Barley sprouts, Curcuma longa L., Dendropanax morbifera LEV., Phellinus linteus were dried, ground and extracted in different temperature conditions. Powder size of barley sprouts and Curcuma longa L. were about $50{\mu}m$ and Dendropanax morbifera LEV. and Phellinus linteus were about $20{\mu}m$. Cryogenic ground of Barley sprouts preserved 18.27-124.65% of nutrients such as protein, ash, carbohydrate, beta carotene, minerals, vitamins. Cryogenic grinding powder of Curcuma longa L. show high nutrients retention rate of lipid and carbohydrate. Permeability was measured by Parallel Artificial Membrane Permeability Assay (PAMPA) to predict passive gastrointestinal absorption. Permeability of saponarin, which is marker compound of Barley sprouts, is 9.88 times higher in cryogenic grinding powder than ambient grinding powder. Curcumin permability is 3.1 times higher than ambient grinded powder. As a result, particle size, nutrition, protein digestion degree and permeability demonstrated a positive relationship with the decreasing grinding temperature for the powders. These results confirm that the cryogenic grinding method had good suitability to increase functionality of plants, since it could minimize the heat generated while processing and effectively reduce the particle size.

A Scalable ECC Processor for Elliptic Curve based Public-Key Cryptosystem (타원곡선 기반 공개키 암호 시스템 구현을 위한 Scalable ECC 프로세서)

  • Choi, Jun-Baek;Shin, Kyung-Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.8
    • /
    • pp.1095-1102
    • /
    • 2021
  • A scalable ECC architecture with high scalability and flexibility between performance and hardware complexity is proposed. For architectural scalability, a modular arithmetic unit based on a one-dimensional array of processing element (PE) that performs finite field operations on 32-bit words in parallel was implemented, and the number of PEs used can be determined in the range of 1 to 8 for circuit synthesis. A scalable algorithms for word-based Montgomery multiplication and Montgomery inversion were adopted. As a result of implementing scalable ECC processor (sECCP) using 180-nm CMOS technology, it was implemented with 100 kGEs and 8.8 kbits of RAM when NPE=1, and with 203 kGEs and 12.8 kbits of RAM when NPE=8. The performance of sECCP with NPE=1 and NPE=8 was analyzed to be 110 PSMs/sec and 610 PSMs/sec, respectively, on P256R elliptic curve when operating at 100 MHz clock.

AMG-CG method for numerical analysis of high-rise structures on heterogeneous platforms with GPUs

  • Li, Zuohua;Shan, Qingfei;Ning, Jiafei;Li, Yu;Guo, Kaisheng;Teng, Jun
    • Computers and Concrete
    • /
    • v.29 no.2
    • /
    • pp.93-105
    • /
    • 2022
  • The degrees of freedom (DOFs) of high-rise structures increase rapidly due to the need for refined analysis, which poses a challenge toward a computationally efficient method for numerical analysis of high-rise structures using the finite element method (FEM). This paper presented an efficient iterative method, an algebraic multigrid (AMG) with a Jacobi overrelaxation smoother preconditioned conjugate gradient method (AMG-CG) used for solving large-scale structural system equations running on heterogeneous platforms with parallel accelerator graphics processing units (GPUs) enabled. Furthermore, an AMG-CG FEM application framework was established for the numerical analysis of high-rise structures. In the proposed method, the coarsening method, the optimal relaxation coefficient of the JOR smoother, the smoothing times, and the solution method for the coarsest grid of an AMG preconditioner were investigated via several numerical benchmarks of high-rise structures. The accuracy and the efficiency of the proposed FEM application framework were compared using the mature software Abaqus, and there were speedups of up to 18.4x when using an NVIDIA K40C GPU hosted in a workstation. The results demonstrated that the proposed method could improve the computational efficiency of solving structural system equations, and the AMG-CG FEM application framework was inherently suitable for numerical analysis of high-rise structures.

Specification and Analysis of System Properties by using Petri nets (페트리 네트를 이용한 시스템 속성의 명세 및 분석)

  • Lee, Woo-Jin
    • The KIPS Transactions:PartD
    • /
    • v.11D no.1
    • /
    • pp.115-122
    • /
    • 2004
  • Software system modeling has a goal for finding and solving system's problems by describing and analyzing system model in formal notations. Petri nets, as graphical formalism, have been used in describing and analyzing the software systems such as parallel systems, real-time system, and protocols. In the analysis of Petri nets, general system properties such as deadlock and liveness are analyzed by the reachability analysis. On the other side, specific properties such as functional requirements and constraints are checked by model-checking. However, since these analysis methods are based on enumeration of ail possible states, there nay be state explosion problem, which means that system states exponentially increase as the size of system is larger. In this paper, we propose a new method for mechanically checking system properties with avoiding state explosion problem. At first, system properties are described in property nets then the system model and the property net are composed and analyzed. In the compositional analysis, system parts irrelevant to the specific property are reduced to minimize the analysis domain of the system. And it is possible to mechanically check whether a specific property is satisfied or not.

Design and Implementation of KDSM(KAIST Distributed Shared Memory) System (KDSM(KAIST Distributed Shared Memory) 시스템의 설계 및 구현)

  • Lee, Sang-Kwon;Yun, Hee-Chul;Lee, Joon-Won;Maeng, Seung-Ryoul
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.29 no.5
    • /
    • pp.257-264
    • /
    • 2002
  • In this paper, we give a detailed description of KDSM(KAIST Distributed Shared Memory) system. KDSM is implemented as a user-level library running on Linux 2.2.13, and TCP/IP is used for communication. KDSM uses page-based invalidation protocol, multiple-writer protocol, and supports HLRC(Home-based Lazy Release Consistency) memory consistency model. To evaluate performance of KDSM, we executed 4 scientific applications and compared the result to JLAJLA. The results showed that performance of KDSM almost equal to JIAJIA for 2 applications and performance of KDSM is better than JIAJIA for 2 applications.