• Title/Summary/Keyword: Parallel data processing

Search Result 751, Processing Time 0.035 seconds

Multithread video coding processor for the videophone (동영상 전화기용 다중 스레드 비디오 코딩 프로세서)

  • 김정민;홍석균;이일완;채수익
    • Journal of the Korean Institute of Telematics and Electronics A
    • /
    • v.33A no.5
    • /
    • pp.155-164
    • /
    • 1996
  • The architecture of a programmable video codec IC is described that employs multiple vector processors in a single chip. The vector processors operate in parallel and communicate with one another through on-chip shared memories. A single scalar control processor schedules each vector processor independently to achieve real-tiem video coding with special vector instructions. With programmable interconnection buses, the proposed architecture performs multi-processing of tasks and data in video coding. Therefore, it can provide good parallelism as well as good programmability. especially, it can operate multithread video coding, which processes several independent image sequences simultaneously. We explain its scheduling, multithred video coding, and vector processor architectures. We implemented a prototype video codec with a 0.8um CMOS cell-based technology for the multi-standard videophone. This codec can execute video encoding and decoding simultaneously for the QCIF image at a frame rate of 30Hz.

  • PDF

Laser Drilling of High-Density Through Glass Vias (TGVs) for 2.5D and 3D Packaging

  • Delmdahl, Ralph;Paetzel, Rainer
    • Journal of the Microelectronics and Packaging Society
    • /
    • v.21 no.2
    • /
    • pp.53-57
    • /
    • 2014
  • Thin glass (< 100 microns) is a promising material from which advanced interposers for high density electrical interconnects for 2.5D chip packaging can be produced. But thin glass is extremely brittle, so mechanical micromachining to create through glass vias (TGVs) is particularly challenging. In this article we show how laser processing using deep UV excimer lasers at a wavelength of 193 nm provides a viable solution capable of drilling dense patterns of TGVs with high hole counts. Based on mask illumination, this method supports parallel drilling of up over 1,000 through vias in 30 to $100{\mu}m$ thin glass sheets. (We also briefly discuss that ultrafast lasers are an excellent alternative for laser drilling of TGVs at lower pattern densities.) We present data showing that this process can deliver the requisite hole quality and can readily achieve future-proof TGV diameters as small $10{\mu}m$ together with a corresponding reduction in pitch size.

A New Synchronization Scheme for Parallel Processing of Loop with Constant and Variable Dependence Distance (불변 및 가변 종속거리를 갖는 루프의 병렬처리를 위한 새로운 동기화 기법)

  • 이광형;황종선;박두순
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.5
    • /
    • pp.693-701
    • /
    • 1995
  • In most application programs, loops usually comprise most of the computation in a program and are the most important source of parallelism. When loops are executed on multiprocessors, the cross iteration data dependences need to be enforced by synchronization between processors. Existing synchronization schemes have been studied mainly on the loop with constant dependence distance. When these schemes are applied to the loop with variable dependence distance, there exists lots of overhead by the use of unnecessary synchronization variables and execution of unuseful synchronization instructions. Even though there exist various variable synchronization schemes, they have a lot of run-time overhead to compute synchronization information. In this paper, we present a new synchronization scheme, Synch-Free/Synch-Hold for managing synchronization efficiently on the loop with constant and variable dependence distance.

  • PDF

Design of 5" True Color FED Driving System (5″ FED True Color 구동시스템 설계)

  • Shin, Hong-Jae;Choi, Chang-Woon;Kim, Jin;Choi, Jeong-Og;Kwon, Oh-Kyong
    • Proceedings of the IEEK Conference
    • /
    • 2000.06e
    • /
    • pp.65-68
    • /
    • 2000
  • We design a new driving system of 5" true color FED using current controlled PWM method. Further more, we successfully developed a 5" FED panel, which resolution is 320$\times$240(Color). When we design a 5" FED driving circuit, FED tips are modeled as R-C for circuit simulator of FED driving circuit. In Video data processing, parallel R, G, B input signals is processed independently, so duty ratio increase and no noise, high quality performance is achieved in display of 5" FED. The luminance is about 100cd/$m^2$, the anode power consumption Is 2.1W and total power of the driving system is 21.54W

  • PDF

Construction of Highly Integrated PC Cluster based on Windows XP (높은 집적도를 가지는 Windows XP PC 클러스터 구축)

  • Lee S.-K.;Shin J.-R.;Choi J.-Y.
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2005.04a
    • /
    • pp.41-46
    • /
    • 2005
  • A new PC cluster was designed and constructed based on Windows XP Operating system. Primary target of the present design was the high node density per rack by using the general PC parts those are cost-effective and readily available in the market. Other major design points were system cooling and the convenient maintenance using standard PC parts. Presently 24 nodes per rack seems to be optimum considering the specification of the network switching device, system cooling and power supply, but 40 nodes can be accommodated within a single rack at maximum. Windows XP was selected as a high-performance computing environment considering the cost and the convenience in acquisition, maintenance and education. Both fast-Ethernet and Gigabit Ethernet network connection were tested and compared with previous data, especially for Linux doter using Myrinet. The result shows that there is no significant difference between the operating systems and the Fast-Ethernet and/or Gigabit Ethernet are good solution for the high-performance PC cluster considering the cost and performance.

  • PDF

PREDICTION OF THE AERODYNAMIC CHARACTERISTICS OF AN ORBITAL BLOCK OF A LAUNCH VEHICLE IN THE RAREFIED FLOW REGIME USING DSMC APPROACH (DSMC 해석기법을 이용한 희박유동 환경에서의 발사체 Orbital Block 공력특성 예측)

  • Kim, Young-Hoon;Ok, Ho-Nan;Choi, Young-In;Kim, In-Sun
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2007.04a
    • /
    • pp.79-82
    • /
    • 2007
  • The aerodynamic coefficients of Apollo capsule are calculated using a DSMC solver, SMILE, and the results agree very well with the data predicted by NASA. The aerodynamic characteristics of an orbital block which operates at high altitudes in the free molecule regime are also predicted. For the nominal flow conditions, the predicted aerodynamic force is very small since the dynamic pressure is extremely low. And the additional aerodynamic coefficients for the analysis of the attitude control are presented as the angle of attack and the side slip angle vary from $+45^{\circ}\;to\;-45^{\circ}$ of the nominal angle.

  • PDF

An Application-Level Fault Tolerant Linear System Solver Using an MPMD Type Asynchronous Iteration (MPMD 방식의 비동기 연산을 이용한 응용 수준의 무정지 선형 시스템의 해법)

  • Park, Pil-Seong
    • The KIPS Transactions:PartA
    • /
    • v.12A no.5 s.95
    • /
    • pp.421-426
    • /
    • 2005
  • In a large scale parallel computation, some processor or communication link failure results in a waste of huge amount of CPU hours. However, MPI in its current specification gives the user no possibility to handle such a problem. In this paper, we propose an application-level fault tolerant linear system solver by using an MPMD-type asynchronous iteration, purely on the basis of the MPI standard without using any non-standard fault-tolerant MPI library.

A Study on The Performance Analysis of Partition Multistage Interconnection Network (분할된 다단상호접속망의 성능 분석에 관한 연구)

  • 김영선;최진규
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.14 no.6
    • /
    • pp.675-685
    • /
    • 1989
  • The interconnection network is an integral part of parallel processing system. The multistage interconnection networks(MINs) have been the objects of intense research in recent years. In this paper, simulation techniques for circuit switchign MIN are extended to allow the performance evaluation of partitioned ADM/IADM network. Based on simulation data, the relationship between the netwrok performance, the partitioning scheme employed, and the conflict resolution strategies used within the network is enumerated. It is shown that IADM network coupled with the use of the hold strategy produces the best network operation in terms of RST (Request Service Time).

  • PDF

Concurrent Hash Table Optimized for NUMA System (NUMA 시스템에 최적화된 병렬 해시 테이블)

  • Choi, JaeYong;Jung, NaiHoon
    • Journal of Korea Game Society
    • /
    • v.20 no.5
    • /
    • pp.89-98
    • /
    • 2020
  • In MMO game servers, NUMA (Non-Uniform Memory Access) architecture is generally used to achieve high performance. Furthermore, such servers normally use hash tables as internal data structure which have constant time complexity for insert, delete, and search operations. In this study, we proposed a concurrent hash table optimized for NUMA system to make MMO game servers improve their performance. We tested our hash table on 4 socket NUMA system, and the hash table shows at most 100% speedup over another high-performance hash table.

A Study on EMG Signal Processing Using Linear Prediction (선형예측을 이용한 EMG 신호처리에 관한 연구)

  • ;邊潤植;李建基
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.24 no.2
    • /
    • pp.280-291
    • /
    • 1987
  • In this paper, the linear autoregressive model of EMG signal for four basic arm functions was presented and parameters for each function were estimated. The signal identification was carried out using function discrimination algorithm. It was validated that EMG signal was a widesense stationary process and the linear autoregressive model of EMG signal was constructed through approximating it to Gaussian process. It was confined that Levinson-Durbin algoridthm is a more appropriate one than the recursive least square method for parameter estimation of the linear model. Optimal function discrimination was acquired when sampling frequency was 500Hz and two electrodes were attached to bicep and tricep muscle, respectively. Parameter values were independent of variance and the number of minimum data for function discrimination was 200. Bayesian discrimination method turned out to be a better one than parallel filtering method for functional discrimination recognition.

  • PDF