• Title/Summary/Keyword: Branch Prediction Accuracy

Search Result 55, Processing Time 0.018 seconds

Analysis on the Thermal Efficiency of Branch Prediction Techniques in 3D Multicore Processors (3차원 구조 멀티코어 프로세서의 분기 예측 기법에 관한 온도 효율성 분석)

  • Ahn, Jin-Woo;Choi, Hong-Jun;Kim, Jong-Myon;Kim, Cheol-Hong
    • The KIPS Transactions:PartA
    • /
    • v.19A no.2
    • /
    • pp.77-84
    • /
    • 2012
  • Speculative execution for improving instruction-level parallelism is widely used in high-performance processors. In the speculative execution technique, the most important factor is the accuracy of branch predictor. Unfortunately, complex branch predictors for improving the accuracy can cause serious thermal problems in 3D multicore processors. Thermal problems have negative impact on the processor performance. This paper analyzes two methods to solve the thermal problems in the branch predictor of 3D multi-core processors. First method is dynamic thermal management which turns off the execution of the branch predictor when the temperature of the branch predictor exceeds the threshold. Second method is thermal-aware branch predictor placement policy by considering each layer's temperature in 3D multi-core processors. According to our evaluation, the branch predictor placement policy shows that average temperature is $87.69^{\circ}C$, and average maximum temperature gradient is $11.17^{\circ}C$. And, dynamic thermal management shows that average temperature is $89.64^{\circ}C$ and average maximum temperature gradient is $17.62^{\circ}C$. Proposed branch predictor placement policy has superior thermal efficiency than the dynamic thermal management. In the perspective of performance, the proposed branch predictor placement policy degrades the performance by 3.61%, while the dynamic thermal management degrades the performance by 27.66%.

A Study on the Development of the Train Wind Rate Prediction Program in Tunnel of the Subway (지하철 터널내 열차풍 예측 프로그램 개발에 관한 연구)

  • Kim, J.R.;Choi, K.H.
    • Journal of Power System Engineering
    • /
    • v.3 no.1
    • /
    • pp.38-44
    • /
    • 1999
  • Subway is one of the most important transportation and its facilities are increased by the drift of population to cities in these days. But heat generation results from lighting, human and traffic increase in subway, half-closed space, gives uncomfortable sense to the subway passengers. Therefore, natural ventilation by piston effect is done to relieve uncomfortable sense. But train wind by piston effect gives uncomfortable sense to the subway passengers, too. So the numerical calculation of inflow and outflow amounts is important to predict thermal environment and reduce train wind. In case of actual survey of train wind in target station, the amount of train wind are about $3100m^3/train$ at the minimum, about $6000m^3/train$ at the maximum, about $4200m^3/train$ on average. When comparison between simulation for train wind prediction and actual survey for accuracy was done train wind prediction program showed similar results.

  • PDF

Prediction of maximum shear modulus (Gmax) of granular soil using empirical, neural network and adaptive neuro fuzzy inference system models

  • Hajian, Alireza;Bayat, Meysam
    • Geomechanics and Engineering
    • /
    • v.31 no.3
    • /
    • pp.291-304
    • /
    • 2022
  • Maximum shear modulus (Gmax or G0) is an important soil property useful for many engineering applications, such as the analysis of soil-structure interactions, soil stability, liquefaction evaluation, ground deformation and performance of seismic design. In the current study, bender element (BE) tests are used to evaluate the effect of the void ratio, effective confining pressure, grading characteristics (D50, Cu and Cc), anisotropic consolidation and initial fabric anisotropy produced during specimen preparation on the Gmax of sand-gravel mixtures. Based on the tests results, an empirical equation is proposed to predict Gmax in granular soils, evaluated by the experimental data. The artificial neural network (ANN) and Adaptive Neuro Fuzzy Inference System (ANFIS) models were also applied. Coefficient of determination (R2) and Root Mean Square Error (RMSE) between predicted and measured values of Gmax were calculated for the empirical equation, ANN and ANFIS. The results indicate that all methods accuracy is high; however, ANFIS achieves the highest accuracy amongst the presented methods.

Performance Improvement of Single Chip Multiprocessor using Concurrent Branch Execution (분기 동시 수행을 이용한 단일 칩 멀티프로세서의 성능 개선)

  • Lee, Seung-Ryul;Kim, Jun-Shik;Choi, Jae-Hyeok;Choi, Sang-Bang
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.44 no.2
    • /
    • pp.61-71
    • /
    • 2007
  • The instruction level parallelism, which has been used to improve the performance of processors, expose its limit. The change of a control flow by a branch miss prediction is one of the obstacles that restrict the instruction level parallelism. The single chip multiprocessors have been developed to utilize the thread level parallelism. However, we could not use the maximum performance of the single chip multiprocessor in case of executing the coded programs without considering the multi-thread. In order to overcome the two performance degradation factors, in this paper, we suggest the concurrent branch execution method that applies to the multi-path execution method at a single chip multiprocessor. We executes all two flows of the conditional branch using the idle core processor. Through this, we can improve the processor's efficiency with blocking the control flow termination by the branch instruction and reducing the idle time. We analyze the effects of concurrent branch execution proposed in this paper through the simulation. As a result of that, concurrent branch execution reduces about 20% of idle time and improves the maximum 10% of the branch prediction accuracy. We show that our scheme improves the overall performance of maximum 39% compared to the normal single chip multiprocessor and maximum 27% compared to the superscalar processor.

A Power-aware Branch Predictor for Embedded Processors (내장형 프로세서를 위한 저전력 분기 예측기 설계 기법)

  • Kim, Cheol-Hong;Song, Sung-Gun
    • The KIPS Transactions:PartA
    • /
    • v.14A no.6
    • /
    • pp.347-356
    • /
    • 2007
  • In designing a branch predictor, in addition to accuracy, microarchitects should consider power consumption, especially for embedded processors. This paper proposes a power-aware branch predictor, which is based on the gshare predictor, by accessing the BTB (Branch Target Buffer) only when the prediction from the PHT (Pattern History Table) is taken. To enable the selective access to the BTB, the PHT in the proposed branch predictor is accessed one cycle earlier than the traditional PHT to prevent the additional delay. As a side effect, two predictions from the PHT are obtained through one access to the PHT, which leads to more power savings. The proposed branch predictor reduces the power consumption, not requiring any additional storage arrays, not incurring additional delay (except just one MUX delay) and never harming accuracy. Simulation results show that the proposed predictor reduces the power consumption by $35{\sim}48%$ compared to the traditional predictor.

Application of support vector machine with firefly algorithm for investigation of the factors affecting the shear strength of angle shear connectors

  • Chahnasir, E. Sadeghipour;Zandi, Y.;Shariati, M.;Dehghani, E.;Toghroli, A.;Mohamad, E. Tonnizam;Shariati, A.;Safa, M.;Wakil, K.;Khorami, M.
    • Smart Structures and Systems
    • /
    • v.22 no.4
    • /
    • pp.413-424
    • /
    • 2018
  • The factors affecting the shear strength of the angle shear connectors in the steel-concrete composite beams can play an important role to estimate the efficacy of a composite beam. Therefore, the current study has aimed to verify the output of shear capacity of angle shear connector according to the input provided by Support Vector Machine (SVM) coupled with Firefly Algorithm (FFA). SVM parameters have been optimized through the use of FFA, while genetic programming (GP) and artificial neural networks (ANN) have been applied to estimate and predict the SVM-FFA models' results. Following these results, GP and ANN have been applied to develop the prediction accuracy and generalization capability of SVM-FFA. Therefore, SVM-FFA could be performed as a novel model with predictive strategy in the shear capacity estimation of angle shear connectors. According to the results, the Firefly algorithm has produced a generalized performance and be learnt faster than the conventional learning algorithms.

Analytical Models and their Performance Analysis of Superscalar Processors (수퍼스칼라 프로세서의 해석적 모델 및 성능 분석)

  • Kim, Hak-Jun;Kim, Seon-Mo;Choe, Sang-Bang
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.26 no.7
    • /
    • pp.847-862
    • /
    • 1999
  • 본 논문에서는 유한버퍼의(finite-buffered) 동기화된(synchronous) 큐잉모델(queueing model)을 이용하여 명령어들간의 병렬성, 분기명령의 빈도수, 분기예측(branch prediction)의 정확도, 캐쉬미스 등의 파라미터들을 고려하여 프로세서의 명령어 실행율을 예측하며 캐쉬의 성능과 파이프라인 성능간의 관계를 분석할 수 있는 새로운 해석적 모델을 제안하였다. 해석적 모델은 모델의 타당성을 검증하기 위해서 시뮬레이션을 수행하여 얻은 결과와 비교하였다. 해석적 모델과 시뮬레이션을 비교한 결과 대부분 10% 오차 내에서 일치하였다. 본 연구를 통하여 얻은 해석적 모델을 사용하면 시뮬레이션에서는 드러나지 않는 성능제약의 원인에 대한 명확한 규명이 가능하기 때문에 성능향상을 위한 설계자료를 얻을 수 있으며, 시스템 성능 밸런스를 위한 캐쉬와 비순차이슈 파이프라인 성능간의 관계에 대한 정확한 분석이 가능하다.Abstract This research presents a novel analytic model to predict the instruction execution rate of superscalar processors using the queuing model with finite-buffer size and synchronous operation mode. The proposed model is also able to analyze the performance relationship between cache and pipeline. The proposed model takes into account various kinds of architectural parameters such as instruction-level parallelism, branch probability, the accuracy of branch prediction, cache miss, and etc.. To prove the correctness of the model, we performed extensive simulations and compared the results with the analytic model. Simulation results showed that the proposed model can estimate the average execution rate accurately within 10% error compared to simulation results. The proposed model can explain the causes of performance bottleneck which cannot be uncovered by the simulation method only. The model is also able to show the effect of the cache miss on the performance of out-of-order issue superscalar processors, which can provide an valuable information in designing a balanced system.

Compressive strength prediction by ANN formulation approach for CFRP confined concrete cylinders

  • Fathi, Mojtaba;Jalal, Mostafa;Rostami, Soghra
    • Earthquakes and Structures
    • /
    • v.8 no.5
    • /
    • pp.1171-1190
    • /
    • 2015
  • Enhancement of strength and ductility is the main reason for the extensive use of FRP jackets to provide external confinement to reinforced concrete columns especially in seismic areas. Therefore, numerous researches have been carried out in order to provide a better description of the behavior of FRP-confined concrete for practical design purposes. This study presents a new approach to obtain strength enhancement of CFRP (carbon fiber reinforced polymer) confined concrete cylinders by applying artificial neural networks (ANNs). The proposed ANN model is based on experimental results collected from literature. It represents the ultimate strength of concrete cylinders after CFRP confinement which is also given in explicit form in terms of geometrical and mechanical parameters. The accuracy of the proposed ANN model is quite satisfactory when compared to experimental results. Moreover, the results of the proposed ANN model are compared with five important theoretical models proposed by researchers so far and considered to be in good agreement.

On DC-Side Impedance Frequency Characteristics Analysis and DC Voltage Ripple Prediction under Unbalanced Conditions for MMC-HVDC System Based on Maximum Modulation Index

  • Liu, Yiqi;Chen, Qichao;Li, Ningning;Xie, Bing;Wang, Jianze;Ji, Yanchao
    • Journal of Power Electronics
    • /
    • v.16 no.1
    • /
    • pp.319-328
    • /
    • 2016
  • In this study, we first briefly introduce the effect of circulating current control on the modulation signal of a modular multilevel converter (MMC). The maximum modulation index is also theoretically derived. According to the optimal modulation index analysis and the model in the continuous domain, different DC-side output impedance equivalent models of MMC with/without compensating component are derived. The DC-side impedance of MMC inverter station can be regarded as a series xR + yL + zC branch in both cases. The compensating component of the maximum modulation index is also related to the DC equivalent impedance with circulating current control. The frequency characteristic of impedance for MMC, which is observed from its DC side, is analyzed. Finally, this study investigates the prediction of the DC voltage ripple transfer between two-terminal MMC high-voltage direct current systems under unbalanced conditions. The rationality and accuracy of the impedance model are verified through MATLAB/Simulink simulations and experimental results.

Relationship between Plastochrone and Development Indices Estimated by a Nonparametric Rice Phenology Model

  • Lee, Byun-Woo;Nam, Taeg-Su;Yim, Young-Seon
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.44 no.2
    • /
    • pp.149-153
    • /
    • 1999
  • Prediction of rice developmental stage is necessary for proper crop management and a prerequisite for growth simulation as well. The objectives of the present study were to find out the relationship between the plastochrone index(PI) and the developmental index(DVI) estimated by non-parametric phenology model which simulates the duration from seedling emergence(DVI=0) to heading(DVI=l) by employing daily mean air temperature and daylength as predictor variables, and to confirm the correspondency of developmental indice to panicle developmental stages based on this relationship. Four japonica rice cultivars, Kwanakbyeo, Sangpungbyeo, Dongjinbyeo, and Palgumbyeo which range from very early to very late in maturity, were grown by sowing directly in dry paddy field five times at an interval of two weeks. Data for seedling emergence, leaf appearance, differentiation stage of primary rachis branch and heading were collected. The non-parametric phenology model predicted well the duration from seedling emergence to heading with errors of less than three days in all sowings and cultivars. PI was calculated for every leaf appearance and related to the developmental index estimated for corresponding PI. The stepwise polynomial analysis produced highly significant square-rooted cubic or biquadratic equations depending on cultivars, and highly significant square-rooted biquadratic equation for pooled data across cultivars without any considerable reduction in accuracy compared to that for each cultivar. To confirm the applicability of this equation in predicting the panicle developmental stage, DVI at differentiation stage of primary rachis branch primordium was calculated by substituting PI with 82 corresponding to this stage, and the duration reaching this DVI from seedling emergence was estimated. The estimated duration revealed a good agreement with that observed in all sowings and cultivars. The deviations between the estimated and the observed were not greater than three days, and significant difference in accuracy was not found for predicting this developmental stage between those equations derived for each cultivar and for pooled data across all cultivars tested.

  • PDF