• Title/Summary/Keyword: Bi-prediction

Search Result 140, Processing Time 0.024 seconds

A Prediction Cost based Complexity Reduction Method for Bi-Prediction in High Efficiency Video Coding (HEVC) (HEVC의 양-예측을 위한 예측 비용 기반의 복잡도 감소 기법)

  • Kim, Jong-Ho;Lee, Ha-Hyun;Jun, Dong-San;Cho, Suk-Hee;Choi, Jin-Soo
    • Journal of Broadcast Engineering
    • /
    • v.17 no.5
    • /
    • pp.781-788
    • /
    • 2012
  • In HEVC, the fast search method is used for reducing the complexity of the motion prediction procedure. It is consisted of the sub-sampled SAD which reduce the complexity of Sum of Absolute Differences(SAD) calculation and the simplified bi-prediction method which reduce the iterations of the uni-prediction for the bi-prediction. The computational complexity is largely decreased by the fast search method but the coding gain is also decreased. In this paper, the simplified bi-prediction is extended to compensate the performance loss and the prediction cost based complexity reduction methods are also proposed to reduce the complexity burden by the extended bi-prediction method. A prediction cost based complexity reduction method is consisted of early termination method for the extended bi-prediction and the bi-prediction skipping method. Compare with HM 6.0 references S/W, the average 0.42% of BD-bitrate is decreased by both the extended bi-prediction method and the prediction cost based complexity reduction methods with negligible increasement of the complexity.

An SAD-Based Selective Bi-prediction Method for Fast Motion Estimation in High Efficiency Video Coding

  • Kim, Jongho;Jun, DongSan;Jeong, Seyoon;Cho, Sukhee;Choi, Jin Soo;Kim, Jinwoong;Ahn, Chieteuk
    • ETRI Journal
    • /
    • v.34 no.5
    • /
    • pp.753-758
    • /
    • 2012
  • As the next-generation video coding standard, High Efficiency Video Coding (HEVC) has adopted advanced coding tools despite the increase in computational complexity. In this paper, we propose a selective bi-prediction method to reduce the encoding complexity of HEVC. The proposed method evaluates the statistical property of the sum of absolute differences in the motion estimation process and determines whether bi-prediction is performed. A performance comparison of the complexity reduction is provided to show the effectiveness of the proposed method compared to the HEVC test model version 4.0. On average, 50% of the bi-prediction time can be reduced by the proposed method, while maintaining a negligible bit increment and a minimal loss of image quality.

Adaptive Combination of Intra/Inter Predictions in JM KTA Software (JM KTA 소프트웨어에서 인트라 및 인터 예측블록이 혼합된 코딩 방법)

  • Kim, Min-Jae;Seo, Chan-Won;Jang, Myung-Hun;Han, Jong-Ki
    • Journal of Broadcast Engineering
    • /
    • v.16 no.2
    • /
    • pp.190-206
    • /
    • 2011
  • We propose an adaptive combination scheme of intra and inter prediction modes, where uni-directional intra prediction, bi-directional intra prediction, and inter prediction method are adaptively selected in an EMB (extended macro block). For each EMB, after all inter blocks have been encoded and decoded, the reconstructed blocks are used as reference data for bi-directional intra prediction of other blocks. Whereas conventional intra coding scheme does not use the right and below side pixels of the current block as reference data, the proposed method uses those for bi-directional intra prediction mode. In this paper, we propose three advanced techniques; (a) filter design for bi-directional prediction, (b) adaptive coding order scheme which increases the chance to use the bi-directional intra prediction mode, (c) modification of syntax to represent coding order. The information for the coding order is informed to the decoder by using the modified syntax structure without adding any additional flag. The simulation results show that the proposed scheme reduces the BD-Rate by 0.5%, on average, compared to KTA.

A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using K-ToBI system (K-ToBI 기호에 준한 F0 곡선 생성 알고리듬)

  • Lee YongJu;Lee Sook-hyang;Kim Jong-Jin;Go Hyeon-Ju;Kim Yeong-Il;Kim Sang-Hun;Lee Jeong-Cheol
    • MALSORI
    • /
    • no.35_36
    • /
    • pp.131-143
    • /
    • 1998
  • This study describes an algorithm for the F0 contour generation system for Korean sentences and its evaluation results. 400 K-ToBI labeled utterances were used which were read by one male and one female announcers. F0 contour generation system uses two classification trees for prediction of K-ToBI labels for input text and 11 regression trees for prediction of F0 values for the labels. Evaluation results of the system showed 77.2% prediction accuracy for prediction of IP boundaries and 72.0% prediction accuracy for AP boundaries. Information of voicing and duration of the segments was not changed for F0 contour generation and its evaluation. Evaluation results showed 23.5Hz RMS error and 0.55 correlation coefficient in F0 generation experiment using labelling information from the original speech data.

  • PDF

A Performance Improvement Method using Variable Break in Corpus Based Japanese Text-to-Speech System (가변 Break를 이용한 코퍼스 기반 일본어 음성 합성기의 성능 향상 방법)

  • Na, Deok-Su;Min, So-Yeon;Lee, Jong-Seok;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.155-163
    • /
    • 2009
  • In text-to-speech systems, the conversion of text into prosodic parameters is necessarily composed of three steps. These are the placement of prosodic boundaries. the determination of segmental durations, and the specification of fundamental frequency contours. Prosodic boundaries. as the most important and basic parameter. affect the estimation of durations and fundamental frequency. Break prediction is an important step in text-to-speech systems as break indices (BIs) have a great influence on how to correctly represent prosodic phrase boundaries, However. an accurate prediction is difficult since BIs are often chosen according to the meaning of a sentence or the reading style of the speaker. In Japanese, the prediction of an accentual phrase boundary (APB) and major phrase boundary (MPB) is particularly difficult. Thus, this paper presents a method to complement the prediction errors of an APB and MPB. First, we define a subtle BI in which it is difficult to decide between an APB and MPB clearly as a variable break (VB), and an explicit BI as a fixed break (FB). The VB is chosen using the classification and regression tree, and multiple prosodic targets in relation to the pith and duration are then generated. Finally. unit-selection is conducted using multiple prosodic targets. In the MOS test result. the original speech scored a 4,99. while proposed method scored a 4.25 and conventional method scored a 4.01. The experimental results show that the proposed method improves the naturalness of synthesized speech.

Comparative Analysis of Prediction Performance of Aperiodic Time Series Data using LSTM and Bi-LSTM (LSTM과 Bi-LSTM을 사용한 비주기성 시계열 데이터 예측 성능 비교 분석)

  • Ju-Hyung Lee;Jun-Ki Hong
    • The Journal of Bigdata
    • /
    • v.7 no.2
    • /
    • pp.217-224
    • /
    • 2022
  • Since online shopping has become common, people can easily buy fashion goods anytime, anywhere. Therefore, consumers quickly respond to various environmental variables such as weather and sales prices. Therefore, utilizing big data for efficient inventory management has become very important in the fashion industry. In this paper, the changes in sales volume of fashion goods due to changes in temperature is analyzed via the proposed big data analysis algorithm by utilizing actual big data from Korean fashion company 'A'. According to the simulation results, it was confirmed that Bidirectional-LSTM(Bi-LSTM) compared to LSTM(Long Short-Term Memory) takes more simulation time about more than 50%, but the prediction accuracy of non-periodic time series data such as clothing product sales data is the same.

Prediction of small-scale leak flow rate in LOCA situations using bidirectional GRU

  • Hye Seon Jo;Sang Hyun Lee;Man Gyun Na
    • Nuclear Engineering and Technology
    • /
    • v.56 no.9
    • /
    • pp.3594-3601
    • /
    • 2024
  • It is difficult to detect a small-scale leakage in a nuclear power plant (NPP) quickly and take appropriate action. Delaying these procedures can have adverse effects on NPPs. In this paper, we propose leak flow rate prediction using the bidirectional gated recurrent unit (Bi-GRU) method to detect leakage quickly and accurately in small-scale leakage situations because large-scale leak rates are known to be predicted accurately. The data were acquired by simulating small loss-of-coolant accidents (LOCA) or small-scale leakage situations using the modular accident analysis program (MAAP) code. In addition, to improve prediction performance, data were collected by distinguishing the break sizes in more detail. In addition, the prediction accuracy was improved by performing both LOCA diagnosis and leak flow rate prediction in small LOCA situations. The prediction model developed using the Bi-GRU showed a superior prediction performance compared with other artificial intelligence methods. Accordingly, the accurate and effective prediction model for small-scale leakage situations proposed herein is expected to support operators in decision-making and taking actions.

Feature Selection with Ensemble Learning for Prostate Cancer Prediction from Gene Expression

  • Abass, Yusuf Aleshinloye;Adeshina, Steve A.
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.12spc
    • /
    • pp.526-538
    • /
    • 2021
  • Machine and deep learning-based models are emerging techniques that are being used to address prediction problems in biomedical data analysis. DNA sequence prediction is a critical problem that has attracted a great deal of attention in the biomedical domain. Machine and deep learning-based models have been shown to provide more accurate results when compared to conventional regression-based models. The prediction of the gene sequence that leads to cancerous diseases, such as prostate cancer, is crucial. Identifying the most important features in a gene sequence is a challenging task. Extracting the components of the gene sequence that can provide an insight into the types of mutation in the gene is of great importance as it will lead to effective drug design and the promotion of the new concept of personalised medicine. In this work, we extracted the exons in the prostate gene sequences that were used in the experiment. We built a Deep Neural Network (DNN) and Bi-directional Long-Short Term Memory (Bi-LSTM) model using a k-mer encoding for the DNA sequence and one-hot encoding for the class label. The models were evaluated using different classification metrics. Our experimental results show that DNN model prediction offers a training accuracy of 99 percent and validation accuracy of 96 percent. The bi-LSTM model also has a training accuracy of 95 percent and validation accuracy of 91 percent.

Coding Efficiency Improvement for Identical Motion Information of Bi-prediction Mode within the GPB Slcice of HEVC (HEVC의 GPB 슬라이스에서 양예측 모드의 동일 움직임 정보에 대한 성능 향상 방안)

  • Kim, Sang-Min;Kim, Kyung-Yong;Park, Gwang-Hoon;Kim, Hui-Yong;Lim, Sung-Chang;Lee, Jin-Ho
    • Journal of Broadcast Engineering
    • /
    • v.16 no.6
    • /
    • pp.1069-1072
    • /
    • 2011
  • This paper proposes the method which reduces complexity and improves coding efficiency by solving a problem of HEVC bi-prediction. In current HM 3.0, it is frequently occurred that L0 motion information and L1 motion information are identical in blocks which are bi-predicted. In this case, L1 motion vector is replaced by non-zero motion vector which belongs to first available neighbor block of current block. If they are still identical, prediction mode is replaced by uni-prediction. As an experimental result, in LD(Low-Delay) case, decoding time is reduced roughly 2%~5% and coding gain is roughly 0.3%~0.5% compared with the HM 3.0 anchor.

Application of Informer for time-series NO2 prediction

  • Hye Yeon Sin;Minchul Kang;Joonsung Kang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.7
    • /
    • pp.11-18
    • /
    • 2023
  • In this paper, we evaluate deep learning time series forecasting models. Recent studies show that those models perform better than the traditional prediction model such as ARIMA. Among them, recurrent neural networks to store previous information in the hidden layer are one of the prediction models. In order to solve the gradient vanishing problem in the network, LSTM is used with small memory inside the recurrent neural network along with BI-LSTM in which the hidden layer is added in the reverse direction of the data flow. In this paper, we compared the performance of Informer by comparing with other models (LSTM, BI-LSTM, and Transformer) for real Nitrogen dioxide (NO2) data. In order to evaluate the accuracy of each method, mean square root error and mean absolute error between the real value and the predicted value were obtained. Consequently, Informer has improved prediction accuracy compared with other methods.