Search | Korea Science

Comparison of feature parameters for emotion recognition using speech signal (음성 신호를 사용한 감정인식의 특징 파라메터 비교)

김원구
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.40 no.5
- /
- pp.371-377
- /
- 2003
In this paper, comparison of feature parameters for emotion recognition using speech signal is studied. For this purpose, a corpus of emotional speech data recorded and classified according to the emotion using the subjective evaluation were used to make statical feature vectors such as average, standard deviation and maximum value of pitch and energy and phonetic feature such as MFCC parameters. In order to evaluate the performance of feature parameters speaker and context independent emotion recognition system was constructed to make experiment. In the experiments, pitch, energy parameters and their derivatives were used as a prosodic information and MFCC parameters and its derivative were used as phonetic information. Experimental results using vector quantization based emotion recognition system showed that recognition system using MFCC parameter and its derivative showed better performance than that using the pitch and energy parameters.
PDF KSCI

Dynamic Control of DCT Coefficients for Image Quality Improvement (화질 개선을 위한 DCT 계수의 동적 제어)

Im, Yong-Soon;Lee, Keun-Young
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.36S no.7
- /
- pp.116-123
- /
- 1999
of a block if the method uses the quantization parameter depending on the bitrate control, and consequently it influence the image quality of video. In this paper, we propose a new method using the following three steps : calculating an averaging (Average of Sum, AS) value in each pixel's block of images, earning an average value of differences between each pixels and As (Differential Averaging Block Pixels, DABP), and finally achieving an improved coefficient values by the DABP and DCT coefficients. Simulation results show that the quality of moving picture could be improved by the proposed method.
PDF

Joint Spatial-Temporal Quality Improvement Scheme for H.264 Low Bit Rate Video Coding via Adaptive Frameskip

Cui, Ziguan;Gan, Zongliang;Zhu, Xiuchang
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.6 no.1
- /
- pp.426-445
- /
- 2012
Conventional rate control (RC) schemes for H.264 video coding usually regulate output bit rate to match channel bandwidth by adjusting quantization parameter (QP) at fixed full frame rate, and the passive frame skipping to avoid buffer overflow usually occurs when scene changes or high motions exist in video sequences especially at low bit rate, which degrades spatial-temporal quality and causes jerky effect. In this paper, an active content adaptive frame skipping scheme is proposed instead of passive methods, which skips subjectively trivial frames by structural similarity (SSIM) measurement between the original frame and the interpolated frame via motion vector (MV) copy scheme. The saved bits from skipped frames are allocated to coded key ones to enhance their spatial quality, and the skipped frames are well recovered based on MV copy scheme from adjacent key ones at the decoder side to maintain constant frame rate. Experimental results show that the proposed active SSIM-based frameskip scheme acquires better and more consistent spatial-temporal quality both in objective (PSNR) and subjective (SSIM) sense with low complexity compared to classic fixed frame rate control method JVT-G012 and prior objective metric based frameskip method.
https://doi.org/10.3837/tiis.2012.01.024 인용 PDF KSCI

Implementation of Video Watermarking and Transcoding for High Compression and Copyright protection based on Directshow Environment (다이렉트쇼 환경 기반에서 고압축과 저작권 보호를 위한 비디오 트랜스 코딩과 워터마킹 구현)

Yong-Jae Jeong;Tae-Il Jung;Jong-Nam Kim;Kwang-Seok Moon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2008.11a
- /
- pp.1500-1503
- /
- 2008
H.264와 같은 고압축 비디오처리 기법의 등장으로 기존의 MPEG2와 같은 비디오 압축에서 H.264로의 비디오 트랜스코딩이 증가되고 있지만, 고압축 비디오 콘텐츠의 온라인과 오프라인에서 불법배포는 현재 문제가 되고 있다. 본 논문에서는 다이렉트쇼 환경 기반에서 고압축과 저작권 보호를 위한 비디오 트랜스 코딩과 워터마킹을 구현한다. 제안한 방법은 다이렉트쇼의 필터를 이용하여 MPG,WMV를 H.264로 비디오 트랜스코딩을 하고 이와 함께 비디오의 공간영역 특성을 이용하여 저작권 보호를 위한 강인한 워터마킹을 구현한다. 실험 결과 MPG,WMV를 H.264로 트랜스코딩에서 H.264의 QP(Quantization parameter)를 15로 하고 화면간 반복을 10프레임으로 하였을 경우 저작권 보호를 위하여 삽입된 워터마크는 평균 99% 검출됨을 확인하였고, 또한 트랜스코딩중 워터마크삽입에 따른 시간지연은 전체 트랜스코딩시간의 5.7%가 됨을 확인할 수 있었다. 제안한 방법은 저작권 삽입 기능가지는 트랜스코딩 소프트웨어를 필요로 하는 Digital TV방송, IPTV, DVD 사업에 사용 될 수 있을 것이다.
https://doi.org/10.3745/PKIPS.y2008m011a.1500 인용 PDF

The Development of Dynamic Forecasting Model for Short Term Power Demand using Radial Basis Function Network (Radial Basis 함수를 이용한 동적 - 단기 전력수요예측 모형의 개발)

Min, Joon-Young;Cho, Hyung-Ki
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.7
- /
- pp.1749-1758
- /
- 1997
This paper suggests the development of dynamic forecasting model for short-term power demand based on Radial Basis Function Network and Pal's GLVQ algorithm. Radial Basis Function methods are often compared with the backpropagation training, feed-forward network, which is the most widely used neural network paradigm. The Radial Basis Function Network is a single hidden layer feed-forward neural network. Each node of the hidden layer has a parameter vector called center. This center is determined by clustering algorithm. Theatments of classical approached to clustering methods include theories by Hartigan(K-means algorithm), Kohonen(Self Organized Feature Maps %3A SOFM and Learning Vector Quantization %3A LVQ model), Carpenter and Grossberg(ART-2 model). In this model, the first approach organizes the load pattern into two clusters by Pal's GLVQ clustering algorithm. The reason of using GLVQ algorithm in this model is that GLVQ algorithm can classify the patterns better than other algorithms. And the second approach forecasts hourly load patterns by radial basis function network which has been constructed two hidden nodes. These nodes are determined from the cluster centers of the GLVQ in first step. This model was applied to forecast the hourly loads on Mar. $4^{th},\;Jun.\;4^{th},\;Jul.\;4^{th},\;Sep.\;4^{th},\;Nov.\;4^{th},$ 1995, after having trained the data for the days from Mar. $1^{th}\;to\;3^{th},\;from\;Jun.\;1^{th}\;to\;3^{th},\;from\;Jul.\;1^{th}\;to\;3^{th},\;from\;Sep.\;1^{th}\;to\;3^{th},\;and\;from\;Nov.\;1^{th}\;to\;3^{th},$ 1995, respectively. In the experiments, the average absolute errors of one-hour ahead forecasts on utility actual data are shown to be 1.3795%.
PDF

Acquisition Behavior of a Class of Digital Phase-Locked Loops (Digital Phase-Locked Loops의 위상 포착 관정에 관한 연구)

안종구;은종관
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.19 no.5
- /
- pp.55-67
- /
- 1982
In this Paper new results relating to the acquisition behavior of a class of first-and secondorder digital phase-locked loops (DPLL) originally proposed by Reddy and Cupta are presented in the absence of noise. It has been found that the number of quantization levels L and the number of phase error states N play important roles in acquisition. For a given L-level quantizer, as N increases, the acquisition time increases, and the lock range decreases. However, the deviation of the steady state phase error decreases in this case. When L increases, the acquisition time decreases, and the lock range increases. However, variation of L affects little for the steady state phase error. In addition, the effects of a loop filter on acquisition have also been considered. One can get smaller acquisition time and larger lock range as the filter parameter value becomes larger. However, deviation of the steady state phase error increases in that case. Analytical results have been verified by computer simulation.
PDF

A Video Traffic Model based on the Shifting-Level Process (Part I : Modeling and the Effects of SRD and LRD on Queueing Behavior) (Shifting-Level Process에 기반한 영상트래픽 모델 (1부: 모델링과 대기체계 영향 분석))

안희준;강상혁;김재균
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.24 no.10B
- /
- pp.1971-1978
- /
- 1999
In this paper, we study the effects of long-range dependence (LRD) in VBR video traffic on queueing system. This paper consists of Part I and II. In Part I, we present a (LRD) video traffic model based on the shifting-level (SL) process. We observe that the ACF of an empirical video trace is accurately captured by the shifting-level process with compound correlation (SLCC): an exponential function in short range and a hyperbolic function in long range. We present an accurate parameter matching algorithm for video traffic. In the Part II, we offer the queueing analysis of SL/D/1/K called ‘quantization reduction method’. Comparing the queueing performances of the DAR(1) model and the SLCC with that of a real video trace, we identify the effects of SRD and LRD in VBR video traffic on queueing performance. Simulation results show that Markoivian models can estimate network performances fairly accurately under a moderate traffic load and buffer condition, whereas LRD may have a significant effect on queueing behavior under a heavy traffic load and large buffer condition.
PDF

Aggregated Encoder Control Exploiting Interlayer Statistical Characteristics for Advanced Terrestrial-DMB (지상파 DMB 고도화망에서 계층간 통계적 특성을 이용한 통합 부호기 제어)

Kim, Jin-Soo;Park, Jong-Kab;Seo, Kwang-Deok;Kim, Jae-Gon
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.13 no.8
- /
- pp.1513-1526
- /
- 2009
The SVC (Scalable Video Coding) scheme can be effectively used for reducing the redundancy and for improving the coding efficiency but, it requires very high computational complexities. In order to accelerate the successful standardization and commercialization of the Advanced Terrestrial-DMB service, it is necessary to overcome this problem. For this aim, in this paper, we propose an efficient aggregated encoder control algorithm, which shows better performances than the conventional control scheme. Computer simulation result shows that the proposed scheme performs about up to 0.3dB better than those of the conventional scheme. Additionally, based on this control scheme, we propose a fast mode decision method by constraining the redundant coding modes based on the statistical properties of the quantization parameter in the spatial scalable encoder. Through computer simulations, it is shown that the proposed control schemes reduce the heavy computational burden up to 12% compared to the conventional scheme, while keeping the objective visual qualify very high.
https://doi.org/10.6109/JKIICE.2009.13.8.1513 인용 PDF KSCI

Realtime No-Reference Quality-Assessment Over Packet Video Networks (패킷 비디오 네트워크상의 실시간 무기준법 동영상 화질 평가방법)

Sung, Duk-Gu;Kim, Yo-Han;Hana, Jung-Hyun;Shin, Ji-Tae
- Journal of Broadcast Engineering
- /
- v.14 no.4
- /
- pp.387-396
- /
- 2009
No-Reference video-quality assessments are divided into two kinds of metrics based on decoding pixel domain or the bitstream one. Traditional full-/reduced- reference methods have difficulty to be deployed as realtime video transmission because it has problems of additional data, complexity, and assessment accuracy. This paper presents simple and highly accurate no-reference video-quality assessment in realtime video transmission. Our proposed method uses quantization parameter, motion vector, and information of transmission error. To evaluate performance of the proposed algorithm, we perform subjective test of video quality with the ITU-T P.910 Absolute Category Rating(ACR) method and compare our proposed algorithm with the subjective quality assessment method. Experimental results show the proposed quality metric has a high correlation (85%) in terms of subjective quality assessment.
https://doi.org/10.5909/JBE.2009.14.4.387 인용 PDF KSCI

Application and Verification of Time-Division Watermarking Algorithm in H.264 (시간 분할 워터마킹 알고리즘의 H.264 적용 및 검증)

Youn, Jin-Seon;Choi, Jun-Rim
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.45 no.6
- /
- pp.68-73
- /
- 2008
In this paper, we propose watermark algorithm called TDWA(Time-Division Watermarking Algorithm) and we applied the proposed algorithm to H.264 video coding standard. We establish that a proposed algorithm is applied to H.264 baseline profile CODEC. The proposed algorithm inserts a watermark into the spatial domain of several frames. We can easily insert strong and invisible watermarks into original pictures using this method. For verification of the proposed algorithm we design hardware core using Verilog-HDL and Excalibur for JM 8.7 code with hardware & software co-simulation. As a result of verification, the PSNR between watermarked pictures and original pictures are more than 60dB and we found the watermark is kept more than 80% after encoding of H.264/AVC with quantization parameter of 28 in baseline profile.
PDF KSCI

Search Result 145, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)