Search | Korea Science

Lightweight Quality Metric Based on No-Reference Bitstream for H.264/AVC Video

Kim, Yo-Han;Shin, Ji-Tae;Kim, Ho-Kyom
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.6 no.5
- /
- pp.1388-1399
- /
- 2012
This paper proposes a quality metric based on a No-Reference Bitstream (NR-B) having least computational complexity for the assessment of the human-perceptual quality of H.264 encoded video. The proposed NR-B method performs a modeling of encoding distortion with three bit-stream information (i.e. frame-rate, motion-vector, and quantization-parameter) that can be directly extractable from the encoded bitstream and does not require additional complex processing of final pictures. From performance evaluation using 165 compressed video sequences, the experiment results show that the proposed metric has a higher correlation with subjective quality than is achieved with other comparable methods.
https://doi.org/10.3837/tiis.2012.05.008 인용 PDF KSCI

Brand Image: Analysis of Domestic Jeans Market through Benefit Segmentation and Perceptual Mapping(II) (혜택세분화와 인식도에 의한 진의류 브랜드 이미지 연구(II) -인식도에 의한 브랜드 이미지 분석-)

최일경;고애란
- Journal of the Korean Society of Clothing and Textiles
- /
- v.19 no.5
- /
- pp.699-712
- /
- 1995
The purpose of this study was 1) to identify the constructing factors of jeans brand image 2) to analyze the domestic jeans market using perceptual maps of three benefit segments based on stdy(I). The questionnaire consisted of brand preference, attribute of brand image and wearer image was selected from the previous studies or developed for this study. The subjects were 350 male and female university students who have purchased at least one of the nine jeans wear brand selected for the study. For statistical analysis, reliability test, factor analysis, MANOVA, and multiple regression were used. The results of this study were as follows: 1. Symbolism, quality, and economy were found out as constricting factors of brand image in the attribute dimensions, while innovative and active image were found out in the wearer image dimensions. 2. 9 Perceptual maps of attribute dimensions and 3 perceptual maps of wearer image dimensions were constructed and each ideal vector was drawn.
PDF

Fractal image compression with perceptual distortion measure (인지 왜곡 척도를 사용한 프랙탈 영상 압축)

문용호;박기웅;손경식;김윤수;김재호
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.21 no.3
- /
- pp.587-599
- /
- 1996
In general fractal imge compression, each range block is approximated by a contractive transform of the matching domain block under the mean squared error criterion. In this paper, a distortion measure reflecting the properties of human visual system is defined and applied to a fractal image compression. the perceptual distortion measure is obtained by multiplying the mean square error and the noise sensitivity modeled by using the background brightness and spatial masking. In order to compare the performance of the mean squared error and perceptual distortion measure, a simulation is carried out by using the 512*512 Lena and papper gray image. Compared to the results, 6%-10% compression ratio improvements under improvements under the same image quality are achieved in the perceptual distortion measure.
PDF

Characteristics of voice quality on clear versus casual speech in individuals with Parkinson's disease (명료발화와 보통발화에서 파킨슨병환자 음성의 켑스트럼 및 스펙트럼 분석)

Shin, Hee-Baek;Shim, Hee-Jeong;Jung, Hun;Ko, Do-Heung
- Phonetics and Speech Sciences
- /
- v.10 no.2
- /
- pp.77-84
- /
- 2018
The purpose of this study is to examine the acoustic characteristics of Parkinsonian speech, with respect to different utterance conditions, by employing acoustic/auditory-perceptual analysis. The subjects of the study were 15 patients (M=7, F=8) with Parkinson's disease who were asked to read out sentences under different utterance conditions (clear/casual). The sentences read out by each subject were recorded, and the recorded speech was subjected to cepstrum and spectrum analysis using Analysis of Dysphonia in Speech and Voice (ADSV). Additionally, auditory-perceptual evaluation of the recorded speech was conducted with respect to breathiness and loudness. Results indicate that in the case of clear speech, there was a statistically significant increase in the cepstral peak prominence (CPP), and a decrease in the L/H ratio SD (ratio of low to high frequency spectral energy SD) and CPP F0 SD values. In the auditory-perceptual evaluation, a decrease in breathiness and an increase in loudness were noted. Furthermore, CPP was found to be highly correlated to breathiness and loudness. This provides objective evidence of the immediate usefulness of clear speech intervention in improving the voice quality of Parkinsonian speech.
https://doi.org/10.13064/KSSS.2018.10.2.077 인용 PDF KSCI

Conversational Quality Measurement System for Mobile VoIP Speech Communication (모바일 VoIP 음성통신을 위한 대화음질 측정 시스템)

Cho, Jae-Man;Kim, Hyoung-Gook
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.10 no.4
- /
- pp.71-77
- /
- 2011
In this paper, we propose a conversational quality measurement (CQM) system for providing the objective QoS of high quality mobile VoIP voice telecommunication. For measuring the conversational quality, the VoIP telecommunication system is implemented in two smart phones connected with VoIP. The VoIP telecommunication system consists of echo cancellation, noise reduction, speech encoding/decoding, packet generation with RTP (Real-Time Protocol), jitter buffer control and POS (Play-out Schedule) with LC (loss Concealment). The CQM system is connected to a microphone and a speaker of each smart phone. The voice signal of each speaker is recorded and used to measure CE (Conversational Efficiency), CS (Conversational Symmetry), PESQ (Perceptual Evaluation of Speech Quality) and CE-CS-PESQ correlation. We prove the CQM system by measuring CE, CS and PESQ under various SNR, delay and loss due to IP network environment.
PDF KSCI

Performance comparison evaluation of speech enhancement using various loss functions (다양한 손실 함수를 이용한 음성 향상 성능 비교 평가)

Hwang, Seo-Rim;Byun, Joon;Park, Young-Cheol
- The Journal of the Acoustical Society of Korea
- /
- v.40 no.2
- /
- pp.176-182
- /
- 2021
This paper evaluates and compares the performance of the Deep Nerual Network (DNN)-based speech enhancement models according to various loss functions. We used a complex network that can consider the phase information of speech as a baseline model. As the loss function, we consider two types of basic loss functions; the Mean Squared Error (MSE) and the Scale-Invariant Source-to-Noise Ratio (SI-SNR), and two types of perceptual-based loss functions, including the Perceptual Metric for Speech Quality Evaluation (PMSQE) and the Log Mel Spectra (LMS). The performance comparison was performed through objective evaluation and listening tests with outputs obtained using various combinations of the loss functions. Test results show that when a perceptual-based loss function was combined with MSE or SI-SNR, the overall performance is improved, and the perceptual-based loss functions, even exhibiting lower objective scores showed better performance in the listening test.
https://doi.org/10.7776/ASK.2021.40.2.176 인용 PDF KSCI

S-JND based Perceptual Rate Control Algorithm of HEVC (S-JND 기반의 HEVC 주관적 율 제어 알고리즘)

Kim, JaeRyun;Sim, Donggyu
- Journal of Broadcast Engineering
- /
- v.22 no.3
- /
- pp.381-396
- /
- 2017
In this paper, the perceptual rate control algorithm is studied for HEVC (High Efficiency Video Coding) encoder with bit allocation based on perceived visual quality. This paper proposes perceptual rate control algorithm which could consider perceived quality for HEVC encoding method. The proposed rate control algorithm employs adaptive bit allocation for frame and CTU level using the perceived visual importance of each CTU. For performance evaluation of the proposed algorithm, the proposed algorithm was implemented on HM 16.9 and tested for sequences in Class B under the CTC (Common Test Condition) RA (Random Access) case. Experimental results show that the proposed method reduces the bitrate of 3.12%, and improves BD-PSNR of 0.08dB and bitrate accuracy of 0.07% on average. And also, we achieved MOS improvement of 0.16 with the proposed method, compared with the conventional method based on DSCQS (Double Stimulus Continuous Quality Scale).
https://doi.org/10.5909/JBE.2017.22.3.381 인용 PDF KSCI KPUBS

Sound Quality Evaluation of Vehicle Interior Noise Using Virtual Sound Quality Analysis (가상 음질 분석을 이용한 자동차 실내소음 음질 평가)

Kang, Sang-wook
- Transactions of the Korean Society for Noise and Vibration Engineering
- /
- v.27 no.1
- /
- pp.100-106
- /
- 2017
Sound quality engineering in automobile noise applications has become more and more important under the current quiet driving condition because various noise components masked under high noise level can be audible in quieter driving situation. Many researches have been carried out for subjective and objective assessments on automobile sounds and noises. In particular, the interior sound quality has been one of research fields that can give high-quality feature to automobile products. Although many works related to the interior sound quality have been progressed or completed in foreign countries, limited research results are presented in the country. In the study, subjective assessments are first performed with 20 subjects to select perceptual adjectives suitable to the assessment of car interior noises during acceleration. The selected perceptual adjectives are employed as the assessment scales to evaluate the acceleration noises in questionnaire procedures using 35 subjects, for which several noises are created through digital filtering of the acceleration noises measured. Mean values and standard deviations for subjective assessment scores obtained by the questionnaire procedures are calculated and their reliability are also verified. Finally, various statistical analyses such as the correlation analysis and the factor analysis are carried out to reveal the interrelationship between the assessment scales and the spectrum components of the acceleration noises.
https://doi.org/10.5050/KSNVE.2017.27.1.100 인용 PDF KSCI

A Perception-based Color Correction Method for Multi-view Images

Shao, Feng;Jiang, Gangyi;Yu, Mei;Peng, Zongju
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.5 no.2
- /
- pp.390-407
- /
- 2011
Three-dimensional (3D) video technologies are becoming increasingly popular, as it can provide users with high quality and immersive experiences. However, color inconsistency between the camera views is an urgent problem to be solved in multi-view imaging. In this paper, a perception-based color correction method for multi-view images is proposed. In the proposed method, human visual sensitivity (VS) and visual attention (VA) models are incorporated into the correction process. Firstly, the VS property is used to reduce the computational complexity by removing these visual insensitive regions. Secondly, the VA property is used to improve the perceptual quality of local VA regions by performing VA-dependent color correction. Experimental results show that compared with other color correction methods, the proposed method can greatly promote the perceptual quality of local VA regions greatly and reduce the computational complexity, and obtain higher coding performance.
https://doi.org/10.3837/tiis.2011.02.009 인용 PDF KSCI

An Objective No-Reference Perceptual Quality Assessment Metric based on Temporal Complexity and Disparity for Stereoscopic Video

Ha, Kwangsung;Bae, Sung-Ho;Kim, Munchurl
- IEIE Transactions on Smart Processing and Computing
- /
- v.2 no.5
- /
- pp.255-265
- /
- 2013
3DTV is expected to be a promising next-generation broadcasting service. On the other hand, the visual discomfort/fatigue problems caused by viewing 3D videos have become an important issue. This paper proposes a perceptual quality assessment metric for a stereoscopic video (SV-PQAM). To model the SV-PQAM, this paper presents the following features: temporal variance, disparity variation in intra-frames, disparity variation in inter-frames and disparity distribution of frame boundary areas, which affect the human perception of depth and visual discomfort for stereoscopic views. The four features were combined into the SV-PQAM, which then becomes a no-reference stereoscopic video quality perception model, as an objective quality assessment metric. The proposed SV-PQAM does not require a depth map but instead uses the disparity information by a simple estimation. The model parameters were estimated based on linear regression from the mean score opinion values obtained from the subjective perception quality assessments. The experimental results showed that the proposed SV-PQAM exhibits high consistency with subjective perception quality assessment results in terms of the Pearson correlation coefficient value of 0.808, and the prediction performance exhibited good consistency with a zero outlier ratio value.
PDF

Search Result 344, Processing Time 0.041 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)