• 제목/요약/키워드: perceptual quality

검색결과 344건 처리시간 0.03초

인터넷 쇼핑몰 이미지 워터마킹을 위한 HVS 설계 방법 (HVS design for Internet Shopping-Mall Image Watermarking)

  • 서용석;김원겸;이선화;황치정
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2006년도 하계종합학술대회
    • /
    • pp.443-444
    • /
    • 2006
  • In this paper, a spatial-based perceptual watermarking considering human visual system (HVS) that is proposed for small-size images such as internet shopping-mall image. In our method, a multi-bit data can be embedded in luminance component of color images still keeping the perceptual quality of image. Experimental results demonstrated that watermarks can be strongly embedded while preserving a good fidelity.

  • PDF

연결발화에서 마비말화자의 음질 특성 (Voice Quality of Dysarthric Speakers in Connected Speech)

  • 서인효;성철재
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.33-41
    • /
    • 2013
  • This study investigated the perceptual and cepstral/spectral characteristics of phonation and their relationships in dysarthria in connected speech. Twenty-two participants were divided into two groups; the eleven dysarthric speakers were paired with matching age and gender healthy control participants. A perceptual evaluation was performed by three speech pathologists using the GRBAS scale to measure the cepstrual/spectral characteristics of phonation between the two groups' connected speech. Correlations showed dysarthric speakers scored significantly worse (with a higher rating) with severities in G (overall dysphonia grade), B (breathiness), and S (strain), while the smoothed prominence of the cepstral peak (CPPs) was significantly lower. The CPPs were significantly correlated with the perceptual ratings, including G, B, and S. The utility of CPPs is supported by its high relationship with perceptually rated dysphonia severity in dysarthric speakers. The receiver operating characteristic (ROC) analysis showed that the threshold of 5.08 dB for the CPPs achieved a good classification for dysarthria, with 63.6% sensitivity and the perfect specificity (100%). Those results indicate the CPPs reliably distinguished between healthy controls and dysarthric speakers. However, the CPP frequency (CPP F0) and low-high spectral ratio (L/H ratio) were not significantly different between the two groups.

저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상 (Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding)

  • 이창헌;강홍구
    • 한국음향학회지
    • /
    • 제29권1호
    • /
    • pp.62-68
    • /
    • 2010
  • 본 논문에서는 기존 마스킹 임계값 적응 방식을 개선하여 저전송률 오디오 부호화에서 음성 신호에 대한 성능을 향상시킨다. 포먼트 영역 검색 이후, 각 포먼트 영역의 평균 에너지와 해당 서브밴드의 에너지 비율을 이용하여 마스킹 임계값을 변화시킨다. 상대적으로 에너지가 큰 밴드에 대해서는 더 많은 양자화 노이즈가 허용되는 반면, 청각적으로 민감한 스펙트럴 밸리에서는 비트 할당을 높여 양자화 에러를 좀 더 줄인다. 이는 음성 부호화에서 널리 사용되는 지각 가중(perceptual weighting) 개념을 반영한 것이다. 객관적 음질 평가 결과, 제안한 알고리즘이 기존 방식에 비해 음성 신호에 대한 성능을 향상시킨다는 것을 확인하였다.

성악 전공 학생의 가창 시 음성의 음향학적 매개 변수와 지각적 매개 변수사이의 상관 연구 (A Correlation Study between Acoustic and Perceptual Parameters of the Singing Voice in Singing Students)

  • 조성미;이상욱;정옥란
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2004년도 춘계 학술대회 발표논문집
    • /
    • pp.219-222
    • /
    • 2004
  • The purpose of this study was to determine a correlation between acoustic and perceptual parameters of the singing voice in singing students and compare them with the results with previous studies, and a more sensitive parameters in analyzing professional vocal usage. This study measured acoustic and perceptual parameters in 41 singing students. Digital audio recordings were made in sung vowels acoustic analysis. Each sample was judged by 1 experienced singing teacher and 1 voice pathologist on two semantic bipolar 7-point scales (ringing-dull, rich-thin). The results showed that SPP1 (p<0.01), SPP2 (p<0.01), and P1(p<0.01) had significant correlations with ringing and richness quality.

  • PDF

Foveated Frequency Sensitivity의 구현 (Desgin of Foveated Frequency Sensitivity)

  • ;;김원하
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2014년도 추계학술대회
    • /
    • pp.248-251
    • /
    • 2014
  • We develop the signal processing method for implementing the human perceptual variant on frequency and space. The human visual perceptual sensitivity varies as frequency components and the human perceivable resolution diminishes as the distances further from the eye-focused point. For realizing the frequency sensitivity, we developed the signal direction adaptive multiband energy scaling method to weight the frequency components. The low-pass filtering is designed on the developed energy scaling method for diminishing perceivable resolutions as the deviated distance from the eye-focused point. The developed method not only enhances the frequency components of image signals at the eye-focused region but also smoothes non-perceivable detailed image signals at non-focused regions. The proposed method is verified by the subjective and objective evaluations that it can improve human perceptual visual quality.

  • PDF

Perceptual weighting on English lexical stress by Korean learners of English

  • Goun Lee
    • 말소리와 음성과학
    • /
    • 제14권4호
    • /
    • pp.19-24
    • /
    • 2022
  • This study examined which acoustic cue(s) that Korean learners of English give weight to in perceiving English lexical stress. We manipulated segmental and suprasegmental cues in 5 steps in the first and second syllables of an English stress minimal pair "object". A total of 27 subjects (14 native speakers of English and 13 Korean L2 learners) participated in the English stress judgment task. The results revealed that native Korean listeners used the F0 and intensity cues in identifying English stress and weighted vowel quality most strongly, as native English listeners did. These results indicate that Korean learners' experience with these cues in L1 prosody can help them attend to these cues in their L2 perception. However, L2 learners' perceptual attention is not entirely predicted by their linguistic experience with specific acoustic cues in their native language.

여성 노인 합창단원의 합창단 유형에 따른 청지각적 음성평가(GRBAS) 및 음성관련 삶의 질(K-VRQOL) 비교 (A comparison of the perceptual-auditory voice quality evaluation (GRBAS) and voice-related quality of life (K-VRQOL) according to choir type of elderly women choir members)

  • 이현정;강빈나;김수지
    • 말소리와 음성과학
    • /
    • 제12권2호
    • /
    • pp.51-61
    • /
    • 2020
  • 본 연구의 목적은 음성의 청지각적 평가도구(GRBAS)와 음성관련 삶의 질(K-VRQOL) 척도를 통해 합창활동에 참여하는 여성 노인의 음성 특성과 음성관련 삶의 질을 비교하는 것이다. 연구 대상은 서울 및 부산 소재의 합창단에서 활동 중인 만 60세 이상의 여성 노인으로 총 77명이었다. 합창단은 참여 유형에 따라 합창단(Regular choir)과 찬양단(Church choir) 두 개의 집단으로 분류하였다. 청지각적 음성평가는 /a/ 모음을 발성하는 음성을 듣고 전문가가 청지각적 평가(GRBAS) 척도를 사용하여 평정하였다. 연구 결과, 합창활동 참여 유형에 따라 집단 간 차이를 비교했을 때 찬양단에서 활동하는 여성 노인에 비해 합창단에서 활동하는 여성 노인의 경우 주관적 음성 인식 수준에서 대화 시 음성 사용 만족도가 높은 것으로 나타났다. 또한, 음성관련 삶의 질(K-VRQOL) 척도의 신체 기능 영역에 해당하는 문항에서 만족도가 높은 것으로 분석되었다. 본 연구는 합창활동이 노년기 음성기능의 개선뿐 아니라 음성사용의 주관적 인식 수준을 향상시키는데 긍정적인 결과를 기대할 수 있을 것이라는 점을 확인하였으며, 노인 음성개선을 위한 체계적인 음악 중재 프로그램의 필요성을 시사하고 있다.

A Model-Based Image Steganography Method Using Watson's Visual Model

  • Fakhredanesh, Mohammad;Safabakhsh, Reza;Rahmati, Mohammad
    • ETRI Journal
    • /
    • 제36권3호
    • /
    • pp.479-489
    • /
    • 2014
  • This paper presents a model-based image steganography method based on Watson's visual model. Model-based steganography assumes a model for cover image statistics. This approach, however, has some weaknesses, including perceptual detectability. We propose to use Watson's visual model to improve perceptual undetectability of model-based steganography. The proposed method prevents visually perceptible changes during embedding. First, the maximum acceptable change in each discrete cosine transform coefficient is extracted based on Watson's visual model. Then, a model is fitted to a low-precision histogram of such coefficients and the message bits are encoded to this model. Finally, the encoded message bits are embedded in those coefficients whose maximum possible changes are visually imperceptible. Experimental results show that changes resulting from the proposed method are perceptually undetectable, whereas model-based steganography retains perceptually detectable changes. This perceptual undetectability is achieved while the perceptual quality - based on the structural similarity measure - and the security - based on two steganalysis methods - do not show any significant changes.

A Study on the Evaluation Method of Perceptual Contrast with CIECAM02

  • Chong, Jong-Ho;Lee, Seung-Bae;Lee, Sang-Myung;Choi, Young-Chul;Bae, Jae-Woo;Kim, Hun-Soo;Chung, Ho-Kyoon
    • 한국정보디스플레이학회:학술대회논문집
    • /
    • 한국정보디스플레이학회 2007년도 7th International Meeting on Information Display 제7권2호
    • /
    • pp.1661-1663
    • /
    • 2007
  • The contrast of display is one of the important specifications. Even if the contrast indicates luminance range which is a capability of the display and is greater in lower luminance or higher luminance, we consider that the greater contrast gets not the better performance. It is not the same value in human visual system. In practice, it is difficult to achieve the full dynamic range seen by human beings using electronic equipment. Therefore, we consider ambient condition and human perception to calculate perceptual contrast using the CIECAM02. In this paper, we propose perceptual contrast that is calculated using the brightness of CIECAM02.

  • PDF

Visual-Attention-Aware Progressive RoI Trick Mode Streaming in Interactive Panoramic Video Service

  • Seok, Joo Myoung;Lee, Yonghun
    • ETRI Journal
    • /
    • 제36권2호
    • /
    • pp.253-263
    • /
    • 2014
  • In the near future, traditional narrow and fixed viewpoint video services will be replaced by high-quality panorama video services. This paper proposes a visual-attention-aware progressive region of interest (RoI) trick mode streaming service (VA-PRTS) that prioritizes video data to transmit according to the visual attention and transmits prioritized video data progressively. VA-PRTS enables the receiver to speed up the time to display without degrading the perceptual quality. For the proposed VA-PRTS, this paper defines a cutoff visual attention metric algorithm to determine the quality of the encoded video slice based on the capability of visual attention and the progressive streaming method based on the priority of RoI video data. Compared to conventional methods, VA-PRTS increases the bitrate saving by over 57% and decreases the interactive delay by over 66%, while maintaining a level of perceptual video quality. The experiment results show that the proposed VA-PRTS improves the quality of the viewer experience for interactive panoramic video streaming services. The development results show that the VA-PRTS has highly practical real-field feasibility.