• Title/Summary/Keyword: perceptual contrast

Search Result 73, Processing Time 0.024 seconds

Robust Image Watermarking via Perceptual Structural Regularity-based JND Model

  • Wang, Chunxing;Xu, Meiling;Wan, Wenbo;Wang, Jian;Meng, Lili;Li, Jing;Sun, Jiande
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.2
    • /
    • pp.1080-1099
    • /
    • 2019
  • A better tradeoff between robustness and invisibility will be realized by using the just noticeable (JND) model into the quantization-based watermarking scheme. The JND model is usually used to describe the perception characteristics of human visual systems (HVS). According to the research of cognitive science, HVS can adaptively extract the structure features of an image. However, the existing JND models in the watermarking scheme do not consider the structure features. Therefore, a novel JND model is proposed, which includes three aspects: contrast sensitivity function, luminance adaptation, and contrast masking (CM). In this model, the CM effect is modeled by analyzing the direction features and texture complexity, which meets the human visual perception characteristics and matches well with the spread transform dither modulation (STDM) watermarking framework by employing a new method to measure edge intensity. Compared with the other existing JND models, the proposed JND model based on structural regularity is more efficient and applicable in the STDM watermarking scheme. In terms of the experimental results, the proposed scheme performs better than the other watermarking scheme based on the existing JND models.

Executive function and Korean children's stop production

  • Eun Jong Kong;Hyunjung Lee;Jeffrey J. Holliday
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.45-52
    • /
    • 2023
  • Previous studies have established a role for cognitive differences in explaining variability in speech processing across individuals. In the case of perceptual cue weighting in the context of a sound change, studies have produced conflicting results regarding the relationship between executive function and the use of redundant cues. The current study aimed to explore this relationship in acoustic cue weighting during speech production. Forty-one Korean-speaking children read a list of stop-initial words and completed two tests that assess executive function, i.e., Dimensional Change Card Sorting (DCCS) and digit n-back. Voice onset time (VOT) and fundamental frequency (F0) were measured in each word, and analyses were carried out to determine the extent to which children's executive function predicted their use of both informative and less informative cues to the three pairs comprising the Korean three-way stop laryngeal contrast. No evidence was found for a relationship between cognitive ability and acoustic cue weighting in production, which is at odds with previous, albeit conflicting, results for speech perception. While this result may be due to the lack of task demands in the production task used here, it nevertheless expands the empirical ground upon which future work in this area may proceed.

Morphological Categorization and its Role in Design Method

  • Kwun, Joon-Bum;Whang, Hee-Joon
    • Architectural research
    • /
    • v.13 no.4
    • /
    • pp.11-18
    • /
    • 2011
  • The first attempt in architectural design theory to consider the perceptual and metaphysical dimension separately with a fully modern scientific manner, which exist in contrast to the Renaissance idea, was Claude Perrault, who emphasized the cognitive factors as an important scientific human issue in building design in 1683 in his book "Ordonnance". Even today many elaborated works to reveal the mysterious design processes based on a set of rational approaches have been introduced since the Design Method movement in 1960's. Their pioneering and challenging efforts to rationalize design process have been mostly rely on the cultural issues whether it takes a qualitative or quantitative stance. On the other hand, however, today's computer generated free form architecture seems not to be aware of those lessons learned from the past and, therefore, this study conducted an extensive research through exploring on morphological building forms with cultural issues to fill up the missing and/or lacking characteristics of today trend in building design.

A DCT-Based Bisually Adaptive Quantization (DCT 기반의 시각 적응적 양자화 방법에 관한 연구)

  • Park, Sung-Chan;Kim, Jung-Hyun;Lee, Guee-Sang
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.50 no.7
    • /
    • pp.332-338
    • /
    • 2001
  • A visually adaptive quantization method of DCT-based images based on Human Visual System(HVS) is proposed. This approach uses the spatial masking in HVS characteristics to obtain higher compression ratio with relatively small degradation in the image quality. HVS is nonsensitive to an edge area, so a high complexity area is quantized coarsely in contrast to fine quantization of the low complexity area. The complexity of an area is estimated by the variance of DCT coefficients of the image. Experimental results demonstrate the performance of the proposed method and the resulting images show little difference from the original image in the subjective perception.

  • PDF

The Study of the Sensorineural Hearing Loss Compensation Algorithm using Psychoacoustics Model (심리음향모델을 적용한 난청 보정 알고리즘의 연구)

  • 노형철;김헌중;한헌수;차형태
    • Proceedings of the IEEK Conference
    • /
    • 2000.09a
    • /
    • pp.189-192
    • /
    • 2000
  • 본 논문에서는 청각 장애인의 보다 향상된 보청 환경을 조성하고자 청각손실을 심리음향 모델을 적용하여 감음 신경성 난청을 보정하는 알고리즘을 제안한다. 제안한 알고리즘에서는 난청의 유형은 내이에서부터 중추 뇌에 걸친 감음계와 신경계의 장애에서 비롯되는 감음신경성 난청(sensorineural hearing loss)으로 주파수 영역상에서 MTH(minimum hearing threshold)가 균일하지 않게 상승하게되어 가청영역이 좁아지는 문제점을 해결하기 위한 방법으로 각각의 주파수 밴드마다 멀티밴드 압축 알고리즘을 적용하였다. 그러나 이 경우 각각의 주파수 밴드에 따른 서로 다른 가청 영역의 영향에 의한 변형된 스펙트럼 모양으로 인해 spectral contrast reduction과 변형된 마스킹 특성으로 인해 음성 변별력에 제한을 가하게 된다. 이것은 주변 주파수 성분들에 의한 마스킹 효과에 의한 것으로, 신호에 대한 난청인이 느끼는 지각 영역(perceptual domain)에서의 해석과 심리음향 모델 파라미터를 통한 보청기의 개발이 이루어져야 하며, 본 논문에서 그 알고리즘을 적용하였다.

  • PDF

Depth sensitivity of stereoscopic displays

  • Choi, Byeong-Hwa;Choi, Dong-Wook;Lee, Ja-Eun;Lee, Seung-Bae;Kim, Sung-Chul
    • Journal of Information Display
    • /
    • v.13 no.1
    • /
    • pp.43-49
    • /
    • 2012
  • Depth sensitivity is considered one of the factors influencing 3D displays the most. In this paper, the perceptual 3D depth was quantitatively measured to compare the depth difference among the display devices. No difference was found in the typical display performance among the devices, but the subjective evaluation of the depth sensitivity where the disparity was varied showed that the organic light emitting diode (OLED) had the highest performance, mainly due to its almost 0% crosstalk, one of the features of OLED. Crosstalk is a form of image superposition that greatly affects the depth sensitivity. The experiment results showed that the quantitative depth sensitivity varies due to geometric factors such as disparity, viewing distance, and subjective sensitivity, depending on the display image characteristics, such as crosstalk and contrast.

Discrimination of Synthesized English Vowels by American and Korean Listeners

  • Yang, Byung-Gon
    • Speech Sciences
    • /
    • v.13 no.1
    • /
    • pp.7-27
    • /
    • 2006
  • This study explored the discrimination of synthesized English vowel pairs by twenty-seven American and Korean, male and female listeners. The average formant values of nine monophthongs produced by ten American English male speakers were employed to synthesize the vowels. Then, subjects were instructed explicitly to respond to AX discrimination tasks in which the standard vowel was followed by another one with the increment or decrement of the original formant values. The highest and lowest formant values of the same vowel quality were collected and compared to examine patterns of vowel discrimination. Results showed that the American and Korean groups discriminated the vowel pairs almost identically and their center formant frequency values of the high and low boundary fell almost exactly on those of the standards. In addition, the acceptable range of the same vowel quality was similar among the language and gender groups. The acceptable thresholds of each vowel formed oval to maintain perceptual contrast from adjacent vowels. The results suggested that nonnative speakers with high English proficiency could match native speakers' performance in discriminating vowel pairs with a shorter inter-stimulus interval. Pedagogical implications of those findings are discussed.

  • PDF

Effects of attention on the perception of L2 phonetic contrast

  • Lee, Hyunjung
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.47-52
    • /
    • 2014
  • This study investigated how the degree of attention modulates English learners' perception of Korean stop contrasts. The contributions of VOT and F0 in perceiving Korean stops were examined while availability of attentional resources was manipulated using a dual-task paradigm. Results demonstrated the attentional modulation in the use of VOT, but not in F0: under less attention, the contribution of VOT to the perception of aspirated stops decreased, whereas that of lenis stops increased, which suggests more native-like performance. This implies that the role of attention in perceiving non-native contrasts might differ depending on how equivalent the acoustic and perceptual cues are between L1 and target L2 contrasts.

The acoustic realization of the Korean sibilant fricative contrast in Seoul and Daegu

  • Holliday, Jeffrey J.
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.67-74
    • /
    • 2012
  • The neutralization of /$s^h$/ and /$s^*$/ in Gyeongsang dialects is a culturally salient stereotype that has received relatively little attention in the phonetic literature. The current study is a more extensive acoustic comparison of the sibilant fricative productions of Seoul and Gyeongsang dialect speakers. The data presented here suggest that, at least for young Seoul and Daegu speakers, there are few inter-dialectal differences in sibilant fricative production. These conclusions are supported by the output of mixed effects logistic regression models that used aspiration duration, spectral mean of the frication noise, and H1-H2 of the following vowel to predict fricative type in each dialect. The clearest dialect difference was that Daegu speakers' /$s^h$/ and /$s^*$/ productions had overall shorter aspiration durations than those of Seoul speakers, suggesting the opposite of the traditional "/$s^*$/ produced as [$s^h$]" stereotype of Gyeongsang dialects. Further work is needed to investigate whether /$s^h/-/s^*$/ neutralization in Daegu is perceptual rather than acoustic in nature.

Consonantal and Vocalic Effects in Korean Stop Identification

  • Kim, Mi-Ryoung
    • Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.93-111
    • /
    • 2001
  • This study investigates the contribution of vocalic information following the release of an initial stop to the identification of the three-way stop contrast (aspirated, lax, and tense) in Korean. Recent studies showed that there is a strong interaction between consonant types and tone. The findings raise questions concerning Korean listeners' use of tonal (or vocalic F0) variation in differentiation initial tense, lax, and aspirated stops. The above issues are addressed in the present study using a cross-splicing methodology. The overall results show that low vocalic F0 provided the most salient information for lax stops; tense and aspirated stop identification depended on a combination of VOT, F0, and H1-H2 characteristics. The perceptual dominance of F0 over VOT for lax stops is consistent with the size of the F0 difference in utterance-initial position, as well as their prominent role in Korean intonational phonology.

  • PDF