• Title/Summary/Keyword: 지각 최적화

Search Result 22, Processing Time 0.02 seconds

Design of Audio Watermarks by Noise Shaping (잡음 형상화에 의한 오디오 워터마크 설계)

  • Lee, Jin-Geol
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.11
    • /
    • pp.1432-1438
    • /
    • 2005
  • A psychoacoustic model based noise shaping method is proposed. The method shapes the noise in the frequency domain such that its presence with a host signal will not be perceptually noticeable. The derivation of imperceptible noise levels from the masking thresholds of the signal involves deconvolution associated with the spreading function in the psychoacoustic model. It has been known as an ill-conditioned Problem. In this paper, a constrained optimization is applied such that the noise excitation level conforms to the masking thresholds of the signal. Thus, the noises embedded in the signal will not be perceived by human ear, and its performance is demonstrated experimentally.

  • PDF

A Study of Optimum Time-Spread Echo Audio Watermarking via Listening Test (청취실험에 의한 에코확산 오디오 워터마킹방법의 최적화에 관한 검토)

  • Ko Byeong-Seob
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.545-546
    • /
    • 2004
  • 서브밴드 분리에 의한 에코확산 오디오 워터마킹법은 호스트 신호를 특정 주파수 대역으로 분리하고, MPEG 심리음향 모델을 이용하여 각 대역별로 삽입되는 워터마크의 파워를 파라미터 설정 함수에 의하여 설정한다. 여기서, 본 방법의 강인성과 비지각성을 좌우하는 것은 파라미터 설정 함수가 된다. 따라서, 본 연구에서는 최대의 강인성과 최소의 음질 열화를 구현하기 위하여 청취실험을 실시하여 최적의 파라미터 설정 함수 설정방법에 대한 검토를 수행하였다.

  • PDF

Performance comparison evaluation of speech enhancement using various loss functions (다양한 손실 함수를 이용한 음성 향상 성능 비교 평가)

  • Hwang, Seo-Rim;Byun, Joon;Park, Young-Cheol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.2
    • /
    • pp.176-182
    • /
    • 2021
  • This paper evaluates and compares the performance of the Deep Nerual Network (DNN)-based speech enhancement models according to various loss functions. We used a complex network that can consider the phase information of speech as a baseline model. As the loss function, we consider two types of basic loss functions; the Mean Squared Error (MSE) and the Scale-Invariant Source-to-Noise Ratio (SI-SNR), and two types of perceptual-based loss functions, including the Perceptual Metric for Speech Quality Evaluation (PMSQE) and the Log Mel Spectra (LMS). The performance comparison was performed through objective evaluation and listening tests with outputs obtained using various combinations of the loss functions. Test results show that when a perceptual-based loss function was combined with MSE or SI-SNR, the overall performance is improved, and the perceptual-based loss functions, even exhibiting lower objective scores showed better performance in the listening test.

A neural network model for recognizing facial expressions based on perceptual hierarchy of facial feature points (얼굴 특징점의 지각적 위계구조에 기초한 표정인식 신경망 모형)

  • 반세범;정찬섭
    • Korean Journal of Cognitive Science
    • /
    • v.12 no.1_2
    • /
    • pp.77-89
    • /
    • 2001
  • Applying perceptual hierarchy of facial feature points, a neural network model for recognizing facial expressions was designed. Input data were convolution values of 150 facial expression pictures by Gabor-filters of 5 different sizes and 8 different orientations for each of 39 mesh points defined by MPEG-4 SNHC (Synthetic/Natural Hybrid Coding). A set of multiple regression analyses was performed with the rating value of the affective states for each facial expression and the Gabor-filtered values of 39 feature points. The results show that the pleasure-displeasure dimension of affective states is mainly related to the feature points around the mouth and the eyebrows, while a arousal-sleep dimension is closely related to the feature points around eyes. For the filter sizes. the affective states were found to be mostly related to the low spatial frequency. and for the filter orientations. the oblique orientations. An optimized neural network model was designed on the basis of these results by reducing original 1560(39x5x8) input elements to 400(25x2x8) The optimized model could predict human affective rating values. up to the correlation value of 0.886 for the pleasure-displeasure, and 0.631 for the arousal-sleep. Mapping the results of the optimized model to the six basic emotional categories (happy, sad, fear, angry, surprised, disgusted) fit 74% of human responses. Results of this study imply that, using human principles of recognizing facial expressions, a system for recognizing facial expressions can be optimized even with a a relatively little amount of information.

  • PDF

Threshold Selection Method for Capacity Optimization of the Digital Watermark Insertion (디지털 워터마크의 삽입용량 최적화를 위한 임계값 선택방법)

  • Lee, Kang-Seung;Park, Ki-Bum
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.10 no.1
    • /
    • pp.49-59
    • /
    • 2009
  • In this paper a watermarking algorithm is proposed to optimize the capacity of the digital watermark insertion in an experimental threshold using the characteristics of human visual system(HVS), adaptive scale factors, and weight functions based on discrete wavelet transform. After the original image is decomposed by a 3-level discrete wavelet transform, the watermarks for capacity optimization are inserted into all subbands except the baseband, by applying the important coefficients from the experimental threshold in the wavelet region. The adaptive scale factors and weight functions based on HVS are considered for the capacity optimization of the digital watermark insertion in order to enhance the robustness and invisibility. The watermarks are consisted of gaussian random sequences and detected by correlation. The experimental results showed that this algorithm can preserve a fine image quality against various attacks such as the JPEG lossy compression, noise addition, cropping, blurring, sharpening, linear and non-linear filtering, etc.

  • PDF

The Effect of Intrinsic Attributes of IoT Product on Brand Image and Customer Loyalty (IoT제품의 내재적 속성이 브랜드 이미지와 고객 충성도에 미치는 영향)

  • Peng, Tian;Chen, Xing
    • Journal of Digital Convergence
    • /
    • v.20 no.5
    • /
    • pp.61-68
    • /
    • 2022
  • This study examines the impact of consumers' subjective perception on the quality loyalty of the Internet of things, explores academic innovation schemes, and puts forward business strategies for on-site optimization of the industry. The research methods use online survey methods for Chinese consumers who have used or bought Xiaomi IoT products. The results show that the inherent attributes of Internet of things products have an impact on brand image, and brand image has an impact on customer loyalty. Brand image shows complete media effect in the relationship between hyper-connectivity and customer loyalty of Internet of things products. For the enterprises that develop the Internet of things, obtaining the perceived brand image has a very important strategic significance in expanding the loyal customer base.

Optimization of Multi-time Scale Loss Function Suitable for DNN-based Audio Coder (심층신경망 기반 오디오 부호화기를 위한 Multi-time Scale 손실함수의 최적화)

  • Shin, Seung-Min;Byun, Joon;Park, Young-Cheol;Beack, Seung-kwon;Sung, Jong-mo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.1315-1317
    • /
    • 2022
  • 최근, 심층신경망 기반 오디오 부호화기가 활발히 연구되고 있다. 심층신경망 기반 오디오 부호화기는 기존의 전통적인 오디오 부호화기보다 구조적으로 간단하지만, 네트워크의 복잡도를 증가시키지 않고 인지적 성능향상을 기대하는 것은 어렵다. 이 문제를 해결하기 위하여 인간의 청각적 특성을 활용한 심리음향모델 기반 손실함수를 사용한 기법들이 소개되었다. 심리음향 모델 기반 손실함수를 사용한 오디오 부호화기는 양자화 잡음을 잘 제어하였지만, 여전히 지각적인 향상이 필요하다. 본 논문에서는 심층신경망 기반 오디오 부호화기를 위한 Multi-time Scale 손실함수의 지역 손실함수 윈도우 크기의 최적화 제안한다. Multi-time Scale 손실함수의 지역 손실함수 계산을 위한 윈도우 크기를 조절하며, 이를 통하여 오디오 부호화에 적합한 윈도우 사이즈를 결정한다. 실험을 통해 얻은 최적의 Multi-time Scale 손실함수를 사용하여 네트워크를 훈련하였고, 주관적 평가를 통해 기존의 심리음향모델 기반 손실함수보다 좋은 음성 품질을 보여주는 것을 확인하였다.

  • PDF

Spherical Slepian Harmonic Expression of the Crustal Magnetic Vector and Its Gradient Components (구면 스레피안 함수로 표현된 지각 자기이상값과 구배 성분)

  • Kim, Hyung Rae
    • Economic and Environmental Geology
    • /
    • v.49 no.4
    • /
    • pp.269-280
    • /
    • 2016
  • I presented three vector crustal magnetic anomaly components and six gradients by using spherical Slepian functions over the cap area of $20^{\circ}$ of radius centered on the South Pole. The Swarm mission, launched by European Space Agency(ESA) in November of 2013, was planned to put three satellites into the low-Earth orbits, two in parallel in East-West direction and one in cross-over of the higher altitude. This orbit configuration will make the gradient measurements possible in North-South direction, vertical direction, as well as E-W direction. The gravity satellites, such as GRACE and GOCE, have already implemented their gradient measurements for recovering the accurate gravity of the Earth and its temporal variation due to mass changes on the subsurface. However, the magnetic gradients have little been applied since Swarm launched. A localized magnetic modeling method is useful in taking an account for a region where data availability was limited or of interest was special. In particular, computation to get the localized solutions is much more efficient and it has an advantage of presenting high frequency anomaly features with numbers of solutions fewer than the global ones. Besides, these localized basis functions that were done by a linear transformation of the spherical harmonic functions, are orthogonal so that they can be used for power spectrum analysis by transforming the global spherical harmonic coefficients. I anticipate in scientific and technical progress in the localized modeling with the gradient measurements from Swarm and here will do discussion on the results of the localized solution to represent the three vector and six gradient anomalies over the Antarctic area from the synthetic data derived from a global solution of the spherical harmonics for the crustal magnetic anomalies of Swarm measurements.

Color reproduction using color appearance model in LCD projection systems (표색계를 이용한 액정 프로젝션 시스템의 색재현)

  • 김지홍
    • Korean Journal of Optics and Photonics
    • /
    • v.9 no.6
    • /
    • pp.373-379
    • /
    • 1998
  • A new method is proposed for the design of the dichroic mirrors in 3-LCD projection systems for color separation/composition. Rather than simply basing the color performance cirterion on luminance or chromatic saturation only, the optimum design parameters can be found by maximizing the volume of the perceived color gamut in RLAB color space and related color appearance model and used the linearly approximated spectrum of dichroic mirrors for simplicity and vector space description. By this method, we found optimal half-power wavelengths in dichroic mirrors which maximized our performance criterion.

  • PDF

Factors Affecting Attitude to Use Devices in Watching Video through Smart Devices (스마트기기를 통한 동영상 시청 환경에서 기기 이용 태도에 영향을 미치는 요인)

  • Song, Jaemin;Kim, Dongyeon
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.5
    • /
    • pp.46-57
    • /
    • 2020
  • The dissemination of smart devices has made a lot of changes in overall social activities. In particular, people use various types of smart devices in their spare time, such as watching video clips, but there is a lack of research on external factors influencing the attitude toward using such devices. Therefore, in this study, the effects of video viewing environmental factors (e.g. screen size and video length) and personal factors (e.g. gender and need for entertainment) on perceived ease of use, perceived usefulness, and attitude to use devices based on technology acceptance model. As a result of analyzing 660 users having different smart devices, the attitude to use smart devices is more positive as the screen size increases, but there is no difference according to gender. In addition, while the length of video clips does not affect the attitude to use, the need for entertainment positively affects the attitude to use. Based on the results of this study, we expect that it can be used for optimized customer marketing and management strategy that integrates product development and video content production in consideration of factors such as video viewing environmental factors and personal factors.