• Title/Summary/Keyword: Visual-Perceptual

Search Result 244, Processing Time 0.026 seconds

A Multi-category Task for Bitrate Interval Prediction with the Target Perceptual Quality

  • Yang, Zhenwei;Shen, Liquan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.12
    • /
    • pp.4476-4491
    • /
    • 2021
  • Video service providers tend to face user network problems in the process of transmitting video streams. They strive to provide user with superior video quality in a limited bitrate environment. It is necessary to accurately determine the target bitrate range of the video under different quality requirements. Recently, several schemes have been proposed to meet this requirement. However, they do not take the impact of visual influence into account. In this paper, we propose a new multi-category model to accurately predict the target bitrate range with target visual quality by machine learning. Firstly, a dataset is constructed to generate multi-category models by machine learning. The quality score ladders and the corresponding bitrate-interval categories are defined in the dataset. Secondly, several types of spatial-temporal features related to VMAF evaluation metrics and visual factors are extracted and processed statistically for classification. Finally, bitrate prediction models trained on the dataset by RandomForest classifier can be used to accurately predict the target bitrate of the input videos with target video quality. The classification prediction accuracy of the model reaches 0.705 and the encoded video which is compressed by the bitrate predicted by the model can achieve the target perceptual quality.

Effect of Visual Perception by Vision Therapy for Improvement of Visual Function (시각기능 개선을 위한 시기능훈련이 시지각에 미치는 영향)

  • Lee, Seung Wook;Lee, Hyun Mee
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.20 no.4
    • /
    • pp.491-499
    • /
    • 2015
  • Purpose: This study was to examine how decline of visual function affects visual perception by assessing visual perception after improving visual function through visual training, and observing the change in the cognitive ability of visual perception. Methods: This study analyzes the visual perceptual evaluation (TVPS_R) of 23 children below age 13($8.75{\pm}1.66$) who have visual abnormalities, and improves visual function after conducting vision training (vision therapy) of the children. Results: Convergence increased from average $3.39{\pm}2.52{\Delta}$ (prism) to $13.87{\pm}6.04{\Delta}$ in the measurement of long-distance disparate points, and from average $5.48{\pm}3.42{\Delta}$ to $18.43{\pm}7.58{\Delta}$ in the measurement of short-distance disparate points. Short-distance diplopia points increased from $25.87{\pm}7.33cm$ to $7.48{\pm}2.87cm$, and as for accommodative insufficiency, short-distance blur points increased from $19.57{\pm}7.16cm$ to $7.09{\pm}1.88cm$. In the visual perceptual evaluation performed before and after improving visual function, 6 items except visual memory showed statistically significant improvement. By order of significant improvement, response gap was highest with $17.74{\pm}16.94$(p=0.000) in visual closure, followed by $15.65{\pm}17.11$(p=0.000) in visual sequential-memory, $13.65{\pm}16.63$(p=0.001) in visual figure-ground, $12.74{\pm}18.41$(p=0.003) in visual form-constancy, $6.48{\pm}10.07$ (p=0.005) in visual discrimination, and $4.17{\pm}9.33$(p=0.043) in visual spatial-relationship. In the visual perception quotient that added up these scores, the response gap was $15.22{\pm}8.66$(p=0.000), showing a more significant result. Conclusions: Vision training enables efficient visual processing and improves visual perceptual ability. It was confirmed that improvement of visual function through visual training not only improves abnormal visual function but also affects visual perception of children such as learning, perception and recognition.

The Comparative Analysis of Visual Perceptual Function and Impulse on Players Chagi in Taekwondo Events (태권도 종목별 선수들의 차기에 대한 시지각기능 및 충격량 비교 분석)

  • Lee, Young-Rim;Ha, Chul-Soo
    • Korean Journal of Applied Biomechanics
    • /
    • v.20 no.2
    • /
    • pp.205-212
    • /
    • 2010
  • The purpose of this study was to compare the efficiency of visual perception and impulse according to the three types of Taekwondo players to be able to supply an efficient training method, for this a total of 12 representative Taekwondo players of the Korean National team, 4 poomsae players, 4 kyokpa players and 4 kyorugi players weighting between 68 to 74 kg, and the results from the motion analysis system, eye tracker and Electronic hogu are as follows. For the visual perceptual function, the total body reaction time was slowest for the kyokpa group, and for the visible reaction and vision fixation time was longest of the poomsae group, while the performance movement was fastest for the kyorugi group. As for description of the two kicking motions dollyo chagi and dolgae chagi the longer visual fixation helps the accuracy of the kick. In conclusion, as there was a difference between the groups, this information could help to train the visual perception of players according to what event they are participating in.

A Study on the Elevation of Korean Traditional Architecture in Visual Perception (시화학 측면에서의 한국전통건축 입면구성에 관한 연구)

  • 장석하
    • Journal of the Korean housing association
    • /
    • v.8 no.3
    • /
    • pp.17-27
    • /
    • 1997
  • This thesis is concerned with a study of spatial characteristics of Korean traditional architecture in visual perception. This study, therefore has been made of principles of visual perception. range of visual perception in architectural environment, spatial characteristics of Korean traditional architecture, and example case studies are exhibited. The architectural compositional principles of parts selected in process of study could be selected to facilitate comparison with the perceptual psychology. The result of this study can be used to construct Korean architectural plans. elevations. form and spatial and order pertinent to human understandings and existances on the priority of wholeness and the relationship of parts to visual perception.

  • PDF

Visuality and Hapticity in Acupoints: A Study on Benshu Chapter in Huangdi Neijng Lingshu (경혈의 시각성과 촉각성: 『영추·본수』의 한 연구)

  • Song, Seok Mo
    • Korean Journal of Acupuncture
    • /
    • v.38 no.4
    • /
    • pp.290-307
    • /
    • 2021
  • Objectives : Perceptual experiences have a causal relationship with reality. If there exists something corresponding to acupoints, there should be perceptual experiences for that something. The purpose of this study is to identify and to analyze the perceptual experiences for acupoints within 『LingShu·BenShu』. Methods : First, we briefly propose a perceptual anatomy in order to describe the perceived human body parts, and their perceived directions and places. Second, we analyze the ways of identifying acupoints in the original text of 『LingShu·BenShu』. Results : From 『LingShu·BenShu』, the procedures of identifying total 64 acupoints were recognized. It was clarified that they are by way of visual and haptic explorations in body regions and partial regions. Conclusions : Perceptual explorations for acupoints follow three major principles: of gradual narrowing down, of determination of direction or place, of relative distance. At the final stages, categories of form and location are encountered by observers. The forms have either concavities or convexities. They are determinate indicators of where acupoints are, while the locations are indetermanate. Haptic forms of acupoints are newly discovered from textual analysis with perceptual anatomy. These properties will shed new light both on study of acupoints and on study of meridians.

Fractal image compression with perceptual distortion measure (인지 왜곡 척도를 사용한 프랙탈 영상 압축)

  • 문용호;박기웅;손경식;김윤수;김재호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.3
    • /
    • pp.587-599
    • /
    • 1996
  • In general fractal imge compression, each range block is approximated by a contractive transform of the matching domain block under the mean squared error criterion. In this paper, a distortion measure reflecting the properties of human visual system is defined and applied to a fractal image compression. the perceptual distortion measure is obtained by multiplying the mean square error and the noise sensitivity modeled by using the background brightness and spatial masking. In order to compare the performance of the mean squared error and perceptual distortion measure, a simulation is carried out by using the 512*512 Lena and papper gray image. Compared to the results, 6%-10% compression ratio improvements under improvements under the same image quality are achieved in the perceptual distortion measure.

  • PDF

Reversible Multipurpose Watermarking Algorithm Using ResNet and Perceptual Hashing

  • Mingfang Jiang;Hengfu Yang
    • Journal of Information Processing Systems
    • /
    • v.19 no.6
    • /
    • pp.756-766
    • /
    • 2023
  • To effectively track the illegal use of digital images and maintain the security of digital image communication on the Internet, this paper proposes a reversible multipurpose image watermarking algorithm based on a deep residual network (ResNet) and perceptual hashing (also called MWR). The algorithm first combines perceptual image hashing to generate a digital fingerprint that depends on the user's identity information and image characteristics. Then it embeds the removable visible watermark and digital fingerprint in two different regions of the orthogonal separation of the image. The embedding strength of the digital fingerprint is computed using ResNet. Because of the embedding of the removable visible watermark, the conflict between the copyright notice and the user's browsing is balanced. Moreover, image authentication and traitor tracking are realized through digital fingerprint insertion. The experiments show that the scheme has good visual transparency and watermark visibility. The use of chaotic mapping in the visible watermark insertion process enhances the security of the multipurpose watermark scheme, and unauthorized users without correct keys cannot effectively remove the visible watermark.

Video Coding Method Using Visual Perception Model based on Motion Analysis (움직임 분석 기반의 시각인지 모델을 이용한 비디오 코딩 방법)

  • Oh, Hyung-Suk;Kim, Won-Ha
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.223-236
    • /
    • 2012
  • We develop a video processing method that allows the more advanced human perception oriented video coding. The proposed method necessarily reflects all influences by the rate-distortion based optimization and the human visual perception that is affected by the visual saliency, the limited space-time resolution and the regional moving history. For reflecting the human perceptual effects, we devise an online moving pattern classifier using the Hedge algorithm. Then, we embed the existing visual saliency into the proposed moving patterns so as to establish a human visual perception model. In order to realize the proposed human visual perception model, we extend the conventional foveation filtering method. Compared to the conventional foveation filter only smoothing less stimulus video signals, the developed foveation filter can locally smooth and enhance signals according to the human visual perception without causing any artifacts. Due to signal enhancement, the developed foveation filter more efficiently transfers the bandwidth saved at smoothed signals to the enhanced signals. Performance evaluation verifies that the proposed video processing method satisfies the overall video quality, while improving the perceptual quality by 12%~44%.

Linear Sub-band Decomposition based Pre-processing Algorithm for Perceptual Video Coding (지각적 동영상 부호화를 위한 선형 부 대역 분해 기반 전처리 기법)

  • Choi, Kwang Yeon;Song, Byung Cheol
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.1
    • /
    • pp.80-87
    • /
    • 2017
  • This paper proposes a pre-processing algorithm to improve perceptual video coding efficiency which decomposes an input frame via a sub-band decomposition, and suppresses only high frequency band(s) having low visual sensitivity. First, we decompose the input frame into several frequency subbands by a linear sub-band decomposition. Next, high frequency subband(s) which is rarely recognized by human visual system (HVS) is suppressed by applying relatively small gain(s). Finally, the high frequency suppressed frame is compressed by a specific video encoder. We can find from the experimental results that if comparing before-use and after-use of the proposed pre-processing prior to the encoder, no visual difference is shown. Also, the proposed algorithm achieves bit-saving of 13.12% on average in a H.264 video encoder.

Digital Cage Watermarking using Human Visual System and Discrete Cosine Transform (인지 시각시스템 및 이산코사인변환을 이용한 디지털 이미지 워터마킹)

  • 변성철;김종남;안병하
    • Journal of KIISE:Information Networking
    • /
    • v.30 no.1
    • /
    • pp.17-23
    • /
    • 2003
  • In this Paper. we Propose a digital watermarking scheme for digital images based on a perceptual model, the frequency masking, texture making, and luminance masking Properties of the human visual system(HVS), which have been developed in the context of image compression. We embed two types of watermark, one is pseudo random(PN) sequences, the other is a logo image. To embed the watermarks, original images are decomposed into $8\times8$ blocks, and the discrete cosine transform(DCT) is carried out for each block. Watermarks are casted in the low frequency components of DCT coefficients. The perceptual model adjusts adaptively scaling factors embedding watermarks according to the local image properties. Experimental results show that the proposed scheme presents better results than that of non-perceptual watermarking methods for image qualify without loss of robustness.