Search | Korea Science

Perceptual Photo Enhancement with Generative Adversarial Networks (GAN 신경망을 통한 자각적 사진 향상)

Que, Yue;Lee, Hyo Jong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.05a
- /
- pp.522-524
- /
- 2019
In spite of a rapid development in the quality of built-in mobile cameras, their some physical restrictions hinder them to achieve the satisfactory results of digital single lens reflex (DSLR) cameras. In this work we propose an end-to-end deep learning method to translate ordinary images by mobile cameras into DSLR-quality photos. The method is based on the framework of generative adversarial networks (GANs) with several improvements. First, we combined the U-Net with DenseNet and connected dense block (DB) in terms of U-Net. The Dense U-Net acts as the generator in our GAN model. Then, we improved the perceptual loss by using the VGG features and pixel-wise content, which could provide stronger supervision for contrast enhancement and texture recovery.
https://doi.org/10.3745/PKIPS.y2019m05a.522 인용 PDF

The Aesthetic Evaluative Response of Eating and Drinking Space Design -Focused on the Relationships between Aesthetic Variables and Preference by Perceptual-Cognitive and Affective Judgment- (식음 공간 디자인의 심미적 평가 반응 -지각적.감정적 판단에 따른 미적 변수와 선호도의 관계를 중심으로-)

Choi, Eun-Hee;Kwon, Young-Gull
- Archives of design research
- /
- v.20 no.1 s.69
- /
- pp.21-32
- /
- 2007
To quantitatively measure or evaluate aesthetic factors is not easy in comparison with physical, functional, behavioral or economic factors. Yet aesthetic factors essentially play an important role in design modeling process. Despite its importance, research on aesthetic assessment or the interaction of aesthetic influential elements is insufficient. Therefore, this study is intended to find the relationships between visual preference and aesthetic variables of perceptual-cognitive dimension and affective dimension in commercial space design. According to the result of this substantiation research, aesthetic variables that give a positive effect on the preference of commercial space design are unity, order, and clarity in perceptual-cognitive dimension and 'pleasant', 'relaxing' in affective dimension. On the other side, aesthetic variables that give a negative effect on the preference are contrast, complexity, and ambiguity that is a contrary concept of clarity in perceptual-cognitive dimension and 'exciting', 'arousing' in affective dimension.
PDF

Perception of Japanese word-initial stops by native listeners (모어청자에 의한 일본어 어두 폐쇄음의 지각)

Byun, Hi-Gyung
- Phonetics and Speech Sciences
- /
- v.13 no.3
- /
- pp.53-64
- /
- 2021
It is known that the voicing contrast for Japanese word-initial stops is primarily realized as differences in the voice onset time (VOT). However, recent studies have reported that voiced stops are more often produced with a positive VOT than with a negative VOT among the younger generation nationwide. It is also known that post-stop F0 is associated with the stop contrast, but the degree of F0 use differs from region to region. This study explores whether the difference in post-stop F0 functions as a perceptual cue to the stop contrast along with VOT. Fifty-five college students who are native listeners from four different regions participated in two or three perception tests. The results show that VOT is a primary cue to the voiced-voiceless distinction of word-initial stops, but that the effect of post-stop F0 on the stop contrast is marginal. The post-stop F0 is involved in perception only when VOT is ambiguous, such that a sound with high F0 is more often perceived as a voiceless stop, but not vice versa. The results of this study indicate that the acoustic parameters associated with the stop contrast are not the same in production and perception, and suggest that other factors such as context, which is not an acoustic characteristic, may also be involved in the stop contrast.
https://doi.org/10.13064/KSSS.2021.13.3.053 인용 PDF KSCI

Speech processing strategy and executive function: Korean children's stop perception

Kong, Eun Jong;Yoo, Jeewon
- Phonetics and Speech Sciences
- /
- v.9 no.3
- /
- pp.57-65
- /
- 2017
The current study explored how Korean-speaking children processed the multiple acoustic cues (VOT and f0) for the stop laryngeal contrast (/t'/, /t/, and /$t^h$/) and examined whether individual perceptual strategies could be related to a general cognitive ability performing executive functions (EF). 15 children (aged from 7 to 8) participated in the speech perception task identifying the three Korean laryngeal stops (3AFC) on listening to the auditory stimuli of C-/a/ with synthetically varying VOT and f0. They completed a series of EF tasks to measure working memory, inhibition, and cognitive shifting ability. The findings showed that children used the two cues in a highly correlated manner. While children utilized VOT consistently for the three laryngeal categories, their use of f0 was either reduced or enhanced depending on the phonetic categories. Importantly, the children's processing strategies of a f0 suppression for a tense-aspirated contrast were meaningfully associated with children's better cognitive abilities such as working memory, inhibition, and attentional shifting. As a preliminary experimental investigation, the current research demonstrated that listeners with inefficient processing strategies were poor at the EF skills, suggesting that cognitive skills might be responsible for developmental variations of processing sub-phonemic information for the linguistic contrast.
https://doi.org/10.13064/KSSS.2017.9.3.057 인용 PDF KSCI

Lexical Encoding of L2 Suprasegmentals: Evidence from Korean Learners' Acquisition of Japanese Vowel Length Distinctions

Han, Jeong-Im
- Phonetics and Speech Sciences
- /
- v.1 no.4
- /
- pp.17-27
- /
- 2009
Despite many studies on the production and perception of L2 phonemes, studies on how such phonemes are encoded lexically remain scarce. The aim of this study is to examine whether L2 learners have a perceptual problem with L2 suprasegmentals which are not present in their L1, or if they are able to perceive but not able to encode them in their lexicon. Specifically, Korean learners were tested to see if they could discriminate the vowel length differences in Japanese at the psychoacoustic level through a simple AX discrimination task. Then, a speeded lexical decision task with high phonetic variability was conducted to see whether they could use such contrasts lexically. The results showed that Korean learners of Japanese have no difficulties in discriminating Japanese vowel length contrast, but they are unable to encode such contrast in their phonological representation, even with long L2 exposure.
PDF

A Novel Method to Evaluate the Emotional Image Quality with CIECAM02

Chong, Jong-Ho;Lee, Seung-Bae;Park, Hye-Ryoung;Kim, Sang-Ho;Bae, Jae-Woo;Kim, Hye-Dong;Kim, Hun-Soo
- 한국정보디스플레이학회:학술대회논문집
- /
- 2008.10a
- /
- pp.47-50
- /
- 2008
We propose a new method evaluating the image quality of display devices using the CIECAM02 that is the recently developed CIE color appearance model and provides an extension of the previously recommended CIE color spaces. We develop the evaluation method that quantifies the color reproduction capability, emotional gray scale (gradation), and visual perception contrast (perceptual contrast range) based on the gamut in this model.
PDF

Brightness Function on TV Viewing Condition (TV 시청 조건에서의 Brightness Function)

최성호;김희철;장수욱;김은수;한찬호;송규익
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2403-2406
- /
- 2003
When viewing images, the relative luminance of the surround has a profound impact on the apparent contrast of the image. The dark surround causes the image elements to appear lighter than those viewed in an illuminated surround. For this reason, it is worthwhile to briefly review the general results of brightness sealing under a various viewing condition. Two of the most often cited parers on the topic of brightness scaling are Stevens-stevens and Bartleson-Breneman's function. There are, however, significant differences between the perceptual functions for simple-field and complex-field viewing. In this paper, we research the relationship between Steven's power law and Bartleson-Breneman's function. We present an appropriate brightness perception function due to TV system viewing conditions. Highlight luminance peak and absolute brightness threshold value in various adaptation levels are obtained from the proposed brightness function . Also, the luminance value of black level to produce the same contrast ratio with variety of display highlight luminance peak is obtained from the proposed brightness function.
PDF

Perception of Korean stops with a three-way laryngeal contrast

Kong, Eun-Jong
- Phonetics and Speech Sciences
- /
- v.4 no.1
- /
- pp.13-20
- /
- 2012
A lax stop in Korean, one of the three laryngeal contrastive stops, has undergone a sound change in terms of its acoustic properties. Prior production studies described this recent lax stop as being differentiated from tense and aspirated stops primarily by fundamental frequencies (f0). And, the acoustic property of voice onset time (VOT) further separates tense stops from lax and aspirated stops. The current research explores how these two major acoustic parameters of f0 and VOT cue the three stop categories in Korean adult listeners' perception. Thirty-one native speakers of Korean participated in two experimental tasks: categorization judgment and within-category goodness ratings. Two sets of audio stimuli were prepared by synthesizing English and Korean male speakers' CV productions. The findings showed that while f0 cues listeners to lax stops as production patterns would predict, VOT were closely related to listeners' categorization and goodness ratings of lax stops. This suggests that accurate characterizations of the recent lax stop category need to be based on Korean speakers' perceptual behavior as well as production patterns.
https://doi.org/10.13064/KSSS.2012.4.1.013 인용 PDF

Stereoscopic Perception Improvement Using Color and Depth Transformation (컬러 및 깊이 데이터 변환을 이용하는 입체감 향상)

Gil, Jong-In;Jang, Seung-Eun;Seo, Joo-Ha;Kim, Man-Bae
- Journal of Broadcast Engineering
- /
- v.16 no.4
- /
- pp.584-595
- /
- 2011
Recently, RGB images and depth maps have been supplied to academic fields. The depth maps are utilized to the generation of stereoscopic images in the diverse formats according to the users' preference. A variety of methods that use depth maps have been introduced so far. One of applications is a medical field. In this area, the improvement of the perceptual quality of 2D medical images has gained much interest. In this paper, we propose a novel scheme that expands the conventional method to 3D stereoscopic image, thereby achieving the perceptual depth quality improvement as well as 3D stereoscopic perception enhancement at the same time. For this, contrast transformation as well as depth darkening are proposed and their performance is validated through the subjective test. Subjective experiments peformed for stereoscopic enhancement as well as visual fatigue validate that the proposed method achieves better 3D perception than the usage of the original stereoscopic image and suggests the limitation in terms of the visual fatigue.
https://doi.org/10.5909/JEB.2011.16.4.584 인용 PDF KSCI

An acoustic and perceptual investigation of the vowel length contrast in Korean

Lee, Goun;Shin, Dong-Jin
- Phonetics and Speech Sciences
- /
- v.8 no.1
- /
- pp.37-44
- /
- 2016
The goal of the current study is to investigate how the sound change is reflected in production or in perception, and what the effect of lexical frequency is on the loss of sound contrasts. Specifically, the current study examined whether the vowel length contrasts are retained in Korean speakers' productions, and whether Korean listeners can distinguish vowel length minimal pairs in their perception. Two production experiments and two perception experiments investigated this. For production tests, twelve Korean native speakers in their 20s and 40s completed a read-aloud task as well as a map-task. The results showed that, regardless of their age group, all Korean speakers produced vowel length contrasts with a small but significant differences in the read-aloud test. Interestingly, the difference between long and short vowels has disappeared in the map task, indicating that the speech mode affects producing vowel length contrasts. For perception tests, thirty-three Korean listeners completed a discrimination and a forced-choice identification test. The results showed that Korean listeners still have a perceptual sensitivity to distinguish lexical meaning of the vowel length minimal pair. We also found that the identification accuracy was affected by the word frequency, showing a higher identification accuracy in high- and mid- frequency words than low frequency words. Taken together, the current study demonstrated that the speech mode (read-aloud vs. spontaneous) affects the production of the sound undergoing a language change; and word frequency affects the sound change in speech perception.
https://doi.org/10.13064/KSSS.2016.8.1.037 인용 PDF KSCI

Search Result 73, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)