• Title/Summary/Keyword: recognition cue

Search Result 21, Processing Time 0.021 seconds

Robust Facial Expression Recognition using PCA Representation (PCA 표상을 이용한 강인한 얼굴 표정 인식)

  • Shin Young-Suk
    • Korean Journal of Cognitive Science
    • /
    • v.16 no.4
    • /
    • pp.323-331
    • /
    • 2005
  • This paper proposes an improved system for recognizing facial expressions in various internal states that is illumination-invariant and without detectable rue such as a neutral expression. As a preprocessing to extract the facial expression information, a whitening step was applied. The whitening step indicates that the mean of the images is set to zero and the variances are equalized as unit variances, which reduces murk of the variability due to lightening. After the whitening step, we used the facial expression information based on principal component analysis(PCA) representation excluded the first 1 principle component. Therefore, it is possible to extract the features in the lariat expression images without detectable cue of neutral expression from the experimental results, we ran also implement the various and natural facial expression recognition because we perform the facial expression recognition based on dimension model of internal states on the images selected randomly in the various facial expression images corresponding to 83 internal emotional states.

  • PDF

Acoustic parameters for induced emotion categorizing and dimensional approach (자연스러운 정서 반응의 범주 및 차원 분류에 적합한 음성 파라미터)

  • Park, Ji-Eun;Park, Jeong-Sik;Sohn, Jin-Hun
    • Science of Emotion and Sensibility
    • /
    • v.16 no.1
    • /
    • pp.117-124
    • /
    • 2013
  • This study examined that how precisely MFCC, LPC, energy, and pitch related parameters of the speech data, which have been used mainly for voice recognition system could predict the vocal emotion categories as well as dimensions of vocal emotion. 110 college students participated in this experiment. For more realistic emotional response, we used well defined emotion-inducing stimuli. This study analyzed the relationship between the parameters of MFCC, LPC, energy, and pitch of the speech data and four emotional dimensions (valence, arousal, intensity, and potency). Because dimensional approach is more useful for realistic emotion classification. It results in the best vocal cue parameters for predicting each of dimensions by stepwise multiple regression analysis. Emotion categorizing accuracy analyzed by LDA is 62.7%, and four dimension regression models are statistically significant, p<.001. Consequently, this result showed the possibility that the parameters could also be applied to spontaneous vocal emotion recognition.

  • PDF

Robust 3D Hand Tracking based on a Coupled Particle Filter (결합된 파티클 필터에 기반한 강인한 3차원 손 추적)

  • Ahn, Woo-Seok;Suk, Heung-Il;Lee, Seong-Whan
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.1
    • /
    • pp.80-84
    • /
    • 2010
  • Tracking hands is an essential technique for hand gesture recognition which is an efficient way in Human Computer Interaction (HCI). Recently, many researchers have focused on hands tracking using a 3D hand model and showed robust tracking results compared to using 2D hand models. In this paper, we propose a novel 3D hand tracking method based on a coupled particle filter. This provides robust and fast tracking results by estimating each part of global hand poses and local finger motions separately and then utilizing the estimated results as a prior for each other. Furthermore, in order to improve the robustness, we apply a multi-cue based method by integrating a color-based area matching method and an edge-based distance matching method. In our experiments, the proposed method showed robust tracking results for complex hand motions in a cluttered background.

Prosodic Break Index Estimation using LDA and Tri-tone Model (LDA와 tri-tone 모델을 이용한 운율경계강도 예측)

  • 강평수;엄기완;김진영
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.7
    • /
    • pp.17-22
    • /
    • 1999
  • In this paper we propose a new mixed method of LDA and tri-tone model to predict Korean prosodic break indices(PBI) for a given utterance. PBI can be used as an important cue of syntactic discontinuity in continuous speech recognition(CSR). The model consists of three steps. At the first step, PBI was predicted with the information of syllable and pause duration through the linear discriminant analysis (LDA) method. At the second step, syllable tone information was used to estimate PBI. In this step we used vector quantization (VQ) for coding the syllable tones and PBI is estimated by tri-tone model. In the last step, two PBI predictors were integrated by a weight factor. The proposed method was tested on 200 literal style spoken sentences. The experimental results showed 72% accuracy.

  • PDF

ACOUSTIC FEATURES DIFFERENTIATING KOREAN MEDIAL LAX AND TENSE STOPS

  • Shin, Ji-Hye
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.53-69
    • /
    • 1996
  • Much research has been done on the rues differentiating the three Korean stops in word initial position. This paper focuses on a more neglected area: the acoustic cues differentiating the medial tense and lax unaspirated stops. Eight adult Korean native speakers, four males and four females, pronounced sixteen minimal pairs containing the two series of medial stops with different preceding vowel qualities. The average duration of vowels before lax stops is 31 msec longer than before their tense counterparts (70 msec for lax vs 39 msec for tense). In addition, the average duration of the stop closure of tense stops is 135 msec longer than that of lax stops (69 msec for lax vs 204msec for tense). THESE DURATIONAL DIFFERENCES ARE 50 LARGE THAT THEY MAY BE PHONOLOGICALLY DETERMINED, NOT PHONETICALLY. Moreover, vowel duration varies with the speaker's sex. Female speakers have 5 msec shorter vowel duration before both stops. The quality of voicing, tense or lax, is also a cue to these two stop types, as it is in initial position, but the relative duration of the stops appears to be much more important cues. The duration of stops changes the stop perception while that of preceding vowel does not. The consequences of these results for the phonological description of Korean as well as the synthesis and automatic recognition of Korean will be discussed.

  • PDF

Speech sound and personality impression (말소리와 성격 이미지)

  • Lee, Eunyung;Yuh, Heaok
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.59-67
    • /
    • 2017
  • Regardless of their intention, listeners tend to assess speakers' personalities based on the sounds of the speech they hear. Assessment criteria, however, have not been fully investigated to indicate whether there is any relationship between the acoustic cue of produced speech sounds and perceived personality impression. If properly investigated, the potential relationship between these two will provide crucial insights on the aspects of human communications and further on human-computer interaction. Since human communications have distinctive characteristics of simultaneity and complexity, this investigation would be the identification of minimum essential factors among the sounds of speech and perceived personality impression. The purpose of this study, therefore, is to identify significant associations between the speech sounds and perceived personality impression of speaker by the listeners. Twenty eight subjects participated in the experiment and eight acoustic parameters were extracted by using Praat from the recorded sounds of the speech. The subjects also completed the Neo-five Factor Inventory test so that their personality traits could be measured. The results of the experiment show that four major factors(duration average, pitch difference value, pitch average and intensity average) play crucial roles in defining the significant relationship.

Conveying Emotions Through CMC: A Comparative Study of Memoji, Emoji, and Human Face

  • Eojin Kim;Yunsun Alice Hong;Kwanghee Han
    • Science of Emotion and Sensibility
    • /
    • v.26 no.4
    • /
    • pp.93-102
    • /
    • 2023
  • Emojis and avatars are widely used in online communications, but their emotional conveyance lacks research. This study aims to contribute to the field of emotional expression in computer-mediated communication (CMC) by exploring the effectiveness of emotion recognition, the intensity of perceived emotions, and the perceived preferences for emojis and avatars as emotional expression tools. The following were used as stimuli: 12 photographs from the Yonsei-Face database, 12 Memojis that reflected the photographs, and 6 iOS emojis. The results of this study indicate that emojis outperformed other forms of emotional expression in terms of conveying emotions, intensity, and preference. Indeed, the study findings confirm that emojis remain the dominant form of emotional signals in CMC. In contrast, the study revealed that Memojis were inadequate as an expressive emotional cue. Participants did not perceive Memojis to effectively convey emotions compared with other forms of expression, such as emojis or real human faces. This suggests room for improvement in the design and implementation of Memojis to enhance their effectiveness in accurately conveying intended emotions. Addressing the limitations of Memojis and exploring ways to optimize their emotional expressiveness necessitate further research and development in avatar design.

A Study on the Effects of Quality Evaluation Cues on Private Brands Purchasing Behavior (유통업체 상표의 구매행동에 관한 실증적 연구)

  • Kim, Yong-Mahn;Kang, Seok-Jeong;Byeon, Choong-Kyu
    • Journal of Global Scholars of Marketing Science
    • /
    • v.7
    • /
    • pp.353-374
    • /
    • 2001
  • Price and brand are two major attributes of products that consumer purchases. Price is important because it is often a measure of worth and quality. Some consumers purchase only well-known national brands. However, By reason of the price competition on account of new business condition and depressions, and consumers practical and rational purchasing tendency, consumers tend to purchase private brands(PB hereafter) because as consumers they expect that producers have reasonable and acceptable quality. Accordingly, The study, with intrinsic cue, extrinsic cue, familiarity anything like these cues from the study of Richardson et aI(1994, 1996), intends to present current topics we guide in retailer's promotion strategy for PB. As for investigating how quality evaluation has on effect on the private brands purchasing behavior of discount store grocery items. This study establishes a hypotheses on the basis of the quality evaluation cues of PB and literature review for purchasing behavior and collects materials for consumers about 196, and also analyzes them using a variety of SPSS/PC+package program. Therefore, the findings of this study provide the following managerial implications. 1) Retailer will successful in increasing private brand market share through dramatic improvement in package design, labeling, advertising, and branding strategies. 2) Planned Purchasers have high intention to repurchase PB because they buy them reasonably in accordance with the estimate therefore, they might have word-of-mouth effect for the evaluation of quality and recognition. They need to acknowledge benefits for PB purchases to maintain purchase like that. 3) The main consumers are housewives in their thirties and forties and they something reasonably because they have a lot of family and retailer will work out.

  • PDF

A Study on Public Needs for Privately Owned Public Space (실내공적공간의 공공성에 관한 연구)

  • Yun, Ji-Hye;Kim, Jung-Gon
    • Korean Institute of Interior Design Journal
    • /
    • v.15 no.5 s.58
    • /
    • pp.157-166
    • /
    • 2006
  • Recently, it appears several counterproposals about desirable figures of urban architecture. All of them proposes 'publicity' with cohernt tendency. The reason why it concentrates quantitative expansion of city without united design by urban plannar is that neglect quality values of city. As a solution of poor environment, there cue out the various efforts, about problem of each building, problem of city space, problem of laws and so forth. The reason why necessity of public space was embossed in that architecture extend the activity of citizen and make up the city space. But, each building pursues the private interest, so it is difficult to secure a public space with a high hand. Thus, architecture law has been revised in 1991 and bring the system of open space to match up the publicity and the private interest. Actually, western country brought it and obtained excellent results. While quantity of open space have increased since 1991, a lot of problems revealed in real usage and quality. By means of problem's solution, this study focus on the diversion of recognition for necessity of various open space. In result, on the occasion of approach and openess, except for several building, most glass a facade and the pedestrian can approach easily. Moreover, office buildings near the subway station connected with their low floor. So, the office buildings give openess to pedestrian and a people can approach easily to the buildings. On the occasion of amenity, most have bank and lobby on the first floor and have facilities on the underground floor. It leave open. But the reason why they have bank and lobby is that the space is dry and boring(without elements of nature and rest space). Hence, to make a space full of vitality, it have to plan various design elements and facilities. First of all, plan of indoor public space have to make up facility for the public interest. This study is basic investigation for necessity of indoor public space and through the survey of office buildings, it analyze the character of plan and find out the method of publicity's realization.

Aerial Video Summarization Approach based on Sensor Operation Mode for Real-time Context Recognition (실시간 상황 인식을 위한 센서 운용 모드 기반 항공 영상 요약 기법)

  • Lee, Jun-Pyo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.6
    • /
    • pp.87-97
    • /
    • 2015
  • An Aerial video summarization is not only the key to effective browsing video within a limited time, but also an embedded cue to efficiently congregative situation awareness acquired by unmanned aerial vehicle. Different with previous works, we utilize sensor operation mode of unmanned aerial vehicle, which is global, local, and focused surveillance mode in order for accurately summarizing the aerial video considering flight and surveillance/reconnaissance environments. In focused mode, we propose the moving-react tracking method which utilizes the partitioning motion vector and spatiotemporal saliency map to detect and track the interest moving object continuously. In our simulation result, the key frames are correctly detected for aerial video summarization according to the sensor operation mode of aerial vehicle and finally, we verify the efficiency of video summarization using the proposed mothed.