• Title/Summary/Keyword: Visual Word Recognition

Search Result 47, Processing Time 0.029 seconds

Visual and Phonological Neighborhood Effects in Computational Visual Word Recognition Model (계산주의적 시각단어재인 모델에서의 시각이웃과 음운이웃 효과)

  • Lim, Heui-Seok;Park, Ki-Nam;Nam, Ki-Chun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.8 no.4
    • /
    • pp.803-809
    • /
    • 2007
  • This study suggests a computational model to inquire the roles of phonological information and orthography information in the process of visual word recognition among the courses of language information processing, and the representation types of the mental lexicon. The model that this study is presenting here was designed as a feed forward network structure which is comprised of input layer which uses two Korean syllables as its input value, hidden layer, and output layer which express meanings. As the result of the study, the computational model showed the phonological and orthographic neighborhood effect among language phenomena which are shown in Korean word recognition, and showed proofs which implies that the mental lexicon is represented as phonological information in the process of Korean word recognition.

  • PDF

A Salient Based Bag of Visual Word Model (SBBoVW): Improvements toward Difficult Object Recognition and Object Location in Image Retrieval

  • Mansourian, Leila;Abdullah, Muhamad Taufik;Abdullah, Lilli Nurliyana;Azman, Azreen;Mustaffa, Mas Rina
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.2
    • /
    • pp.769-786
    • /
    • 2016
  • Object recognition and object location have always drawn much interest. Also, recently various computational models have been designed. One of the big issues in this domain is the lack of an appropriate model for extracting important part of the picture and estimating the object place in the same environments that caused low accuracy. To solve this problem, a new Salient Based Bag of Visual Word (SBBoVW) model for object recognition and object location estimation is presented. Contributions lied in the present study are two-fold. One is to introduce a new approach, which is a Salient Based Bag of Visual Word model (SBBoVW) to recognize difficult objects that have had low accuracy in previous methods. This method integrates SIFT features of the original and salient parts of pictures and fuses them together to generate better codebooks using bag of visual word method. The second contribution is to introduce a new algorithm for finding object place based on the salient map automatically. The performance evaluation on several data sets proves that the new approach outperforms other state-of-the-arts.

Subword-based Lip Reading Using State-tied HMM (상태공유 HMM을 이용한 서브워드 단위 기반 립리딩)

  • Kim, Jin-Young;Shin, Do-Sung
    • Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.123-132
    • /
    • 2001
  • In recent years research on HCI technology has been very active and speech recognition is being used as its typical method. Its recognition, however, is deteriorated with the increase of surrounding noise. To solve this problem, studies concerning the multimodal HCI are being briskly made. This paper describes automated lipreading for bimodal speech recognition on the basis of image- and speech information. It employs audio-visual DB containing 1,074 words from 70 voice and tri-viseme as a recognition unit, and state tied HMM as a recognition model. Performance of automated recognition of 22 to 1,000 words are evaluated to achieve word recognition of 60.5% in terms of 22word recognizer.

  • PDF

The Neighborhood Effect in Korean Visual Word Recognition (한국어 시각단어재인에서 나타나는 이웃효과)

  • Kwon, You-An;Cho, Hyae-Suk;Kim, Choong-Myung;Nam, Ki-Chun
    • MALSORI
    • /
    • no.60
    • /
    • pp.29-45
    • /
    • 2006
  • We investigated whether the first syllable plays an important role in lexical access in Korean visual word recognition. To do so, one lexical decision task (LDT) and two form primed LDT experiments examined the nature of the syllabic neighborhood effect. In Experiment 1, the syllabic neighborhood density and the syllabic neighborhood frequency was manipulated. The results showed that lexical decision latencies were only influenced by the syllabic neighborhood frequency. The purpose of experiment 2 was to confirm the results of experiment 1 with form-primed LDT task. The lexical decision latency was slower in form-related condition compared to form-unrelated condition. The effect of syllabic neighborhood density was significant only in form-related condition. This means that the first syllable plays an important role in the sub-lexical process. In Experiment 3, we conducted another form-primed LDT task manipulating the number of syllabic neighbors in words with higher frequency neighborhood. The interaction of syllabic neighborhood density and form relation was significant. This result confirmed that the words with higher frequency neighborhood are more inhibited by neighbors sharing the first syllable than words with no higher frequency neighborhood in the lexical level. These findings suggest that the first syllable is the unit of neighborhood and the unit of representation in sub-lexical representation is syllable in Korea.

  • PDF

The Cerebral Activation of the Emotional and Linguistic Attributes during Visual Word Recognition: fMRI Study (시각 단어 재인동안 정서적 속성과 언어적 속성에 의해 활성화되는 대뇌 영역 : fMRI 연구)

  • Park, Chang-Su;Han, Jong-Hye;Choi, Moon-Gee;Nam, Ki-Chun
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2006.06a
    • /
    • pp.53-58
    • /
    • 2006
  • We examined the cerebral activation of the emotional and linguistic attributes during the visual word recognition. This research investigated the affective priming effect preserving the behavioral paradigm. We used the primed-evaluation task in which the participants classify the target as positive or negative, and manipulated the emtional attributes by emtional relations of the prime-target word pairs(PP, PN, NP, NN). ROIs analyses for the semantic processing and emotional processing were performed. The results showed that the semantic processing areas including the IPL, SMG, and aSTS were activated differently according to the experimental condition. The activations of the IPL were increased only on the NN condition, whereas the activation of the SMG was decreased only on the PP condition. Furthmore, the activation of the emotional processing areas including the mPFC and ACC, was different according to the emotional realtions of word pairs. Similar to the SMG, the BOLD signal of the mPFC was decreaed only on the PP condition, whereas the activation of ACC was Increased only on the NN condition. These results were seemed to show the interact ive cerebral activations for processing the emtoional and linguistic attributes in a word, during visual word recognition.

  • PDF

The Neighborhood Effects in Korean Word Recognition Using Computation Model (계산주의적 모델을 이용한 한국어 시각단어 재인에서 나타나는 이웃효과)

  • Park, Ki-Nam;Kwon, You-An;Lim, Heui-Seok;Nam, Ki-Chun
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.295-297
    • /
    • 2007
  • This study suggests a computational model to inquire the roles of phonological information and orthography information in the process of visual word recognition among the courses of language information processing and the representation types of the mental lexicon. As the result of the study, the computational model showed the phonological and orthographic neighborhood effect among language phenomena which are shown in Korean word recognition, and showed proofs which implies that the mental lexicon is represented as phonological information in the process of Korean word recognition.

  • PDF

The Effect of Spatial Attention in Hangul Word Recognition: Depending on Visual Factors (한글 단어 재인에서 시각적 요인에 따른 공간주의의 영향)

  • Ko Eun Lee;Hye-Won Lee
    • Korean Journal of Cognitive Science
    • /
    • v.34 no.1
    • /
    • pp.1-20
    • /
    • 2023
  • In this study, we examined the effects of spatial attention in Hangul word recognition depending on visual factors. The visual complexity of words (Experiment 1) and contrast (Experiment 2) were manipulated to examine whether the effect of spatial attention differs depending on visual quality. Participants responded to words with and without codas in experiment 1 and words in high-contrast and low-contrast conditions in experiment 2. The effects of spatial attention were investigated by calculating the difference in performance between the condition where spatial cues were given at the target location (valid trial) and the condition where the spatial cues were not given at the target location (invalid trial) as the cuing effects. As a result, the cuing effects were similar depending on the complexity of the words. It indicates that the effects of spatial attention were not different across the visual complexity conditions. The cuing effects were greater in the low-contrast condition than in the high-contrast condition. The greater effect of spatial attention when the contrast is low was explained as a mechanism of signal enhancement.

Variables affecting Korean word recognition: focusing on syllable shape (한글 단어 재인에 영향을 미치는 변인: 음절 형태를 중심으로)

  • Min, Suyoung;Lee, Chang H.
    • Korean Journal of Cognitive Science
    • /
    • v.29 no.4
    • /
    • pp.193-220
    • /
    • 2018
  • Recent studies have demonstrated that word frequency, word length, neighborhood and word shape may have a role in visual word recognition. Shape information may affect word processing in different ways as Korean letter system works differently than that of English. The purpose of this study was to apply Gestalt's continuity principle to Korean alphabetic script(hangul), and to investigate the processing unit of hangul and to verify whether syllable shape affects word recognition in hangul. In experiment 1, three syllable words were utilized and two variables; 1) syllable types(horizontal syllable shape, e.g., "가". vertical syllable shape, e.g., "고") and 2) presenting direction (horizontal, vertical) were manipulated. Whereas "가" meets the criteria of Gestalt's continuity principle, "고" does not. Based on the result of lexical decision time, horizontal syllable shape type showed significant performance improvement, when compared to vertical syllable shape type, regardless of the presenting direction. In experiment 2, syllable types(horizontal syllable shape, vertical syllable shape) and the visual relationship between prime and target(identical, similar, different) were manipulated by using masked priming. There was a significant performance difference between the visual relationship of prime and target, and thus the effect of syllable shape was verified.

The Effect of the Orthographic and Phonological Priming in Korean Visual Word Recognition (한국어 시각 단어재인과정에서 음운정보와 표기정보의 역할)

  • Tae, Jini;Lee, ChangHwan;Lee, Yoonhyoung
    • Korean Journal of Cognitive Science
    • /
    • v.26 no.1
    • /
    • pp.1-26
    • /
    • 2015
  • The purpose of this study was to examine whether the phonological information or the orthographic information plays a major role in visual word recognition. To do so, we used a non-word lexical decision task(LDT) in Experiment 1 and masked priming tasks in Experiement 2 and 3. The results of Experiment 1 showed that reaction times and the error rates were affected by the orthographic characteristics of the non-word stimuli such that orthographically similar non-words condition showed prolonged reaction times and higher error rates than control condition. In Experiment 2 and Experiment 3, the participants performed masked priming lexical decision tasks in two SOA conditions(60ms, 150ms). The results of the both experiments showed that the orthographically identical first syllable priming facilitated lexical decision of the target words while both of the pseudo-homophone priming and the phonologically identical first syllable priming did not. The dual route hypothesis(Coltheart et al, 2001), assuming that orthographic information rather than phonological information is the major source for the visual word recognition processes, fits well with the results of the current study.

A Novel Integration Scheme for Audio Visual Speech Recognition

  • Pham, Than Trung;Kim, Jin-Young;Na, Seung-You
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.8
    • /
    • pp.832-842
    • /
    • 2009
  • Automatic speech recognition (ASR) has been successfully applied to many real human computer interaction (HCI) applications; however, its performance tends to be significantly decreased under noisy environments. The invention of audio visual speech recognition (AVSR) using an acoustic signal and lip motion has recently attracted more attention due to its noise-robustness characteristic. In this paper, we describe our novel integration scheme for AVSR based on a late integration approach. Firstly, we introduce the robust reliability measurement for audio and visual modalities using model based information and signal based information. The model based sources measure the confusability of vocabulary while the signal is used to estimate the noise level. Secondly, the output probabilities of audio and visual speech recognizers are normalized respectively before applying the final integration step using normalized output space and estimated weights. We evaluate the performance of our proposed method via Korean isolated word recognition system. The experimental results demonstrate the effectiveness and feasibility of our proposed system compared to the conventional systems.