• 제목/요약/키워드: linguistic feature

검색결과 61건 처리시간 0.022초

보편 양화사 (Universal Quantifier)에 대한 아동들의 해석 양상 (Universal Quantification by Children)

  • 강혜경
    • 한국언어정보학회지:언어와정보
    • /
    • 제5권2호
    • /
    • pp.39-55
    • /
    • 2001
  • This paper investigates the idiosyncratic understanding of universal quantifiers such as every, each or all by young children at the ages of 4 to 7, and argues that the phenomenon is explicable in terms of the maturation of both the cognitive system and the linguistic system. Evidence for this dual explanation comes from the fact that the visual input, a picture, plays a key role in determining the children's conceptual representation, suggesting the need for the central integration of visual and linguistic elements; and from the fact that a quantifier in the linguistic input has an intrinsic property, i.e. a <+focus> feature. I have tried to explain the nature of the cognitive factors in terms of the function of the central system, suggesting a modified form of Smith & Tsimpli's (1995) yersion of Fodor's (1983) modularity hypothesis. The categorial status of the quantifier in the children's interpretation is considered, focusing on the movement of that quantifier out of its own extended projection to FP. It is claimed that children initially treat quantifiers as modifiers, rather than functional heads, and that the phenomenon of quantifier spreading by children can be attributed to delay in the development of the relevant functional category, i.e., DP (or QP), in language acquisition.

  • PDF

음성인식을 위한 주파수 부대역별 효과적인 특징추출 (Effective Feature Extraction in the Individual frequency Sub-bands for Speech Recognition)

  • 지상문
    • 한국정보통신학회논문지
    • /
    • 제7권4호
    • /
    • pp.598-603
    • /
    • 2003
  • 본 논문에서는 주파수 부대역마다 최적의 특징추출을 위해서, 음성인식률을 기준으로 최적의 방법을 선택한다. 다중대역 음성인식 접근을 사용하여 각기 다른 주파수 영역에서 특징벡터를 독립적으로 추출함으로써 부대역별로 다른 특징추출 방법을 적용할 수 있었다. 저주파 대역의 음성은 비교적 스펙트럼의 구조가 명확하므로 전극모델을 사용하는 것이 효과적이었고, 고주파 대역에서는 비모수적인 변환방법인 이산 코사인 변환을 사용한 켑스트럼이 효과적이었다. 부대역별로 효과적인 특징추출 방법을 사용함으로써, 각 주파수 부대역에 포함된 음성인식을 위한 언어정보를 보다 효과적으로 추출할 수 있었다. 음성인식 실험결과, 제안한 방법은 전대역 특징추출보다 우수한 성능을 나타내었다.

Comparing Machine Learning Classifiers for Movie WOM Opinion Mining

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권8호
    • /
    • pp.3169-3181
    • /
    • 2015
  • Nowadays, online word-of-mouth has become a powerful influencer to marketing and sales in business. Opinion mining and sentiment analysis is frequently adopted at market research and business analytics field for analyzing word-of-mouth content. However, there still remain several challengeable areas for 1) sentiment analysis aiming for Korean word-of-mouth content in film market, 2) availability of machine learning models only using linguistic features, 3) effect of the size of the feature set. This study took a sample of 10,000 movie reviews which had posted extremely negative/positive rating in a movie portal site, and conducted sentiment analysis with four machine learning algorithms: naïve Bayesian, decision tree, neural network, and support vector machines. We found neural network and support vector machine produced better accuracy than naïve Bayesian and decision tree on every size of the feature set. Besides, the performance of them was boosting with increasing of the feature set size.

한중 자동 문서분류를 위한 최적 자질어 비교 (Comparison Between Optimal Features of Korean and Chinese for Text Classification)

  • 임미영;강신재
    • 한국지능시스템학회논문지
    • /
    • 제25권4호
    • /
    • pp.386-391
    • /
    • 2015
  • 본 논문에서는 한국어와 중국어의 언어학적인 특징을 고려하여 문서 자동분류 시스템의 성능을 높일 수 있는 최적의 자질어 단위를 제안한다. 언어 종속적 단위인 형태소 자질어와 언어 독립적 단위인 n-gram 자질어 그리고 이들을 조합한 복합 자질어 집합을 대상으로 각 언어의 인터넷 신문기사를 SVM으로 분류하는 실험을 수행하였다. 실험 결과, 한국어 문서분류에서는 bi-gram이 F1-measure 87.07%로 가장 좋은 분류 성능을 보였고, 중국어 문서분류에서는 'uni-gram 명사 동사 형용사 사자성어'의 복합 자질어 집합이 F1-measure 82.79%로 가장 좋은 성능을 보였다.

보편 양화사 (UNIVERSAL QUANTIFIER)에 대한 아동들의 해석 양상

  • 강혜경
    • 한국언어정보학회:학술대회논문집
    • /
    • 한국언어정보학회 2001년도 학술대회 논문집
    • /
    • pp.237-257
    • /
    • 2001
  • This paper investigates the idiosyncratic understanding of universal quantifiers such as every, each or all by young children at the ages of 4 to 7, and argues that the phenomenon is explicable in terms of the maturation of both the cognitive system and the linguistic system. Evidence for this dual explanation comes from the fact that the visual input, a picture, plays a key role in determining the children’s conceptual representation, suggesting the need for the central integration of visual and linguistic elements; and from the fact that a quantifier in the linguistic input has an intrinsic property, i.e. a <+focus> feature. I have tried to explain the nature of the cognitive factors in terms of the function of the central system, suggesting a modified form of Smith & Tsimpli’s (1995) version of Fodor’s (1983) modularity hypothesis. Conceptual representations of two kinds are in competition with each other and they are integrated into a neutral LOT (Language of Thought) representation at some point . In the process of this integration, the representations from the visual input predominate over those from the auditory input, though the quantize. (treated as new information provided by the latter) is salient in the final representations. When visual conceptual representations predominate over purely linguistic ones, quantifier spreading errors occur. By contrast, when the relevant grammatical knowledge has developed sufficiently to counteract the conceptual representations, this peculiar behaviour by children should disappear. It is argued that children have to learn two kinds of grammatical fact with regard to universal quantification: (i) they have to learn the status of the quantifier as a functional head of DP so that it has to be positioned inside DP; and (ii) they have to learn the Left-Branch Condition which specifies that movement of an element in the left-branch position is possible only by pied-piping the entire phrase.

  • PDF

몽고복식어휘의 비교 연구 (A Comparative Study on the Mongolic Costume Terminologies)

  • 김진구
    • 복식문화연구
    • /
    • 제2권2호
    • /
    • pp.297-306
    • /
    • 1994
  • This study is concerned with a comparative linguistic analysis of the Mongolian costume terminologies. It was found that the Mongolian costume terminologies were related to the costume terminologies of English, Indian, Chinese, Arabic Uigur, Tibetan, Persian, Turkish, Hebrew, Assyrian. The influence of the Chinese costume terminologies was fond in the terminologies of the fabrics and the precious stones of the Mongolians. Also, the influence of the Indian costume terminologies were shown in the Mongolian costume terminologies. A characteristic feature in the Mogolian costume terminologies of the fabrics and the precious stones showed the elements of the foreign terminologies. This factor amy be due to the fact that originally the Mongolians ere the nomades.

  • PDF

Multimodal Context Embedding for Scene Graph Generation

  • Jung, Gayoung;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • 제16권6호
    • /
    • pp.1250-1260
    • /
    • 2020
  • This study proposes a novel deep neural network model that can accurately detect objects and their relationships in an image and represent them as a scene graph. The proposed model utilizes several multimodal features, including linguistic features and visual context features, to accurately detect objects and relationships. In addition, in the proposed model, context features are embedded using graph neural networks to depict the dependencies between two related objects in the context feature vector. This study demonstrates the effectiveness of the proposed model through comparative experiments using the Visual Genome benchmark dataset.

Modality-Based Sentence-Final Intonation Prediction for Korean Conversational-Style Text-to-Speech Systems

  • Oh, Seung-Shin;Kim, Sang-Hun
    • ETRI Journal
    • /
    • 제28권6호
    • /
    • pp.807-810
    • /
    • 2006
  • This letter presents a prediction model for sentence-final intonations for Korean conversational-style text-to-speech systems in which we introduce the linguistic feature of 'modality' as a new parameter. Based on their function and meaning, we classify tonal forms in speech data into tone types meaningful for speech synthesis and use the result of this classification to build our prediction model using a tree structured classification algorithm. In order to show that modality is more effective for the prediction model than features such as sentence type or speech act, an experiment is performed on a test set of 970 utterances with a training set of 3,883 utterances. The results show that modality makes a higher contribution to the determination of sentence-final intonation than sentence type or speech act, and that prediction accuracy improves up to 25% when the feature of modality is introduced.

  • PDF

조선시대 여성저고리의 그리드체계 (The Grid System of Women's Jeogori in Joseon Dynasty)

  • 한은혜
    • 복식
    • /
    • 제62권6호
    • /
    • pp.200-217
    • /
    • 2012
  • The purpose of this research is to examine the specificity of grids to define the characteristics of clothes styles in the Joseon Dynasty period. The significance of examining of the specificity of grids is to find out arbitrary types of the features of grids involved in structuring the Jeogori in the Joseon Dynasty period one by one. The Visual Linguistic Theory was introduced as a methodological tool to exquisitely analyze the characteristics of grids in deep structures of Jeogori in the Joseon Dynasty period. This theory strives to examine sample distribution, the distribution of samples by quality and the distribution of the types of ploidy features. Through the examination, the results are as follows. The grid systems of the Jeogori consisted of diverse proportion systems reaching 86 cases, that is, sequence systems composed of multi-functional, multi-combined bodies. Most ornamental grids had feature angles distributed in a range of $2-20^{\circ}$ that showed a common preference for low sloped diagonal lines or small curvature. Although the preference for certain feature angles were prominent, the feature angles that were used were generally distributed evenly among diverse feature angles to show the characteristics of separation. Therefore, Jeogori makers in the Joseon Dynasty period can be considered as having experimented with many proportion systems to show their aesthetics. In conclusion, based on the results of the examination of feature distributions and related methods to allocate ploidy features, O-type accounted for 66% and thus it was identified that the Jeogori was characterized by O-type. Therefore, it was identified that the characteristic of the Jeogori in the Joseon Dynasty period consisted of O-type fractal structures which are formative structures unique to our nation.

An EMG Study of the Tense-lax Distinction Theory

  • Kim, Dae-Won
    • 음성과학
    • /
    • 제1권
    • /
    • pp.7-26
    • /
    • 1997
  • An electromyographic device was used to investigate the relationship between a linguistic hypothesis of tense-lax distinction and muscular activity. Muscle action potentials of the orbicularis oris muscle and the depressor anguli oris muscle were obtained from four subjects using CVCVCV and CVCVC words in English and VCV and CVC words in Korean. Findings: The hypothesis that the speaker may select at least one of muscles involved in the articulation of a phoneme so that the selected muscle could be activated for tense-lax distinction, and either a timing variable or an amplitude variablethe and/or both from the selected muscle distinguish(es) /p/ from /b/ in English and /$p^{h},\;p^{l}$/ from /p/ in Korean, with the English /p/ and the Korean /$p^{h},\;p^{l}$/ being tense, and the Korean unaspirated /p/ and the English /b/ lax, has been verified, except for the case with subject 2 in stressed syllables in English. (2) Thus, the linguistic hypothesis of tense-lax distinction was strongly supported by the muscular activities during the Korean bilabial stops, with /$p^{h}\;and\;p^{l}$/ being tense and /p/ lax. (3) Considering the intermuscle compensation and the interspeaker variabilities in the choice of a muscle or muscles, in English the usability of the feature 'tensity' appeared to be positive rather than negative although further investigations with more subjects remain to take on the muscles associated with the onset/offset of the labial closure, including the respiratory muscles related with the aspiration. The phoneme-sensitive EMG manifestations of stress and possible reasons for the interspeaker variabilities are discussed.

  • PDF