• Title/Summary/Keyword: linguistic contexts

Search Result 40, Processing Time 0.023 seconds

Automatic Transcription of the Union Symbols in Korean Texts (한국어 텍스트에 사용된 이음표의 자동 전사)

  • 윤애선;권혁철
    • Language and Information
    • /
    • v.7 no.1
    • /
    • pp.23-40
    • /
    • 2003
  • In this paper, we have proposed Auto-TUS, an automatic transcription module of three union symbols-hyphen, dash and tilde (‘­’, ‘―’, ‘∼’)-using their linguistic contexts. Few previous studies have discussed the problems of ambiguities in transcribing symbols into Korean alphabetic letters. We have classified six different reading formulae of the union symbols, analyzed the left and right contexts of the symbols, and investigated selection rules and distributions between the symbols and their contexts. Based on these linguistic features, 86 stereotyped patterns, 78 rules and 8 heuristics determining the types of reading formulae are suggested for Auto-TUS. This module works modularly in three steps. The pilot test was conducted with three test suites, which contains respectively 418, 987 and 1,014 clusters of words containing a union symbol. Encouraging results of 97.36%, 98.48%, 96.55% accuracy were obtained for three test suites. Our next phases are to develop a guessing routine for unknown contexts of the union symbols by using statistical information; to refine the proper nouns and terminology detecting module; and to apply Auto-TUS on a larger scale.

  • PDF

A Simultaneous Design of TSK - Linguistic Fuzzy Models with Uncertain Fuzzy Output

  • Kwak, Keun-Chang;Kim, Dong-Hwa
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.427-432
    • /
    • 2005
  • This paper is concerned with a simultaneous design of TSK (Takagi-Sugeno-Kang)-linguistic fuzzy models with uncertain model output and the computationally efficient representation. For this purpose, we use the fundamental idea of linguistic models introduced by Pedrycz and develop their comprehensive design framework. The design process consists of several main phases such as (a) the automatic generation of the linguistic contexts by probabilistic distribution using CDF (conditional density function) and PDF (probability density function) (b) performing context-based fuzzy clustering preserving homogeneity based on the concept of fuzzy granulation (c) augment of bias term to compensate bias error (d) combination of TSK and linguistic context in the consequent part. Finally, we contrast the performance of the enhanced models with other fuzzy models for automobile MPG predication data and coagulant dosing process in a water purification plant.

  • PDF

Automatic Transcription of Three Ambiguous Symbols Used with Arabic Numerals: Period, Colon and Slash. (아라비안 숫자를 동반한 중의적 기호의 자동전사: 온점, 쌍점, 빗금을 중심으로)

  • 윤애선;정영임;권혁철
    • Language and Information
    • /
    • v.8 no.1
    • /
    • pp.117-136
    • /
    • 2004
  • In this paper, we have proposed Auto- TSS, an automatic transcription module of three ambiguous symbols-period (.), colon (:) and slash (/)--using their linguistic contexts. Few previous studies have discussed the problems of ambiguities in reading those symbols into Korean alphabetic letters in order to improve the current Korean TTS (Text-To-Speech) systems. We have classified 9 different reading formulae of the three symbols, analyzed their left and right contexts, and investigated selection rules and distributions between the symbols and their contexts. Based on these linguistic features, 30 stereotyped patterns, 53 rules and 5 heuristics determining the types of reading formulae are investigated for Auto-TSS. This module works modularly in 4 steps. The pilot test was conducted with three test suites, which contain respectively 6,979, 3,491 and 2,450 morpheme clusters containing at least one of three ambiguous symbols and Arabic numeral(s). Encouraging results of 94.3%, 93.0%, 94.2% accuracy were obtained for the test suites. Our next phases are to develop a guessing routine for unknown contexts of the union symbols by using statistical information; to refine the proper nouns and terminology detecting module; and to apply Auto-TSS on a larger scale.

  • PDF

A Design of an Improved Linguistic Model based on Information Granules (정보 입자에 근거한 개선된 언어적인 모델의 설계)

  • Han, Yun-Hee;Kwak, Keun-Chang
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.3
    • /
    • pp.76-82
    • /
    • 2010
  • In this paper, we develop Linguistic Model (LM) based on information granules as a systematic approach to generating fuzzy if-then rules from a given input-output data. The LM introduced by Pedrycz is performed by fuzzy information granulation obtained from Context-based Fuzzy Clustering(CFC). This clustering estimates clusters by preserving the homogeneity of the clustered patterns associated with the input and output data. Although the effectiveness of LM has been demonstrated in the previous works, it needs to improve in the sense of performance. Therefore, we focus on the automatic generation of linguistic contexts, addition of bias term, and the transformed form of consequent parameter to improve both approximation and generalization capability of the conventional LM. The experimental results revealed that the improved LM yielded a better performance in comparison with LM and the conventional works for automobile MPG(miles per gallon) predication and Boston housing data.

Logic-based Fuzzy Neural Networks based on Fuzzy Granulation

  • Kwak, Keun-Chang;Kim, Dong-Hwa
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.1510-1515
    • /
    • 2005
  • This paper is concerned with a Logic-based Fuzzy Neural Networks (LFNN) with the aid of fuzzy granulation. As the underlying design tool guiding the development of the proposed LFNN, we concentrate on the context-based fuzzy clustering which builds information granules in the form of linguistic contexts as well as OR fuzzy neuron which is logic-driven processing unit realizing the composition operations of T-norm and S-norm. The design process comprises several main phases such as (a) defining context fuzzy sets in the output space, (b) completing context-based fuzzy clustering in each context, (c) aggregating OR fuzzy neuron into linguistic models, and (c) optimizing connections linking information granules and fuzzy neurons in the input and output spaces. The experimental examples are tested through two-dimensional nonlinear function. The obtained results reveal that the proposed model yields better performance in comparison with conventional linguistic model and other approaches.

  • PDF

An Acoustic Study of the Perceptual Significance of F2 Transition of /w/ in English and Korean

  • Kang, Hyun-Sook
    • Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.7-21
    • /
    • 2006
  • The intent of the present study is to investigate the acoustic properties of Korean /w/ in various phonological contexts, compare them with those of English /w/, and attempt to explain why English /w/'s are perceived differently by Korean speakers depending on the phonological contexts. Experiments 1 and 2 present the acoustic measure of F2 of Korean /w/ in various linguistic positions and show that unlike English /w/, Korean /w/ shows quite a strong coarticulation with the following vowel. Based on these experiments, Experiment 3 investigates why English /w/ is adapted differently into Korean. Specifically, it discusses why English /wain/ is adapted as /wain/ whereas English /twin/ is adapted into Korean as $/t^{h_i}win/$ with an extra vowel. This study argues that the different perception of English /w/ by Korean and English speakers is due to the different F2 transitional pattern of /w/ in Korean and English in various phonological contexts. It also argues that the F2 transitional pattern is an important factor in the perception of /w/.

  • PDF

Linguistic and Educational Factors Affecting TOEFL Scores: Focusing on Three OECD Countries in EFL contexts

  • Lee, Young-Hwa;Kim, Seon-Jae
    • International Journal of Contents
    • /
    • v.6 no.2
    • /
    • pp.33-40
    • /
    • 2010
  • This study aims at investigating the linguistic and educational factors affecting TOEFL scores, focusing on three OECD countries, Korea, Japan, and Finland. The data comprise document analysis on curriculums, websites, and literature. The findings reveal that the number of Korean test-takers and their TOEFL scores gradually increased year by year. Finnish test-takers consistently gained greatly high scores, and Japanese examinees showed the lowest scores. The languages Korean, Japanese, and Finnish are all far distant from English and receive little support on historical grounds from the Indo-European family tree. In Finland, however, Swedish which belongs to Indo-European languages is still used as an official language with Finnish. Korea and Finland adopt English education from Year 3 in primary school, whereas English is not an official subject in primary school at present in Japan. Finnish students are taught a foreign language in addition to English from primary school. These seem to support the result of the high TOEFL scores of Finnish test-takers. This study concludes that social context which includes linguistic and educational environments are the main factors which affect TOEFL scores.

Durational Interaction of Stops and Vowels in English and Korean Child-Directed Speech

  • Choi, Han-Sook
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.61-70
    • /
    • 2012
  • The current study observes the durational interaction of tautosyllabic consonants and vowels in the word-initial position of English and Korean child-directed speech (CDS). The effect of phonological laryngeal contrasts in stops on the following vowel duration, and the effect of the intrinsic vowel duration on the release duration of preceding stops in addition to the acoustic realization of the contrastive segments are explored in different prosodic contexts - phrase-initial/medial, focal accented/non-focused - in a marked speech style of CDS. A trade-off relationship between Voice Onset Time (VOT), as consonant release duration, and voicing phonation time, as vowel duration, reported from adult-to-adult speech, and patterns of durational variability are investigated in CDS of two languages with different linguistic rhythms, under systematically controlled prosodic contexts. Speech data were collected from four native English mothers and four native Korean mothers who were talking to their one-word staged infants. In addition to the acoustic measurements, the transformed delta measure is employed as a variability index of individual tokens. Results confirm the durational correlation between prevocalic consonants and following vowels. The interaction is revealed in a compensatory pattern such as longer VOTs followed by shorter vowel durations in both languages. An asymmetry is found in CV interaction in that the effect of consonant on vowel duration is greater than the VOT differences induced by the vowel. Prosodic effects are found such that the acoustic difference is enhanced between the contrastive segments under focal accent, supporting the paradigmatic strengthening effect. Positional variation, however, does not show any systematic effects on the variations of the measured acoustic quantities. Overall vowel duration and syllable duration are longer in English tokens but involve less variability across the prosodic variations. The constancy of syllable duration, therefore, is not found to be more strongly sustained in Korean CDS. The stylistic variation is discussed in relation to the listener under linguistic development in CDS.

Pragmatics and Translation in the Use of English Words in Banner Advertising on Portal Sites

  • Ban, Hyun;Noh, Bo Kyung
    • International Journal of Advanced Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.259-264
    • /
    • 2021
  • In modern socity, online communication plays a vital role in social interaction of communicities. It is so common for online users to see display advertisements online while surting the Net. Specifically, most web banners diaplayed on portral sites consist of words, phrase, and sentences. Considering that the primary purpose of adversiting is persuation, the advertisement such as web banners is an examplary case to show the interaction among pragmatics, translation and advertising because the linguistic expressions employed in the banners represent its pragmatic use, leading to persuation and functioning as a communicative tool for the smooth communication between source text producers (adversisers) and target audience (online users). This can be part of the so-called translation process. In particular, we can easily witness the use of English words in web banners. Thus, this paper looks at web banners displayed on major four portal sites-Naver, Daum, Nate, and Zum, giving a special attention to the content contained in the web banners as well as the use of English words. As s result, we found that the frequencies of English words in each portal site were higher when the advertised products were targeting young online users, whereas the frequencies were lower when the users are older group than young people. The finding supports the prgramatic perspective that linguistic expressions are understood in social contexts and shows the so-called translation process which involves a shift from semantic meaning of words to their pragmatic use. Finally, we can conclude that the interaction is possible when we have the framework where translation, pragmatics, and advertising are all communitative components for social interaction within social contexts.

Linguistic Turn and the Education of Liberal Arts (언어적 전환과 인문교육)

  • Paik, Tohyung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.1
    • /
    • pp.279-288
    • /
    • 2020
  • In this paper, we try to criticize a bias that liberal arts are not practical. First, I introduce holistic models of knowledge and constructivism in education. Second, I suggest 'linguistic turn' as another noticeable phenomenon. Language is an element of culture, but a priviledged one because culture, in itself, is linguistic. So culture is a sort of a text. We can see that liberal arts or humanities to deal with texts, contexts and languages can have an important role to read, understand and construct a world. So the rediscovery of the traditional model of the education of liberal arts like processes of graduating studies of liberal arts or humanities, is relevant to a new 'practical' model of reading and leading recent social transformations. An important point is academic contribution. And current situation called 'Knowledge-Based Society' also supports my point in another way. The situation of rapidly changing knowledge requires an education of problem-solving ability by unifying and reconstructing knowledge supporting with constructivism in education rather than the cramming system of education.