• Title/Summary/Keyword: linguistic model

Search Result 286, Processing Time 0.029 seconds

Performance Comparison of Korean Dialect Classification Models Based on Acoustic Features

  • Kim, Young Kook;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.10
    • /
    • pp.37-43
    • /
    • 2021
  • Using the acoustic features of speech, important social and linguistic information about the speaker can be obtained, and one of the key features is the dialect. A speaker's use of a dialect is a major barrier to interaction with a computer. Dialects can be distinguished at various levels such as phonemes, syllables, words, phrases, and sentences, but it is difficult to distinguish dialects by identifying them one by one. Therefore, in this paper, we propose a lightweight Korean dialect classification model using only MFCC among the features of speech data. We study the optimal method to utilize MFCC features through Korean conversational voice data, and compare the classification performance of five Korean dialects in Gyeonggi/Seoul, Gangwon, Chungcheong, Jeolla, and Gyeongsang in eight machine learning and deep learning classification models. The performance of most classification models was improved by normalizing the MFCC, and the accuracy was improved by 1.07% and F1-score by 2.04% compared to the best performance of the classification model before normalizing the MFCC.

Language Variation and World Englishes (언어변이와 세계영어들)

  • Kim, Yangsoon
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.1
    • /
    • pp.234-239
    • /
    • 2021
  • The purpose of this paper is to find out the nature of language variation by exploring the ways of the progress of the language variation that produces all English-lects, i.e., the World Englishes. The study of language variation in linguistics is a hybrid enterprise, so the study of World Englishes has led to the recognition of a highly diverse set of all English-lects, encompassing regional dialects, sociolects, ethnolects and (post-)colonial dialects of World Englishes. In this paper, we propose a hybrid language variation model with three interacting factors of social distancing, on/off-contact, and linguistic diversity to examine the characteristics of language variation. In the context of World Englishes, the social distance is typically low in terms of their local location (country/speech) for local purposes. The social distance also varies based on online/offline communication modes and other social factors like gender, age and ethnic groups, resulting in all English-lects. To clarify the nature of World Englishes, the core Englishes, BrE, AmE and CanE are discussed here.

Sentence Filtering Dataset Construction Method about Web Corpus (웹 말뭉치에 대한 문장 필터링 데이터 셋 구축 방법)

  • Nam, Chung-Hyeon;Jang, Kyung-Sik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.11
    • /
    • pp.1505-1511
    • /
    • 2021
  • Pretrained models with high performance in various tasks within natural language processing have the advantage of learning the linguistic patterns of sentences using large corpus during the training, allowing each token in the input sentence to be represented with appropriate feature vectors. One of the methods of constructing a corpus required for a pre-trained model training is a collection method using web crawler. However, sentences that exist on web may contain unnecessary words in some or all of the sentences because they have various patterns. In this paper, we propose a dataset construction method for filtering sentences containing unnecessary words using neural network models for corpus collected from the web. As a result, we construct a dataset containing a total of 2,330 sentences. We also evaluated the performance of neural network models on the constructed dataset, and the BERT model showed the highest performance with an accuracy of 93.75%.

Speech Animation Synthesis based on a Korean Co-articulation Model (한국어 동시조음 모델에 기반한 스피치 애니메이션 생성)

  • Jang, Minjung;Jung, Sunjin;Noh, Junyong
    • Journal of the Korea Computer Graphics Society
    • /
    • v.26 no.3
    • /
    • pp.49-59
    • /
    • 2020
  • In this paper, we propose a speech animation synthesis specialized in Korean through a rule-based co-articulation model. Speech animation has been widely used in the cultural industry, such as movies, animations, and games that require natural and realistic motion. Because the technique for audio driven speech animation has been mainly developed for English, however, the animation results for domestic content are often visually very unnatural. For example, dubbing of a voice actor is played with no mouth motion at all or with an unsynchronized looping of simple mouth shapes at best. Although there are language-independent speech animation models, which are not specialized in Korean, they are yet to ensure the quality to be utilized in a domestic content production. Therefore, we propose a natural speech animation synthesis method that reflects the linguistic characteristics of Korean driven by an input audio and text. Reflecting the features that vowels mostly determine the mouth shape in Korean, a coarticulation model separating lips and the tongue has been defined to solve the previous problem of lip distortion and occasional missing of some phoneme characteristics. Our model also reflects the differences in prosodic features for improved dynamics in speech animation. Through user studies, we verify that the proposed model can synthesize natural speech animation.

COVID-19-related Korean Fake News Detection Using Occurrence Frequencies of Parts of Speech (품사별 출현 빈도를 활용한 코로나19 관련 한국어 가짜뉴스 탐지)

  • Jihyeok Kim;Hyunchul Ahn
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.267-283
    • /
    • 2023
  • The COVID-19 pandemic, which began in December 2019 and continues to this day, has left the public needing information to help them cope with the pandemic. However, COVID-19-related fake news on social media seriously threatens the public's health. In particular, if fake news related to COVID-19 is massively spread with similar content, the time required for verification to determine whether it is genuine or fake will be prolonged, posing a severe threat to our society. In response, academics have been actively researching intelligent models that can quickly detect COVID-19-related fake news. Still, the data used in most of the existing studies are in English, and studies on Korean fake news detection are scarce. In this study, we collect data on COVID-19-related fake news written in Korean that is spread on social media and propose an intelligent fake news detection model using it. The proposed model utilizes the frequency information of parts of speech, one of the linguistic characteristics, to improve the prediction performance of the fake news detection model based on Doc2Vec, a document embedding technique mainly used in prior studies. The empirical analysis shows that the proposed model can more accurately identify Korean COVID-19-related fake news by increasing the recall and F1 score compared to the comparison model.

Analysis of High School Students' Conceptual Change in Model-Based Instruction for Blood Circulation (혈액 순환 모형 기반 수업에서 고등학생들의 개념 변화 분석)

  • Kim, Mi-Young;Kim, Heui-Baik
    • Journal of The Korean Association For Science Education
    • /
    • v.27 no.5
    • /
    • pp.379-393
    • /
    • 2007
  • The purpose of this article is to analyze the conceptual change of nine 11th graders after implementing the model-based instruction of blood circulation by multidimensional framework, and to find some implications about teaching strategies for improving conceptual understanding. The model-based instruction consisted of 4 periods: (1) introduction for inducing students' interests using an episode in the science history of blood circulation, (2) vivisectional experiment on rats, (3) visual-linguistic model instruction using the videotape of heartbeat, and (4) modeling activity on the path of blood flow. Based on the data from pre-test, post-test and interviews, we classified students' models on the path of blood flow, and investigated their ontological features and the conceptual status of blood circulation. Most students could describe the path of blood flow and the changes of substances in blood precisely after the instructions. However, the modeling activity were not sufficient to improve students' understanding of the mechanisms of the blood distribution throughout various organs and the material exchanges between blood and tissues. From the interview of 9 students, we acquired informative results about conceptual status elements that were helpful to, preventing from, or not used for students' understanding. It was also found that conceptual status of students depended on the ontological categories into which students' conceptions of blood circulation fell. The results of this study can help design the effective teaching strategy for the understanding of concept of the equilibrium category.

Risk Analysis for the Rotorcraft Landing System Using Comparative Models Based on Fuzzy (퍼지 기반 다양한 모델을 이용한 회전익 항공기 착륙장치의 위험 우선순위 평가)

  • Na, Seong Hyeon;Lee, Gwang Eun;Koo, Jeong Mo
    • Journal of the Korean Society of Safety
    • /
    • v.36 no.2
    • /
    • pp.49-57
    • /
    • 2021
  • In the case of military supplies, any potential failure and causes of failures must be considered. This study is aimed at examining the failure modes of a rotorcraft landing system to identify the priority items. Failure mode and effects analysis (FMEA) is applied to the rotorcraft landing system. In general, the FMEA is used to evaluate the reliability in engineering fields. Three elements, specifically, the severity, occurrence, and detectability are used to evaluate the failure modes. The risk priority number (RPN) can be obtained by multiplying the scores or the risk levels pertaining to severity, occurrence, and detectability. In this study, different weights of the three elements are considered for the RPN assessment to implement the FMEA. Furthermore, the FMEA is implemented using a fuzzy rule base, similarity aggregation model (SAM), and grey theory model (GTM) to perform a comparative analysis. The same input data are used for all models to enable a fair comparison. The FMEA is applied to military supplies by considering methodological issues. In general, the fuzzy theory is based on a hypothesis regarding the likelihood of the conversion of the crisp value to the fuzzy input. Fuzzy FMEA is the basic method to obtain the fuzzy RPN. The three elements of the FMEA are used as five linguistic terms. The membership functions as triangular fuzzy sets are the simplest models defined by the three elements. In addition, a fuzzy set is described using a membership function mapping the elements to the intervals 0 and 1. The fuzzy rule base is designed to identify the failure modes according to the expert knowledge. The IF-THEN criterion of the fuzzy rule base is formulated to convert a fuzzy input into a fuzzy output. The total number of rules is 125 in the fuzzy rule base. The SAM expresses the judgment corresponding to the individual experiences of the experts performing FMEA as weights. Implementing the SAM is of significance when operating fuzzy sets regarding the expert opinion and can confirm the concurrence of expert opinion. The GTM can perform defuzzification to obtain a crisp value from a fuzzy membership function and determine the priorities by considering the degree of relation and the form of a matrix and weights for the severity, occurrence, and detectability. The proposed models prioritize the failure modes of the rotorcraft landing system. The conventional FMEA and fuzzy rule base can set the same priorities. SAM and GTM can set different priorities with objectivity through weight setting.

The Interaction Effect of Foreign Model Attractiveness and Foreign Language Usage (외국인 모델의 매력도와 외국어 사용의 상호작용 효과)

  • Lee, Ji-Hyun;Lee, Dong-Il
    • Journal of Global Scholars of Marketing Science
    • /
    • v.17 no.3
    • /
    • pp.61-81
    • /
    • 2007
  • Recently, use of foreign models and foreign language in advertising is a general trend in Korea even though the effect has not been well-known..Most of the previous research shows rather an opposite effect claiming marketing communication is more effective when higher congruity between marketing communication and consumer's cultural values are achieved. However, the introduction of global culture due to the expansion of new media such as Internet or cable television makes the congruity not the best choice of marketing strategy. In addition, use of highly attractive models in advertising to increase the effect of advertising is general. However, recent studies show that targeted women audience tend to compare themselves to the highly attractive models and do experience negative sentiment. Bower (2001) proved the difference between 'comparer' and 'noncomparer' when women face highly attractive models. The results show that a comparer who has an intention to compare highly attractive model (HAM) with herself has a significantly negative effect on model expertise, product argument, product evaluation and buying intention. Therefore, HAM is not always a good choice and model attractiveness plays a role in the processing other cues or changing the advertising effect from result of processing other cues. The purpose of this study is to investigate the effect of the use of foreign language on the advertising response of the audience with regard of the model attractiveness. For the empirical study, the virtual advertising using foreign models (HAM, NAM), brand names and slogans(Korean, English) were used as stimuli. The respondents of each stimulus were 75('HAM-Korean'), 75('NAM-Korean'), 66('HAM-English') and 66 ('NAM-English') respectively. To establish the effect of marketing communication, the attitude for media(AM), the attitude for product(AP), targetedness(TD), overall quality(OQ), and purchase intention(PI) with 7 point likert scale were measured. The manipulation was verified to check the difference between HAM attractiveness assessment (m=3.27) and NAM attractiveness assessment (m=5.12). The mean difference was statiscally significant (p<.05). As a result, all consequences were significantly changed with model attractiveness, and overall quality evaluation(OQ) were significantly changed with language. The interaction effect from model attractiveness and language was significant on attitude toward the product(AP) and purchase intention(PI). To analyze the difference, the mean values and standard deviation of consequences were compared. The result was more positive when model attractiveness was high for all consequences. For language effect, the assessment was more positive when English was used for OQ. Considering model attractiveness and language simultaneously, HAM-Korean was more positive for AP and PI, and NAM-English was more positive for AP and PI. In other words, the interaction effect was confirmed by model attractiveness and language. As mentioned above, use of foreign models and foreign language in advertising was explained by cultural match up hypothesis (Leclerc et al. 1994) which claimed that culture of origin effect. In other words, in advertising, use of same cultural language with the foreign model could make positive assessment for OQ. But this effect was moderated by model attractiveness. When the model attractiveness was low, the use of English makes PI high because of the effect of foreign language which supported the cultural match up hypothesis. When the model attractiveness was low, the use of Korean made AP and PI high because the effect of foreign language was diluted. It was a general notion that the visual cues got processed before (Holbrook and Moore, 1981; Sholl et al, 1995) compared to linguistic cues. Therefore, when consumers were faced HAM, so much perception was already consumed at processing visual cues making their native language of Korean to strongly and positively connected with the advertising concept. On the contrary, when consumers were faced with NAM, less perception was consumed compared to HAM, making English to accompany cultural halo effect which affected more positively. Therefore, when foreign models were employed in advertising, the language must be carefully selected according to the level of model attractiveness.

  • PDF

Validity and Reliability of the Korean Version of the Menopause-Specific Quality of Life (한국어판 폐경 특이형 삶의 질 측정도구의 신뢰도와 타당도 검증)

  • Park, Jin-Hee;Bae, Sun Hyoung;Jung, Young-Mi
    • Journal of Korean Academy of Nursing
    • /
    • v.50 no.3
    • /
    • pp.487-500
    • /
    • 2020
  • Purpose: This study aimed to evaluate the validity and reliability of the Korean version of Menopause-Specific Quality of Life (MENQOL). Methods: The MENQOL was translated into Korean according to algorithm of linguistic validation process. A total of 308 menopausal women were recruited and assessed using the Korean version of MENQOL (MENQOL-K), the World Health Organization Quality of Life Brief Version (WHOQOL-BREF), and Center for Epidemiological Studies Depression Scale (CES-D-K). In estimating reliability, internal consistency reliability coefficients were calculated. Validity was evaluated through criterion validity and construct validity with confirmatory factor analyses using SPSS 23.0 and AMOS 25.0 software. Results: In item analyses, the "increased facial hair" symptom was excluded because of the low contribution of MENQOL-K. The confirmatory factor analysis supported good fit and reliable scores for MENQOL-K model, and the four-factor structure was validated (χ2=553.28, p<.001, NC=1.84, RMSEA=.05, AGIF=.85, AIC=765.28). The MENQOL-K consists of 28 items in 4 domains, including vasomotor (3 items), psychosocial (7 items), physical (15 items), and sexual subscales (3 items). There was an acceptable criterion validity with moderately significant correlation between MENQOL-K and WHOQOL-BREF. The Cronbach's α for the 4 subsacles ranged from .80 to .93. Conclusion: The MENQOL-K is a valid and reliable scale to measure condition-specific quality of life for perimenopausal and postmenopausal women. It can be used to assess the impact of menopausal symptoms on the quality of life of Korean women in clinical trials.

An Alignment Model for Extracting English-Korean Translations of Term Constituents (영-한 조어단위 대역쌍 추출을 위한 조어단위 정렬 모델)

  • Oh Jong-Hoon;Huang Jin-Xia;Choi Key-Sun
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.4
    • /
    • pp.300-311
    • /
    • 2005
  • Terms are linguistic realization of technical concepts. Term constituents are important elements used for representing the concept. Since many new terms are created from the modification or combination of existing constituents, it is important to analyze term constituents for understanding the concept of the term. It means that term constituents offer clues for understanding the concept of terms. However, there are a couple of difficulties in matching concept unit and term constituents such as mismatching between a term constituent and a concept unit, homonym of term constituents and synonym of term constituents. To solve them, it is necessary to recognize concept units of term constituents. In this paper, we define an English term constituent as the concept unit and use an alignment algorithm between English-Korean term constituents in order to recognize concept units of term constituents. By our alignment algorithm we recognize Korean term constituents corresponding to an English term constituent with about $93\%$ precision.