• 제목/요약/키워드: Cross Language

검색결과 453건 처리시간 0.024초

Zero-Shot 기반 기계번역 품질 예측 연구 (Study on Zero-shot based Quality Estimation)

  • 어수경;박찬준;서재형;문현석;임희석
    • 한국융합학회논문지
    • /
    • 제12권11호
    • /
    • pp.35-43
    • /
    • 2021
  • 최근 다언어모델(Cross-lingual language model)을 활용하여 한 번도 보지 못한 특정 언어의 하위 태스크를 수행하는 제로샷 교차언어 전이(Zero-shot cross-lingual transfer)에 대한 관심이 증가하고 있다. 본 논문은 기계번역 품질 예측(Quality Estimation, QE)을 학습하기 위한 데이터 구축적 측면에서의 한계점을 지적하고, 데이터를 구축하기 어려운 상황에서도 QE를 수행할 수 있도록 제로샷 교차언어 전이를 수행한다. QE에서 제로샷을 다룬 연구는 드물며, 본 논문에서는 교차언어모델을 활용하여 영어-독일어 QE 데이터에 대해 미세조정을 실시한 후 다른 언어쌍으로의 제로샷 전이를 진행했고 이 과정에서 다양한 다언어모델을 활용하여 비교 연구를 수행했다. 또한 다양한 자원 크기로 구성된 언어쌍에 대해 제로샷 실험을 진행하고 실험 결과에 대해 언어별 언어학적 특성 관점으로의 분석을 수행하였다. 실험결과 multilingual BART와 multillingual BERT에서 가장 높은 성능을 보였으며, 특정 언어쌍에 대해 QE 학습을 전혀 진행하지 않은 상황에서도 QE를 수행할 수 있도록 유도하였다.

중국인 한국어 학습자의 중간언어 연구 - 평균발화길이(MLU)와 어휘적 특성을 중심으로 (A Research on the Interlanguage of Chinese Speaking Korean Language Learners: Focusing on MLU and Characteristics Found in Vocabulary Usage)

  • 김선정;김목아
    • 비교문화연구
    • /
    • 제22권
    • /
    • pp.303-327
    • /
    • 2011
  • This study aims to uncover the learner's language proficiency shown in the writing data of Chinese elementary/intermediate level learners. Language proficiency of the learners acquired by error analysis provides only partial information, and thus this study analyses the interlanguage of Korean learners in terms of 'Mean Length of Utterance, MLU' to discover the overall aspect of learner's language proficiency more symmetrically. The analysis of vocabulary area is to be enforced after generally studying the learner's language development aspect in accordance with MLU-m(orpheme) and MLU-(w)ord found in compositions by Chinese speaking Korean language learners. In terms of MLU, it has been slightly increased as the level of proficiency between elementary level and intermediate level learners; however, the morpheme seemed to be difficult to use, since the difference between Chinese learners and Korean university students has been notably shown. Vocabulary diversity, using aspect for each word class, and using aspect of the predicate are studied for vocabulary area; more various and numerous vocabulary tend to be used as the level of proficiency increases. In terms of predicate use, Chinese learners use less numerous vocabulary types.

드라마 「신조협려(神雕俠侶)」를 활용한 대학 중국어 교육 (Teaching Chinese through Drama to University Students for Language Skills)

  • 최태훈
    • 비교문화연구
    • /
    • 제31권
    • /
    • pp.415-438
    • /
    • 2013
  • This paper explores how to teach Chinese, using multi-media resources such as Chinese dramas and focusing on one of Jin Yong's dramas, The Return of the Condor Heroes. The purpose of this study is to develop teaching methodologies for university students learning Chinese through drama to integrate language skills: enhancing communicative competence and understanding Chinese cultures. First, the overview of previous studies provides several cases of foreign language education using drama. Teaching Chinese through drama can be an integrative education because students can develop their communicative competence as well as understand the cultures of the target language. In other words, the contexts of drama may offer rich sources of the history of China, Han Chinese ethnocentrism, and knowledge of Chinese literature as well as geography. Second, this study applies the principles of Tomlinson (2010) for materials development in language teaching into the case of Chinese drama. It concentrates on Jin Yong's The Return of the Condor Heroes that the author has used in the Chinese language courses for three years. It examines the characteristics of the drama for developing effective ways of teaching and learning Chinese language and culture. Furthermore, it discusses the impact of using drama on changes in students' pervasive perceptions about unnecessity of Chinese classical literature. Third, this paper presents some sample lessons which may help teachers to develop understanding of how to organize lessons through drama. Finally, it illustrates university students' opinions about using drama to learn Chinese.

기계번역을 이용한 교차언어 문서 범주화의 분류 성능 분석 (Classification Performance Analysis of Cross-Language Text Categorization using Machine Translation)

  • 이용구
    • 한국문헌정보학회지
    • /
    • 제43권1호
    • /
    • pp.313-332
    • /
    • 2009
  • 교차언어 문서 범주화(CLTC)는 다른 언어로 된 학습집단을 이용하여 문헌을 자동 분류할 수 있다. 이 연구는 KTSET으로부터 CLTC에 적합한 실험문헌집단을 추출하고, 기계 번역기를 이용하여 가능한 여러 CLTC 방법의 분류 성능을 비교하였다. 분류기는 SVM 분류기를 이용하였다. 실험 결과, CLTC 중에 다국어 학습방법이 가장 좋은 분류 성능을 보였으며, 학습집단 번역방법, 검증집단 번역방법 순으로 분류 성능이 낮아졌다. 하지만 학습집단 번역방법이 기계번역 측면에서 효율적이며, 일반적인 환경에 쉽게 적용할 수 있고, 비교적 분류 성능이 좋아 CLTC 방법 중에서 가장 높은 이용 가능성을 보였다. 한편 CLTC에서 기계번역을 이용하였을 때 번역과정에서 발생하는 자질축소나 주제적 특성이 없는 자질로의 번역으로 인해 성능 저하를 가져왔다.

교차언어 문서검색에서 다국어 온톨로지에 기반한 한영 질의어 변환 (Korean-to-English Query Translation based on Multilingual Ontology in Cross-Language Text Retrieval)

  • 천정훈;최기선
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 1999년도 제11회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.43-49
    • /
    • 1999
  • 본 논문에서는 교차언어 문서검색(CLTR: Cross-Language Text Retrieval)에서의 한-영 질의어 변환을 다룬다. 질의어 변환시 영어 대역어 획득과정에서는 다음 두 가지를 고려한다. 첫째, 한국어 질의어를 구성하는 단어가 한가지 개념을 기호화하지만 이에 대응되는 영어 대역어들이 하나 이상인 경우이다. 둘째, 질의어 구성 단어가 둘 이상의 개념들을 기호화하는 다의성을 지닌 경우이다. 전자의 경우는 영어 대역어들이 모두 동일한 개념, 또는 유사한 개념을 나타내므로 그대로 검색에 이용한다 해도 검색 성능을 크게 좌우하지 않지만, 후자의 경우는 모든 개념을 다 검색에 이용하게 되면 정확률(precision)이 크게 떨어지게 된다. 이에 본 연구에서는 개념 선택단계와 선택된 개념의 영어 대역어들에 가중치를 주는 가중치 부가단계로 나누어 질의어 변환을 수행한다. 본 논문의 질의어 변환에서 영어 대역어는 대역사전 대신 다국어 온톨로지인 KAIST 분류어휘표와 한영 음차복원 모듈을 통해 얻어진다.

  • PDF

Culture in language: comparing cultures through words in South Africa

  • Montevecchi, Michela
    • 비교문화연구
    • /
    • 제24권
    • /
    • pp.120-131
    • /
    • 2011
  • South Africa is a multiracial country where different cultures and languages coexist. Culture can be conveyed through language. Language conditioning is also social conditioning, and through words we make sense of our own and others' experience. In this paper I investigate the meaning of two culturally significant words: (English) peace and (African) ubuntu. Data findings will show how L2 speakers of English, when asked to define peace, promptly operate a process of transfer of the meaning from their mother-tongue Xhosa equivalent - uxolo - to its English equivalent. Ubuntu, an African word which encompasses traditional African values, has no counterpart in English. I will also argue how, in the ongoing process of globalisation, English is playing a predominant role in promoting cultural homogenization.

문화에 대한 태도와 학습동기가 사관생도의 영어 성취도에 미치는 영향 (The Effects of Attitudes Toward Culture and Motivation on Military Cadets' English Achievement)

  • 정한기
    • 비교문화연구
    • /
    • 제19권
    • /
    • pp.313-338
    • /
    • 2010
  • Attitudes toward target language culture and motivation have been known as important factors in second or foreign language learning. In this study, cadets' attitudes toward the target language community and culture were investigated to find out any relationships with English achievement. Integrative or instrumental motivation in foreign military context was investigated to determine which motivation was more positively related to English proficiency. Cadets' responses were shown highly positive and internally consistent in most cases. Military cadets' attitudes toward American culture were statistically significant and could predict their English achievement. Even though integrative motivation was statistically significant, correlation with English achievement was low. However, instrumental motivation was not significant. This result implies that attitudes toward culture is an important factor in foreign language learning and integrative-instrumental motivation dichotomy might not be enough to explain specific context like cadets' English learning situation at Korea Army Academy at Youngcheon.

외래어의 표준 발음과 어문 규범 (On the Regulation for Pronunciation of Loanwords in Korean)

  • 이은경
    • 비교문화연구
    • /
    • 제38권
    • /
    • pp.405-431
    • /
    • 2015
  • The purpose of this paper is to investigate how to decide pronunciation of loanwords in Korean language. There has not been a regulation for pronunciation of loanwords in Korean language. Even the dictionary published by the government does not provide any information about the pronunciation of loanwords. In this paper, some actual solutions are suggested for the pronunciation of loanwords. Korean language has Regulations of Standard Korean, Korean Orthography, Regulations on Hangeul Transcriptions on Loanwords and Pronunciation Methods of Standard Korean. These language standards could help to decide pronunciation of loanwords. Some pronunciations which could not be regulated by them must be presented in the standard pronunciation dictionary. For example, glottalization rule of 's' in many loanwords could be presented in the description of each loanword in the dictionary. However the pronunciation of loanwords must be similar to the spelling. If various pronunciations are allowed to one spelling, then people will be so confused by the discrepancy between pronunciation and spelling of loanwords.

표준화된 간호용어체계를 이용한 암환자 간호기록의 분석 (Analysis of nursing records of cancer patients with standardized nursing language systems)

  • 이미순;이병숙
    • 간호행정학회지
    • /
    • 제10권2호
    • /
    • pp.243-254
    • /
    • 2004
  • Purpose: The purpose of this study was cross-mapping unique nursing statements which were identified in the nursing records of patients with six most common cancers in Korea with the standardized nursing languages of NANDA, NIC, NOC and ICNP. Method: The subjects were 72 nursing records which covered 1,502 admission days from August 1, 2003 to June 30, 2003. They were the records of the patients of six most common cancers who were treated at the six 3rd level general hospitals in Busan and Daegu. The unique nursing statements were identified by dividing the statements from the nursing records into the single statements according to their meanings. For cross-mapping, identified unique nursing statements were classified as 'Data(D)' for the subjective, objective data of the patients and the other data such as treatment, admission, discharge, and residence of patient, 'Problem(P)' for nursing problem or diagnosis defined by the nurse's decision, 'Intervention(I)' for nursing intervention for problem solving, and 'Outcome(O)' for patient reaction and results of the provided nursing interventions. Unique nursing statements classified to D, P, I, O were cross-napped by using Microsoft Excel 2000. The statements of D were cross-mapped with ICNP Nursing phenomena, P with NANDA nursing diagnosis and ICNP nursing phenomena, I with NIC and ICNP nursing intervention, and O with NOC and ICNP nursing phenomena Result: The results of this study were as follows. 1. Number of unique nursing statements were 506 in the records of lung cancer patients (18.12%), 480 in stomach cancer(17.19%), 458 in liver cancer(16.40%), 456 in colon cancer (16.33), 457 in breast cancer (16.36%) and 436 in cervix cancer (15.60%). 2. The range of percentage of cross-mapped unique nursing statements with the standardized nursing languages were as follows: P with NANDA nursing diagnosis $87.50{\sim}100%$, I with NIC $59.72{\sim}74.43$, O with NOC $61.05{\sim}72.64%$, and D, P, I and O with ICNP $60.92{\sim}69.95%$. 3. Number of the standardized nursing languages identified in this study were 21(12.66%) from 155 NANDA nursing diagnosis, 76(15.64%) from 486 NIC Nursing interventions, 54(17.47%) from 260 NOC nursing outcomes, and 343(13.03%) from ICNP 2,634. Conclusions: By the results of this study, NANDA, NIC, NOC and ICNP were found that they can be used as the language systems for nursing record and nursing information system for cancer patients. But, further study on the unique nursing statements which were not cross-mapped with the standardized nursing language systems will be necessary.

  • PDF

POS System Integrated with Cross-Platform for Supervision of Restaurant's

  • Alisha Farman;Hira Farman;Saad Ahmed;Anees Ahmed
    • International Journal of Computer Science & Network Security
    • /
    • 제24권8호
    • /
    • pp.205-213
    • /
    • 2024
  • As the Restaurant industry is growing rapidly. The demand for an effortless POS (Point Of Sale) system which can make management easy is increasing. So, the purpose of this study is to digitalise the growing industry of restaurants and its consumers by utilizing cross-platform development. Crossplatform development frameworks provide great opportunities to solve the issues of handling ubiquitous devices with minimum efforts to reduce the cost and increase the stability, accessibility of the end consumers. By availing those opportunities, an Integrated POS system with cross platform is proposed. This integrated cross-platform POS system is originally designed for a single restaurant managed by its own private cloud server. This research solves the 2 major problems. One of them is the accessibility of the system on modern devices without even writing platform-specific code with the help of cross-platform development. This included web, mobile, desktops & tablets at the same time with the same codebase. Second one is handling data consistency with ubiquitous devices with the help of cloud infrastructure to make data safe and consistent more than ever. In the Development of this system Dart will be used as the primary programming language for cross-platform development. On the Cloud server system apache will be used as the web server and PHP as server side language. System will be using MySQL as the database server.