• Title/Summary/Keyword: Word translation

Search Result 146, Processing Time 0.021 seconds

Opinions on the Turks' Turkic Translation Activities in the Period of Taspar Qagan

  • YILDIRIM, KURSAT
    • Acta Via Serica
    • /
    • v.3 no.2
    • /
    • pp.151-160
    • /
    • 2018
  • There is a variety of opinions about the first translation activities within the Turkic Empire. It is widely believed that some Buddhist sutras were translated into the Turkic language in the period of Taspar Qagan (572-581). This theory is based on certain arguments: Some Turks practiced Buddhism, Buddhist monks translated sutras in the center of the Turkic Empire, Taspar brought sutras from China and had them translated, and the monarch of Northern Qi had a sutra translated and sent to Taspar. However, in my opinion, these arguments lack credibility. This article, which is based on primary Chinese sources, will question the likelihood of such translation activities having occurred. Some Chinese records for these claims exist: Da Tang Nei Dian Lu (大唐內典錄) and Xu Gao Seng Chuan (續高僧傳) by the Buddhist monk Jinagupta and the records of Hui Lin in Sui Shu (隋書) and Wen Xian Tong Kao (文獻通考). These are known as "primary sources." Secondary sources, namely contemporary history and language studies, such as those in books and articles, must be based on primary sources. It can be seen that claims relating to the first Turkic translation activities at the time of Taspar are mainly derived from secondary sources, and that the arguments in these secondary sources vary. Sometimes researchers make suppositions on the existence of information that is not referred to in primary sources. However, this is not normal practice. If a researcher relies on unknowns for the evidence of information existing, it can cause false information, ideas and anachronisms to be created. It is important that primary sources, such as the Chinese sources mentioned above, be translated correctly in language and history studies. If only a word is mistranslated, very different results may occur. Mistranslating or misinterpreting a primary source allows conclusions to be reached that are not supported by dissemination of information from primary sources. This can mislead experts and result in information that is not correct being considered as being true. As well as helping to prevent such misinterpretations occurring, another aim of this paper is to question the interpretations of the first Turkic translations in contemporary studies on history and language. The origin of such assessments will be explored and the validity of that information will be examined.

Development of Japanese to Korean Machine Translation System ATOM Using Personal Computer I - Dictionary Construction and Morphological Analysis - (PC를 이용한 일$\cdot$한 번역 시스템 ATOM의 개발에 관한 연구 ( I ) - 구문해석과 생성과 사전 구성과 형태소 해석을 중심으로 -)

  • Kim, Young-Sum;Kim, Han-Woo;Choi, Byung-Uk
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.25 no.10
    • /
    • pp.1183-1192
    • /
    • 1988
  • In this paper, we describe heuristic information-added morphological dictionary and connection table, and automatic MUNJEUL separation process on the basis of least cost method for efficient morphological analysis. It is simplified the composition of connection and inflective word information by mutually interconnect conjugation table with connection tables. As a result, the applicability of system is increased. Translation dictionary consists of analysis and generation part and, increase the applicability by describing frequently using termination phrase which is extracted statistically as idiom and the procedure directly on the dictionary for the efficiency of analysis process and more natural generation of translation sentence.

  • PDF

A Study on the Natural Language Generation by Machine Translation (영한 기계번역의 자연어 생성 연구)

  • Hong Sung-Ryong
    • Journal of Digital Contents Society
    • /
    • v.6 no.1
    • /
    • pp.89-94
    • /
    • 2005
  • In machine translation the goal of natural language generation is to produce an target sentence transmitting the meaning of source sentence by using an parsing tree of source sentence and target expressions. It provides generator with linguistic structures, word mapping, part-of-speech, lexical information. The purpose of this study is to research the Korean Characteristics which could be used for the establishment of an algorism in speech recognition and composite sound. This is a part of realization for the plan of automatic machine translation. The stage of MT is divided into the level of morphemic, semantic analysis and syntactic construction.

  • PDF

Study on the grammatical characteristics and fallacy of translation in the sentences of Donguibogam by Heo Jun - Focused on Tangaekpean(湯液篇) in Donguibogam "東醫寶鑑" - ("동의보감(東醫寶鑑)"에 쓰여진 허준(許浚) 문장(文章)의 문법적(文法的) 특성(特性)과 번역서(飜譯書)의 오류(誤謬) - "탕액편(湯液篇)"을 중심(中心)으로 -)

  • Kim, Yong-Han;Kim, Eun-Ha
    • Journal of Korean Medical classics
    • /
    • v.24 no.6
    • /
    • pp.111-124
    • /
    • 2011
  • The objectives of this study are to look into the grammatical characteristics and find misinterpretations on the translation books. 1. Sentences characteristics 1) Lots of ellipses of grammatical parts can be found such as conjunction, postposition, particle, Coverb, and focus on the parts which has practical meaning such as noun, pronoun, verb, adjective in the sentences. 2) Some predicates are skipped in the later phrases which has contradictive concepts against them of former phrases. 3) Pure Korean word order is exposed especially in complement. 2. Translation fallacy 1) There is fallacy in the sentences omitted paratactic conjunction as follows (1) mistranslation based on the wrong concept of the context between equal relation and subordinate relation. (2) failure on setting up the period, (3) misunderstanding equal relation as cause relation. 2) Some singular phrases, which are condition relation, were analyzed as plural phrases in the sentences omitted connection conjunction. 3) Ellipses of postposition obstruct understanding the difference between modifier and modificand in some sentences. 4) Some cause relation phrases were translated as equality relation due to lack of recognition of ellipsis of coverbs.

Study of Contents Localization Case on the Game 'Paper, Please': Based on the Korean and North Korean Translations (게임 'Paper, Please'의 번역을 통한 콘텐츠 현지화 사례 연구: 한국어와 문화어 번역의 차이를 중심으로)

  • Won, Ho-Hyeuk;Gu, Bon-Hyeok;Kim, Hyoung-Youb
    • Journal of Korea Game Society
    • /
    • v.19 no.2
    • /
    • pp.145-160
    • /
    • 2019
  • In this research, we attempt to suggest the differences between Korean translation and the North Korean translation of the game 'Paper, Please'; moreover, we will consider about the effect of language and image on localization through this. North Korean language and cultural contents in 'Paper, Please' are evaluated well by many people that they show real life of North Korea even though there are some errors like loanword translations and using anachronic symbol, 'Kaksital' as secret organization. Through the research, we could know that people could concentrate on cultural contents by images and motives without critical errors so have fun.

A Hybrid Sentence Alignment Method for Building a Korean-English Parallel Corpus (한영 병렬 코퍼스 구축을 위한 하이브리드 기반 문장 자동 정렬 방법)

  • Park, Jung-Yeul;Cha, Jeong-Won
    • MALSORI
    • /
    • v.68
    • /
    • pp.95-114
    • /
    • 2008
  • The recent growing popularity of statistical methods in machine translation requires much more large parallel corpora. A Korean-English parallel corpus, however, is not yet enoughly available, little research on this subject is being conducted. In this paper we present a hybrid method of aligning sentences for Korean-English parallel corpora. We use bilingual news wire web pages, reading comprehension materials for English learners, computer-related technical documents and help files of localized software for building a Korean-English parallel corpus. Our hybrid method combines sentence-length based and word-correspondence based methods. We show the results of experimentation and evaluate them. Alignment results from using a full translation model are very encouraging, especially when we apply alignment results to an SMT system: 0.66% for BLEU score and 9.94% for NIST score improvement compared to the previous method.

  • PDF

Modifiers and Compound Sentences Processing of a Korean-Japanese Machine Translation System (한국어-일본어 기계번역 시스템의 수식어 처리와 중문처리)

  • Joo, I.S.;Paik, M.H.;Jin, J.H.;Lim, S.T.;Lim, I.C.
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1046-1049
    • /
    • 1987
  • This paper proposes a Korean-Japanese Machine Translation System that processes unregistered words, modifiers and compound sentences. In mophological analysis, the unregistered words are processed by using unregistered word processing algorithm. The modifiers are processed by consulting noun-attributes and grammar rules. The compound sentence processing algorithm recognizes whether the sentence that includes commas is compound sentence or not. This system performs on IBM-PC/AT DOS using Prolog-1.

  • PDF

An Approach to Semantic Mapping using Product Ontology for CPC Environment (CPC 환경을 위한 Product 온톨로지 기반 의미 공유 접근법)

  • Kim K.-Y.;Suh H.-W.
    • Korean Journal of Computational Design and Engineering
    • /
    • v.9 no.3
    • /
    • pp.192-202
    • /
    • 2004
  • This paper introduces an approach to semantic mapping using Product ontology for CPC environment. In CPC environment, it is necessary that the participants in a product life cycle should share the same understanding about the semantic of product terms. For example, they should know that although 'COMPONENT' and 'ITEM' are different word-expressions, they could have the same meaning. In order to handle such terms in the information system, it is desirable that the system automatically recognizes that the terms have the same semantics. Serving this purpose, we described an ontology design methodology using first order logic, knowledge interchange format, and knowledge engineering process. In our approach, we investigated domain knowledge of the Bill Of Material, and then designed Product ontology of it. Based on the ontology, we described syntactic translation, semantic translation, and semantic mapping procedure with an example.

A Study of Korean Semantic Role Labeling using Word Sense (의미 정보를 이용한 한국어 의미역 인식 연구)

  • Lim, Soojong;Kim, Hyunki
    • Annual Conference on Human and Language Technology
    • /
    • 2015.10a
    • /
    • pp.18-22
    • /
    • 2015
  • 기계학습 기반의 의미역 인식에서 주로 어휘, 구문 정보가 자질로 주로 쓰이지만, 의미 정보를 분석하는 의미역 인식은 단어의 의미 정보 또한 매우 주요한 정보이다. 그러나, 기존 연구에서는 의미 정보를 활용할 수 있는 방법이 제한되어 있기 때문에, 소수의 연구만 진행되었다. 본 논문에서는 동형이의어 수준의 의미 애매성 해소 기술, 고유 명사에 대한 개체명 인식 기술, 의미 정보에 기반한 필터링, 유의어 사전을 이용한 클러스터 및 기존 프레임 정보를 확장하는 방법을 제안한다. 제안하는 방법은 기존 연구 대비 뉴스 도메인인 Korean Propbank는 3.14, 위키피디아 문서 기반의 WiseQA 평가셋인 GS 3.0에서는 6.57의 성능 향상을 보였다.

  • PDF

Improving The Performance of Triple Generation Based on Distant Supervision By Using Semantic Similarity (의미 유사도를 활용한 Distant Supervision 기반의 트리플 생성 성능 향상)

  • Yoon, Hee-Geun;Choi, Su Jeong;Park, Seong-Bae
    • Journal of KIISE
    • /
    • v.43 no.6
    • /
    • pp.653-661
    • /
    • 2016
  • The existing pattern-based triple generation systems based on distant supervision could be flawed by assumption of distant supervision. For resolving flaw from an excessive assumption, statistics information has been commonly used for measuring confidence of patterns in previous studies. In this study, we proposed a more accurate confidence measure based on semantic similarity between patterns and properties. Unsupervised learning method, word embedding and WordNet-based similarity measures were adopted for learning meaning of words and measuring semantic similarity. For resolving language discordance between patterns and properties, we adopted CCA for aligning bilingual word embedding models and a translation-based approach for a WordNet-based measure. The results of our experiments indicated that the accuracy of triples that are filtered by the semantic similarity-based confidence measure was 16% higher than that of the statistics-based approach. These results suggested that semantic similarity-based confidence measure is more effective than statistics-based approach for generating high quality triples.