• Title/Summary/Keyword: Lexicography

Search Result 7, Processing Time 0.019 seconds

Types of Lexicographical Information Needs and their Relevance for Information Science

  • Bergenholtz, Henning;Agerbo, Heidi
    • Journal of Information Science Theory and Practice
    • /
    • v.5 no.3
    • /
    • pp.15-30
    • /
    • 2017
  • In some situations, you need information in order to solve a problem that has occurred. In information science, user needs are often described through very specific examples rather than through a classification of situation types in which information needs occur. Furthermore, information science often describes general human needs, typically with a reference to Maslow's classification of needs (1954), instead of actual information needs. Lexicography has also focused on information needs, but has developed a more abstract classification of types of information needs, though (until more recent research into lexicographical functions) with a particular interest in linguistic uncertainties and the lack of knowledge and skills in relation to one or several languages. In this article, we suggest a classification of information needs in which a tripartition has been made according to the different types of situations: communicative needs, cognitive needs, and operative needs. This is a classification that is relevant and useful in general in our modern information society and therefore also relevant for information science, including lexicography.

An Algorithm for Predicting the Relationship between Lemmas and Corpus Size

  • Yang, Dan-Hee;Gomez, Pascual Cantos;Song, Man-Suk
    • ETRI Journal
    • /
    • v.22 no.2
    • /
    • pp.20-31
    • /
    • 2000
  • Much research on natural language processing (NLP), computational linguistics and lexicography has relied and depended on linguistic corpora. In recent years, many organizations around the world have been constructing their own large corporal to achieve corpus representativeness and/or linguistic comprehensiveness. However, there is no reliable guideline as to how large machine readable corpus resources should be compiled to develop practical NLP software and/or complete dictionaries for humans and computational use. In order to shed some new light on this issue, we shall reveal the flaws of several previous researches aiming to predict corpus size, especially those using pure regression or curve-fitting methods. To overcome these flaws, we shall contrive a new mathematical tool: a piecewise curve-fitting algorithm, and next, suggest how to determine the tolerance error of the algorithm for good prediction, using a specific corpus. Finally, we shall illustrate experimentally that the algorithm presented is valid, accurate and very reliable. We are confident that this study can contribute to solving some inherent problems of corpus linguistics, such as corpus predictability, compiling methodology, corpus representativeness and linguistic comprehensiveness.

  • PDF

Corpus Linguistics as Necessary Concept for Korean Lexicography (뭉치 언어학 : 사전 편찬의 필수적 개념)

  • Lee, Sang-Sup
    • Annual Conference on Human and Language Technology
    • /
    • 1989.10a
    • /
    • pp.73-76
    • /
    • 1989
  • 기존 한국어 사전들은 자연 언어로서의 한국어에 대한 실질적 조사 연구에 근거하고 있지 않다는 점에서 치명적 결함을 안고 있다. 최근 유럽에서 개발 응용되고 있는 ${\ulcorner}$뭉치 언어학${\lrcorner}$(corpus linguistics) 은 컴퓨터의 급격한 발전에 힘입어 대규모 용량의 자연언어 자료를 다각적으로 처리할 수 있는 방법을 고안할 수 있게 해주고 있다. 예컨대 영국 버밍엄 대학의 COBUILD 계획은 전혀 새로운 개념의 영어 사전을 편찬하는 데에 성공했다. 한국어 사전의 편찬도 뭉치 언어학적 방법의 도입으로 가능할 것으로 믿어, 필자가 작성한 작은 ${\ulcorner}$뭉치${\lrcorner}$로부터의 실례를 제시한다.

  • PDF

Discriminative Models for Automatic Acquisition of Translation Equivalences

  • Zhang, Chun-Xiang;Li, Sheng;Zhao, Tie-Jun
    • International Journal of Control, Automation, and Systems
    • /
    • v.5 no.1
    • /
    • pp.99-103
    • /
    • 2007
  • Translation equivalence is very important for bilingual lexicography, machine translation system and cross-lingual information retrieval. Extraction of equivalences from bilingual sentence pairs belongs to data mining problem. In this paper, discriminative learning methods are employed to filter translation equivalences. Discriminative features including translation literality, phrase alignment probability, and phrase length ratio are used to evaluate equivalences. 1000 equivalences randomly selected are filtered and then evaluated. Experimental results indicate that its precision is 87.8% and recall is 89.8% for support vector machine.

Parallels between Korean Verbs and Nouns in Subcategorization (한국어 동사와 명사사이의 하위범주화에 있어서의 평행성)

  • 노용균
    • Language and Information
    • /
    • v.1
    • /
    • pp.27-65
    • /
    • 1997
  • Nouns in the Korean language are subcategorized for various frames(called SUBCAT lists) in much the same way as verbs are. Assuming a monostratal grammar and building on analyses of various 'little elements' as clitics, such as the ones given by No(1991), Chae(1995,1996), and Oh(1991), I delineate the ranges of SUBCAT lists for the Korean verbs and nouns and show that the two word-classes have heavily overlapping frames. Twenty five SUBCAT lists are identified for verbs, and twenty four for nouns, of which twenty three find associated lexical items in both. By the way of justification, I offer analyses of noun--verb collocations in terms of the new five-valued syntactic feature COLLOC along with SUBCAT, which subsume 'light verb' constructions. It is hoped that this work will have given clear syntactic underpinnings to those who are concerned with practical lexicography.

  • PDF

A Study on the Microstructure of Terminological Dictionaries (전문용어사전의 미시구조에 관한 연구)

  • 김성진;정동열
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.35 no.1
    • /
    • pp.143-162
    • /
    • 2001
  • As technical intellectual work now has become prolific and various, rapid evolution of technical terms needs to collect and manage them systematically and efficiently. The purpose of this study is to suggest the microstructure for systematic definitions of definiendum in terminological dictionaries. This study reveals specific properties or differences of terminography on the basis of terminology and lexicography, analyzes problems of microstructure in a current terminological dictionary, and, based upon the theoretical and practical analysis, attempts to derive structural and methodical suggestions for microstructure of terminological dictionaries.

  • PDF

An analysis of terminological definitions (전문용어의 정의문 분석)

  • Lee Hae-Yun
    • Koreanishche Zeitschrift fur Deutsche Sprachwissenschaft
    • /
    • v.7
    • /
    • pp.145-163
    • /
    • 2003
  • In this paper, we examined various definitions of terminological definition for the extraction of terminological information from corpora. After we reviewed researches at the lexicography and at the terminology, we introduced the qualia structure of Generative Lexicon (Pustejovsky 1995) for the purpose of analyzing terminological definitions. By means of the qualia structure, we analyzed the definitions which are presented at the terminological dictionaries. As a result, we confirmed that the terminological definitions can be discomposed into 4 subtypes of qualia structure. Based on this examination, we analyzed terminological definitions of articles at a newspaper and showed the usefulness of the qualia structure at the extraction of terminological definitions from the corpora.

  • PDF