• Title/Summary/Keyword: 21st century sejong project

Search Result 4, Processing Time 0.027 seconds

The $21^{st}$ Century Sejong Project Special Corpus Construction (1998~2007) (21세기 세종 계획 특수자료 구축 분과의 성과 (1998~2007))

  • Seo, Sang-Kyu
    • Annual Conference on Human and Language Technology
    • /
    • 2007.10a
    • /
    • pp.317-322
    • /
    • 2007
  • 이 발표는, <21세기 세종 계획>(문화관광부/국립국어원의 지원, 1998~2007)의 일환으로 이루어진, 특수자료 구축 분과의 지난 10년간의 성과를 소개하고자 하는 데에 목적이 있다. 특수자료 구축 분과에서는 구어, 병렬, 역사 자료, 북한 및 해외 말뭉치와 같은 특수 말뭉치의 구축을 담당하고 있다. 여기서는 특수자료 구축 소분과의 개요와 과제의 구성, 각 세부 과제별 말뭉치 구축 성과 및 각 말뭉치의 가치와 특성을 밝히고자 한다.

  • PDF

A Study of the Research Direction and Trend in the Use of Corpus - Focusing on the Case of Japan - (말뭉치 구축·활용의 흐름과 현재의 동향 - 일본의 사례를 중심으로 -)

  • 윤영민
    • Language Facts and Perspectives
    • /
    • v.45
    • /
    • pp.35-59
    • /
    • 2018
  • In this paper, as a proposal to an effective corpus construction and utilization scheme, there is a purpose to explore the present situation of Japanese corpus construction, concrete content and current trend. In Japan, dependence on google is remarkable, there has been a steady effort to develop high-quality corpus and development tool. On the other hand, the Japanese corpus should clearly grasp the location and information from those created by individual researchers to their own purpose to those created mainly by universities, research institutes, national policy institutions, etc. It is difficult. In this survey, it was possible to distinguish by "media corpus", "literary·magazine·web and balanced corpus", "spoken language corpus", "learner corpus", "historical material corpus" etc. by field and type. In addition, there were not many tools developed for corpus efficient use and secondary processing such as "example search", "morphological analysis", "machine translation", etc. for tool corpus only. The current trend in Japanese corpus construction spurred preparations for Seed data which can be utilized in linguistic research and various fields of the fourth industry, including national policy and research institutes such as NINJAL, JPO, NICT, ALAGIN and companies such as RAKUTEN ing.

Current Status of Bioinformatics on Bio-databases and it Tools (바이오데이터베이스와 도구를 활용한 바이오인포매틱스의 동향)

  • Im, Dal-Hyuk;Jeon, Sue-Kyoung;Park, Wan-Kyu;Lee, Young-Joo
    • Journal of Pharmaceutical Investigation
    • /
    • v.34 no.1
    • /
    • pp.73-79
    • /
    • 2004
  • The union of information-technology and biology presents great possibilities to both applications of bio-information and development of science and technology. Also, meaningful analysis of bio-information brings about a new innovation in the field of bio-market with the advent and growth of bioinformatics. Hence, bioinformatics is the most import aspect for establishing a science-technology-oriented society in the $21^{st}$ century. This article provides trends in current state of bioinformatics. Technological development of bioinformatics for the rapid growth of bio-industry means that using bioinformatics, a biologist can process and store enormous amount of data such as current Human Genome Project and future data in the field of biology. We have manly looked at the tends of bio-information, databases and mining tools that are generally used, and strategies and directions for the future.

Aspects of Language Use in Newspaper Articles: A Corpus Linguistic Perspective (신문 기사의 언어 사용 양상: 코퍼스언어학적 접근)

  • Song, Kyung-Hwa;Kang, Beom-Mo
    • Korean Journal of Cognitive Science
    • /
    • v.17 no.4
    • /
    • pp.255-269
    • /
    • 2006
  • The purpose of this study is to analyze newspaper articles from corpus linguistic point of view. We used a large corpus of newspaper articles built from <21st century Sejong Project> and counted occurrences of certain expressions. A newspaper article is divided into the headline, the lead and the body. We tried to figure out how to measure the characteristics of indication and compression which are typical to headlines. Then, we focused on the differences between the headline and the lead. finally, we analyzed the sentence structure and measured the ratio of the frequency of common nouns in the body. This study verifies the existing stylistic theories of newspapers and shows new aspects of language use in newspaper articles. Texts like newspaper articles are the results of human language processing and they in turn affect the development of cognitive ability of language.

  • PDF