• Title/Summary/Keyword: punctuation

Search Result 38, Processing Time 0.025 seconds

Rich Transcription Generation Using Automatic Insertion of Punctuation Marks (자동 구두점 삽입을 이용한 Rich Transcription 생성)

  • Kim, Ji-Hwan
    • MALSORI
    • /
    • no.61
    • /
    • pp.87-100
    • /
    • 2007
  • A punctuation generation system which combines prosodic information with acoustic and language model information is presented. Experiments have been conducted first for the reference text transcriptions. In these experiments, prosodic information was shown to be more useful than language model information. When these information sources are combined, an F-measure of up to 0.7830 was obtained for adding punctuation to a reference transcription. This method of punctuation generation can also be applied to the 1-best output of a speech recogniser. The 1-best output is first time aligned. Based on the time alignment information, prosodic features are generated. As in the approach applied in the punctuation generation for reference transcriptions, the best sequence of punctuation marks for this 1-best output is found using the prosodic feature model and an language model trained on texts which contain punctuation marks.

  • PDF

An Analysis of Korean and American Presidential Addresses: Focusing on Punctuation and Transition

  • Jun, Ki-Suk;Jung, Kyu-Tae
    • English Language & Literature Teaching
    • /
    • v.17 no.2
    • /
    • pp.1-18
    • /
    • 2011
  • The object of this study is to show some features of English, focused on such mechanics as punctuation and transition, in Korean presidential addresses transcribed in English which are different from those of the United States. Towards that end, the presidential addresses of the United States and Korea from January, 2010 to June, 2010 are collected, made into corpora, and analyzed. Through analyzing the corpora, this paper is to address the following research questions: (1) What features can be regarded as different in terms of punctuation and transition? (2) If there are any differences between the corpora, are they significant enough to pose any problems for Korean and American English users to communicate with each other? (3) If so, what can be done to solve the problems in regard to pedagogical implications? Overall, as for punctuation, both Presidents' addresses share a lot in common, even with some idiosyncratic variations though. However, there are some noticeable differences in transitional devices. It is not clear whether those should be taken as a sign of personal preference, though. Transitional markers are meant to be part of wording in writing. (196 words).

  • PDF

Development of a korean Text Recognition System (한글 문서 인식 시스템 개발 연구)

  • 고견;이일병
    • Korean Journal of Cognitive Science
    • /
    • v.1 no.1
    • /
    • pp.77-102
    • /
    • 1989
  • This paper reports on the development of a recognition system for Korean character,numbers and punctuation marks by syntactic approach after extracting a character or punctuation mark from a page of text.First,using the projection profile(Masudaet.al.1985,Pavlidin 1981)method, we segment a page into different regions of column or row major and then extracts lines of characters from it.Considering the height,width and connectivity of character block,we proceed to extract syllables from the extracted lines.Basically we distinguish syables into six types of formal pattern(남궁재찬 1982,이주근등 1981)following the research of lee and others,and the punctuation marks and numbers into two kinds of formal patterns,and discriminate the surface structure of the extracted syllables.By Index-Removal algorithm,we subdivide them into 44 kinds of basic korean subpattern and special characters (numbers,punctuation marks)and recognize them by syntactic method(이주근등 1981.)

On the Donginjimun-ouchil, the Remnant Book (Kwean 7~9) of Incunabulum published in the period of koryo. (여각본 "동인지문오칠" 잔본(권7~권9)에 대하여)

  • Shin Seung-Woon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.20
    • /
    • pp.473-491
    • /
    • 1991
  • Summarizing the conclusion of this article is following this: 1. Donginjimun-ouchil(동인지문오칠) published at the close of Koryo, is not only the oldest anthology but also the only one of the same kinds that we have in present. 2. Donginjimun-ouchil is consist of 9 Kweons. We can know the fact through comparing samhansiguegam(삼한시구감), becouse it seems to summerize Donginjimun-ouchil. 3. Donginjimun-ouchil is different from other books and espically has a speial features which in eluding profils about the characters. 4. With additional punctuation marks in profile and critiism marks, we can know the rule of punctuation mark and at the same time it can give many assistances to the study of poetics.

  • PDF

Rule-based Named Entity (NE) Recognition from Speech (음성 자료에 대한 규칙 기반 Named Entity 인식)

  • Kim Ji-Hwan
    • MALSORI
    • /
    • no.58
    • /
    • pp.45-66
    • /
    • 2006
  • In this paper, a rule-based (transformation-based) NE recognition system is proposed. This system uses Brill's rule inference approach. The performance of the rule-based system and IdentiFinder, one of most successful stochastic systems, are compared. In the baseline case (no punctuation and no capitalisation), both systems show almost equal performance. They also have similar performance in the case of additional information such as punctuation, capitalisation and name lists. The performances of both systems degrade linearly with the number of speech recognition errors, and their rates of degradation are almost equal. These results show that automatic rule inference is a viable alternative to the HMM-based approach to NE recognition, but it retains the advantages of a rule-based approach.

  • PDF

Automated Classification of Sentential Types in Korean with Morphological Analysis (형태소 분석을 통한 한국어 문장 유형 자동 분류)

  • Chung, Jin-Woo;Park, Jong-C.
    • Language and Information
    • /
    • v.13 no.2
    • /
    • pp.59-97
    • /
    • 2009
  • The type of a given sentence indicates the speaker's attitude towards the listener and is usually determined by its final endings and punctuation marks. However, some 6na1 endings are used in several types of sentences, which means that we cannot identify the sentential type by considering only the final endings and punctuation marks. In this paper, we propose methods of finding some other linguistic clues for indentifying the sentential type with a morphological analysis. We also propose to use these methods to implement a system that automatically classifies sentences in Korean according to their sentential types.

  • PDF

A Fundamental Study for the Phenomenology of Communication (커뮤니케이션 현상학에 관한 기초 연구)

  • Lee, Bum-Soo
    • Korean journal of communication and information
    • /
    • v.71
    • /
    • pp.250-273
    • /
    • 2015
  • The phenomenology of communication represents a starting point in the union of consciousness and experience, analogue and digit, expression and perception, person and lived-world, rhetoric and ethic, constitute human communication. A phenomenological definition of communication requires that analysis proceed through a phenomenological description, reduction, interpretation. This analysis thus far has taken up the general issue of fact versus value, consisting in a subdivision into intention and punctuation on the factual side, and convention and legitimation on the value side. It has treated of the relationships among intention and metaphysics, punctuation and epistemology, convention and logic, and legitimation and axiology.

  • PDF

Character Segmentation in Chinese Handwritten Text Based on Gap and Character Construction Estimation

  • Zhang, Cheng Dong;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • v.8 no.1
    • /
    • pp.39-46
    • /
    • 2012
  • Character segmentation is a preprocessing step in many offline handwriting recognition systems. In this paper, Chinese characters are categorized into seven different structures. In each structure, the character size with the range of variations is estimated considering typical handwritten samples. The component removal and merge criteria are presented to remove punctuation symbols or to merge small components which are part of a character. Finally, the criteria for segmenting the adjacent characters concerning each other or overlapped are proposed.

A Study on Hangeul Orthography Guidelines for Foreigners (외국인을 위한 한글맞춤법 시안 연구)

  • Han, Jae young
    • Journal of Korean language education
    • /
    • v.28 no.4
    • /
    • pp.273-296
    • /
    • 2017
  • This study focuses on a review of Hangeul orthography guidelines in Korean language regulations. It is indispensable to revise the guidelines thoroughly because it has been more than 80 years since a unified plan of Korean orthography was established in 1933, which the current orthography is based on. Also, it has been approximately 30 years since 1989, when the current guidelines were issued and promulgated. The viewpoint towards this review reflects the requirements by education fields of Korean as a foreign language and modern Korean users. Hangeul orthography consists of six clauses, along with an appendix regarding punctuation marks: 1) general rules, 2) consonants and vowels, 3) related to sounds, 4) about forms, 5) spacing between words, and 6) miscellaneous. This paper examined individual clauses and specific usages of the clauses, in terms of Korean as a foreign language. Based on the review, this paper suggests the following tasks in order to establish a draft of Hangeul orthography for foreigners. A. Among the individual clauses, some clauses that embody vocabulary education aspects should be addressed in a Korean dictionary, and deleted in Hangeul orthography guidelines. B. The clauses of Hangeul orthography guidelines should be edited for revision and substitution where necessary. C. The usage of individual clauses should be replaced with more appropriate examples aligned with everyday conversation. D. In order to establish 'Hangeul orthography for foreigners', linguists should continuously review several chapters and the appendix of Hangeul orthography, such as components about forms, spacing between words, miscellaneous, and punctuation marks. The purpose of this review is to pursue the simplicity of Hangeul orthography guidelines and the practicality in terms of reflecting more realistic examples. This review contributes to facilitate Korean language usage not only for non-native learners, but also native users.

A Study on the Special Characters as UX/UI Icon Design Elements (UX/UI 아이콘 디자인 요소로서 특수 문자 체계 연구)

  • Song, Jae-yeon
    • Journal of Digital Convergence
    • /
    • v.19 no.5
    • /
    • pp.397-405
    • /
    • 2021
  • The purpose of this study is to organize the system of special characters as UX/UI icon design elements, thus laying the groundwork for improvement direction for unclear use regulations. This study examines the theoretical background of UX/UI design and special characters and discovers UX/UI design and special characters' relations and assignments. Besides, the case study summarizes the system of special characters being utilized in the company's UX/UI icon design guidelines to produce the study results. As a result of the analysis, the special character types being utilized in UX/UI were graphic characters, mathematical symbols, punctuation marks, and parentheses. And the special characters commonly used in analysis cases, iOS, Android, and Windows, are ▶, ♥, ★, ○, ⊙, +, ×, ⋯. So this study organizes the common characters to standardize them. Hopefully, this study contributes to increasing the interest in the study of 'special characters' in the UX/UI design field and helps establish a framework for future industrial standards.