• Title/Summary/Keyword: Consonants

Search Result 457, Processing Time 0.029 seconds

Hangul Bitmap Data Compression Embedded in TrueType Font (트루타입 폰트에 내장된 한글 비트맵 데이타의 압축)

  • Han Joo-Hyun;Jeong Geun-Ho;Choi Jae-Young
    • Journal of KIISE:Software and Applications
    • /
    • v.33 no.6
    • /
    • pp.580-587
    • /
    • 2006
  • As PDA, IMT-2000, and e-Book are developed and popular in these days, the number of users who use these products has been increasing. However, available memory size of these machines is still smaller than that of desktop PCs. In these products, TrueType fonts have been increased in demand because the number of users who want to use good quality fonts has increased, and TrueType fonts are of great use in Windows CE products. However, TrueType fonts take a large portion of available device memory, considering the small memory sizes of mobile devices. Therefore, it is required to reduce the size of TrueType fonts. In this paper, two-phase compression techniques are presented for the purpose of reducing the sire of hangul bitmap data embedded in TrueType fonts. In the first step, each character in bitmap is divided into initial consonant, medial vowel, and final consonant, respectively, then the character is recomposed into the composite bitmap. In the second phase, if any two consonants or vowels are determined to be the same, one of them is removed. The TrueType embedded bitmaps in Hangeul Wanseong (pre-composed) and Hangul Johab (pre-combined) are used in compression. By using our compression techniques, the compression rates of embedded bitmap data for TrueType fonts can be reduced around 35% in Wanseong font, and 7% in Johab font. Consequently, the compression rate of total TrueType Wanseong font is about 9.26%.

Effect of Percentage of Correct Consonants and Nasalance Score on the Speech Intelligibility and Acceptability in Adults with Dysarthria (마비말장애 성인의 자음정확도와 비음치가 말명료도 및 말용인도에 미치는 영향)

  • Jang, Seon Jeong;Choi, Hyun Joo
    • 재활복지
    • /
    • v.20 no.3
    • /
    • pp.67-82
    • /
    • 2016
  • The purpose of this study was to investigate relation and effect of PCC(Percentage of Correct Consonant) and nasalance score on the speech intelligibility and acceptability in adults with dysarthria by reading task of standardized passage. Ten adults with dysarthria and sixteen normal adults were participated in this study. PCC and nasalance score were measured through reading task of standardized passage. And, speech intelligibility and acceptability were examined using visual analogue criteria. The result of the study was as follows. First, the nasalance score of adults with dysarthria group is significantly higher than normal adults group in reading sample by standardized passage. Second, the PCC, speech intelligibility and acceptability shows significant correlation. However, the nasalance score doesn't show significant corelation with speech intelligibility and acceptability. These results suggest that PCC is closely related to speech intelligibility and speech acceptability, but nasalance score is not related to speech intelligibility and speech acceptability.

Perception of lenis and aspirated stops in Seoul Korean by younger and older male and female listeners (한국어 서울 방언의 평음과 격음 변별 지각에서 연령과 성별에 따른 차이)

  • Kim, Jeahong;Kim, Soan;Ahn, Joohee;Nam, Kichun;Choi, Jiyoun
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.1-8
    • /
    • 2020
  • Traditionally it has been understood that the aspirated and lenis stops in Seoul Korean are distinguished primarily by voice onset time (VOT) and secondarily by other cues such as the fundamental frequency (F0) of the following vowel. However, recent studies on stop production have shown that the aspirated and lenis stops are currently merging in VOT and that they are now differentiated primarily by F0. In the present study, we examined whether the currently reported change in the production domain would be also found in the perception domain. To this end, an auditory identification task was conducted using speech materials of varying VOT and F0 values with young and older male and female Seoul listeners. Results revealed that all listener groups used both VOT and F0 to distinguish the lenis vs. aspirated stops but they used the F0 cue more reliably than the VOT cue in discriminating the stop contrast. The effects of gender and age were found only in the VOT cue (i.e., not in the F0 cue), with the greatest VOT cue weight in older males and the smallest in young females, which is in line with recent production studies.

Automatic severity classification of dysarthria using voice quality, prosody, and pronunciation features (음질, 운율, 발음 특징을 이용한 마비말장애 중증도 자동 분류)

  • Yeo, Eun Jung;Kim, Sunhee;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.57-66
    • /
    • 2021
  • This study focuses on the issue of automatic severity classification of dysarthric speakers based on speech intelligibility. Speech intelligibility is a complex measure that is affected by the features of multiple speech dimensions. However, most previous studies are restricted to using features from a single speech dimension. To effectively capture the characteristics of the speech disorder, we extracted features of multiple speech dimensions: voice quality, prosody, and pronunciation. Voice quality consists of jitter, shimmer, Harmonic to Noise Ratio (HNR), number of voice breaks, and degree of voice breaks. Prosody includes speech rate (total duration, speech duration, speaking rate, articulation rate), pitch (F0 mean/std/min/max/med/25quartile/75 quartile), and rhythm (%V, deltas, Varcos, rPVIs, nPVIs). Pronunciation contains Percentage of Correct Phonemes (Percentage of Correct Consonants/Vowels/Total phonemes) and degree of vowel distortion (Vowel Space Area, Formant Centralized Ratio, Vowel Articulatory Index, F2-Ratio). Experiments were conducted using various feature combinations. The experimental results indicate that using features from all three speech dimensions gives the best result, with a 80.15 F1-score, compared to using features from just one or two speech dimensions. The result implies voice quality, prosody, and pronunciation features should all be considered in automatic severity classification of dysarthria.

Acoustic analysis of Korean affricates produced by dysarthric speakers with cerebral palsy (뇌성마비 마비말장애 성인의 파찰음 실현 양상 분석)

  • Mun, Jihyun;Kim, Sunhee;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.45-55
    • /
    • 2021
  • This study aims to analyze the acoustic characteristics of Korean affricates produced by dysarthric speakers with cerebral palsy. Korean fricatives and affricates are the consonants that are prone to errors in dysarthric speech, but previous studies have focused only on fricatives. For this study, three affricates /tɕ, tɕh, ͈tɕ/ appearing at word initial and intervocalic positions produced by six mild-moderate male speakers of spastic dysarthria are selected from a QOLT database constructed in 2014. The parameters representing the acoustic characteristics of Korean affricates were extracted by using Praat: frication duration, closure duration, center of gravity, variance, skewness, kurtosis, and central moment. The results are as follows: 1) frication duration of the intervocalic affricates produced by dysarthric speakers was significantly longer than that of the non-disordered speakers; 2) the closure duration of dysarthric speakers was significantly longer; 3) in the case of the center of gravity, there was no significant difference between the two groups; 4) the skewness of the dysarthric speakers was significantly larger; and 5) the central moment of dysarthric speakers was significantly larger. This study investigated the characteristics of the affricates produced by dysarthric speakers and differences with non-disordered speakers.

Familiarity and Preference on Korean Typefaces by Serif and Square-Frame (한글 글꼴의 세리프 및 네모틀 여부에 따른 친숙성과 선호도)

  • Lee, Haeun;Hyun, Joo-Seok
    • Science of Emotion and Sensibility
    • /
    • v.24 no.4
    • /
    • pp.29-38
    • /
    • 2021
  • Korean typefaces are characterized on two axes: a font is either serifed or non-serifed, and it is either square-frame or non-squared. A serifed font entails small strokes that are regularly attached to the ends of larger strokes. Conversely, fonts without these marks are termed sans-serif. One of the exclusive features of Korean typeface of the square-frame type is that in such fonts, vowels and consonants often with their final vowels, are harmonically placed within the boundaries of the virtual square. We hypothesize that serifed and squared-frame typefaces are more popular and preferred owing to their widespread use throughout history. A survey incorporating Korean pangrams written with serif, sans-serif, squared, and non-squared typefaces was designed to test the present hypothesis. We found that people typically preferred and were more familiar with squared typefaces compared to non-squared typefaces. However, no difference was observed between serifed and san-serif typefaces. Furthermore, a positive correlation was found between familiarity and preference ratings only where the typefaces had squared and serifed features. The results revealed that Korean typefaces with the squared feature were more well-known and, therefore, more preferred to the typefaces without it. The results further indicated that Korean typefaces with the squared feature can be recommended for people's familiarity to it and the comfort it provides, and their emotional relevance and sensibility enhanced if serifs are added.

Preprocessing Technique for Malicious Comments Detection Considering the Form of Comments Used in the Online Community (온라인 커뮤니티에서 사용되는 댓글의 형태를 고려한 악플 탐지를 위한 전처리 기법)

  • Kim Hae Soo;Kim Mi Hui
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.12 no.3
    • /
    • pp.103-110
    • /
    • 2023
  • With the spread of the Internet, anonymous communities emerged along with the activation of communities for communication between people, and many users are doing harm to others, such as posting aggressive posts and leaving comments using anonymity. In the past, administrators directly checked posts and comments, then deleted and blocked them, but as the number of community users increased, they reached a level that managers could not continue to monitor. Initially, word filtering techniques were used to prevent malicious writing from being posted in a form that could not post or comment if a specific word was included, but they avoided filtering in a bypassed form, such as using similar words. As a way to solve this problem, deep learning was used to monitor posts posted by users in real-time, but recently, the community uses words that can only be understood by the community or from a human perspective, not from a general Korean word. There are various types and forms of characters, making it difficult to learn everything in the artificial intelligence model. Therefore, in this paper, we proposes a preprocessing technique in which each character of a sentence is imaged using a CNN model that learns the consonants, vowel and spacing images of Korean word and converts characters that can only be understood from a human perspective into characters predicted by the CNN model. As a result of the experiment, it was confirmed that the performance of the LSTM, BiLSTM and CNN-BiLSTM models increased by 3.2%, 3.3%, and 4.88%, respectively, through the proposed preprocessing technique.

The Effect of Retrieval Difficulty and Association Strength on Memory Inhibition (자극의 인출난이도와 연합강도가 기억억제에 미치는 효과)

  • Yoonjae Jung
    • Korean Journal of Cognitive Science
    • /
    • v.34 no.1
    • /
    • pp.21-38
    • /
    • 2023
  • The present study was designed to investigate the effect of the difficulty level of retrieval practice and the association strength of categories and stimuli within categories on memory inhibition. Most of the studies have investigated whether inhibition was occurred by manipulating the degree of association strength, emotion value or physical characteristics of non-retrieval practice words within the retrieval practice category. Therefore, it was necessary to study how inhibition occurs according to the degree of difficulty of retrieval stimuli during retrieval practice. The difficulty of retrieval was manipulated into three levels: difficult condition, normal condition, and easy condition through the degree of presentation of consonants and vowels of words during retrieval learning. Additionally, the strength of association between categories and words within categories was manipulated. In previous studies, retrieval-induced forgetting occurred under conditions where the association strength between categories and words within the categories was strong. On the other hand, retrieval-induced forgetting did not occur under conditions where the association strength between categories and words within the categories was weak. The present study, if the inhibition process differs according to the difficulty of retrieval, the possibility of different results from previous studies was explored according to the difference in the strength of association with the category. As a result of the study, in the condition of strong association strength, retrieval-induced forgetting was observed under normal and difficult retrieval difficulty conditions. Whereas retrieval-induced forgetting was not observed under conditions of easy retrieval difficulty condition. In the condition of weak association strength, retrieval-induced forgetting tended to occur under difficult retrieval difficulty conditions. Whereas retrieval-induced forgetting was not observed under conditions of normal and easy retrieval difficulty condition. These results suggest that memory inhibition may appear differently depending on the difficulty of retrieval.

A Comparison Study on the Speech Signal Parameters for Chinese Leaners' Korean Pronunciation Errors - Focused on Korean /ㄹ/ Sound (중국인 학습자의 한국어 발음 오류에 대한 음성 신호 파라미터들의 비교 연구 - 한국어의 /ㄹ/ 발음을 중심으로)

  • Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.6
    • /
    • pp.239-246
    • /
    • 2017
  • This paper compares the speech signal parameters between Korean and Chinese for Korean pronunciation /ㄹ/, which is caused many errors by Chinese leaners. Allophones of /ㄹ/ in Korean is divided into lateral group and tap group. It has been investigated the reasons for these errors by studying the similarity and the differences between Korean /ㄹ/ pronunciation and its corresponding Chinese pronunciation. In this paper, for the purpose of comparison the speech signal parameters such as energy, waveform in time domain, spectrogram in frequency domain, pitch based on ACF, Formant frequencies are used. From the phonological perspective the speech signal parameters such as signal energy, a waveform in the time domain, a spectrogram in the frequency domain, the pitch (F0) based on autocorrelation function (ACF), Formant frequencies (f1, f2, f3, and f4) are measured and compared. The data, which are composed of the group of Korean words by through a philological investigation, are used and simulated in this paper. According to the simulation results of the energy and spectrogram, there are meaningful differences between Korean native speakers and Chinese leaners for Korean /ㄹ/ pronunciation. The simulation results also show some differences even other parameters. It could be expected that Chinese learners are able to reduce the errors considerably by exploiting the parameters used in this paper.

Study on the Neural Network for Handwritten Hangul Syllabic Character Recognition (수정된 Neocognitron을 사용한 필기체 한글인식)

  • 김은진;백종현
    • Korean Journal of Cognitive Science
    • /
    • v.3 no.1
    • /
    • pp.61-78
    • /
    • 1991
  • This paper descibes the study of application of a modified Neocognitron model with backward path for the recognition of Hangul(Korean) syllabic characters. In this original report, Fukushima demonstrated that Neocognitron can recognize hand written numerical characters of $19{\times}19$ size. This version accepts $61{\times}61$ images of handwritten Hangul syllabic characters or a part thereof with a mouse or with a scanner. It consists of an input layer and 3 pairs of Uc layers. The last Uc layer of this version, recognition layer, consists of 24 planes of $5{\times}5$ cells which tell us the identity of a grapheme receiving attention at one time and its relative position in the input layer respectively. It has been trained 10 simple vowel graphemes and 14 simple consonant graphemes and their spatial features. Some patterns which are not easily trained have been trained more extrensively. The trained nerwork which can classify indivisual graphemes with possible deformation, noise, size variance, transformation or retation wre then used to recongnize Korean syllabic characters using its selective attention mechanism for image segmentation task within a syllabic characters. On initial sample tests on input characters our model could recognize correctly up to 79%of the various test patterns of handwritten Korean syllabic charactes. The results of this study indeed show Neocognitron as a powerful model to reconginze deformed handwritten charavters with big size characters set via segmenting its input images as recognizable parts. The same approach may be applied to the recogition of chinese characters, which are much complex both in its structures and its graphemes. But processing time appears to be the bottleneck before it can be implemented. Special hardware such as neural chip appear to be an essestial prerquisite for the practical use of the model. Further work is required before enabling the model to recognize Korean syllabic characters consisting of complex vowels and complex consonants. Correct recognition of the neighboring area between two simple graphemes would become more critical for this task.