• 제목/요약/키워드: StopWords

검색결과 108건 처리시간 0.023초

분리된 게이트 구조를 갖는 필드 스톱 IGBT의 전기적 특성에 관한 연구 (A Study on Electrical Characteristics of Field Stop IGBT with Separated Gate Structure)

  • 조형성;이장현;리긍연;강이구
    • 한국전기전자재료학회논문지
    • /
    • 제36권6호
    • /
    • pp.609-613
    • /
    • 2023
  • In this paper, a 1,200 V Si-based IGBT used in electric vehicles and new energy industries was designed. A field stop IGBT with a separate gate structure, which is the proposed structure, was designed to change trench depth and split gate width variables. Then, the general trench structure and electrical characteristics were compared and analyzed. As a result of conducting the trench depth experiment, it was confirmed that the breakdown voltage was the highest at 6 ㎛, and the on-state voltage drop was the lowest at 3.5 ㎛. In the separate gate width experiment, it was confirmed that the breakdown voltage decreased as the variable increased, and the on-state voltage drop increased. Therefore, it may be seen that it is preferable not to change the width of the separate gate. In addition, experiments show that there is no difference in on-state voltage drop compared to a structure in which a general field stop structure has a separate gate structure. In other words, it is determined that adding a dummy gate with a separate gate structure to the active cell will significantly improve the on-voltage drop characteristics, while confirming that the on-voltage drop does not change, and while having excellent characteristics in terms of breakdown voltage.

국내(國內) 문헌정보(文獻情報) 검색(檢索)을 위한 키워드 자동추출(自動抽出) 시스템 개발(開發) (Automatic Keyword Extraction System for Korean Documents Information Retrieval)

  • 예용희
    • 정보관리연구
    • /
    • 제23권1호
    • /
    • pp.39-62
    • /
    • 1992
  • 본(本) 연구(硏究)는 실제의 데이터 분석(分析)을 통하여 60여개의 조사(助詞)와 출현빈도는 높지만 검색(檢率)에 불필요한 320여개의 불용어(不用語)를 선정하여 좌우절단을 적용한 네 가지 유형으로 분류하고 조사(助詞)와 불용어 테이블을 구성하는 방법(方法)을 제시한다. 한글문헌에서 단어(單語)가 추출되면 조사의 효율적인 절단이 이러우지고, 한자어(漢字語)일 경우 한글로 변환되며, 2단계로 불용어제거(不用語除去) 과정을 거쳐 키워드를 선정하는 시스템을 개발한다. 여기서 추출된 키워드는 정보전문가(情報專門家)에 의해 추출된 색인어(索引語)와는 92.2%의 일치율을 보였다. 그리고 $4{\sim}6$글자로 구성된 복합어(複合語)의 경우 본(本) 연구(硏究)에서 제시한 분리방법에 의해 약 2배의 새로운 단어(單語)를 추가할 수 있었으며 그 중 58.8%가 키워드로 적합했다.

  • PDF

A Comparative Study on Requirements Analysis Techniques using Natural Language Processing and Machine Learning

  • Cho, Byung-Sun;Lee, Seok-Won
    • 한국컴퓨터정보학회논문지
    • /
    • 제25권7호
    • /
    • pp.27-37
    • /
    • 2020
  • 본 연구의 목적은 다양한 도메인에 대한 소프트웨어 요구사항 명세서로부터 수집된 요구사항을 데이터로 활용하여 데이터 중심적 접근법(Data-driven Approach)의 연구를 통해 요구사항을 분류한다. 이 과정에서 기존 요구사항의 특징과 정보를 바탕으로 다양한 자연어처리를 이용한 데이터 전처리와 기계학습 모델을 통해 요구사항을 기능적 요구사항과 비기능적 요구사항으로 분류하고 각 조합의 결과를 제시한다. 그 결과로, 요구사항을 분류하는 과정에서, 자연어처리를 이용한 데이터 전처리에서는 어간 추출과 불용어제거와 같은 토큰의 개수와 종류를 감소하여 데이터의 희소성을 좀 더 밀집형태로 변형하는 데이터 전처리보다는 단어 빈도수와 역문서 빈도수를 기반으로 단어의 가중치를 계산하는 데이터 전처리가 다른 전처리보다 좋은 결과를 도출할 수 있었다. 이를 통해, 모든 단어를 고려하여 가중치 값은 기계학습에서 긍정적인 요인을 볼 수 있고 오히려 문장에서 의미 없는 단어를 제거하는 불용어 제거는 부정적인 요소로 확인할 수 있었다.

Compensation in VC and Word

  • Yun, Il-Sung
    • 말소리와 음성과학
    • /
    • 제2권3호
    • /
    • pp.81-89
    • /
    • 2010
  • Korean and three other languages (English, Arabic, and Japanese) were compared with regard to the compensatory movements in a VC (Vowel and Consonant) sequence and word. For this, Korean data were collected from an experiment and the other languages' data from literature. All the test words of the languages had the same syllabic contexture, i.e., /CVCV(r)/, where C was an oral stop and intervocalic consonants were either bilabial or alveolar stops. The present study found that (1) Korean is most striking in the durational variations of segments (vowel and the following hetero-syllabic consonant); (2) unlike the three languages that show a constant sum of VC, Korean yields a three-way distinction in the length of VC according the type (lax unaspirated vs. tense unaspirated vs. tense aspirated) of the following stop consonant; (3) a durational constancy is maintained up to the word level in the three languages, but Korean word duration varies as a function of the feature tenseness of the intervocalic consonants; (4) consonant duration is proven to differentiate Korean the most from the other languages. It is suggested that the durational difference between a lax consonant and its tense cognate(s) and the degree of compensation between V and C are determined by the phonology in each language.

  • PDF

자연어검색시스템을 위한 스태밍알고리즘의 설계 및 구현 (A stemming algorithm for a korean language free-text retrieval system)

  • 이효숙
    • 정보관리학회지
    • /
    • 제14권2호
    • /
    • pp.213-234
    • /
    • 1997
  • 본 연구에서는 자연어 검색시스템을 위한 스태밍알고리즘을 설계하고 이를 구현하였다. 알고리즘은 순환적으로 다음과 같은 세가지 과정으로 진행된다. : 불용어사전에 의한 불용어의 제거; 규칙 테이블1의 적용에 따른 기본 어미의 처리; 전단계에서 처리되고 남은 어절에 대해 규칙테이블 2를 적용하여 확장스태밍 및 다시쓰기루틴으로 진행된다. 알고리즘의 성능 평가를 위한 한글문헌집단을 사용하여 테스트한 결과 압축률 21.4%, 오류율 15.9%의 결과를 나타내었다.

  • PDF

발화 속도에 따른 한국어 폐쇄음의 VOT 값 변화 (Voice Onset Time of Korean Stops as a Function of Speaking Rate)

  • 오은진
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.39-48
    • /
    • 2009
  • Previous studies on the effects of speaking rate on voice onset time (VOT) of stops in English, French, Icelandic, and Thai indicate that speaking rate asymmetrically affects VOT values. That is, pre-voiced and long-lag stops vary due to the rate factor more than short-lag stops do. One suggested explanation for this asymmetry is that it is due to the necessity of maintaining phonetic contrasts among the stop categories. Since pre-voiced and long-lag stops represent the ends of the VOT scale, they encompass broad swathes of that range and consequently allow for large variations. On the other hand, the VOT variations of short-lag stops may result in overlap with the VOTs of long-lag stops. This study aimed to explore the effects of speaking rate on the VOTs of Korean stops and see whether Korean fortis and lenis stops are limited in the degrees of variation as a function of rates due to the existence of stops with larger VOT values, lenis and aspirated stops respectively. Conversely, aspirated stops were expected to show more variation since there are no other categories with longer VOTs. Fortis, lenis, and aspirated stops in /CVn/ words (C = bilabial or velar stop, V = /i/ or /a/) were examined in isolation, and at normal and fast rates in a carrier sentence. Speaking rates were controlled by alternating words or sentences on a computer screen at intervals of two seconds for the isolation- and normal-rate conditions and one second for the fast-rate condition. This study found that while the VOTs of fortis stops did not change significantly, those of lenis and aspirated stops showed considerable changes as a function of speaking rates. Also, overlap between lenis and aspirated stops occurred considerably at all speaking rates. These phenomena were interpreted to relate to the fact that VOT contrasts between lenis and aspirated stops in Korean are currently being collapsed. Large variations of lenis stops as a function of rates seem to occur due to a weak motivation to limit the degree of variations for the purpose of maintaining phonetic contrasts. The significant overlap between lenis and aspirated stops at all rates was interpreted to occur because the VOT merger between the two categories became considerably fixed. Also the percentage of correctly-classified VOTs by optimal-boundary values between lenis and aspirated stops turned out to be lower than in previously-studied languages. This was interpreted to be further evidence that VOTs are losing their role in contrasting the two stop categories in Korean.

  • PDF

불용어 시소러스를 이용한 비정형 텍스트 데이터 후처리 방법론에 관한 연구 (A Study on Unstructured text data Post-processing Methodology using Stopword Thesaurus)

  • 이원조
    • 문화기술의 융합
    • /
    • 제9권6호
    • /
    • pp.935-940
    • /
    • 2023
  • 인공지능과 빅데이터 분석을 위해 웹 스크래핑으로 수집된 대부분의 텍스트 데이터들은 일반적으로 대용량이고 비정형이기 때문에 빅데이터 분석을 위해서는 정제과정이 요구된다. 그 과정은 휴리스틱 전처리 정제단계와 후처리 머시인 정제단계를 통해서 분석이 가능한 정형 데이터가 된다. 따라서 본 연구에서는 후처리 머시인 정제과정에서 한국어 딕셔너리와 불용어 딕셔너리를 이용하여 워드크라우드 분석을 위한 빈도분석을 위해 어휘들을 추출하게 되는데 이 과정에서 제거되지 않은 불용어를 효율적으로 제거하기 위한 "사용자 정의 불용어 시소러스" 적용에 대한 방법론을 제안하고 R의 워드클라우드 기법으로 기존의 "불용어 딕셔너리" 방법의 문제점을 보완하기 위해 제안된 "사용자 정의 불용어 시소러스" 기법을 이용한 사례분석을 통해서 제안된 정제방법의 장단점을 비교 검증하여 제시하고 제안된 방법론의 실무적용에 대한 효용성을 제안한다.

한국어 방언 음성의 실험적 연구 (An Experimental Study of Korean Dialectal Speech)

  • 김현기;최영숙;김덕수
    • 음성과학
    • /
    • 제13권3호
    • /
    • pp.49-65
    • /
    • 2006
  • Recently, several theories on the digital speech signal processing expanded the communication boundary between human beings and machines drastically. The aim of this study is to collect dialectal speech in Korea on a large scale and to establish a digital speech data base in order to provide the data base for further research on the Korean dialectal and the creation of value-added network. 528 informants across the country participated in this study. Acoustic characteristics of vowels and consonants are analyzed by Power spectrum and Spectrogram of CSL. Test words were made on the picture cards and letter cards which contained each vowel and each consonant in the initial position of words. Plot formants were depicted on a vowel chart and transitions of diphthongs were compared according to dialectal speech. Spectral times, VOT, VD, and TD were measured on a Spectrogram for stop consonants, and fricative frequency, intensity, and lateral formants (LF1, LF2, LF3) for fricative consonants. Nasal formants (NF1, NF2, NF3) were analyzed for different nasalities of nasal consonants. The acoustic characteristics of dialectal speech showed that young generation speakers did not show distinction between close-mid /e/ and open-mid$/\epsilon/$. The diphthongs /we/ and /wj/ showed simple vowels or diphthongs depending to dialect speech. The sibilant sound /s/ showed the aspiration preceded to fricative noise. Lateral /l/ realized variant /r/ in Kyungsang dialectal speech. The duration of nasal consonants in Chungchong dialectal speech were the longest among the dialects.

  • PDF

연명의료 관련 신문 기사의 텍스트네트워크분석 (Text Network Analysis of Newspaper Articles on Life-sustaining Treatments)

  • 박은준;안대웅;박찬숙
    • 지역사회간호학회지
    • /
    • 제29권2호
    • /
    • pp.244-256
    • /
    • 2018
  • Purpose: This study tried to understand discourses of life-sustaining treatments in general daily and healthcare newspapers. Methods: A text-network analysis was conducted using the NetMiner program. Firstly, 572 articles from 11 daily newspapers and 258 articles from 8 healthcare newspapers were collected, which were published from August 2013 to October 2016. Secondly, keywords (semantic morphemes) were extracted from the articles and rearranged by removing stop-words, refining similar words, excluding non-relevant words, and defining meaningful phrases. Finally, co-occurrence matrices of the keywords with a frequency of 30 times or higher were developed and statistical measures-indices of degree and betweenness centrality, ego-networks, and clustering-were obtained. Results: In the general daily and healthcare newspapers, the top eight core keywords were common: "patients," "death," "LST (life-sustaining treatments)," "hospice palliative care," "hospitals," "family," "opinion," and "withdrawal." There were also common subtopics shared by the general daily and healthcare newspapers: withdrawal of LST, hospice palliative care, National Bioethics Review Committee, and self-determination and proxy decision of patients and family. Additionally, the general daily newspapers included diverse social interest or events like well-dying, euthanasia, and the death of farmer Baek Nam-ki, whereas the healthcare newspapers discussed problems of the relevant laws, and insufficient infrastructure and low reimbursement for hospice-palliative care. Conclusion: The discourse that withdrawal of futile LST should be allowed according to the patient's will was consistent in the newspapers. Given that newspaper articles influence knowledge and attitudes of the public, RNs are recommended to participate actively in public communication on LST.

Acoustic Evidence for the Development of Aspiration Feature in Putonghua Stops

  • Han, Ji-Yeon
    • 음성과학
    • /
    • 제12권3호
    • /
    • pp.201-209
    • /
    • 2005
  • This study was investigated developmental temporal features in Putonghua-speaking children. The total of 212 children between the ages 2;6 and 6;5 participated in Shanghai. Speech materials were constructed according to aspiration feature in stop sounds of Putonghua. Six words were selected in this study. A voice onset time was measured. Non-parametric procedures were employed for all the analyses. The VOT value across bilabial, alveolar, and velar stops was significantly differed between aspirated and unaspirated stops for each age group. Effect of age is. significant for unaspirated stops. It is clear that each of Putonghua stops showed decreasing mean and standard deviation. The overshoot phenomenon of VOT was apparent from the age of 2;6-2;11 to 4;6-4;11. There was high variability in the production of lag time for aspirated stops.

  • PDF