• 제목/요약/키워드: corpora

검색결과 251건 처리시간 0.024초

Phonological processes of consonants from orthographic to pronounced words in the Seoul Corpus

  • Yang, Byunggon
    • 말소리와 음성과학
    • /
    • 제12권2호
    • /
    • pp.1-7
    • /
    • 2020
  • This paper investigates the phonological processes of consonants in pronounced words in the Seoul Corpus, and compares the frequency distribution of these processes to provide a clearer understanding of conversational Korean to linguists and teachers. To this end, both orthographic and pronounced words were extracted from the transcribed label scripts of the Seoul Corpus. Next, the phonological processes of consonants in the orthographic and pronounced forms were tabulated separately after syllabifying the onsets and codas, and major consonantal processes were examined. First, the results showed that the majority of the orthographic consonants' sounds were pronounced the same way as their pronounced forms. Second, more than three quarters of the onsets were pronounced as the same forms, while approximately half of the codas were pronounced as variants. Third, the majority of different onset and coda symbols were primarily caused by deletions and insertions. Finally, the five phonological process types accounted for only 12.4% of the total possible procedures. Based on these results, this paper concludes that an analysis of phonological processes in spontaneous speech corpora can improve the practical understanding of spoken Korean. Future studies ought to compare the current phonological process data with those of other languages to establish universal patterns in phonological processes.

확률적 문법규칙에 기반한 국어사전의 뜻풀이말 구문분석기 (A Parser of Definitions in Korean Dictionary based on Probabilistic Grammar Rules)

  • 이수광;옥철영
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제28권5호
    • /
    • pp.448-448
    • /
    • 2001
  • The definitions in Korean dictionary not only describe meanings of title, but also include various semantic information such as hypernymy/hyponymy, meronymy/holonymy, polysemy, homonymy, synonymy, antonymy, and semantic features. This paper purposes to implement a parser as the basic tool to acquire automatically the semantic information from the definitions in Korean dictionary. For this purpose, first we constructed the part-of-speech tagged corpus and the tree tagged corpus from the definitions in Korean dictionary. And then we automatically extracted from the corpora the frequency of words which are ambiguous in part-of-speech tag and the grammar rules and their probability based on the statistical method. The parser is a kind of the probabilistic chart parser that uses the extracted data. The frequency of words which are ambiguous in part-of-speech tag and the grammar rules and their probability resolve the noun phrase's structural ambiguity during parsing. The parser uses a grammar factoring, Best-First search, and Viterbi search In order to reduce the number of nodes during parsing and to increase the performance. We experiment with grammar rule's probability, left-to-right parsing, and left-first search. By the experiments, when the parser uses grammar rule's probability and left-first search simultaneously, the result of parsing is most accurate and the recall is 51.74% and the precision is 87.47% on raw corpus.

새로운 반합성 Rifamucin 유도체 KTC-1의 랫트 수태능력 시험 (Fertility Study of KTC-1, a New Semisynthetic Rifamycin Derivative, in Rats.)

  • 김종춘;정문구;노정구
    • Toxicological Research
    • /
    • 제12권1호
    • /
    • pp.93-99
    • /
    • 1996
  • The effect of KTC-1, a new semisynthetic rifamycin antituberculous drug, on general toxicity, reproductive capability and fetal development was investigated in Sprague-Dawley rats. Male rats were administered KTC-1 with mashed feed from 63 days before mating to the end of mating period, and female rats were given from 14 days before mating to day 7 of gestation at dose levels of 0, 375, 750, and 1,500 ppm. The females were sacrificed on day 21 of gestation for examination of their fetuses. At 1,500 ppm, a reduction in body weight gain and testis atrophy were observed in male rats. Histological examination revealed testicular atrophy, absence or decrease of germinal cells, and vacuolization of Sertoli cells in testis. A reduction in body weight gain, a decrease in food consumption were found in female rats. In addition, decreases in the number of corpora lutea, iraplantations, and the litter size of live fetuses were seen. Mating, fertility, and pregnancy performances were also affected. There were no external abnormalities observed by examination of fetuses. At 750 ppm, a reduction in the body weight gain of male and female rats and decreases in the number of implantations and litter size were found. At 375 ppm, no treatment-related effects were observed. The results suggest that the no-effect dose levels (NOELs) of KTC-1 are 375 ppm for males and females on general toxicity, 750 ppm for males and females on reproductive capability, and 375 ppm for fetuses on embryonic development.

  • PDF

A Corpus-Based Study of the Use of HEART and HEAD in English

  • Oh, Sang-suk
    • 한국언어정보학회지:언어와정보
    • /
    • 제18권2호
    • /
    • pp.81-102
    • /
    • 2014
  • The purpose of this paper is to provide corpus-based quantitative analyses of HEART and HEAD in order to examine their actual usage status and to consider some cognitive linguistic aspects associated with their use. The two corpora COCA and COHA are used for analysis in this study. The analysis of COCA corpus reveals that the total frequency of HEAD is much higher than that of HEART, and that the figurative use of HEART (60%) is two times higher than its literal use (32%); by contrast, the figurative use of HEAD (41%) is a bit higher than its literal use (38%). Among all four genres, both lexemes occur most frequently in fictions and then in magazines. Over the past two centuries, the use of HEART has been steadily decreasing; by contrast, that the use of HEAD has been steadily increasing. It is assumed that the decreasing use of HEART has partially to do with the decrease in its figurative use and that the increasing use of HEAD is attributable to its diverse meanings, the increase of its lexical use, and the partial increase in its figurative use. The analysis of the collocation of verbs and adjectives preceding HEART and HEAD, as well the modifying and predicating forms of HEART and HEAD also provides some relevant information of the usage of the two lexemes. This paper showcases that the quantitative information helps understanding not only of the actual usage of the two lexemes but also of the cognitive forces working behind it.

  • PDF

자기조직화 지도를 이용한 이중언어사전 자동 구축 (Bilingual Lexicon Extraction Using Self-Organizing Maps)

  • 서형원;천민아;김재훈
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2015년도 춘계학술발표대회
    • /
    • pp.802-805
    • /
    • 2015
  • 본 논문은 인공신경망(artificial neural network)의 한 종류인 자기조직화 지도(self-organizing map)를 이용하여 비교말뭉치(comparable corpora)로부터 이중언어사전(bilingual lexicon)을 자동으로 구축하는 방법에 대하여 기술한다. 일반적으로 우리가 대상으로 하는 언어 쌍마다 말뭉치 혹은 초기사전과 같은 언어 자원을 수집하고 그것을 필요에 맞게 가공하는 것은 매우 어려운 일이다. 이런 관점에서 볼 때, 비지도학습(unsupervised learning) 방법 중 하나인 자기조직화 지도를 이용하여 사전을 구축하면 다른 방법에 비해 적은 노력으로도 더 높은 성능을 얻을 수 있다. 본 논문에서는 한국어와 불어에 대하여 실험을 하였고, 그 결과 적은 양의 초기사전으로도 주목할 만한 정확도를 얻을 수 있었다. 향후 연구로는 학습 파라미터에 대해 좀 더 다양한 실험을 하고, 다른 언어 쌍으로의 적용 및 기존의 평가사전을 확장하여 더 많은 경우에 대해 실험하는 것을 들 수 있다.

Estradiol Valerate로 유발된 다낭성 난소 증후군의 한약치료에 대한 국내 실험연구 고찰 (Review of Experimental Studies on Estradiol Valerate-induced Polycystic Ovarian Syndrome in Herbal Medicine)

  • 안뜰에봄;김동철
    • 대한한방부인과학회지
    • /
    • 제27권4호
    • /
    • pp.25-42
    • /
    • 2014
  • Objectives: To observe the effects of Herb medication on Polycystic Ovary (PCO) in rats. Methods: Riss (Research Information Sharing Service) was searched for Oriental medicine about PCO on 2000~2014. So, Experimental studies using PCO model induced by Estradiol Valerate (EV) was selected to analyze how effects they were. Consequently, 10 studies published on 2000~2014 were selected to analyze about weight of model and both ovaries, the number aries, the number of follicle and corpus of follicle and corpus luteum, the level of serum Androstenedione (ADD) and total estrogen, the level of Nerve Growth Factor (NGF) and Corticotropin Releasing Factor (CRF), the crossover rate and the number of implantation. Results: Among them, there was significantly difference between treatment group and control group in weight of both ovaries, the number of follicle and corpora lutea, the crossover rate and the number of implantation. Conclusions: The effects of Herb medication was observed on making weight of ovaries with PCO to promote. The study about effect of Herb medication on NGF and CRF with PCOS had to be conducted. The further research have to focus on treatment of subfertility and infertility, that is field showing the advantage of Oriental medicine.

Fillers in the Hong Kong Corpus of Spoken English (HKCSE)

  • Seto, Andy
    • 아시아태평양코퍼스연구
    • /
    • 제2권1호
    • /
    • pp.13-22
    • /
    • 2021
  • The present study employed an analytical framework that is characterised by a synthesis of quantitative and qualitative analyses with a specially designed computer software SpeechActConc to examine speech acts in business communication. The naturally occurring data from the audio recordings and the prosodic transcriptions of the business sub-corpora of the HKCSE (prosodic) are manually annotated with a speech act taxonomy for finding out the frequency of fillers, the co-occurring patterns of fillers with other speech acts, and the linguistic realisations of fillers. The discoursal function of fillers to sustain the discourse or to hold the floor has diverse linguistic realisations, ranging from a sound (e.g. 'uhuh') and a word (e.g. 'well') to sounds (e.g. 'um er') and words, namely phrase ('sort of') and clause (e.g. 'you know'). Some are even combinations of sound(s) and word(s) (e.g. 'and um', 'yes er um', 'sort of erm'). Among the top five frequent linguistic realisations of fillers, 'er' and 'um' are the most common ones found in all the six genres with relatively higher percentages of occurrence. The remaining more frequent realisations consist of clause ('you know'), word ('yeah') and sound ('erm'). These common forms are syntactically simpler than the less frequent realisations found in the genres. The co-occurring patterns of fillers and other speech acts are diverse. The more common co-occurring speech acts with fillers include informing and answering. The findings show that fillers are not only frequently used by speakers in spontaneous conversation but also mostly represented in sounds or non-linguistic realisations.

A Corpus Analysis of Temporal Adverbs and Verb Tenses Cooccurrence in Spanish, English, and Chinese

  • Cheng, An Chung;Lu, Hui-Chuan
    • 아시아태평양코퍼스연구
    • /
    • 제3권2호
    • /
    • pp.1-16
    • /
    • 2022
  • This study investigates the cooccurrence between temporal adverbs and grammatical tenses in Spanish and contrasts temporal specifications across Spanish, English, and Chinese. Based on a monolingual Spanish corpus and a trilingual parallel corpus, the study identified the top ten frequent single-word temporal adverbs collocating with grammatical tenses in Spanish. It also contrasted the cooccurrence of temporal adverbs and verb tenses in three languages. The results show that aun 'still', hoy 'today', and ahora 'now' collocate with the present tense at more than 80%. Ayer 'yesterday' and finalmente 'finally' cooccurring with the simple past tense are at 84% and 69%, respectively. Then, mientras 'meanwhile' collocates with the past imperfect at 55%, the highest of all. Mañana 'tomorrow' cooccurs with the future and present tenses at 34%. Other adverbs, ya 'already', siempre 'always', and nuevamete 'again', do not present a strong cooccurrence tendency with a tense overall. The contrastive analysis of the trilingual parallel corpus shows a comprehensive view of temporal specifications in the three languages. However, no clear one-to-one mapping pattern of the cooccurrence across the three languages can be concluded, which provides helpful insights for second language instruction with natural language data rather than intuition. Future research with larger corpora is needed.

Effects of Maternal Hypothyroidism on the Pubertal Development in Female Rat Offspring

  • Park, Jin-Soo;Lee, Sung-Ho
    • 한국발생생물학회지:발생과생식
    • /
    • 제25권2호
    • /
    • pp.83-91
    • /
    • 2021
  • The present study was performed to investigate the effect of maternal hypothyroidism and puberty onset in female rat pups. To do this, we employed propylthiouracil (PTU) to prepare a hypothyroid rat model. Pregnant rats were treated with PTU (0.025%) in drinking water from gestational day 14 to postnatal day 21 of offspring. Comparison of general indices such as body and tissue weights and puberty indices such as vaginal opening (VO) and tissue histology between control and PTU-treated rats were conducted. There was no significant difference in the date of VO between control and PTU group. The body weights of the PTU group were significantly lower, only 36.8% of the control group (p<0.001). Although the absolute thyroid weight was not changed by PTU treatment, the relative weight increased significantly about 2.8 times (p<0.001), indicating that hypothyroidism was successfully induced. On the other hand, the absolute weights of the ovary and uterus were markedly decreased by PTU administration (p<0.001), and the relative weight was not significantly changed. The ovarian histology of PTU group revealed the advanced state of differentiation (i.e., presence of corpora lutea). Inversely, the uterine histology of PTU group showed underdeveloped structures compared those in control group. Taken together, the present study demonstrates that our maternal hypothyroidism model resulted in minimal effect on pubertal development symbolized by VO despite of huge retardation in somatic growth. More sophisticatedly designed hypothyroidism model will be helpful to achieve a better understanding of pubertal development and related disorders.

홍화자(紅花子)와 조경종옥탕(調經種玉湯)이 임신 흰쥐에 미치는 영향 (Effect of Carthami Semen and Jogyeongjongok-Tang On Pregnant Rats)

  • 김대준;서부일
    • 대한본초학회지
    • /
    • 제28권3호
    • /
    • pp.33-38
    • /
    • 2013
  • Objectives : The present study has been undertaken to investigate the effects of Carthami Semen and Jogyeongjongok-Tang on pregnant rats. Method : In this experiment, the pregnant rats were administered by water extracts of Carthami Semen and Jogyeongjongok-Tang. The levels of weights, ALT, AST, ALP, BUN, creatinine, progesterone, Na and K in serum and reproductive indices of the rats were measured after treatment. Results : The levels of body weight gains were not significantly changed in comparison with Control group in Carthami Semen group and Jogyeongjongok-Tang group. In the levels of reproductive indices of the rats, the number of corpora lutea, implantation and viable fetuses, post-implantation loss, fetal weight and placental wight were not significantly changed in comparison with Control group in Carthami Semen group and Jogyeongjongok-Tang group. But pre-implantation loss was significantly increased in comparison with Control group in Carthami Semen group. The levels of BUN, creatine, ALT, AST and ALP were not significantly changed in comparison with Control group in Carthami Semen group and Jogyeongjongok-Tang group. The level of progesterone was not significantly changed in comparison with Control group in Carthami Semen group and Jogyeongjongok-Tang group. The levels of Na and K were not significantly changed in comparison with Control group in Carthami Semen group and Jogyeongjongok-Tang group. Conclusion : Reviewing these experimetal results, it appeared that Carthami Semen and Jogyeongjongok-Tang had not toxicity on pregnant rats.