• Title/Summary/Keyword: Corpus-based

Search Result 571, Processing Time 0.026 seconds

A Study on the Integration of Information Extraction Technology for Detecting Scientific Core Entities based on Large Resources (대용량 자원 기반 과학기술 핵심개체 탐지를 위한 정보추출기술 통합에 관한 연구)

  • Choi, Yun-Soo;Cheong, Chang-Hoo;Choi, Sung-Pil;You, Beom-Jong;Kim, Jae-Hoon
    • Journal of Information Management
    • /
    • v.40 no.4
    • /
    • pp.1-22
    • /
    • 2009
  • Large-scaled information extraction plays an important role in advanced information retrieval as well as question answering and summarization. Information extraction can be defined as a process of converting unstructured documents into formalized, tabular information, which consists of named-entity recognition, terminology extraction, coreference resolution and relation extraction. Since all the elementary technologies have been studied independently so far, it is not trivial to integrate all the necessary processes of information extraction due to the diversity of their input/output formation approaches and operating environments. As a result, it is difficult to handle scientific documents to extract both named-entities and technical terms at once. In this study, we define scientific as a set of 10 types of named entities and technical terminologies in a biomedical domain. in order to automatically extract these entities from scientific documents at once, we develop a framework for scientific core entity extraction which embraces all the pivotal language processors, named-entity recognizer, co-reference resolver and terminology extractor. Each module of the integrated system has been evaluated with various corpus as well as KEEC 2009. The system will be utilized for various information service areas such as information retrieval, question-answering(Q&A), document indexing, dictionary construction, and so on.

Beneficial effect of Combination with Korean Red Ginseng, Gastrodia Rhizoma and Polygoni Multiflori on Cholesterol and Erectile Dysfunction in Hyperlipidemia rats (홍삼, 천마, 적하수오 병용투여에 의한 고지혈증 랫드에서의 콜레스테롤 및 발기부전 개선효과)

  • Lee, Yun Jung;Kho, Min Chul;Tan, Rui;Lee, Jae Yun;Hwang, Jin Seok;Cha, Jeong Dan;Choi, Kyung Min;Kang, Dae Gill
    • The Korea Journal of Herbology
    • /
    • v.30 no.6
    • /
    • pp.69-75
    • /
    • 2015
  • Objectives : This study was designed to investigate effects of the combination with Korean Red Ginseng (Panax ginseng C.A. Meyer), Gastrodia Rhizoma (Gastrodia elata Blume) and Polygoni Multiflori Radix (Polygonum multiflorum Thunberg) on metabolic disorders including cholesterol and erectile dysfunction in hyperlipidemia rats.Methods : Animals were divided into six groups; Control with normal diet, high fat/cholesterol-diet (HFCD), fluvastatin, Korean Red Ginseng treated (KRG), and the combination treated (Korean Red Ginseng, Gastrodia Rhizoma and Polygoni Multiflori Radix; 1:1:1 for KGP1 and 2:1:1 for KGP2). The experimental groups initially received HFCD for 10 weeks and then treated orally with fluvastatin, KRG, KGP1 and KGP2 during the final 6 weeks. Erectile function was determined by the measurements of intracavernosal pressure (ICP) and maximal arterial pressure (MAP) after electrical stimulation of the cavernosal nerve.Results : KGP2 decreased the level of total cholesterol and LDL cholesterol in the sera of HFCD rats without no changes of body weights. KRG, KGP1 and KGP2 decreased the level of C-reactive protein (CRP) levels except of fluvastatin, synthetic HMG-CoA reductase inhibitor. KRG, KGP1 and KGP2 significantly increased the ICP, ICP/MAP ratio, area under the curve (AUC) compared with those of normal rat. Morphometric analyses showed that KRG, KGP1 and KGP2 increased the volume of smooth muscle and the regular arrangement of collagen fibers in corpus cavernosum of HFCD rats. The penile expression of eNOS was increased by KRG, KGP1 and KGP2.Conclusions : Based on these results, we suggest that the combination with Korean Red Ginseng, Gastrodia Rhizoma and Polygoni Multiflori may improve hyperlipidemia through regulating the lipid profiles and erectile dysfunction in rats.

Behavioral Changes of Rats following Cingulate or Other Cortical Damages (대상회전 기타 피질이 손상된 흰쥐들의 행동 변화)

  • Kim, Chung-Chin;Kim, Jong-Kyu;Kim, Myung-Suk
    • The Korean Journal of Physiology
    • /
    • v.2 no.2
    • /
    • pp.83-92
    • /
    • 1968
  • A study was planned to evaluate the effects of removal of the cingulate cortex upon the occurrence of any behavior commonly displayed by the rat, and to compare the effects of cingulectomy with those of removal of the parietal, parieto-occipital, or occipital regions. The subjects were 54 male albino rats (Holtzman strain, body weight $200{\sim}330\;gm$) including 14 rats in which the cingulate gyri between splenium and genu of the corpus callosum were bilaterally ablated by suction (cingulate group), 9 animals which had their parietal cortices (chiefly area 7) partially removed (parietal group), 9 rats whose parietal and occipital regions (chiefly areae 7 & 17), 13 animals in which the occipital cortices (chiefly area 17) were removed bilaterally (occipital group), and 9 normal rats (normal control group). Eighteen observation cages, each of which housed a subject and was provided with food and water ad lib., were arranged in 6 rows on a rack and the behavior of each subject was scanned by an observer at a distance of 1.5 m from the rack. The observer scanned the first and second rows 6 times in 1 min, then proceeded to the 3rd and 4th rows, scanning for another 1 min, and finally to the 5th and 6th rows. The speed of scanning was such that behavioral observations of all of the 18 rats were completed in 3 min, each subject receiving 6 observations. The scanning was repeated every 3 min for 18 min, which constituted one observation session and was followed by a 72 minutes' recess. The whole procedure was repeated through 24 hours so that a total of 576 behavioral observations were made on each subject in 16 observation sessions. Behaviors checked were sleeping, lying, lying and sniffing, standing, standing and sniffing, exploring, eating, drinking, grooming (included were washing, licking, and scratching), and others. Results obtained were as follows: 1. The cingulate group ate significantly more often than the normal control, the parietal, and the parieto-occipital groups. 2. Exploration was significantly less frequent in the cingulate group than in the normal control, the parietal, and the occipital groups. There was, in the case of the cingulate group, a significant negative correlation between the occurrence of eating and the exploratory activity. 3. The general activity, as judged from the value obtained by adding the occurrence of exploration, eating, drinking, grooming, and standing and sniffing, was significantly increased in the cingulate group compared with those of any other groups including the normal control. 4. Though statistically insignificant, the cingulate group slept least often among all the animal groups tested. 5. The parieto-occipital group tended to groom less, and the parietal group to eat less often than the normal control group did, but the difference was not significant. There were no significant differences among all the groups except the cingulate group as regards other behaviors analyzed. Based on the above results, it was inferred that the cingulate cortex exerts an inhibitory influence upon the occurrence of eating and general activity, while it tends to facilitate the occurrence of sleep.

  • PDF

Superovulation Response after Follicular Wave Synchronization with Follicular Aspiration by Ultrasonography in HanWoo I. Effect of Follicular Aspiration on Ovarian Response Following Superovulation (과배란 처치시 우세난포 조절에 의한 한우 수정란 생산성 향상에 관한 연구 I. 우세난포 처리에 따른 난소반응)

  • 이병천;이동원;신수정;박종임;황우석
    • Journal of Embryo Transfer
    • /
    • v.14 no.3
    • /
    • pp.203-210
    • /
    • 1999
  • In this stuyd, the effect of the dominant follicle aspiration for the superovulatory response in HanWoo was investigated. The criterion for the presence or absence of a dominant follicle based on their morphological examination. The dominant follicle was aspirated 48hr before the onset of superovulation treatment by 6.5MHz convex probe connected with a carrier and superovulation induced by FSH (Super-Ov Tyrer, Texas, U.S.A) adminstered twic a day s.c. over 4 day in a decreasing regimen. From 13 HanWoo scanned daily to determine the presence and growth of the dominant follicle, its an average diameter of 15.4mm was measured and an average diameter of corpora lutea was 18.7mm on day of follicular aspiration. In the experiment, a follicular remove by ultrasound-guided aspiration, the ovarian response was significantly enhanced when animals were superovulated in the aspiation of a dominant follicle compare with animals superovulated non-aspiration of a dominat follicle. In the aspiration of a dominant follicle donors yieleded more corpora lutea(14.4$\pm$4.7 vs 8.6$\pm$3.4) and transferable embryos(8.9$\pm$4.2 vs 5.4$\pm$2.7) than control. In cows in which the dominant follicle had been aspirated under sonographical control 2 days before superovuation, the number of corpus lutea and transferable embryos were significantly enhanced compared with animals superovulated in the presence of a dominant follicle (14.4$\pm$4.7 vs 6.9$\pm$2.7, ; 8.9$\pm$4.2 vs 3.3$\pm$1.6). After 7 days of artificial insemination, the embryos at 7 days were cllected by uterine flushing after dominant follicle insemination, the embryos at 7 days were collected by uterine flushing after dominant follicle aspiration and superovulation treatment, and evaluated their quality by morphological criteria. Sixteen embryos with excellent and good grade were transferred into 8 recipient cows. Six pregnancies were identified at 60 and 120 days of gestation by rectal palpations. In conclusion, the present study showed that 1) the presence or absence of a dominant follicle signficicnatly affects superovulatory responses, and 2) ultrasound-guided follicular aspiration of the dominant follicle and superovuation treatment provides an accurate and procedure to increase ovarian responses in HanWoo.

  • PDF

Effect of Different Feeding Ratios of Whole Crop Barley Silage on the Embryo Production in Hanwoo Donors

  • Son, Dong-Soo;Choe, Chang-Yong;Cho, Sang-Rae;Kim, Nam-Tae;Kim, Hyun-Jong;Yeon, Seong-Heum;Ryu, Il-Sun;Son, Jun-Kyu;Choi, Sun-Ho;Kim, Ill-Hwa
    • Journal of Embryo Transfer
    • /
    • v.24 no.4
    • /
    • pp.265-269
    • /
    • 2009
  • The purpose of this study was to determine the effect of different feeding ratios of whole crop barley silage on the embryo production in Hanwoo donors. All donors were basically fed 2.5 kg concentrate daily. Donors were divided into three groups according to the different feeding of forage; hay 70% and rice straw 30% (control, n = 21), whole crop barley silage 80% and rice straw 20% (T1, n = 25), and whole crop barley silage 60% and rice straw 40% (T2, n = 23) fed based on TDN 6.70/ BW 500 kg. All Hanwoo donors received a CIDR together with injections of 1 mg estradiol benzoate and 50 mg progesterone ($P_4$, Day 0). Four days later, they were superovulated with 28 mg FSH twice daily IM in decreasing doses over 4 days. Then donors received 2 doses of $PGF_2{\alpha}$ (25 and 15 mg) with the 5th and 6th injections of FSH on Day 6. CIDR were withdrawn at the 6th FSH injection and the donors received $100\;{\mu}g$ GnRH 36 h after the second $PGF_2{\alpha}$ injection. The donors were artificially inseminated twice, at 8 and 24 h after GnRH, and embryos were recovered 7 or 8 days after the 1st insemination. The flush rate of the donors following positive superovulation responses did not differ among groups (76.2~96.0%, p>0.05). The number of corpus luteum (CL) at embryo recovery also did not differ among groups (10.6~14.0, p>0.05). Furthermore, the mean numbers of total ova (9.4, 10.5 and 12.0) and transferable embryos (5.3, 12.0 and 6.5) did not significantly differ among the control, T1 and T2 groups, respectively (p>0.05). However, mean concentrations of serum $P_4$ of the T1 (64.2 ng/ml) and T2 groups (55.7 ng/ml) were higher than that of control group (43.3 ng/ml, p<0.01), while serum cholesterol concentrations in the control (105.8 mg/dl) and T2 groups ($96.9\;{\pm}\;mg/dl$) were significantly lower than in the T1 group (121.1 mg/dl, p<0.05). Conclusively, whole crop barley silage can be fed a good substitute for hay forage for Hanwoo donors. Furthermore the ratios of whole crop barley silage 60% and rice straw 40% might be more worthful for embryo production.

Automatic Word Spacing of the Korean Sentences by Using End-to-End Deep Neural Network (종단 간 심층 신경망을 이용한 한국어 문장 자동 띄어쓰기)

  • Lee, Hyun Young;Kang, Seung Shik
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.11
    • /
    • pp.441-448
    • /
    • 2019
  • Previous researches on automatic spacing of Korean sentences has been researched to correct spacing errors by using n-gram based statistical techniques or morpheme analyzer to insert blanks in the word boundary. In this paper, we propose an end-to-end automatic word spacing by using deep neural network. Automatic word spacing problem could be defined as a tag classification problem in unit of syllable other than word. For contextual representation between syllables, Bi-LSTM encodes the dependency relationship between syllables into a fixed-length vector of continuous vector space using forward and backward LSTM cell. In order to conduct automatic word spacing of Korean sentences, after a fixed-length contextual vector by Bi-LSTM is classified into auto-spacing tag(B or I), the blank is inserted in the front of B tag. For tag classification method, we compose three types of classification neural networks. One is feedforward neural network, another is neural network language model and the other is linear-chain CRF. To compare our models, we measure the performance of automatic word spacing depending on the three of classification networks. linear-chain CRF of them used as classification neural network shows better performance than other models. We used KCC150 corpus as a training and testing data.

Classification of nasal places of articulation based on the spectra of adjacent vowels (모음 스펙트럼에 기반한 전후 비자음 조음위치 판별)

  • Jihyeon Yun;Cheoljae Seong
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.25-34
    • /
    • 2023
  • This study examined the utility of the acoustic features of vowels as cues for the place of articulation of Korean nasal consonants. In the acoustic analysis, spectral and temporal parameters were measured at the 25%, 50%, and 75% time points in the vowels neighboring nasal consonants in samples extracted from a spontaneous Korean speech corpus. Using these measurements, linear discriminant analyses were performed and classification accuracies for the nasal place of articulation were estimated. The analyses were applied separately for vowels following and preceding a nasal consonant to compare the effects of progressive and regressive coarticulation in terms of place of articulation. The classification accuracies ranged between approximately 50% and 60%, implying that acoustic measurements of vowel intervals alone are not sufficient to predict or classify the place of articulation of adjacent nasal consonants. However, given that these results were obtained for measurements at the temporal midpoint of vowels, where they are expected to be the least influenced by coarticulation, the present results also suggest the potential of utilizing acoustic measurements of vowels to improve the recognition accuracy of nasal place. Moreover, the classification accuracy for nasal place was higher for vowels preceding the nasal sounds, suggesting the possibility of higher anticipatory coarticulation reflecting the nasal place.

Building robust Korean speech recognition model by fine-tuning large pretrained model (대형 사전훈련 모델의 파인튜닝을 통한 강건한 한국어 음성인식 모델 구축)

  • Changhan Oh;Cheongbin Kim;Kiyoung Park
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.75-82
    • /
    • 2023
  • Automatic speech recognition (ASR) has been revolutionized with deep learning-based approaches, among which self-supervised learning methods have proven to be particularly effective. In this study, we aim to enhance the performance of OpenAI's Whisper model, a multilingual ASR system on the Korean language. Whisper was pretrained on a large corpus (around 680,000 hours) of web speech data and has demonstrated strong recognition performance for major languages. However, it faces challenges in recognizing languages such as Korean, which is not major language while training. We address this issue by fine-tuning the Whisper model with an additional dataset comprising about 1,000 hours of Korean speech. We also compare its performance against a Transformer model that was trained from scratch using the same dataset. Our results indicate that fine-tuning the Whisper model significantly improved its Korean speech recognition capabilities in terms of character error rate (CER). Specifically, the performance improved with increasing model size. However, the Whisper model's performance on English deteriorated post fine-tuning, emphasizing the need for further research to develop robust multilingual models. Our study demonstrates the potential of utilizing a fine-tuned Whisper model for Korean ASR applications. Future work will focus on multilingual recognition and optimization for real-time inference.

Analysis on the English Translation of The First Chosen Educational Ordinance, Manual of Education of Koreans (1913), and Manual of Education in Chosen 1920 (1920) Using Text Mining Analytics (텍스트 마이닝(Text mining) 기법을 활용한 『제1차조선교육령』과 『조선교육요람』(1913, 1920)의영어번역본 분석)

  • Jinyoung Tak;Eunjoo Kwak;Silo Chin;Minjoo Shon;Dongmie Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.6
    • /
    • pp.309-317
    • /
    • 2023
  • The purpose of this paper is to investigate how Japan tried to dominate Chosen through educational policies by analyzing three official English texts published by the Japanese Government-General of Korea: the First Chosen Educational Ordinance declared in 1911, the Manual of Education of Koreans(1913), and the Manual of Education in Chosen 1920(1920). In order to pursue this purpose, the present study carried a corpus-based diachronic analysis, rather then a qualitative analysis. Facilitating text analytics such as Word Cloud and CONCOR, this paper derived the following results: First, the first Chosen Educational Ordinance(1911) includes overall educational regulations, curriculum, and operations of schools. Second, the Manual of Education of Koreans(1913) contains the educational medium and contents on how to educate. Finally, it can be proposed that the Manual of Education in Chosen 1920(1920) contains specific implementation of education and the subject of education.

Automatic Recognition of Pitch Accent Using Distributed Time-Delay Recursive Neural Network (분산 시간지연 회귀신경망을 이용한 피치 악센트 자동 인식)

  • Kim Sung-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.6
    • /
    • pp.277-281
    • /
    • 2006
  • This paper presents a method for the automatic recognition of pitch accents over syllables. The method that we propose is based on the time-delay recursive neural network (TDRNN). which is a neural network classifier with two different representation of dynamic context: the delayed input nodes allow the representation of an explicit trajectory F0(t) along time. while the recursive nodes provide long-term context information that reflects the characteristics of pitch accentuation in spoken English. We apply the TDRNN to pitch accent recognition in two forms: in the normal TDRNN. all of the prosodic features (pitch. energy, duration) are used as an entire set in a single TDRNN. while in the distributed TDRNN. the network consists of several TDRNNs each taking a single prosodic feature as the input. The final output of the distributed TDRNN is weighted sum of the output of individual TDRNN. We used the Boston Radio News Corpus (BRNC) for the experiments on the speaker-independent pitch accent recognition. π 1e experimental results show that the distributed TDRNN exhibits an average recognition accuracy of 83.64% over both pitch events and non-events.