• Title/Summary/Keyword: Text-search

Search Result 553, Processing Time 0.029 seconds

Analysis of the World Wide Web Contents in Korea for the Climate Change Education (기후 변화 교육을 위한 국내 웹 자료 분석)

  • Choi, Hyeh-Sook;Kim, Yong-Pyo
    • Hwankyungkyoyuk
    • /
    • v.23 no.3
    • /
    • pp.1-16
    • /
    • 2010
  • Global climate change becomes one of the most serious environmental problems over the world. There is growing recognition thai climate change education, especially for children is important. However, there have been few programmes, curricula, teachers' training chances, and teaching-learning materials for climate change education so far. Therefore, we analyse the world wide web(web) contents in Korea which are available for climate change education, providing fundamental data in developing educational contents for climate change, as well as helping users to search appropriate contents for climate change education. Subjects for this study are 10 web sites of public institutions related to climate change in Korea. The web contents are evaluated in terms of diversity, accuracy, authenticity and the ease of use. The key finding in this study is that the majority of the contents are focused on how to respond to the problem, especially mitigation and also we find that most of the web sites provide text-types of lesson plan and video-types. Consequently, it would be necessary to develop various web contents for climate change education in both quality and quantity aspects.

  • PDF

The Future Past of Humanities Research: Musing Methodology in the Digital Convergence Era

  • Kim, Jiyun
    • International journal of advanced smart convergence
    • /
    • v.9 no.3
    • /
    • pp.161-168
    • /
    • 2020
  • Over the last half-century, computer science has revolutionarily changed the landscape of humanities research. This digital shift in research methodology has reached from the brainstorming process to preserving, constructing, collecting, visualizing, and even analyzing materials. Such transformation has brought about the birth of the new field of study: Digital Humanities (DH). DH undeniably has saved much of the physical chores and provided a new angle to interpret the text, thereby making its meteoric rise as a promising future of the humanities. Based on such innovation, electronic circuitry can seem to replace the imagination that detects relationships and significances of research data with ever-improving interfaces. However, despite hitherto technological development, the thousands-year-old essence of traditional liberal arts-human creativity-remains the heart of humanities research and always will. This paper starts by proving this proposition in the way of comparing the old and new liberal arts research methods, focusing on literary studies. Meanwhile, it thoroughly investigates how digitalized bibliographies, search engines, databases, and digital projects provide the most useful data preservation and virtual experience of browsing in the library, along with their limitations due to the intrinsic quality of humanities research data. Also, it probes the differences between traditional and digital data analysis in current methods of literary studies, ultimately presenting the ideal direction for humanities development in the era of digital convergence.

A Method of Intonation Modeling for Corpus-Based Korean Speech Synthesizer (코퍼스 기반 한국어 합성기의 억양 구현 방안)

  • Kim, Jin-Young;Park, Sang-Eon;Eom, Ki-Wan;Choi, Seung-Ho
    • Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.193-208
    • /
    • 2000
  • This paper describes a multi-step method of intonation modeling for corpus-based Korean speech synthesizer. We selected 1833 sentences considering various syntactic structures and built a corresponding speech corpus uttered by a female announcer. We detected the pitch using laryngograph signals and manually marked the prosodic boundaries on recorded speech, and carried out the tagging of part-of-speech and syntactic analysis on the text. The detected pitch was separated into 3 frequency bands of low, mid, high frequency components which correspond to the baseline, the word tone, and the syllable tone. We predicted them using the CART method and the Viterbi search algorithm with a word-tone-dictionary. In the collected spoken sentences, 1500 sentences were trained and 333 sentences were tested. In the layer of word tone modeling, we compared two methods. One is to predict the word tone corresponding to the mid-frequency components directly and the other is to predict it by multiplying the ratio of the word tone to the baseline by the baseline. The former method resulted in a mean error of 12.37 Hz and the latter in one of 12.41 Hz, similar to each other. In the layer of syllable tone modeling, it resulted in a mean error rate less than 8.3% comparing with the mean pitch, 193.56 Hz of the announcer, so its performance was relatively good.

  • PDF

Development of Classification Model for Healthcare Contents on the Online Community (온라인 커뮤니티에서의 건강 관련 콘텐츠 분류 모형 개발)

  • Kim, Tae-Yun;Kim, Yoo-Sin;Choi, Sang-Hyun;Kim, Do-Hun;Chang, You-Jin
    • The Journal of Information Systems
    • /
    • v.26 no.4
    • /
    • pp.285-301
    • /
    • 2017
  • Purpose In this paper we verified the reliabilities of healthcare-related information provided by various users on the site of Naver Jisikin, a Korean typical search platform. Based on Q&A contents we validated answers' reliabilities to the asked questions about a lung cancer with the help of professors at a medical school. Design/methodology/approach The content analysis includes that the types of questions are classified into symptom/diagnosis, therapy, prognosis, after-management and so on. The answers contains advice, advertisement, oriental medicine, and religion as well as the above 5 question categories. The validation results of medical evidence about each answer show that only 49% among all answers have medical grounds. Findings We classified the medical grounded answers into three levels; high, medium and low. Among all answers we need to find out the answers including advertisement because the answers can be harmful to patients. We found the method to select the answers containing advertisement contents with the help of text mining research. The selection model presents high performance as 84% classification accuracy.

A comparative study of filter methods based on information entropy

  • Kim, Jung-Tae;Kum, Ho-Yeun;Kim, Jae-Hwan
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.40 no.5
    • /
    • pp.437-446
    • /
    • 2016
  • Feature selection has become an essential technique to reduce the dimensionality of data sets. Many features are frequently irrelevant or redundant for the classification tasks. The purpose of feature selection is to select relevant features and remove irrelevant and redundant features. Applications of the feature selection range from text processing, face recognition, bioinformatics, speaker verification, and medical diagnosis to financial domains. In this study, we focus on filter methods based on information entropy : IG (Information Gain), FCBF (Fast Correlation Based Filter), and mRMR (minimum Redundancy Maximum Relevance). FCBF has the advantage of reducing computational burden by eliminating the redundant features that satisfy the condition of approximate Markov blanket. However, FCBF considers only the relevance between the feature and the class in order to select the best features, thus failing to take into consideration the interaction between features. In this paper, we propose an improved FCBF to overcome this shortcoming. We also perform a comparative study to evaluate the performance of the proposed method.

A Study on the Ontology Query Module based on Natural Language (자연어 기반 온톨로지 질의 모듈 연구)

  • Kim, Won-Pil;Kong, Hyun-Jang
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.1
    • /
    • pp.146-151
    • /
    • 2010
  • For an application of ontology, query processing is mandatory field for efficient information search in the ontology. Other query processing systems tend to analyze only facts and to simply provide structural information for users. In fact, the systems do not have big difference with database systems or text based information processing systems. Therefore, in this research, the method which can provide the inferred information based on axioms is suggested in order to maximize reusability of ontology.

Analysis of Phonological Reduction in Conversational Japanese (현대일본어의 회화문에 나타난 축약형의 음운론적 분석)

  • Choi Young-sook;Sato Shigeru;Pahk Hy-tay
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.198-206
    • /
    • 1996
  • Using eighteen text materials from various goners of present-day Japanese, we collected phonologically reduced forms frequently observed in conversational Japanese, and classified them in search of unified explanation of phonological reduction phenomena. We found 7,516 cases of reduced forms which we divided into 43 categories according to the types of phonological changes they have undergone. The general tendencies ale that deletion and fusion of a phoneme or an entire syllable takes place frequently, resulting in the decrease in the number of syllable. Typical examples frequently observed throughout the materials are : $~/noda/{\rightarrow}~/nda/,{\;}-/teiru/{\rightarrow}~/teru/,{\;}~/dewa/{\rightarrow}~/zja/,{\;}~/tesimau/{\rightarrow}~/cjau/$. From morphosyntactic point of view phonological reduction often occurs at the NP and VP morpheme boundaries. The following findings are drawn from phonological observations of reduction. (1) Vowels are more easily deleted than consonants. (2) Bilabials(/m/, /b/, and /w/ are the most likely candidates for deletion. (3) In a concatenation of vowels, closed vowels are absorbed into open vowels, or two adjacent vowels come to create another vowel, in which case reconstruction of the original sequence is not always predictable. (4) Alveolars are palatalized under the influence of front vowels. (5) Regressive assimilation takes place in a syllable starting with ill, changing the entire syllable into phonological choked sound or a syllabic nasal, depending on the voicing of following phoneme.

  • PDF

An Investigation of Elementary School Teachers학 Conceptions on Buoyancy (부력 개념에 관한 초등학교 교사들의 이해도 조사)

  • 이형철;이순자
    • Journal of Korean Elementary Science Education
    • /
    • v.19 no.1
    • /
    • pp.145-156
    • /
    • 2000
  • Elementary school teachers' understandings about buoyancy were investigated through the questionnaire method. The questionnaire was composed of 4 questions on hydraulic pressure and 8 questions on buoyancy. The questions on buoyancy asked about the correlation of buoyancy with following basic concepts, density of liquid, volume of submerged object and so forth. 295 teachers on the 22 elementary schools in Busan, Yangsan and Gimhae were selected through random sampling method. The results of this study were summarized as follows: On the correlation of the magnitude and direction of hydraulic pressure with the depth of water, a large portion of the respondents had a scientific conception. But on the correlation of hydraulic pressure with density, the relatively small portion of them appeared to have a scientific conception. The respondents, on the whole, had a scientific conception about the correlation of buoyancy with density of liquid. But they seemed to have naive conceptions about the correlation of buoyancy with the volume of a submerged object and with the depth of water, the amount of water in container and the reduced amount of water by the object from container. We found that the respondents were context dependent and tended to search for solutions for the questions of buoyancy using the concept of pressure in the water. From above results, we suggested that in the would-be revised elementary science text book, the contents of pressure in the water should be introduced after the concept of weight in the water was gained.

  • PDF

Analysis of Naver CAPTCHA with Effective Segmentation (효과적인 글자 분리 방법을 사용한 네이버 캡차 공격)

  • Nyang, Dae Hun;Choi, Yong Heon;Hong, Seok Jun;Lee, Kyunghee
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.23 no.5
    • /
    • pp.909-917
    • /
    • 2013
  • CAPTCHA is an automated test to tell apart computers from human mainly for web services, and it has been evolved since the most naive form in which users are requested to input simple strings has been introduced. Though many types of CAPTCHAs have been proposed, text-based CAPTCHAs have been widely prevailed for user convenience. In this paper, we introduce new segmentation schemes and show an attack method to break the CAPTCHA of Naver that occupies more than 70% of the market share in search engine. The experimental results show that 938 trials out of 1000 have successfully analyzed, which implies that we cannot use the CAPTCHA anymore.

Distribution of Medicinal Plants included in the Korean Pharmacopoeia at Cheongoksan Bonghwagun in Korea (봉화군 청옥산에 분포하는 대한민국약전 수재 약용식물의 분포 특성)

  • Song, Hong Seon;Gim, Mung Hea;Lee, Geo Lyong;Kim, Seong Min
    • Korean Journal of Medicinal Crop Science
    • /
    • v.21 no.4
    • /
    • pp.268-275
    • /
    • 2013
  • This text was analyzed and investigated the distribution of medicinal plants in Cheongoksan Bonghwagun Korea, in order to search the medicinal resources that are used in modern medicine. Medicinal plants of the Korean Pharmacopoeia (10th edition) distributed in Cheongoksan Bonghwagun were consisted of 93 taxa ; 82 species, 10 varieties, 1 forma of 79 genus, 50 families. In medicinal plants of the Korean Pharmacopoeia, rate of native species and exotic species was 89.2% (83 taxa) and 10.8% (10 taxa) respectively. Family classification was the most of compositae of 8 taxa, and life form classification was most of herb of hemicryptophyte species. The classification by using parts were 34 taxa of root use and the classification of efficacy utilization was 24 taxa of Cheongyeolyak (heat-clearing drug) use.