• 제목/요약/키워드: Contents Collection

검색결과 778건 처리시간 0.027초

Document Clustering with Relational Graph Of Common Phrase and Suffix Tree Document Model (공통 Phrase의 관계 그래프와 Suffix Tree 문서 모델을 이용한 문서 군집화 기법)

  • Cho, Yoon-Ho;Lee, Sang-Keun
    • The Journal of the Korea Contents Association
    • /
    • 제9권2호
    • /
    • pp.142-151
    • /
    • 2009
  • Previous document clustering method, NSTC measures similarities between two document pairs using TF-IDF during web document clustering. In this paper, we propose new similarity measure using common phrase-based relational graph, not TF-IDF. This method suggests that weighting common phrases by relational graph presenting relationship among common phrases in document collection. And experimental results indicate that proposed method is more effective in clustering document collection than NSTC.

Comparison of Three Different Methods for Campylobacter Isolation from Porcine Intestines

  • Shin, Eun-Ju;Lee, Yeon-Hee
    • Journal of Microbiology and Biotechnology
    • /
    • 제19권7호
    • /
    • pp.647-650
    • /
    • 2009
  • Using 200 porcine colon tissues, the efficiencies of three isolation methods of Campylobacter from porcine intestines were compared: Method 1, direct streaking of colon mucosa; Method 2, direct inoculation of intestinal contents with a swab; Method 3, inoculation of pre-enriched medium. A total of 460 Campylobacter isolates were obtained from 178 samples (89%) by direct streaking of colon mucosa, 142 samples (71%) by direct streaking of a swab, and 94 samples (47%) by pre-enrichment of intestinal contents in Preston broth. Direct streaking of colon mucosa was superior to the other two isolation methods, in terms of rapidity and higher efficiency. When isolates were identified with various biochemical tests and PCRs specific to 16s rRNA, mapA, and ceuE, C. coli was the predominant species (87%) in porcine, whereas the rest of the isolates were identified as C. lanienae.

Electrostatic Precipitation Characteristics of Coal Combustion Boiler (석탄연소 보일러용 분진의 전기집진특성)

  • Lee, Tae-Sik;Bun, Cha-Seok;Kim, Gyeong-Seok;Nam, Chang-U;Lee, Gyu-Cheol
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • 제48권6호
    • /
    • pp.475-482
    • /
    • 1999
  • The electrostatic precipitation characteristics of two kinds of fly ashes, one derived from a fluidized bed combustor(FBC), the other from a pulverized coal(PC) fired furnace, have been studied on a pilot plant. Experiments have been carried out to enhance the collection efficiency while changing the operating conditions for two kinds of coal ashes, respectively. It has been shown that collection efficiency is affected by many factors such as shape of the ashes, dust contents, humidity, and temperature, etc. Experimantal results showed that collection efficiency of the FBC ashes was higher than that of the PC fly ash in spite of the small size of the FBC ashes. The experimetal results have been applied to the collection efficiency equations to show that the modified Deutsch equation was well agreed with experiment results if modification parameter k was set to 0.6 for the fluidized bed fly ashes and to 0.43 for the pulverized coal fly ashes.

  • PDF

Design and Implementation of Tor Traffic Collection System Using Multiple Virtual Machines (다수의 가상머신을 이용한 토르 트래픽 수집 시스템 설계 및 구현)

  • Choi, Hyun-Jae;Kim, Hyun-Soo;Shin, Dong-Myung
    • Journal of Software Assessment and Valuation
    • /
    • 제15권1호
    • /
    • pp.1-9
    • /
    • 2019
  • We intend to collect and analyze traffic efficiently in order to detect copyright infringement that illegally share contents on Tor network. We have designed and implemented a Tor traffic collection system using multiple virtual machines. We use a number of virtual machines and Mini PCs as clients to connect to Tor network, and automate both the collection and refinement processes in the traffic collection server through script-based test client software. Through this system, only the necessary field data on Tor network can be stored in the database, and only 95% or more of recognition of Tor traffic is achieved.

Developing a Test Collection for Korean Text Categorization (한국어 문서분류 테스트컬렉션 개발)

  • Ra, Dong-Yul;Kim, Yunsik;Shin, Hyun-Joo;Lee, Kyu-Hee;Kim, Tae-Kyu;Kang, Hyun-Kyu;Choe, Ho-Seop;Yoon, Hwa-Mook
    • Proceedings of the Korea Contents Association Conference
    • /
    • 한국콘텐츠학회 2007년도 추계 종합학술대회 논문집
    • /
    • pp.435-439
    • /
    • 2007
  • Document categorization system is important in the internet age in which huge number of documents are created and need to be dealt with. By this reason a lot of research has been done in this field. For the development of the system, a supervised learning method is widely used. This approach needs a test collection as a prerequisite. For the case of English, several test collections are available which provide a lot of help for developing systems and doing research. But no public test collections have been reported and are not available in the case of Korean. To improve the situation for Korean we are undergoing the construction of a Korean test collection. In this paper the approaches being used and current stage of the collection will be described.

  • PDF

Development of a Framework for Semi-automatic Building Test Collection Specialized in Evaluating Relation Extraction between Technical Terminologies (기술용어 간 관계추출의 성능평가를 위한 반자동 테스트 컬렉션 구축 프레임워크 개발)

  • Jeong, Chang-Hoo;Choi, Sung-Pil;Lee, Min-Ho;Choi, Yun-Soo
    • The Journal of the Korea Contents Association
    • /
    • 제10권2호
    • /
    • pp.481-489
    • /
    • 2010
  • Due to the increase of the attention on relation extraction systems, the construction of test collections for assessing their performance has emerged as an important task. In this paper, we propose semi-automatic framework capable of constructing test collections for relation extraction on a large scale. Based on this framework, we develop a test collection which can assess the performance of various approaches to extracting relations between technical terminologies in scientific literatures. This framework can minimize the cost of constructing this kind of collections and reduce the intrinsic fluctuations which may come from the diversity in characteristics of collection developers. Furthermore, we can construct balanced and objective collections by means of controlling the selection process of seed documents and terminologies using the proposed framework.

A File Clustering Algorithm for Wear-leveling (마모도 평준화를 위한 File Clustering 알고리즘)

  • Lee, Taehwa;Cha, Jaehyuk
    • Journal of Digital Contents Society
    • /
    • 제14권1호
    • /
    • pp.51-57
    • /
    • 2013
  • Storage device based on Flash Memory have many attractive features such as high performance, low power consumption, shock resistance, and low weight, so they replace HDDs to a certain extent. An Storage device based on Flash Memory has FTL(Flash Translation Layer) which emulate block storage devices like HDDs. A garbage collection, one of major functions of FTL, effects highly on the performance and the lifetime of devices. However, there is no de facto standard for new garbage collection algorithms. To solve this problem, we propose File Clustering Algorithm. File Clustering Algorithm respect to update page from same file at the same time. So, these are clustered to same block. For this mechanism, We propose Page Allocation Policy in FTL and use MIN-MAX GAP to guarantee wear leveling. To verify the algorithm in this paper, we use TPC Benchmark. So, The performance evaluation reveals that the proposed algorithm has comparable result with the existing algorithms(No wear leveling, Hot/Cold) and shows approximately 690% improvement in terms of the wear leveling.

Design of Coordinator Based on Android for Data Collection in Body Sensor Network

  • Min, Seongwon;Lee, Jong-Yong;Jung, Kye-Dong
    • International Journal of Advanced Culture Technology
    • /
    • 제5권2호
    • /
    • pp.98-105
    • /
    • 2017
  • Smartphones are fast growing in the IT market and are the most influential devices in our daily life. Smartphones are being studied for their use in body sensor networks with excellent processing power and wireless communication technology. In this paper, we propose a coordinator design that provides data collection, classification, and display using based on Android-smartphone in multiple sensor nodes. The coordinator collects data of sensor nodes that measure biological patterns using wireless communication technologies such as Bluetooth and NFC. The coordinator constructs a network using a multiple-level scheduling algorithm for efficient data collection at multiple sensor nodes. Also, to support different protocols between heterogeneous sensors, a data sheet recording wireless communication protocol information is used. The designed coordinator used Arduino to test the performance of multiple sensor node environments.

A Study Comparing the Han Period Bamboo Slats of the Beijing University Collection with the Laoguanshan Collection (북경대학 소장 한대의간(漢代醫簡)과 노관산 의간(老官山醫簡)의 비교 연구)

  • Kim, Beomsu;Kim, Kiwang
    • Journal of Korean Medical classics
    • /
    • 제36권1호
    • /
    • pp.33-43
    • /
    • 2023
  • Objectives : Overlapping contents between two recently discovered Han period bamboo slats, the so-called "Beidahanjian" and the "Liushibingfang" have been identified. This study aims to present new knowledge that could be inferred from the concordance of these two texts. Methods : The most recent original texts of the medical part of the Beidahanjian and medical texts excavated from the Laoguanshan in addition to the Liushibingfang were compared with each other to determine identical parts. The meaning of these concordances was explored. Results : Identical sentences in two verses in the Beidahanjian and the Laoguanshan were identified. Conclusions : The Beidahanjian is a credible Western Han period text, of which the medical bamboo slats are likely to comprise an independent text that is a combination of ancient folk prescriptions and those of doctors.

Development and Evaluation of Core Collection Using Qualitative and Quantitative Trait Descriptor in Sesame (Sesamum indicum L.) Germplasm

  • Park, Jong-Hyun;Suresh, Sundan;Raveendar, Sebastin;Baek, Hyung-Jin;Kim, Chung-Kon;Lee, Sokyoung;Cho, Gyu-Taek;Ma, Kyung-Ho;Lee, Chul-Won;Chung, Jong-Wook
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • 제60권1호
    • /
    • pp.75-84
    • /
    • 2015
  • Sesame (Sesamum indicum L.) is one of the most important oilseed crops with high oil contents and rich nutrient value. The development of a core collection could facilitate easier access to sesame genetic resources for their use in crop improvement programs and simplify the genebank management. The present study was initiated to the development and evaluation of a core collection of sesame based on 5 qualitative and 10 quantitative trait descriptors on 2,751 sesame accessions. The accessions were different countries of origin. About 10.1 percent of accessions were selected by using the power core program to constitute a core collection consisting of 278 accessions. Mean comparisons using t-test, Nei's diversity index of 10 morphological descriptors and correlation coefficients among traits indicated that the existing genetic variation for these traits in the entire collection has been preserved in the core collection. The results from this study will provide effective information for future germplasm conservation and improvement programs in sesame.