• 제목/요약/키워드: Data enrichment

검색결과 291건 처리시간 0.027초

마이크로어레이 자료에서 생존과 유의한 관련이 있는 유전자집단 검색 (Detecting survival related gene sets in microarray analysis)

  • 이선호;이광현
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권1호
    • /
    • pp.1-11
    • /
    • 2012
  • 환자의 생존시간과 함께 유전자 마이크로어레이 자료가 주어진 경우 생존에 유의한 영향을 미치는 대사경로를 찾는 방법을 연구하였다. 기존의 방법인 유전자 집합 농축도 분석, 글로벌 검정과 왈드 형태 검정을 비교 분석하였고, 치환을 통하여 p값을 구하는 단점을 개선한 수정된 왈드 형태 검정을 제안하였다. 모의실험과 실제자료 분석을 이용하여 새로운 방법의 적용 가능성을 보였다.

A Universal Analysis Pipeline for Hybrid Capture-Based Targeted Sequencing Data with Unique Molecular Indexes

  • Kim, Min-Jung;Kim, Si-Cho;Kim, Young-Joon
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.29.1-29.5
    • /
    • 2018
  • Hybrid capture-based targeted sequencing is being used increasingly for genomic variant profiling in tumor patients. Unique molecular index (UMI) technology has recently been developed and helps to increase the accuracy of variant calling by minimizing polymerase chain reaction biases and sequencing errors. However, UMI-adopted targeted sequencing data analysis is slightly different from the methods for other types of omics data, and its pipeline for variant calling is still being optimized in various study groups for their own purposes. Due to this provincial usage of tools, our group built an analysis pipeline for global application to many studies of targeted sequencing generated with different methods. First, we generated hybrid capture-based data using genomic DNA extracted from tumor tissues of colorectal cancer patients. Sequencing libraries were prepared and pooled together, and an 8-plexed capture library was processed to the enrichment step before 150-bp paired-end sequencing with Illumina HiSeq series. For the analysis, we evaluated several published tools. We focused mainly on the compatibility of the input and output of each tool. Finally, our laboratory built an analysis pipeline specialized for UMI-adopted data. Through this pipeline, we were able to estimate even on-target rates and filtered consensus reads for more accurate variant calling. These results suggest the potential of our analysis pipeline in the precise examination of the quality and efficiency of conducted experiments.

Knowledge Representation Using Fuzzy Ontologies: A Survey

  • V.Manikandabalaji;R.Sivakumar
    • International Journal of Computer Science & Network Security
    • /
    • 제23권12호
    • /
    • pp.199-203
    • /
    • 2023
  • In recent decades, the growth of communication technology has resulted in an explosion of data-related information. Ontology perception is being used as a growing requirement to integrate data and unique functionalities. Ontologies are not only critical for transforming the traditional web into the semantic web but also for the development of intelligent applications that use semantic enrichment and machine learning to transform data into smart data. To address these unclear facts, several researchers have been focused on expanding ontologies and semantic web technologies. Due to the lack of clear-cut limitations, ontologies would not suffice to deliver uncertain information among domain ideas, conceptual formalism supplied by traditional. To deal with this ambiguity, it is suggested that fuzzy ontologies should be used. It employs Ontology to introduce fuzzy logical policies for ambiguous area concepts such as darkness, heat, thickness, creaminess, and so on in a device-readable and compatible format. This survey efforts to provide a brief and conveniently understandable study of the research directions taken in the domain of ontology to deal with fuzzy information; reconcile various definitions observed in scientific literature, and identify some of the domain's future research-challenging scenarios. This work is hoping that this evaluation can be treasured by fuzzy ontology scholars. This paper concludes by the way of reviewing present research and stating research gaps for buddy researchers.

일본군 '위안부' 지식그래프: 파편화된 디지털 기록의 연결 (A Knowledge Graph on Japanese "Comfort Women": Interlinking Fragmented Digital Archival Resources)

  • 박하람;김학래
    • 한국기록관리학회지
    • /
    • 제21권3호
    • /
    • pp.61-78
    • /
    • 2021
  • 일본군 '위안부'에 대한 기록은 민간 기관에서 개별적으로 관리하고 있다. 일부 기록은 디지털 아카이브로 구축되어 온라인으로 접근할 수 있다. 그러나, 디지털 아카이브의 기록은 기관에 따라 메타데이터의 구성과 표현 방식이 다르다. 한편, 기록 사이의 관계를 정의할 수 있는 체계가 미흡하기 때문에, 현재 구축된 일본군 '위안부' 기록은 서로 연결되지 않고 파편적인 형식으로 남아있다. 본 연구는 일본군 '위안부' 디지털 기록을 연계하기 위한 지식 모델을 제안하고, 분산화된 디지털 아카이브의 기록을 통합하여 일본군 '위안부' 지식그래프를 구축한다. 일본군 '위안부' 디지털 아카이브의 메타데이터를 분석하여 공통 요소를 도출하고, 표준 어휘를 적용하여 디지털 기록의 다양한 개체와 개체 사이의 관계를 의미적으로 표현한다. 특히, 흩어져 있는 기록을 연계하고 검색하기 위해 수집한 데이터의 정제가 이루어지고, 외부데이터를 활용하여 기록의 맥락 정보를 강화하고 있다. 구축된 지식그래프의 검증은 분산된 기록의 탐색 여부를 측정하는 질의를 통해 수행된다. 검증 결과, 지식그래프는 흩어져 있는 기록을 연계하여 검색할 수 있고, 외부데이터로부터의 강화로 기록의 맥락 정보를 풍부하게 제공하며, 의미 기반의 검색을 통해 사용자의 의도에 맞춘 정확한 검색이 가능하다.

Development of Enrichment Semi-nested PCR for Clostridium botulinum types A, B, E, and F and Its Application to Korean Environmental Samples

  • Shin, Na-Ri;Yoon, So-Yeon;Shin, Ji-Hun;Kim, Yun Jeong;Rhie, Gi-eun;Kim, Bong Su;Seong, Won Keun;Oh, Hee-Bok
    • Molecules and Cells
    • /
    • 제24권3호
    • /
    • pp.329-337
    • /
    • 2007
  • An enrichment semi-nested PCR procedure was developed for detection of Clostridium botulinum types A, B, E, and F. It was applied to sediment samples to examine the prevalence of C. botulinum in the Korean environment. The first pair of primers for the semi-nested PCR was designed using a region shared by the types A, B, E, and F neurotoxin gene sequences, and the second round employed four nested primers complementary to the BoNT/A, /B, /E, and /F encoding genes for simultaneous detection of the four serotypes. Positive results were obtained from the PCR analysis of five of 44 sediments (11%) collected from Yeong-am Lake in Korea; all were identified as deriving from type B neurotoxin (bontb) genes. Two of the C. botulinum type B organisms were isolated, and their bontb genes sequenced. The deduced amino acid sequences of BoNT/B showed 99.5 and 99.8% identity with the amino acid sequence of accession no. AB084152. Our data suggest that semi-nested PCR is a useful tool for detecting C. botulinum in sediments, and renders it practicable to conduct environmental surveys.

한국 주변 해역 표층해수중 삼중수소 농도 (Tritium Concentrations in Surface Seawater around Korean Peninsula)

  • 김창규;조용우;김계훈
    • Journal of Radiation Protection and Research
    • /
    • 제21권2호
    • /
    • pp.107-115
    • /
    • 1996
  • 전해농축법을 이용하여 한국 주변 해역 해수중 저준위 삼중수소를 측정하였다. 동 서 남해안에서 채취된 표충해수중 삼중수소를 측정한 결과, 삼중수소 농도는 $0.12 BqL^{-1}$에서 $1.50 BqL^{-1}$ 범위내의 값을 나타내었으며 그 평균값은 $0.60{\pm}0.353qL^{-1}$였다. 해역별 평균 삼중수소 농도는 동해안에서 $0.54{\pm}0.30 BqL^{-1}$ 남해안에서 $0.48{\pm}0.35 BqL^{-1}$. 서해안에서 $0.77{\pm}0.32 BqL^{-1}$의 농도분포를 각각 나타내었으며, 해역별로 큰 차이를 나타내지 않았다. 또한, 시료채취지점이 매우 좁은 위도범위내에 한정되어 있어 위도에 따른 삼중수소 농도의 체계적인 변화는 관찰되지 않았다. 본 연구에서 얻어진 표층해수 중 삼중수소 농도 준위는 일본 근해에서 조사된 결과와 매우 유사하였으며, 태평양지역에서의 삼중수소 농도에 비해서는 약간 높은 값을 나타내었다.

  • PDF

중성자 방사화분석법과 Gent SFU 샘플러를 이용한 도시의 농촌지역의 대기분지($PM_{10}$)관측 연구 (Study on Airborne Particulate Matter ($PM_{10}$) Monitoring in Urban and Rural Area by Using Gent SFU Sampler and Instrumental Neutron Activation Analysis)

  • 정용삼;문종화;김선하;박광원;강상훈
    • 한국대기환경학회지
    • /
    • 제16권5호
    • /
    • pp.453-467
    • /
    • 2000
  • The aim of this research is to collect and characterize fine particles (FPM:$\leq$2.5${\mu}{\textrm}{m}$) and coarse particles (CPM: 2.5~10${\mu}{\textrm}{m}$) using a low volume air sampler provided by the IAEA, at urban (Taejon) and rural area(Wonju) for a period of about two years(April 1996 to May 1998) and to promote a use of nuclear analytical techniques for air pollution studies. For the collection of airborne particulate matter (PM(sub)10), the Gent stacked filter unit sampler and polycarbonate membrane filters were employed. The concentration of trace elements in collected APM samples were determined byu instrumental Neutron Activation Analysis. For validation of the analytical data, internal quality control were implemented by using both the comparison of the analytical results of standard reference materials(NIST SRM 1648) and interlaboratory comparison for proficiency test (NAT-3). The standard uncertainty was less than 15% and Z-score of two samples were within $\pm$1. The monitoring of (PM(sub)10) mass concentration and elemental concentrations were carried out weekly. The average mass concentration of (PM(sub)10) in urban and rural areas were 59.2$\pm$36.5$\mu\textrm{g}$/㎥ and 41.4$\pm$23.7$\mu\textrm{g}$/㎥, respectively. To investigate the emission source, the enrichment factors were calculated for the fine and coarse particle fractions at two sites, respectively and these values were classified for anthropogenic and soil origin elements.

  • PDF

3-Stage DRUM 샘플러를 이용한 광주 도심지역의 봄철과 여름철 PM2.5 원소적 조성 비교 (Elemental Composition of PM2.5 Particulate with a 3-Stage DRUM Sampler during Spring and Summer Seasons in Urban Area of Gwangju, Korea)

  • 류성윤;김영준
    • 한국대기환경학회지
    • /
    • 제21권6호
    • /
    • pp.699-708
    • /
    • 2005
  • To characterize the elemental composition of fine particles in urban area, $PM_{2.5}$ was collected by a 3-stage DRUM impactor at Gwangju during spring and summer. Time and size resolved concentrations for 19 trace elements were obtained by synchrotron X-Ray fluorescence analysis. Trace elements in summer were distributed in smaller size range compared to those in spring. Almost trace element concentrations in fine particles were highly increased during the Asian dust. In spring, soil elements such as Si, K, Ca, Ti and Fe had low enrichment factors indicating the dominant influence of soil dust. However, all elements had high enrichment factors in summer implying that these elements could be emitted from the anthropogenic sources. Factor analysis was conducted with the elemental composition data in order to identify anthropogenic sources of aerosols in urban area during spring and summer. Fine particles in spring have several sources such as soil dust originating from China continental region, coal and oil combustion, biomass burning, sea salt, ferrous and nonferrous metal sources. On the other hand, fine particles in summer were influenced by road dust, gasoline vehicle as well as coal and oil combustion, sea salt, ferrous and nonferrous metal sources.

마이크로어레이 자료분석에서 모수적 방법을 이용한 유전자군의 유의성 검정 (Developing a Parametric Method for Testing the Significance of Gene Sets in Microarray Data Analysis)

  • 이선호;이승규;이광현
    • Communications for Statistical Applications and Methods
    • /
    • 제16권3호
    • /
    • pp.397-408
    • /
    • 2009
  • 마이크로어레이 기술은 수만 개 유전자의 발현 패턴을 동시에 관찰하는 것을 가능하게 하였고, 이들을 하나씩 검정하여 찾아낸 특이발현 현상을 보이는 유전자를 중심으로 질병의 진단, 치료법 정립과 신약 개발을 위한 기본 정보를 확립하였다. 그러나 개별 유전자분석의 여러 문제점이 발견되면서 유전자들을 생물학적 대사경로나 염색체 위치가 같은 것끼리 묶은 집단을 분석하여 질병의 발생이나 생존에 영향을 미치는 집단을 찾는 방법이 제시되었다. 이러한 유전자 집단의 유의성에 대한 연구는 2002년에 MIT에서 비롯되어 GSEA, SAM-GS와 중심극한 정리의 개념을 이용한 모수적 방법인 PAGE 등이 사용되고 있다. 본 논문에서는 이들 통계량의 구조적 한계를 극복하고 계산이 간단한 새로운 모수적 방법을 제안하고 자료 분석을 통하여 효율성을 보였다.

자연발효 과정에서 인디고에 환원력을 지닌 미생물 커뮤니티 분석과 농화배양 (Analysis and Enrichment of Microbial Community Showing Reducing Ability toward indigo in the Natural Fermentation of Indigo-Plant)

  • 최은실;이은빈;최형안;손경희;김근중;신윤숙
    • KSBB Journal
    • /
    • 제28권5호
    • /
    • pp.295-302
    • /
    • 2013
  • Indigo is utilized in various industries including textile dyeing, cosmetics, printing and medicinal products and its reduced form, leuco-indigo, is mainly used in these process. Chemical reducing agent (sodium dithionite, sodium sulfide, etc.) is preferred to use for the formation of leucoindigo in industry. In traditional indigo fermentation process, microorganisms can participate in the reduction of indigo and thus it has been known to reduce environmental pollution and noxious byproducts. However, in fermentation method using microorganisms it is difficult to standardize large scale production process due to low yield and reproducibility. In this study, we attempted to develop the indigo reduction process using microbial flora which was isolated from naturally fermented indigo vat or deduced by metagenomic approach. From the results of library analyses of PCR-amplified 16S rRNA genes from the traditional indigo fermentation vat sample (metagenome), it was confirmed that Alkalibacteriums (71%) was distinctly dominant in population. Some strains were identified after confirming that they become pure culture in nutrient media modified slightly. Four strains were separated in this process and each strain showed obvious reducing ability toward indigo in dyeing test. It is expected that the analyzed results will provide important data for standardizing the natural fermentation of indigo and investigating the mechanism of indigo reduction.