• 제목/요약/키워드: Data annotation

검색결과 258건 처리시간 0.025초

전북 서해안권 국가지질공원 지질명소 안내 표지판에 사용된 용어 분석 (An Analyses of the Terms used in the Information Boards of Geosites at Jeonbuk West Coast National Geopark)

  • 신영준;조규성
    • 한국지구과학회지
    • /
    • 제41권1호
    • /
    • pp.40-47
    • /
    • 2020
  • 본 연구는 전북 서해안권 국가지질공원 지질명소의 안내 표지판에 기술된 용어를 분석하였다. 안내 표지판에 기술된 용어들 중 명사만을 추출하여 표준국어대사전, 지구과학 학술용어집, 2015개정 교육과정에 따른 교과용 도서 개발을 위한 편수 자료를 기준으로 등재 여부를 확인하여 8가지 유형으로 분류하였다. 추출된 용어 중 71개(10.8%)의 용어는 어느 용어집에도 등재되지 않은 [유형 8]에 해당하는 용어들이었다. 이 유형의 용어들은 대부분이 [명사]+[명사] 또는 [명사]+[접사]가 결합하여 파생된 합성어로 그 의미를 명확하게 해석하여 이해하기란 쉽지 않은 것으로 판단되었다. 또한 256개(46%)의 용어가 전문 분야에서 사용되는 전문 용어로 확인되었다. 따라서 국가지질공원 안내 표지판의 제작에 있어 일반인들과 학생들이 더 쉽게 읽고 이해할 수 있도록 전문 용어를 가급적 쉽게 풀어서 기술하고 전문 용어를 사용할 경우에는 용어에 대한 주석을 달아 부연 설명을 통해 충분한 교육적 효과를 얻을 수 있도록 해야 할 것이다.

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • 한국축산식품학회지
    • /
    • 제33권6호
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

Identification of Recently Selected Mutations Driven by Artificial Selection in Hanwoo (Korean Cattle)

  • Lim, Dajeong;Gondro, Cedric;Park, Hye Sun;Cho, Yong Min;Chai, Han Ha;Seong, Hwan Hoo;Yang, Bo Suk;Hong, Seong Koo;Chang, Won Kyung;Lee, Seung Hwan
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제26권5호
    • /
    • pp.603-608
    • /
    • 2013
  • Hanwoo have been subjected over the last seventy years to intensive artificial selection with the aim of improving meat production traits such as marbling and carcass weight. In this study, we performed a signature of selection analysis to identify recent positive selected regions driven by a long-term artificial selection process called a breeding program using whole genome SNP data. In order to investigate homozygous regions across the genome, we estimated iES (integrated Extended Haplotype Homozygosity SNP) for the each SNPs. As a result, we identified two highly homozygous regions that seem to be strong and/or recent positive selection. Five genes (DPH5, OLFM3, S1PR1, LRRN1 and CRBN) were included in this region. To go further in the interpretation of the observed signatures of selection, we subsequently concentrated on the annotation of differentiated genes defined according to the iES value of SNPs localized close or within them. We also described the detection of the adaptive evolution at the molecular level for the genes of interest. As a result, this analysis also led to the identification of OLFM3 as having a strong signal of selection in bovine lineage. The results of this study indicate that artificial selection which might have targeted most of these genes was mainly oriented towards improvement of meat production.

Protein-protein Interaction Network Analyses for Elucidating the Roles of LOXL2-delta72 in Esophageal Squamous Cell Carcinoma

  • Wu, Bing-Li;Zou, Hai-Ying;Lv, Guo-Qing;Du, Ze-Peng;Wu, Jian-Yi;Zhang, Pi-Xian;Xu, Li-Yan;Li, En-Min
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권5호
    • /
    • pp.2345-2351
    • /
    • 2014
  • Lysyl oxidase-like 2 (LOXL2), a member of the lysyl oxidase (LOX) family, is a copper-dependent enzyme that catalyzes oxidative deamination of lysine residues on protein substrates. LOXL2 was found to be overexpressed in esophageal squamous cell carcinoma (ESCC) in our previous research. We later identified a LOXL2 splicing variant LOXL2-delta72 and we overexpressed LOXL2-delta72 and its wild type counterpart in ESCC cells following microarray analyses. First, the differentially expressed genes (DEGs) of LOXL2 and LOXL2-delta72 compared to empty plasmid were applied to generate protein-protein interaction (PPI) sub-networks. Comparison of these two sub-networks showed hundreds of different proteins. To reveal the potential specific roles of LOXL2- delta72 compared to its wild type, the DEGs of LOXL2-delta72 vs LOXL2 were also applied to construct a PPI sub-network which was annotated by Gene Ontology. The functional annotation map indicated the third PPI sub-network involved hundreds of GO terms, such as "cell cycle arrest", "G1/S transition of mitotic cell cycle", "interphase", "cell-matrix adhesion" and "cell-substrate adhesion", as well as significant "immunity" related terms, such as "innate immune response", "regulation of defense response" and "Toll signaling pathway". These results provide important clues for experimental identification of the specific biological roles and molecular mechanisms of LOXL2-delta72. This study also provided a work flow to test the different roles of a splicing variant with high-throughput data.

Genetic and biochemical evidence for redundant pathways leading to mycosporine-like amino acid biosynthesis in the cyanobacterium Sphaerospermopsis torques-reginae ITEP-024

  • Geraldes, Vanessa;de Medeiros, Livia Soman;Lima, Stella T.;Alvarenga, Danillo Oliveira;Gacesa, Ranko;Long, Paul F.;Fiore, Marli Fatima;Pinto, Ernani
    • ALGAE
    • /
    • 제35권2호
    • /
    • pp.177-187
    • /
    • 2020
  • Cyanobacteria have been widely reported to produce a variety of UV-absorbing mycosporine-like amino acids (MAAs). Herein, we reported production of the unusual MAA, mycosporine-glycine-alanine (MGA) in the cyanobacterium Sphaerospermopsis torques-reginae ITEP-024 using a newly developed UHPLC-DAD-MS/HRMS (ultra-high performance liquid chromatography-diode array detection-high resolution tandem mass spectrometry) method. MGA had previously been first identified in a red-algae, but S. torques-reginae strain ITEP-024 is the first cyanobacteria to be reported as an MGA producer. Herein, the chemical structure of MGA is fully elucidated from one-dimensional / two-dimensional nuclear magnetic resonance and HRMS data analyses. MAAs are unusually produced constitutively in S. torques-reginae ITEP-024, and this production was further enhanced following UV-irradiance. It has been proposed that MAA biosynthesis proceeds in cyanobacteria from the pentose phosphate pathway intermediate sedoheptulose 7-phosphate. Annotation of a gene cluster encoded in the genome sequence of S. torques-reginae ITEP-024 supports these gene products could catalyse the biosynthesis of MAAs. However, addition of glyphosate to cultures of S. torques-reginae ITEP-024 abolished constitutive and ultra-violet radiation induced production of MGA, shinorine and porphyra-334. This finding supports involvement of the shikimic acid pathway in the biosynthesis of MAAs by this species.

Quantitative Proteogenomics and the Reconstruction of the Metabolic Pathway in Lactobacillus mucosae LM1

  • Pajarillo, Edward Alain B.;Kim, Sang Hoon;Lee, Ji-Yoon;Valeriano, Valerie Diane V.;Kang, Dae-Kyung
    • 한국축산식품학회지
    • /
    • 제35권5호
    • /
    • pp.692-702
    • /
    • 2015
  • Lactobacillus mucosae is a natural resident of the gastrointestinal tract of humans and animals and a potential probiotic bacterium. To understand the global protein expression profile and metabolic features of L. mucosae LM1 in the early stationary phase, the QExactiveTM Hybrid Quadrupole-Orbitrap Mass Spectrometer was used. Characterization of the intracellular proteome identified 842 proteins, accounting for approximately 35% of the 2,404 protein-coding sequences in the complete genome of L. mucosae LM1. Proteome quantification using QExactiveTM Orbitrap MS detected 19 highly abundant proteins (> 1.0% of the intracellular proteome), including CysK (cysteine synthase, 5.41%) and EF-Tu (elongation factor Tu, 4.91%), which are involved in cell survival against environmental stresses. Metabolic pathway annotation of LM1 proteome using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database showed that half of the proteins expressed are important for basic metabolic and biosynthetic processes, and the other half might be structurally important or involved in basic cellular processes. In addition, glycogen biosynthesis was activated in the early stationary phase, which is important for energy storage and maintenance. The proteogenomic data presented in this study provide a suitable reference to understand the protein expression pattern of lactobacilli in standard conditions

도로 설계를 위한 지형정보 해석에 있어서 SQL의 응용 (The Application of SQL in Terrain Information Analysis for Route Design)

  • 강준묵;윤희천;이형석;이성순
    • 대한공간정보학회지
    • /
    • 제3권2호
    • /
    • pp.29-42
    • /
    • 1995
  • 도로의 기본설계가 평면 지형도상에서 수작업에 의해 이루어 끼고 있어 많은 시간과 인력이 소요될 뿐만 아니라 효율적인 면에서 많은 문제가 제기되고 있다. 최근 GSIS를 이용한 지형정보처리에 많은 관심이 모아지면서 3차원 수치지형정보를 이용한 효율적인 도로설계방법의 연구가 활발히 진행되고 있다. 본 연구는 도로설계를 위한 지형정보를 보다 효율적으로 해석하기 위해서 데이터베이스를 구축하고 분석하는 과정에 SQL을 응용하므로써 객관적이고 종합적인 근거의 자료제시와 입체적인 지형정보해석의 가능성을 제시한 것이다. 본 연구에서는 축척 1:5,000의 연구대상지역의 지형도를 3차원 기본토로 생성하고 등고선도, 토지이용도, 도로망도 및 수계도 등의 다양한 주세도의 지형 정보를 획득하였다. 완성된 지형도의 도형 정보와 데이터베이스의 속성 정보를 연결해 지형 정보를 구축하고, 노선계획을 함에 있어서 SQL을 응용하였다. 또한 도로설계를 위해 예비노선에 대한 종 횡단면도와 토공량 등의 설계자료를 보다 신속하고 효율적으로 산출하였고, 설계 후 도로형상과 자연경관을 DTM으로 구성하여 시각적으로 파악할 수 있었으므로 도로설계에 효율적인 방법으로 응용될 수 있을 것이다.

  • PDF

메타게놈 서열에 존재하는 보존적인 전사와 번역 인자를 이용한 ORF 예측 (Prediction of ORFs in Metagenome by Using Cis-acting Transcriptional and Translational Factors)

  • 정대은;김근중
    • KSBB Journal
    • /
    • 제25권5호
    • /
    • pp.490-496
    • /
    • 2010
  • 미생물은 지구상에 약 $5\;{\times}\;10^{30}$ 정도의 개체가 존재하며, 350~550 Pg (1Pg = 1015g)의 탄소, 85~130 Pg의 질소, 9~14 Pg의 인 등, 지구상의 어떠한 생물 종보다 거대한 양의 원소를 포함하고 있다. 또한 이러한 미생물과 생태계를 구성하는 다른 유기체나 무기물과의 관계가 지속적으로 밝혀지고 있다. 이러한 연구들의 기본적인 목표는 상호작용에 중요한 인자들의 규명 (대표적으로 유전자)하는 것이기 때문에, 염색체에 존재하는 true ORF의 검색과 확인은 가장 중요한 기본 수단이 된다. 그러나 다양한 미생물로 구성된 환경 유전체는 기존 정보로 검색 가능한 비율을 정확하게 유추할 수 없기에 많은 어려움이 있다. 이렇게 경계가 불분명한 자료의 검색을 위해서는 보다 많은 정보를 필요 (training이나 space를 규정하기 위한 보다 많은 유전자 서열)로 하며, 다른 검색 방법이나 기법들이 추가적으로 개발되어야 할 것이다. 이러한 방법의 대안으로써, 미생물의 유전자간 서열에 존재하는 전사/번역인자의 보존성에 근거한 검색방법은 개량 여하에 따라 광범위한 적용 범위를 지닐 것이다. 현 수준에서도 조합 탐색, 즉 기존의 방법과 혼용하거나 기존의 방법을 보완하는 과정으로 충분한 가치를 지니고 있다. 이러한 추정은, 기존의 ORF 중심의 발굴 결과와 전혀 일치되지 않는 경우에서부터 90% 이상 일치하는 등의 결과로서 확인하였다. 일치 되지 않는 많은 경우가 BLASTing으로 검색되지 않는 새로운 ORF를 포함하기 때문이다.

시맨틱 갭을 줄이기 위한 딥러닝과 행위 온톨로지의 결합 기반 이미지 검색 (Image retrieval based on a combination of deep learning and behavior ontology for reducing semantic gap)

  • 이승;정혜욱
    • 예술인문사회 융합 멀티미디어 논문지
    • /
    • 제9권11호
    • /
    • pp.1133-1144
    • /
    • 2019
  • 최근 스마트 기기의 발전으로 인터넷상에 존재하는 이미지 데이터의 양이 급속하게 증가하는 상황에서 효과적인 이미지 검색을 위한 다양한 방법들이 연구되고 있다. 기존의 이미지 검색 방법들은 이미지에 존재하는 물체들을 단순하게 검출하여 각 물체들의 라벨 정보에 근거한 검색을 수행하기 때문에 사용자가 원하는 이미지와 검색 결과로 얻은 이미지 간에 의미적 차이인 시맨틱 갭(Semantic Gap)이 발생된다. 이미지 검색에서 발생하는 시맨틱 갭을 줄이기 위해, 본 논문에서는 딥러닝 기반의 다중 객체 분류 모듈과 사람의 행위를 분류하는 모듈을 연결하고, 이 모듈들에 행위 온톨로지를 결합하였다. 즉, 딥러닝과 행위 온톨로지의 결합을 기반으로 객체들 간의 연관성을 고려한 이미지 검색 시스템을 제안한다. 이미지에 포함된 동적인 행위를 고려하기 위해 Walking과 Running 데이터를 이용하여 실험한 결과를 분석하였다. 제안한 방법은 향후 이미지 검색 결과의 정확도를 높일 수 있는 영상의 자동 주석 생성 연구에 확장하여 적용할 수 있다.

조선총독부의 '조선도서 및 고문서'의 수집·분류 활동 (A study on collecting and classifying the Chosen literatures and archives of Chosen General Government)

  • 이승일
    • 기록학연구
    • /
    • 제4호
    • /
    • pp.93-130
    • /
    • 2001
  • Chosen General Government initiated the activities of collecting and managing the archives from Chosen Dynasty because of necessity to push positively for its colonial policies. Particularly, such efforts of the regime resulted eventually in boosting their understanding on the Korea cultures, as well as contributed to their reigning Korea to an extent. Some aspects that reflect it are as follows. In 1910 Chosen General Government took over, and began to arrange and classify huge volumes of archives that were held by the royal family. During this period, they collected and arranged literatures that they took over from the earlier Korean government. In 1913, Chosen General Government increased enormously the varieties and volumes of the archives that they intended to collect. They started with collecting archives limited to those literatures that had existed in the civil sector before 1894. It can be noticed that just in 1913 Chosen General Government revealed their intention to collect and classify both royal archives and civil archives. With the work of collecting, classification and annotating archives, Chosen General Government commenced the compilation of Chosensa (Korean History). These efforts aimed at cultural assimilation and educating of Korean people, and in this process, the importance of Chosen Dynasty's archives was reconfirmed. One of the representative cases was a change of terminology. With the compilation efforts into full swing since 1915, Chosen General Government repeatedly started to use the term 'Saryo' (historical records) in connection with Chosen's literatures and archives. The term 'Saryo' previously had been used in Japanese literatures, and it is deemed that it was used as a term generally referring to archives of Chosen Dynasty from that time. This signifies that Chosen General Government began to involve their historical point of view in approaching to the archives of Chosen. As they broadened their understanding on Korea through the annotation of old literatures and compilation of Chosen History, they seriously set on the work of assimilating Korean people culturally aiming at gripping its reign on Korea. Archives of Chosen likewise were very crucial basic data for understanding Korea and its people, and Chosen General Government is deemed to have utilized the archives as a means to reign and assimilate Korean people.