• Title/Summary/Keyword: classification schemes

Search Result 231, Processing Time 0.022 seconds

Taxonomy of tribe Neillieae (Rosaceae): Physocarpus (나도국수나무족의 분류: 산국수나무속)

  • Oh, Sang-Hun
    • Korean Journal of Plant Taxonomy
    • /
    • v.45 no.4
    • /
    • pp.332-352
    • /
    • 2015
  • The tribe Neillieae, a small group of about 18 species in the Rosaceae, comprises three taxonomically difficult genera, Neillia, Physocarpus, and Stephanandra. The tribe, characterized by lobed leaves with persistent or deciduous stipules and ovoid, shiny seeds with copious endosperm, is strongly supported as a monophyletic group by a variety of lines of molecular evidence. Due to the high amount of morphological variation across the three genera and the species in tribe Neillieae, conflicting classification schemes and numerous species have been proposed over the past three centuries. However, no comprehensive systematic study of the group, including all species across their geographic ranges, has ever been undertaken. As part of a taxonomic revision of tribe Neillieae, a revision of Physocarpus based on the morphological examination of herbarium specimens, including types, and field observation is presented. Artificial keys, comprehensive nomenclatural treatments, descriptions, distribution maps, and lists of specimens examined are provided. Six species in Physocarpus are recognized. A lectotype is here designated for the following species: Opulaster pubescens, Opulaster ramaleyi, Spiraea opulifolia var. parvifolia, Spiraea opulifolia var. tomentella, Physocarpus michiganensis, and Physocarpus missouriensis.

A Study on Management of Personal Archives : How to Make My Archive (개인기록 관리 방안 연구 '나의 아카이브(My Archive)' 만들기)

  • Choe, Yu Ri;Yim, Jin Hee
    • The Korean Journal of Archival Studies
    • /
    • no.47
    • /
    • pp.5-49
    • /
    • 2016
  • Compared with public archives, personal archives are likely to disappear if creators don't preserve and manage them. So personal archives must be managed by oneself. But it's difficult to manage their archives systematically for people who don't have the expertise in archival science. Besides, there are not enough available informations. So this thesis suggests how to manage personal archives by two steps. First step is figuring out one's own archives through analyzing one's life by top-down approach and organizing them into collection. Second step is conducting archival appraisal by three steps and establishing classification schemes, describing them. Especially, this study adduce description elements using ISAD(G) for personal archives. this study also recommends using blogs on portal to manage one's archives easily. But they don't have the audit train and exporting function. So this thesis emphasizes the necessity of 'customized archive blogs'. At conclusion, this study highlights the necessity of developing education programs and manuals for people who are trying to manage one's own archives.

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

  • Park, Jongin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.19-41
    • /
    • 2019
  • According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.

An Implementation of OTB Extension to Produce TOA and TOC Reflectance of LANDSAT-8 OLI Images and Its Product Verification Using RadCalNet RVUS Data (Landsat-8 OLI 영상정보의 대기 및 지표반사도 산출을 위한 OTB Extension 구현과 RadCalNet RVUS 자료를 이용한 성과검증)

  • Kim, Kwangseob;Lee, Kiwon
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.3
    • /
    • pp.449-461
    • /
    • 2021
  • Analysis Ready Data (ARD) for optical satellite images represents a pre-processed product by applying spectral characteristics and viewing parameters for each sensor. The atmospheric correction is one of the fundamental and complicated topics, which helps to produce Top-of-Atmosphere (TOA) and Top-of-Canopy (TOC) reflectance from multi-spectral image sets. Most remote sensing software provides algorithms or processing schemes dedicated to those corrections of the Landsat-8 OLI sensors. Furthermore, Google Earth Engine (GEE), provides direct access to Landsat reflectance products, USGS-based ARD (USGS-ARD), on the cloud environment. We implemented the Orfeo ToolBox (OTB) atmospheric correction extension, an open-source remote sensing software for manipulating and analyzing high-resolution satellite images. This is the first tool because OTB has not provided calibration modules for any Landsat sensors. Using this extension software, we conducted the absolute atmospheric correction on the Landsat-8 OLI images of Railroad Valley, United States (RVUS) to validate their reflectance products using reflectance data sets of RVUS in the RadCalNet portal. The results showed that the reflectance products using the OTB extension for Landsat revealed a difference by less than 5% compared to RadCalNet RVUS data. In addition, we performed a comparative analysis with reflectance products obtained from other open-source tools such as a QGIS semi-automatic classification plugin and SAGA, besides USGS-ARD products. The reflectance products by the OTB extension showed a high consistency to those of USGS-ARD within the acceptable level in the measurement data range of the RadCalNet RVUS, compared to those of the other two open-source tools. In this study, the verification of the atmospheric calibration processor in OTB extension was carried out, and it proved the application possibility for other satellite sensors in the Compact Advanced Satellite (CAS)-500 or new optical satellites.

Function of the Korean String Indexing System for the Subject Catalog (주제목록을 위한 한국용어열색인 시스템의 기능)

  • Yoon Kooho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.15
    • /
    • pp.225-266
    • /
    • 1988
  • Various theories and techniques for the subject catalog have been developed since Charles Ammi Cutter first tried to formulate rules for the construction of subject headings in 1876. However, they do not seem to be appropriate to Korean language because the syntax and semantics of Korean language are different from those of English and other European languages. This study therefore attempts to develop a new Korean subject indexing system, namely Korean String Indexing System(KOSIS), in order to increase the use of subject catalogs. For this purpose, advantages and disadvantages between the classed subject catalog nd the alphabetical subject catalog, which are typical subject ca-alogs in libraries, are investigated, and most of remarkable subject indexing systems, in particular the PRECIS developed by the British National Bibliography, are reviewed and analysed. KOSIS is a string indexing based on purely the syntax and semantics of Korean language, even though considerable principles of PRECIS are applied to it. The outlines of KOSIS are as follows: 1) KOSIS is based on the fundamentals of natural language and an ingenious conjunction of human indexing skills and computer capabilities. 2) KOSIS is. 3 string indexing based on the 'principle of context-dependency.' A string of terms organized accoding to his principle shows remarkable affinity with certain patterns of words in ordinary discourse. From that point onward, natural language rather than classificatory terms become the basic model for indexing schemes. 3) KOSIS uses 24 role operators. One or more operators should be allocated to the index string, which is organized manually by the indexer's intellectual work, in order to establish the most explicit syntactic relationship of index terms. 4) Traditionally, a single -line entry format is used in which a subject heading or index entry is presented as a single sequence of words, consisting of the entry terms, plus, in some cases, an extra qualifying term or phrase. But KOSIS employs a two-line entry format which contains three basic positions for the production of index entries. The 'lead' serves as the user's access point, the 'display' contains those terms which are themselves context dependent on the lead, 'qualifier' sets the lead term into its wider context. 5) Each of the KOSIS entries is co-extensive with the initial subject statement prepared by the indexer, since it displays all the subject specificities. Compound terms are always presented in their natural language order. Inverted headings are not produced in KOSIS. Consequently, the precision ratio of information retrieval can be increased. 6) KOSIS uses 5 relational codes for the system of references among semantically related terms. Semantically related terms are handled by a different set of routines, leading to the production of 'See' and 'See also' references. 7) KOSIS was riginally developed for a classified catalog system which requires a subject index, that is an index -which 'trans-lates' subject index, that is, an index which 'translates' subjects expressed in natural language into the appropriate classification numbers. However, KOSIS can also be us d for a dictionary catalog system. Accordingly, KOSIS strings can be manipulated to produce either appropriate subject indexes for a classified catalog system, or acceptable subject headings for a dictionary catalog system. 8) KOSIS is able to maintain a constistency of index entries and cross references by means of a routine identification of the established index strings and reference system. For this purpose, an individual Subject Indicator Number and Reference Indicator Number is allocated to each new index strings and new index terms, respectively. can produce all the index entries, cross references, and authority cards by means of either manual or mechanical methods. Thus, detailed algorithms for the machine-production of various outputs are provided for the institutions which can use computer facilities.

  • PDF

Net Primary Production Changes over Korea and Climate Factors (위성영상으로 분석한 장기간 남한지역 순 일차생산량 변화: 기후인자의 영향)

  • Hong, Ji-Youn;Shim, Chang-Sub;Lee, Moung-Jin;Baek, Gyoung-Hye;Song, Won-Kyong;Jeon, Seong-Woo;Park, Yong-Ha
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.4
    • /
    • pp.467-480
    • /
    • 2011
  • Spatial and temporal variabilities of NPP(Net Primary Production) retrieved from two satellite instruments, AVHRR(Advanced Very High Resolution Radiometer, 1981-2000) and MODIS(MODerate-resolution Imaging Spectroradiometer, 2000-2006), were investigated. The range of mean NPP from A VHRR and MODIS were estimated to be 894-1068 $g{\cdot}C/m^2$/yr and 610-694.90 $g{\cdot}C/m^2$/yr, respectively. The discrepancy of NPP between the two instruments is about 325 $g{\cdot}C/m^2$/yr, and MODIS product is generally closer to the ground measurement than AVHRR despite the limitation in direct comparison such as spatial resolution and vegetation classification. The higher NPP values over South Korea are related to the regions with higher biomass (e.g., mountains) and higher annual temperature. The interannual NPP trends from the two satellite products were computed, and both mean annual trends show continuous NPP increase; 2.14 $g{\cdot}C/m^2$/yr from AVHRR(1981-2000) and 6.08 $g{\cdot}C/m^2$/yr from MODIS (2000-2006) over South Korea. Specifically, the higher increasing trends over the Southwestern region are likely due to the increasing productivity of crop fields from sufficient irrigation and fertilizer use. The retrieved NPP shows a closer relationship between monthly temperature and precipitation, which results in maximum correlation during summer monsoons. The difference in the detection wavelength and model schemes during the retrieval can make a significant difference in the satellite products, and a better accuracy in the meterological and land use data and modeling applications will be necessary to improve the satellite-based NPP data.

Ecoclimatic Map over North-East Asia Using SPOT/VEGETATION 10-day Synthesis Data (SPOT/VEGETATION NDVI 자료를 이용한 동북아시아의 생태기후지도)

  • Park Youn-Young;Han Kyung-Soo
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.8 no.2
    • /
    • pp.86-96
    • /
    • 2006
  • Ecoclimap-1, a new complete surface parameter global database at a 1-km resolution, was previously presented. It is intended to be used to initialize the soil-vegetation- atmosphere transfer schemes in meteorological and climate models. Surface parameters in the Ecoclimap-1 database are provided in the form of a per-class value by an ecoclimatic base map from a simple merging of land cover and climate maps. The principal objective of this ecoclimatic map is to consider intra-class variability of life cycle that the usual land cover map cannot describe. Although the ecoclimatic map considering land cover and climate is used, the intra-class variability was still too high inside some classes. In this study, a new strategy is defined; the idea is to use the information contained in S10 NDVI SPOT/VEGETATION profiles to split a land cover into more homogeneous sub-classes. This utilizes an intra-class unsupervised sub-clustering methodology instead of simple merging. This study was performed to provide a new ecolimatic map over Northeast Asia in the framework of Ecoclimap-2 global database construction for surface parameters. We used the University of Maryland's 1km Global Land Cover Database (UMD) and a climate map to determine the initial number of clusters for intra-class sub-clustering. An unsupervised classification process using six years of NDVI profiles allows the discrimination of different behavior for each land cover class. We checked the spatial coherence of the classes and, if necessary, carried out an aggregation step of the clusters having a similar NDVI time series profile. From the mapping system, 29 ecosystems resulted for the study area. In terms of climate-related studies, this new ecosystem map may be useful as a base map to construct an Ecoclimap-2 database and to improve the surface climatology quality in the climate model.

A Study on History and Archetype Technology of Goli-su in Korea (한국 고리수의 역사와 원형기술의 복원 연구)

  • Kim, Young-ran
    • Korean Journal of Heritage: History & Science
    • /
    • v.46 no.2
    • /
    • pp.4-25
    • /
    • 2013
  • Goli-su is the innovative special kind of the embroidery technique, which combines twining and interlacing skill with metal technology and makes the loops woven to each other with a strand. The loops floating on the space of the ground look like floating veins of sculpture and give people the feeling of the openwork. This kind of characteristic has some similarities with the lacework craft of Western Europe in texture and technique style, but it has its own features different from that of Western Europe. It mainly represents the splendid gloss with metallic materials in the Embroidered cloth, such as gold foil or wire. In the 10th century, early days of Goryo, we can see the basic Goli-su structure form of its initial period in the boy motif embroidery purse unearthed from the first level of Octagonal Nine-storied Pagoda of Woljeong-sa. In the Middle period of Joseon, there are several pieces of Goli-su embroidered relic called "Battle Flag of Goryo", which was taken by the Japanese in 1592 and is now in the Japanese temple. This piece is now converted into altar-table covers. In 18~19th century, two pairs of embroidered pillows in Joseon palace were kept intact, whose time and source are very accurate. The frame of the pillows was embroidered with Goli-su veins, and some gold foil papers were inserted into the inside. The triangle motif with silk was embroidered on the pillow. The stitch in the Needle-Looped embroidery is divided into three kinds according to comprehensive classification: 1. Goli-su ; 2. Goli-Kamgi-su ; 3. Goli-Saegim-su. From the 10th century newly establishing stage to the 13th century, Goli-su has appeared variational stitches and employed 2~3 dimensional color schemes gradually. According to the research of this thesis, we can still see this stitch in the embroidery pillow, which proves that Goli-suwas still kept in Korea in the 19th century. And in terms of the research achievement of this thesis, Archetype technology of Goli-su was restored. Han Sang-soo, Important Intangible Cultural Heritage No. 80 and Master of Embroidery already recreated the Korean relics of Goli-su in Joseon Dynasty. The Needle-Looped embriodery is the overall technological result of ancestral outstanding Metal craft, Twining and Interlacing craft, and Embroidery art. We should inherit, create, and seek the new direction in modern multi-dimensional and international industry societyon the basis of these research results. We can inherit the long history of embroidering, weaving, fiber processing, and expand the applications of other craft industries, and develop new advanced additional values of new dress material, fashion technology, ornament craft and artistic design. Thus, other crafts assist each other and broaden the expressive field to pursue more diversified formative beauty and beautify our life abundantly together.

A Study on the Structural Reinforcement of the Modified Caisson Floating Dock (개조된 케이슨 플로팅 도크의 구조 보강에 대한 연구)

  • Kim, Hong-Jo;Seo, Kwang-Cheol;Park, Joo-Shin
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.27 no.1
    • /
    • pp.172-178
    • /
    • 2021
  • In the ship repair market, interest in maintenance and repair is steadily increasing due to the reinforcement of prevention of environmental pollution caused by ships and the reinforcement of safety standards for ship structures. By reflecting this effect, the number of requests for repairs by foreign shipping companies increases to repair shipbuilders in the Southwest Sea. However, because most of the repair shipbuilders in the southwestern area are small and medium-sized companies, it is difficult to lead to the integrated synergy effect of the repair shipbuilding companies. Moreover, the infrastructure is not integrated; hence, using the infrastructure jointly is a challenge, which acts as an obstacle to the activation of the repair shipbuilding industry. Floating docks are indispensable to operating the repair shipbuilding business; in addition, most of them are operated through renovation/repair after importing aging caisson docks from overseas. However, their service life is more than 30 years; additionally, there is no structure inspection standard. Therefore, it is vulnerable to the safety field. In this study, the finite element analysis program of ANSYS was used to evaluate the structural safety of the modified caisson dock and obtain additional structural reinforcement schemes to solve the derived problems. For the floating docks, there are classification regulations; however, concerning structural strength, the regulations are insufficient, and the applicability is inferior. These insufficient evaluation areas were supplemented through a detailed structural FE-analysis. The reinforcement plan was decided by reinforcing the pontoon deck and reinforcement of the side tank, considering the characteristics of the repair shipyard condition. The final plan was selected to reinforce the side wing tank through the structural analysis of the decision; in addition, the actual structure was fabricated to reflect the reinforcement plan. Our results can be used as reference data for improving the structural strength of similar facilities; we believe that the optimal solution can be found quickly if this method is used during renovation/repair.

Term Mapping Methodology between Everyday Words and Legal Terms for Law Information Search System (법령정보 검색을 위한 생활용어와 법률용어 간의 대응관계 탐색 방법론)

  • Kim, Ji Hyun;Lee, Jong-Seo;Lee, Myungjin;Kim, Wooju;Hong, June Seok
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.137-152
    • /
    • 2012
  • In the generation of Web 2.0, as many users start to make lots of web contents called user created contents by themselves, the World Wide Web is overflowing by countless information. Therefore, it becomes the key to find out meaningful information among lots of resources. Nowadays, the information retrieval is the most important thing throughout the whole field and several types of search services are developed and widely used in various fields to retrieve information that user really wants. Especially, the legal information search is one of the indispensable services in order to provide people with their convenience through searching the law necessary to their present situation as a channel getting knowledge about it. The Office of Legislation in Korea provides the Korean Law Information portal service to search the law information such as legislation, administrative rule, and judicial precedent from 2009, so people can conveniently find information related to the law. However, this service has limitation because the recent technology for search engine basically returns documents depending on whether the query is included in it or not as a search result. Therefore, it is really difficult to retrieve information related the law for general users who are not familiar with legal terms in the search engine using simple matching of keywords in spite of those kinds of efforts of the Office of Legislation in Korea, because there is a huge divergence between everyday words and legal terms which are especially from Chinese words. Generally, people try to access the law information using everyday words, so they have a difficulty to get the result that they exactly want. In this paper, we propose a term mapping methodology between everyday words and legal terms for general users who don't have sufficient background about legal terms, and we develop a search service that can provide the search results of law information from everyday words. This will be able to search the law information accurately without the knowledge of legal terminology. In other words, our research goal is to make a law information search system that general users are able to retrieval the law information with everyday words. First, this paper takes advantage of tags of internet blogs using the concept for collective intelligence to find out the term mapping relationship between everyday words and legal terms. In order to achieve our goal, we collect tags related to an everyday word from web blog posts. Generally, people add a non-hierarchical keyword or term like a synonym, especially called tag, in order to describe, classify, and manage their posts when they make any post in the internet blog. Second, the collected tags are clustered through the cluster analysis method, K-means. Then, we find a mapping relationship between an everyday word and a legal term using our estimation measure to select the fittest one that can match with an everyday word. Selected legal terms are given the definite relationship, and the relations between everyday words and legal terms are described using SKOS that is an ontology to describe the knowledge related to thesauri, classification schemes, taxonomies, and subject-heading. Thus, based on proposed mapping and searching methodologies, our legal information search system finds out a legal term mapped with user query and retrieves law information using a matched legal term, if users try to retrieve law information using an everyday word. Therefore, from our research, users can get exact results even if they do not have the knowledge related to legal terms. As a result of our research, we expect that general users who don't have professional legal background can conveniently and efficiently retrieve the legal information using everyday words.