• Title/Summary/Keyword: Science and technology classification

Search Result 1,634, Processing Time 0.027 seconds

Information Extraction and Sentence Classification applied to Clinical Trial MEDLINE Abstracts

  • Hara, Kazuo;Matsumoto, Yuji
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.85-90
    • /
    • 2005
  • In this paper, firstly we report experimental results on applying information extraction (IE) methodology to the task of summarizing clinical trial design information in focus on ‘Compared Treatment’, ‘Endpoint’ and ‘Patient Population’ from clinical trial MEDLINE abstracts. From these results, we have come to see this problem as one that can be decomposed into a sentence classification subtask and an IE subtask. By classifying sentences from clinical trial abstracts and only performing IE on sentences that are most likely to contain relevant information, we hypothesize that the accuracy of information extracted from the abstracts can be increased. As preparation for testing this theory in the next stage, we conducted an experiment applying state-of-the-art sentence classification techniques to the clinical trial abstracts and evaluated its potential in the original task of the summarization of clinical trial design information.

  • PDF

Classification of Imbalanced Data Based on MTS-CBPSO Method: A Case Study of Financial Distress Prediction

  • Gu, Yuping;Cheng, Longsheng;Chang, Zhipeng
    • Journal of Information Processing Systems
    • /
    • v.15 no.3
    • /
    • pp.682-693
    • /
    • 2019
  • The traditional classification methods mostly assume that the data for class distribution is balanced, while imbalanced data is widely found in the real world. So it is important to solve the problem of classification with imbalanced data. In Mahalanobis-Taguchi system (MTS) algorithm, data classification model is constructed with the reference space and measurement reference scale which is come from a single normal group, and thus it is suitable to handle the imbalanced data problem. In this paper, an improved method of MTS-CBPSO is constructed by introducing the chaotic mapping and binary particle swarm optimization algorithm instead of orthogonal array and signal-to-noise ratio (SNR) to select the valid variables, in which G-means, F-measure, dimensionality reduction are regarded as the classification optimization target. This proposed method is also applied to the financial distress prediction of Chinese listed companies. Compared with the traditional MTS and the common classification methods such as SVM, C4.5, k-NN, it is showed that the MTS-CBPSO method has better result of prediction accuracy and dimensionality reduction.

An Integrated Ontological Approach to Effective Information Management in Science and Technology (과학기술 분야 통합 개념체계의 구축 방안 연구)

  • 정영미;김명옥;이재윤;한승희;유재복
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.1
    • /
    • pp.135-161
    • /
    • 2002
  • This study presents a multilingual integrated ontological approach that enables linking classification systems. thesauri. and terminology databases in science and technology for more effective indexing and information retrieval online. In this integrated system, we designed a thesaurus model with concept as a unit and designated essential data elements for a terminology database on the basis of ISO 12620 standard. The classification system for science and technology adopted in this study provides subject access channels from other existing classification systems through its mapping table. A prototype system was implemented with the field of nuclear energy as an application area.

The Characteristics of Silica Powders Prepared by Spray Pyrolysis Applying Droplet Classification Apparatus (액적 분급 장치를 적용한 분무열분해 공정으로부터 합성된 실리카 분말의 특성)

  • Kang, Yun-Chan;Ju, Seo-Hee;Koo, Hye-Young;Kang, Hee-Sang;Park, Seung-Bin
    • Korean Journal of Materials Research
    • /
    • v.16 no.10
    • /
    • pp.633-638
    • /
    • 2006
  • Silica powders with spherical shape and narrow size distribution were prepared by large-scale ultrasonic spray pyrolysis applying the droplet classification apparatus. On the other hand, silica powders prepared by large-scale ultrasonic spray pyrolysis without droplet classification apparatus had broad size distribution. Droplet classification apparatus used in this paper applied the principles of cyclone and dispersion plate with small holes. The droplets formed from the ultrasonic spray generator applying the droplet classification apparatus had narrow size distribution. The droplets with fine and large sizes were eliminated by droplet classification apparatus. The optimum flow rate of the carrier gas and diameter of the hole of the dispersion plate were studied to reduce the size distribution of the silica powders prepared by large-scale ultrasonic spray pyrolysis. The size distribution of the silica powders prepared by large-scale ultrasonic spray pyrolysis at the optimum preparation conditions was 0.76.

EFTG: Efficient and Flexible Top-K Geo-textual Publish/Subscribe

  • zhu, Hong;Li, Hongbo;Cui, Zongmin;Cao, Zhongsheng;Xie, Meiyi
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.12
    • /
    • pp.5877-5897
    • /
    • 2018
  • With the popularity of mobile networks and smartphones, geo-textual publish/subscribe messaging has attracted wide attention. Different from the traditional publish/subscribe format, geo-textual data is published and subscribed in the form of dynamic data flow in the mobile network. The difference creates more requirements for efficiency and flexibility. However, most of the existing Top-k geo-textual publish/subscribe schemes have the following deficiencies: (1) All publications have to be scored for each subscription, which is not efficient enough. (2) A user should take time to set a threshold for each subscription, which is not flexible enough. Therefore, we propose an efficient and flexible Top-k geo-textual publish/subscribe scheme. First, our scheme groups publish and subscribe based on text classification. Thus, only a few parts of related publications should be scored for each subscription, which significantly enhances efficiency. Second, our scheme proposes an adaptive publish/subscribe matching algorithm. The algorithm does not require the user to set a threshold. It can adaptively return Top-k results to the user for each subscription, which significantly enhances flexibility. Finally, theoretical analysis and experimental evaluation verify the efficiency and effectiveness of our scheme.

The Efficiency of Long Short-Term Memory (LSTM) in Phenology-Based Crop Classification

  • Ehsan Rahimi;Chuleui Jung
    • Korean Journal of Remote Sensing
    • /
    • v.40 no.1
    • /
    • pp.57-69
    • /
    • 2024
  • Crop classification plays a vitalrole in monitoring agricultural landscapes and enhancing food production. In this study, we explore the effectiveness of Long Short-Term Memory (LSTM) models for crop classification, focusing on distinguishing between apple and rice crops. The aim wasto overcome the challenges associatedwith finding phenology-based classification thresholds by utilizing LSTM to capture the entire Normalized Difference Vegetation Index (NDVI)trend. Our methodology involvestraining the LSTM model using a reference site and applying it to three separate three test sites. Firstly, we generated 25 NDVI imagesfrom the Sentinel-2A data. Aftersegmenting study areas, we calculated the mean NDVI values for each segment. For the reference area, employed a training approach utilizing the NDVI trend line. This trend line served as the basis for training our crop classification model. Following the training phase, we applied the trained model to three separate test sites. The results demonstrated a high overall accuracy of 0.92 and a kappa coefficient of 0.85 for the reference site. The overall accuracies for the test sites were also favorable, ranging from 0.88 to 0.92, indicating successful classification outcomes. We also found that certain phenological metrics can be less effective in crop classification therefore limitations of relying solely on phenological map thresholds and emphasizes the challenges in detecting phenology in real-time, particularly in the early stages of crops. Our study demonstrates the potential of LSTM models in crop classification tasks, showcasing their ability to capture temporal dependencies and analyze timeseriesremote sensing data.While limitations exist in capturing specific phenological events, the integration of alternative approaches holds promise for enhancing classification accuracy. By leveraging advanced techniques and considering the specific challenges of agricultural landscapes, we can continue to refine crop classification models and support agricultural management practices.

Alsat-2B/Sentinel-2 Imagery Classification Using the Hybrid Pigeon Inspired Optimization Algorithm

  • Arezki, Dounia;Fizazi, Hadria
    • Journal of Information Processing Systems
    • /
    • v.17 no.4
    • /
    • pp.690-706
    • /
    • 2021
  • Classification is a substantial operation in data mining, and each element is distributed taking into account its feature values in the corresponding class. Metaheuristics have been widely used in attempts to solve satellite image classification problems. This article proposes a hybrid approach, the flower pigeons-inspired optimization algorithm (FPIO), and the local search method of the flower pollination algorithm is integrated into the pigeon-inspired algorithm. The efficiency and power of the proposed FPIO approach are displayed with a series of images, supported by computational results that demonstrate the cogency of the proposed classification method on satellite imagery. For this work, the Davies-Bouldin Index is used as an objective function. FPIO is applied to different types of images (synthetic, Alsat-2B, and Sentinel-2). Moreover, a comparative experiment between FPIO and the genetic algorithm genetic algorithm is conducted. Experimental results showed that GA outperformed FPIO in matters of time computing. However, FPIO provided better quality results with less confusion. The overall experimental results demonstrate that the proposed approach is an efficient method for satellite imagery classification.

A New Model for Connecting the Classification Systems of Knowledge Activities - Linking Research-Technology-Industry and Research-Major-Job - (지식활동의 관계식별을 위한 연계형 분류체계에 관한 연구 - 연구-기술-산업과 연구-전공-취업 연계 -)

  • Seol, Sung-Soo;Song, Choong-Han;Nho, Hwan-Jin
    • Journal of Korea Technology Innovation Society
    • /
    • v.10 no.3
    • /
    • pp.531-554
    • /
    • 2007
  • This paper suggests a new model connecting various knowledge activities through classification systems such as classifications of research, technology, industry, major and job. Although research activities are linked to technology and industry areas or to education and job areas, there is no effort to link these kinds of activities. There are a few studies to link research and technology or research and education respectively. But, there have been no studies to connect technology-industry linkage and education-job linkage. This paper suggests that research area can be a basis of link between technology-industry linkage and education-job linkage. The methods building the links are not simple, but easy; 1) setting up new science/research classification system having two dimensions of research and application, 2) building electronic systems and databases allowing fields for several classification systems, and 3) making rules using multi-dimensional classification systems following the purpose of the programs. The model is designed to meet the needs of nationwide R&D and human resources policies, and for the preparation of knowledge society to grasp the relationship between sequential activities using knowledge. If we know the interactive relationships between various areas, we can trace related phenomena in different activities with restricted information.

  • PDF

Semi-Supervised SAR Image Classification via Adaptive Threshold Selection (선별적인 임계값 선택을 이용한 준지도 학습의 SAR 분류 기술)

  • Jaejun Do;Minjung Yoo;Jaeseok Lee;Hyoi Moon;Sunok Kim
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.3
    • /
    • pp.319-328
    • /
    • 2024
  • Semi-supervised learning is a good way to train a classification model using a small number of labeled and large number of unlabeled data. We applied semi-supervised learning to a synthetic aperture radar(SAR) image classification model with a limited number of datasets that are difficult to create. To address the previous difficulties, semi-supervised learning uses a model trained with a small amount of labeled data to generate and learn pseudo labels. Besides, a lot of number of papers use a single fixed threshold to create pseudo labels. In this paper, we present a semi-supervised synthetic aperture radar(SAR) image classification method that applies different thresholds for each class instead of all classes sharing a fixed threshold to improve SAR classification performance with a small number of labeled datasets.

Visualization and classification of hidden defects in triplex composites used in LNG carriers by active thermography

  • Hwang, Soonkyu;Jeon, Ikgeun;Han, Gayoung;Sohn, Hoon;Yun, Wonjun
    • Smart Structures and Systems
    • /
    • v.24 no.6
    • /
    • pp.803-812
    • /
    • 2019
  • Triplex composite is an epoxy-bonded joint structure, which constitutes the secondary barrier in a liquefied natural gas (LNG) carrier. Defects in the triplex composite weaken its shear strength and may cause leakage of the LNG, thus compromising the structural integrity of the LNG carrier. This paper proposes an autonomous triplex composite inspection (ATCI) system for visualizing and classifying hidden defects in the triplex composite installed inside an LNG carrier. First, heat energy is generated on the surface of the triplex composite using halogen lamps, and the corresponding heat response is measured by an infrared (IR) camera. Next, the region of interest (ROI) is traced and noise components are removed to minimize false indications of defects. After a defect is identified, it is classified as internal void or uncured adhesive and its size and shape are quantified and visualized, respectively. The proposed ATCI system allows the fully automated and contactless detection, classification, and quantification of hidden defects inside the triplex composite. The effectiveness of the proposed ATCI system is validated using the data obtained from actual triplex composite installed in an LNG carrier membrane system.