• Title/Summary/Keyword: data sets

Search Result 3,771, Processing Time 0.029 seconds

INVESTIGATION OF BAIKDU-SAN VOLCANO WITH SPACE-BORNE SAR SYSTEM

  • Kim, Duk-Jin;Feng, Lanying;Moon, Wooil-M.
    • Proceedings of the KSRS Conference
    • /
    • 1999.11a
    • /
    • pp.148-153
    • /
    • 1999
  • Baikdu-san was a very active volcano during the Cenozoic era and is believed to be formed in late Cenozoic era. Recently it was also reported that there was a major eruption in or around 1002 A.D. and there are evidences which indicate that it is still an active volcano and a potential volcanic hazard. Remote sensing techniques have been widely used to monitor various natural hazards, including volcanic hazards. However, during an active volcanic eruption, volcanic ash can basically cover the sky and often blocks the solar radiation preventing any use of optical sensors. Synthetic aperture radar(SAR) is an ideal tool to monitor the volcanic activities and lava flows, because the wavelength of the microwave signal is considerably longer that the average volcanic ash particle size. In this study we have utilized several sets of SAR data to evaluate the utility of the space-borne SAR system. The data sets include JERS-1(L-band) SAR, and RADARSAT(C-band) data which included both standard mode and the ScanSAR mode data sets. We also utilized several sets of auxiliary data such as local geological maps and JERS-1 OPS data. The routine preprocessing and image processing steps were applied to these data sets before any attempts of classifying and mapping surface geological features. Although we computed sigma nought ($\sigma$$^{0}$) values far the standard mode RADARSAT data, the utility of sigma nought image was minimal in this study. Application of various types of classification algorithms to identify and map several stages of volcanic flows was not very successful. Although this research is still in progress, the following preliminary conclusions could be made: (1) sigma nought (RADARSAT standard mode data) and DN (JERS-1 SAR and RADARSAT ScanSAR data) have limited usefulness for distinguishing early basalt lava flows from late trachyte flows or later trachyte flows from the old basement granitic rocks around Baikdu-san volcano, (2) surface geological structure features such as several faults and volcanic lava flow channels can easily be identified and mapped, and (3) routine application of unsupervised classification methods cannot be used for mapping any types of surface lava flow patterns.

  • PDF

DNA Sequence Classification Using a Generalized Regression Neural Network and Random Generator (난수발생기와 일반화된 회귀 신경망을 이용한 DNA 서열 분류)

  • 김성모;김근호;김병환
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.53 no.7
    • /
    • pp.525-530
    • /
    • 2004
  • A classifier was constructed by using a generalized regression neural network (GRU) and random generator (RG), which was applied to classify DNA sequences. Three data sets evaluated are eukaryotic and prokaryotic sequences (Data-I), eukaryotic sequences (Data-II), and prokaryotic sequences (Data-III). For each data set, the classifier performance was examined in terms of the total classification sensitivity (TCS), individual classification sensitivity (ICS), total prediction accuracy (TPA), and individual prediction accuracy (IPA). For a given spread, the RG played a role of generating a number of sets of spreads for gaussian functions in the pattern layer Compared to the GRNN, the RG-GRNN significantly improved the TCS by more than 50%, 60%, and 40% for Data-I, Data-II, and Data-III, respectively. The RG-GRNN also demonstrated improved TPA for all data types. In conclusion, the proposed RG-GRNN can effectively be used to classify a large, multivariable promoter sequences.

COMPARISON OF LINEAR AND NON-LINEAR NIR CALIBRATION METHODS USING LARGE FORAGE DATABASES

  • Berzaghi, Paolo;Flinn, Peter C.;Dardenne, Pierre;Lagerholm, Martin;Shenk, John S.;Westerhaus, Mark O.;Cowe, Ian A.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1141-1141
    • /
    • 2001
  • The aim of the study was to evaluate the performance of 3 calibration methods, modified partial least squares (MPLS), local PLS (LOCAL) and artificial neural network (ANN) on the prediction of chemical composition of forages, using a large NIR database. The study used forage samples (n=25,977) from Australia, Europe (Belgium, Germany, Italy and Sweden) and North America (Canada and U.S.A) with information relative to moisture, crude protein and neutral detergent fibre content. The spectra of the samples were collected with 10 different Foss NIR Systems instruments, which were either standardized or not standardized to one master instrument. The spectra were trimmed to a wavelength range between 1100 and 2498 nm. Two data sets, one standardized (IVAL) and the other not standardized (SVAL) were used as independent validation sets, but 10% of both sets were omitted and kept for later expansion of the calibration database. The remaining samples were combined into one database (n=21,696), which was split into 75% calibration (CALBASE) and 25% validation (VALBASE). The chemical components in the 3 validation data sets were predicted with each model derived from CALBASE using the calibration database before and after it was expanded with 10% of the samples from IVAL and SVAL data sets. Calibration performance was evaluated using standard error of prediction corrected for bias (SEP(C)), bias, slope and R2. None of the models appeared to be consistently better across all validation sets. VALBASE was predicted well by all models, with smaller SEP(C) and bias values than for IVAL and SVAL. This was not surprising as VALBASE was selected from the calibration database and it had a sample population similar to CALBASE, whereas IVAL and SVAL were completely independent validation sets. In most cases, Local and ANN models, but not modified PLS, showed considerable improvement in the prediction of IVAL and SVAL after the calibration database had been expanded with the 10% samples of IVAL and SVAL reserved for calibration expansion. The effects of sample processing, instrument standardization and differences in reference procedure were partially confounded in the validation sets, so it was not possible to determine which factors were most important. Further work on the development of large databases must address the problems of standardization of instruments, harmonization and standardization of laboratory procedures and even more importantly, the definition of the database population.

  • PDF

Mean Meridional Circulation-Eddy Interaction in Three Reanalysis Data Sets during the Boreal Winter (세 가지 재분석 자료에서의 겨울철 북반구 평균 자오면 순환-에디 상호작용)

  • Moon, Hyejin;Ha, Kyung-Ja
    • Atmosphere
    • /
    • v.25 no.3
    • /
    • pp.543-557
    • /
    • 2015
  • The present study examines an interaction between the eddy and mean meridional circulation (MMC) comparing the results in three reanalysis data sets including ERA-Interim, NCEP2, and JRA-55 during the boreal winter in the Northern Hemisphere. It is noteworthy that the JRA-55 tends to produce stronger MMC compared to those of others, which is mainly due to the weak eddy flux. ERA-Interim represents the ensemble averages of MMC. The MMC-eddy interaction equation was adopted to investigate the scale interaction of the eddy momentum flux (EMF), eddy heat flux (EHF), and diabatic heating (DHT) with MMC. The EMF (EHF) shows a significant correlation coefficient with streamfunction under (above) 200 hPa-level. The perturbation (time mean) part of each eddy is dominant compared to another part in the EMF (EHF). The DHT is strongly interacted with streamfunction in the region between the equator and extra-tropical latitude over whole vertical column. Thus, the dominant term in each significant region modulates interannual variability of MMC. The inverse (proportional) relationship between MMC and pressure (meridional) derivative of the momentum (heat) divergence contributions is well represented in the three reanalysis data sets. The region modulated interannual variability of MMC by both EMF and DHT (EHF) is similar in ERA-Interim and JRA-55 (ERA-Interim and NCEP2). JRA-55 shows a lack of significant region of EHF due to the high resolution, compared to other data sets.

Data Fusion of Mineral Exploration Data Sets and Its Application Using Fuzzy Set Theory (광물자원탐사 자료에 대한 데이터 통합과 그 응용사례)

  • Sungwon Choi
    • Economic and Environmental Geology
    • /
    • v.32 no.5
    • /
    • pp.537-544
    • /
    • 1999
  • In mineral exploration, there are many data sets which need to be created, processed and analyzed in order to discover a favorable mineralized zone. Recently, with Geographic Information System (GIS), such exploration data sets have been able to be systematically stored and effectively processed using computer technologies. In this study, most exploration data sets were first digitized and then rasterized. Furthermore, they were integrated together by using fuzzy set theory to provide a possibility map toward a target hypothesis. Our target hypothesis is "there is a skarn magnetite deposit in this study" and all fuzzy membership functions were made with respect to the target hypothesis. Test area is extended from 37:00N/l28:30E to 37:20N/I28:45E, approximately 20 km by 40 km. This area is a part of Taebaeksan mineralized areas, where the Sinyemi mine, a skarn magnetite deposit, is located. In final resultant map, high potential or possibility area coincides with the location of the Shinyemi mine. In this regard, we conclude the fuzzy set theory can be effectively applied to this study and provides an excellent example to define potential area for further mineral exploration.

  • PDF

Middle School Students' Statistical Inference Engaged in Comparing Data Sets (자료집합 비교 활동에서 나타나는 중학교 학생들의 통계적 추리(statistical inference)에 대한 연구)

  • Park, Min-Sun;Park, Mi-Mi;Lee, Kyeong-Hwa;Ko, Eun-Sung
    • School Mathematics
    • /
    • v.13 no.4
    • /
    • pp.599-614
    • /
    • 2011
  • According to prior research studies, comparison of two data sets promote informal and formal statistical reasoning, which may mediate descriptive and inferential statistics. However, there has been relatively little attention given to the mediation of both descriptive and inferential statistics. We attempted to identify which statistical concepts or factors students used and how they applied concepts or factors to make decisions when they compared data sets. We also investigated the characteristics and changes of the view of concepts and factors. As a result, we identified that students paid attention to data value, center, spread, and sample, which are important factors of inferential statistics. Students' understanding of each factors were sometimes appropriate for inferential statistics, but sometimes not. From the results, we suggest instructional ideas for a task which can connect descriptive and inferential statistics.

  • PDF

Accuracy Comparison of TOA and TOC Reflectance Products of KOMPSAT-3, WorldView-2 and Pléiades-1A Image Sets Using RadCalNet BTCN and BSCN Data

  • Kim, Kwangseob;Lee, Kiwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.1
    • /
    • pp.21-32
    • /
    • 2022
  • The importance of the classical theme of how the Top-of-Atmosphere (TOA) and Top-of-Canopy (TOC) reflectance of high-resolution satellite images match the actual atmospheric reflectance and surface reflectance has been emphasized. Based on the Radiometric Calibration Network (RadCalNet) BTCN and BSCN data, this study compared the accuracy of TOA and TOC reflectance products of the currently available optical satellites, including KOMPSAT-3, WorldView-2, and Pléiades-1A image sets calculated using the absolute atmospheric correction function of the Orfeo Toolbox (OTB) tool. The comparison experiment used data in 2018 and 2019, and the Landsat-8 image sets from the same period were applied together. The experiment results showed that the product of TOA and TOC reflectance obtained from the three sets of images were highly consistent with RadCalNet data. It implies that any imagery may be applied when high-resolution reflectance products are required for a certain application. Meanwhile, the processed results of the OTB tool and those by the Apparent Reflection method of another tool for WorldView-2 images were nearly identical. However, in some cases, the reflectance products of Landsat-8 images provided by USGS sometimes showed relatively low consistency than those computed by the OTB tool, with the reference of RadCalNet BTCN and BSCN data. Continuous experiments on active vegetation areas in addition to the RadCalNet sites are necessary to obtain generalized results.

Bootstrap Method for k-Spatial Medians

  • Jhun, Myoung-Shic
    • Journal of the Korean Statistical Society
    • /
    • v.15 no.1
    • /
    • pp.1-8
    • /
    • 1986
  • The k-medians clustering method is considered to partition observations into k clusters. Consistency and advantage of bootstrap confidence sets of k optimal cluster centers are discussed. The k-medians and k-means clustering methods are compared by using actual data sets.

  • PDF

Canonical Correlation Biplot

  • Park, Mi-Ra;Huh, Myung-Hoe
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.1
    • /
    • pp.11-19
    • /
    • 1996
  • Canonical correlation analysis is a multivariate technique for identifying and quantifying the statistical relationship between two sets of variables. Like most multivariate techniques, the main objective of canonical correlation analysis is to reduce the dimensionality of the dataset. It would be particularly useful if high dimensional data can be represented in a low dimensional space. In this study, we will construct statistical graphs for paired sets of multivariate data. Specifically, plots of the observations as well as the variables are proposed. We discuss the geometric interpretation and goodness-of-fit of the proposed plots. We also provide a numerical example.

  • PDF

A Comparative Study on Evaluation of Response spectrum accounting for Soil Types (지반 종류별 응답스펙트럼 평가에 대한 비교 연구)

  • 김선우;한상환
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2001.04a
    • /
    • pp.433-438
    • /
    • 2001
  • The response spectrum has been widely used to differentiate the significant characteristics of earthquake ground motion and to evaluate the response of structures under ground shaking. Current design response spectrum is based on Seed, Ugas, and Lysmer's study. (1976) In this study, earthquake ground motion data sets adopted by Seed, Miranda, and Riddell is analyzed regards to soil types. And how earthquake data sets effected the design response spectrum is evaluated using acceleration-displacement response spectrum.

  • PDF