• Title/Summary/Keyword: data sets

Search Result 3,771, Processing Time 0.03 seconds

Design of the Integrated Incomplete Information Processing System based on Rough Set

  • Jeong, Gu-Beom;Chung, Hwan-Mook;Kim, Guk-Boh;Park, Kyung-Ok
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.5
    • /
    • pp.441-447
    • /
    • 2001
  • In general, Rough Set theory is used for classification, inference, and decision analysis of incomplete data by using approximation space concepts in information system. Information system can include quantitative attribute values which have interval characteristics, or incomplete data such as multiple or unknown(missing) data. These incomplete data cause tole inconsistency in information system and decrease the classification ability in system using Rough Sets. In this paper, we present various types of incomplete data which may occur in information system and propose INcomplete information Processing System(INiPS) which converts incomplete information system into complete information system in using Rough Sets.

  • PDF

Optimization of parameters in segmentation of large-scale spatial data sets (대용량 공간 자료들의 세그먼테이션에서의 모수들의 최적화)

  • Oh, Mi-Ra;Lee, Hyun-Ju
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.897-898
    • /
    • 2008
  • Array comparative genomic hybridization (aCGH) has been used to detect chromosomal regions of amplifications or deletions, which allows identification of new cancer related genes. As aCGH, a large-scale spatial data, contains significant amount of noises in its raw data, it has been an important research issue to segment genomic DNA regions to detect its true underlying copy number aberrations (CNAs). In this study, we focus on applying a segmentation method to multiple data sets. We compare two different threshold values for analyzing aCGH data with CBS method [1]. The proposed threshold values are p-value or $Q{\pm}1.5IQR$ and $Q{\pm}1.5IQR$.

  • PDF

Regional Scale Satellite Data Sets for Agricultural, Hydrological and Environmental Applications in Zambia

  • Ngoma, Solomon
    • Proceedings of The Korean Society of Agricultural and Forest Meteorology Conference
    • /
    • 2001.06a
    • /
    • pp.43-48
    • /
    • 2001
  • Many applications in the areas of agricultural, hydrological and environmental resource management require data over very large areas and with a high imaging frequency - monitoring crop growth, water stress, seasonal wetland flooding and natural vegetation development. This precludes the use of fine resolution data (Landsat, Spot) on the grounds of cost, accessibility and low imaging frequency. Meteorological satellites have the potential to fill this need, given their very wide spatial coverage, and high repeat imaging. The Remote Sensing Unit (RSU) at the Zambia Meteorological Department routinely receives, processes and archives imagery from both Meteosat and NOAA AVHRR satellites. Here I wish to present some examples of applications of these data sets that arise from the RSU work - relationships between rainfall and vegetation development as assessed by satellite, derived information and seasonal patterns of flooding in the Barotse floodplain and the Kafue flats. I also wish to outline ways in which a more widespread use of this data by the Zambian institutions canbe achieved.

  • PDF

Short-term Flood Forecasting Using Artificial Neural Networks (인공신경망 이론을 이용한 단기 홍수량 예측)

  • 강문성;박승우
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.45 no.2
    • /
    • pp.45-57
    • /
    • 2003
  • An artificial neural network model was developed to analyze and forecast Short-term river runoff from the Naju watershed, in Korea. Error back propagation neural networks (EBPN) of hourly rainfall and runoff data were found to have a high performance In forecasting runoff. The number of hidden nodes were optimized using total error and Bayesian information criterion. Model forecasts are very accurate (i.e., relative error is less than 3% and $R^2$is greater than 0.99) for calibration and verification data sets. Increasing the time horizon for application data sets, thus mating the model suitable for flood forecasting. decreases the accuracy of the model. The resulting optimal EBPN models for forecasting hourly runoff consists of ten rainfall and four runoff data(ANN0410 model) and ten rainfall and ten runoff data(ANN1010 model). Performances of the ANN0410 and ANN1010 models remain satisfactory up to 6 hours (i.e., $R^2$is greater than 0.92).

Gene Set and Pathway Analysis of Microarray Data (프마이크로어레이 데이터의 유전자 집합 및 대사 경로 분석)

  • Kim Seon-Young
    • KOGO NEWS
    • /
    • v.6 no.1
    • /
    • pp.29-33
    • /
    • 2006
  • Gene set analysis is a new concept and method. to analyze and interpret microarray gene expression data and tries to extract biological meaning from gene expression data at gene set level rather than at gene level. Compared with methods which select a few tens or hundreds of genes before gene ontology and pathway analysis, gene set analysis identifies important gene ontology terms and pathways more consistently and performs well even in gene expression data sets with minimal or moderate gene expression changes. Moreover, gene set analysis is useful for comparing multiple gene expression data sets dealing with similar biological questions. This review briefly summarizes the rationale behind the gene set analysis and introduces several algorithms and tools now available for gene set analysis.

  • PDF

Automatic Change Detection Based on Areal Feature Matching in Different Network Data-sets (이종의 도로망 데이터 셋에서 면 객체 매칭 기반 변화탐지)

  • Kim, Jiyoung;Huh, Yong;Yu, Kiyun;Kim, Jung Ok
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.31 no.6_1
    • /
    • pp.483-491
    • /
    • 2013
  • By a development of car navigation systems and mobile or positioning technology, it increases interest in location based services, especially pedestrian navigation systems. Updating of digital maps is important because digital maps are mass data and required to short updating cycle. In this paper, we proposed change detection for different network data-sets based on areal feature matching. Prior to change detection, we defined type of updating between different network data-sets. Next, we transformed road lines into areal features(block) that are surrounded by them and calculated a shape similarity between blocks in different data-sets. Blocks that a shape similarity is more than 0.6 are selected candidate block pairs. Secondly, we detected changed-block pairs by bipartite graph clustering or properties of a concave polygon according to types of updating, and calculated Fr$\acute{e}$chet distance between segments within the block or forming it. At this time, road segments of KAIS map that Fr$\acute{e}$chet distance is more than 50 are extracted as updating road features. As a result of accuracy evaluation, a value of detection rate appears high at 0.965. We could thus identify that a proposed method is able to apply to change detection between different network data-sets.

A Study on the Appling National ITS Standardization for Advanced Traffic Information System (통합교통정보체계 추진을 위한 표준안 활용방안 연구)

  • Cho Young-Sung;Lee Sang-Keon;Moon Young-Jun;Jung Hee-Woon
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.2 no.1 s.2
    • /
    • pp.15-24
    • /
    • 2003
  • While public institutions and privite coperations have constructed the data collecting infrastructure and commercialized it to service traffic information, inaccuracy with information, insufficiency with the public sector, overapping investment and the rest are arising from the differences of data treatment, offering method, communication protocal and the like. For these reasons, we need standardization it as a national enterprise. In this paper, we would like to present the unity management and joint method of traffic information based on the present position of system construction and national ITS standardization. At first, we presented the relation between data elements and message sets and then analyzed how message sets have been used to message sets to share traffic information so far in public institutions and private coperations like Korea freeway cooperation-ITS model city-Cheonan-Nonsan expressway, In-cheon international airport expressway-Korea freeway cooperation and Seoul metropolitan police agency-Seoul metropolitan government. As the results of analyzing message sets, it had disclosed that data transmission is impossible or a system is unchangeable because data form and each items to transmission were different from each other and its own address and link IDS were not determined yet. Also it showed these message sets did not abidy by The Draft of National ITS Standards. First of all, we proposed data basic form and elements which were form by items used in each center in common and the elements to manage traffic information suggested by Standards based on the results of analyses.

  • PDF

Modelling the flexural strength of mortars containing different mineral admixtures via GEP and RA

  • Saridemir, Mustafa
    • Computers and Concrete
    • /
    • v.19 no.6
    • /
    • pp.717-724
    • /
    • 2017
  • In this paper, four formulas are proposed via gene expression programming (GEP)-based models and regression analysis (RA) to predict the flexural strength ($f_s$) values of mortars containing different mineral admixtures that are ground granulated blast-furnace slag (GGBFS), silica fume (SF) and fly ash (FA) at different ages. Three formulas obtained from the GEP-I, GEP-II and GEP-III models are constituted to predict the $f_s$ values from the age of specimen, water-binder ratio and compressive strength. Besides, one formula obtained from the RA is constituted to predict the $f_s$ values from the compressive strength. To achieve these formulas in the GEP and RA models, 972 data of the experimental studies presented with mortar mixtures were gathered from the literatures. 734 data of the experimental studies are divided without pre-planned for these formulas achieved from the training and testing sets of GEP and RA models. Beside, these formulas are validated with 238 data of experimental studies un-employed in training and testing sets. The $f_s$ results obtained from the training, testing and validation sets of these formulas are compared with the results obtained from the experimental studies and the formulas given in the literature for concrete. These comparisons show that the results of the formulas obtained from the GEP and RA models appear to well compatible with the experimental results and find to be very credible according to the results of other formulas.

A study on market-production model building for small bar steels (소봉제품의 시장생산 모형 구축에 관한 연구)

  • 김수홍;유정빈
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1996.10a
    • /
    • pp.139-145
    • /
    • 1996
  • A forecast on the past output data sets of small bar steels is very important information to make a decision on the future production quantities. In many cases, however, it has been mainly determined by experience (or rule of thumb). In this paper, past basic data sets of each small bar steels are statistically analyzed by some graphical and statistical forecasting methods. This work is mainly done by SAS. Among various quantitative forecasting methods in SAS, STEPAR forecasting method was best performed to the above data sets. By the method, the future production quantities of each small bar steels are forecasted. As a result of this statistical analysis, 95% confidence intervals for future forecast quantities are very wide. To improve this problem, a suitable systematic database system, integrated management system of demand-production-inventory and integrated computer system should be required.

  • PDF

TERRESTRIAL IMPACT CRATERING CHRONOLOGY II: PERIODICITY ANALYSIS WITH THE 2002 DATABASE

  • Moon, Hong-Kyu;Min, Byung-Hee;Lee, Seung-Kim
    • Bulletin of the Korean Space Science Society
    • /
    • 2003.10a
    • /
    • pp.85-85
    • /
    • 2003
  • We examined the hypothesis that the crater formation rate exhibits periodicity, employing data sets of Grieve (1991), Moon et al. (2001), and the Earth Impact Database (2002; DB02). DB02 is known to supercede previous compilations in terms of its accuracy and precision of the ages; it is the first time that this database has been used for periodicity analysis. For data sets comprising impact structures with D$\geq$5km (and also those with $\geq$20km), there is no convincing evidence for periodicities in the crater ages, according to our Fourier analysis. However, we detected two peaks at 16.1Myr and 34.7Myr for craters with D$\geq$30km; we confirm that the age distribution of impact craters with D$\geq$45km has dominant power at 16.1Myr. Thus, we may conjecture a probable periodic shower of Earth impactors with sizes d$\geq$1.5km. In addition, we found that the selection of data sets, the lower limits on the ages and diameters of impact craters, as well as the accuracy and precision of the ages, all constitute crucial factors in reconstructing the impact cratering history of the Earth.

  • PDF