• Title/Summary/Keyword: Outlier Analysis

Search Result 234, Processing Time 0.022 seconds

Selective Histogram Matching of Multi-temporal High Resolution Satellite Images Considering Shadow Effects in Urban Area (도심지역의 그림자 영향을 고려한 다시기 고해상도 위성영상의 선택적 히스토그램 매칭)

  • Yeom, Jun-Ho;Kim, Yong-Il
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.20 no.2
    • /
    • pp.47-54
    • /
    • 2012
  • Additional high resolution satellite images, other period or site, are essential for efficient city modeling and analysis. However, the same ground objects have a radiometric inconsistency in different satellite images and it debase the quality of image processing and analysis. Moreover, in an urban area, buildings, trees, bridges, and other artificial objects cause shadow effects, which lower the performance of relative radiometric normalization. Therefore, in this study, we exclude shadow areas and suggest the selective histogram matching methods for image based application without supplementary digital elevation model or geometric informations of sun and sensor. We extract the shadow objects first using adjacency informations with the building edge buffer and spatial and spectral attributes derived from the image segmentation. And, Outlier objects like a asphalt roads are removed. Finally, selective histogram matching is performed from the shadow masked multi-temporal Quickbird-2 images.

Regional Analysis of Particulate Matter Concentration Risk in South Korea (국내 지역별 미세먼지 농도 리스크 분석)

  • Oh, Jang Wook;Lim, Tea Jin
    • Journal of the Korean Society of Safety
    • /
    • v.32 no.5
    • /
    • pp.157-167
    • /
    • 2017
  • Millions of People die every year from diseases caused by exposure to outdoor air pollution. Especially, one of the most severe types of air pollution is fine particulate matter (PM10, PM2.5). South Korea also has been suffered from severe PM. This paper analyzes regional risks induced by PM10 and PM2.5 that have affected domestic area of Korea during 2014~2016.3Q. We investigated daily maxima of PM10 and PM2.5 data observed on 284 stations in South Korea, and found extremely high outlier. We employed extreme value distributions to fit the PM10 and PM2.5 data, but a single distribution did not fit the data well. For theses reasons, we implemented extreme mixture models such as the generalized Pareto distribution(GPD) with the normal, the gamma, the Weibull and the log-normal, respectively. Next, we divided the whole area into 16 regions and analyzed characteristics of PM risks by developing the FN-curves. Finally, we estimated 1-month, 1-quater, half year, 1-year and 3-years period return levels, respectively. The severity rankings of PM10 and PM2.5 concentration turned out to be different from region to region. The capital area revealed the worst PM risk in all seasons. The reason for high PM risk even in the yellow dust free season (Jun. ~ Sep.) can be inferred from the concentration of factories in this area. Gwangju showed the highest return level of PM2.5, even if the return level of PM10 was relatively low. This phenomenon implies that we should investigate chemical mechanisms for making PM2.5 in the vicinity of Gwangju area. On the other hand, Gyeongbuk and Ulsan exposed relatively high PM10 risk and low PM2.5 risk. This indicates that the management policy of PM risk in the west side should be different from that in the east side. The results of this research may provide insights for managing regional risks induced by PM10 and PM2.5 in South Korea.

A Study on Quality Control Method for Minutely Rainfall Data (분 단위 강우자료의 품질 개선방안에 관한 연구)

  • Kim, Min-Seok;Moon, Young-Il
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.2
    • /
    • pp.319-326
    • /
    • 2015
  • Rainfall data is necessary component for water resources design and flood warning system. Most analysis are used long-term hourly data of surface synoptic stations from the Meteorological Administration, Ministry of land, Infrastructure and Transport and others. However, It will be used minutely data of more high density automatic weather stations than surface synoptic stations expecting to increase the frequency of heavy precipitation. But minutely data has a problem about quality of rainfall data by auto observation. This study analyzed about quality control method using automatic weather station's minutely rainfall data of meteorological administration. It was performed assessment of the quality control that was classified quality control of miss Data, outlier data and rainfall interpolation. This method will be utilized when hydrological analysis uses minute rainfall data.

Design of Heuristic Decision Tree (HDT) Using Human Knowledge (인간 지식을 이용한 경험적 의사결정트리의 설계)

  • Yoon, Tae-Tok;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.4
    • /
    • pp.525-531
    • /
    • 2009
  • Data mining is the process of extracting hidden patterns from collected data. At this time, for collected data which take important role as the basic information for prediction and recommendation, the process to discriminate incorrect data in order to enhance the performance of analysis result, is needed. The existing methods to discriminate unexpected data from collected data, mainly relies on methods which are based on statistics or simple distance between data. However, for these methods, the problematic point that even meaningful data could be excluded from analysis due that the environment and characteristic of the relevant data are not considered, exists. This study proposes a method to endow human heuristic knowledge with weight value through the comparison between collected data and human heuristic knowledge, and to use the value for creating a decision tree. The data discrimination by the method proposed is more credible as human knowledge is reflected in the created tree. The validity of the proposed method is verified through an experiment.

CUSUM Chart Applied to Monitoring Areal Population Mobility (누적합 관리도를 활용한 생활인구 이상치 탐색)

  • Kim, Hyoung Jun;Sohn, So Young
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.2
    • /
    • pp.241-256
    • /
    • 2020
  • Purpose: Certain places in Seoul such as Shinchon, Hongdae, and Gangnam, often suffer from sudden overflow of mobile population which can cause serious safety problems. This study suggests the application of spatial CUSUM control chart in monitoring areal population mobility data which is recently provided by Seoul metropolitan government. Methods: Monitoring series of standardized local Moran's I enables one to detect spatio-temporal out-of-control status based on the accumulation of past patterns. Moreover, we visualize such pattern map for more intuitive comprehension of the phenomenon. As a case study, we have analyzed the female mobility population aged 25 to 29 appeared in 51 Jipgyegu near Hongik university on fridays from January, 2017 to June, 2018. They are validated by exploring related articles and through local due diligence. Results: The results of the analysis provide insights in figuring out if the change of the mobility population is short-term by particular incident or long-term by spatial alteration, which allows strategic approach in constructing response system. Specific case near popular downtown near Hongik University has shown that newly opened hotels, shops of global sports brand and franchise bookstores have attracted young female population. Conclusion: We expect that the results of our study contribute to planning effective distribution of administrative resources to prepare against drastic increase in floating population. Furthermore, it can be useful in commercial area analysis and age/gender specific marketing strategy for companies.

STA : Sybil Type-aware Robust Recommender System (시빌 유형을 고려한 견고한 추천시스템)

  • Noh, Taewan;Oh, Hayoung;Noh, Giseop;Kim, Chongkwon
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.10
    • /
    • pp.670-679
    • /
    • 2015
  • With a rapid development of internet, many users these days refer to various recommender sites when buying items, movies, music and more. However, there are malicious users (Sybil) who raise or lower item ratings intentionally in these recommender sites. And as a result, a recommender system (RS) may recommend incomplete or inaccurate results to normal users. We suggest a recommender algorithm to separate ratings generated by users into normal ratings and outlier ratings, and to minimize the effects of malicious users. Specifically, our algorithm first ensures a stable RS against three kinds of attack models (Random attack, Average attack, and Bandwagon attack) which are the main recent security issues in RS. To prove the performance of the method of suggestion, we conducted performance analysis on real world data that we crawled. The performance analysis demonstrated that the suggested method performs well regardless of Sybil size and type when compared to existing algorithms.

Evaluation of Major Taper Equation Models for Developing a Stem Volume Table of Cryptomeria japonica in Jeju Island (제주도 삼나무 수간재적표 개발을 위한 주요 수간곡선식 비교)

  • Hyun-Soo, Kim;Su-Young, Jung;Kwang-Soo, Lee
    • Journal of Environmental Science International
    • /
    • v.31 no.11
    • /
    • pp.941-950
    • /
    • 2022
  • This study was conducted to provide data and stem information to establish a local volume table of Cryptomeria japonica in Jeju Island. Stem analysis was performed on 26 trees by selecting two average trees from each site of the 13 plots of C. japonica stands in 2021 and 2022. During the analysis stage, one outlier tree was rejected, and a total of 260 observations of the specific stem height of 25 trees were used. Of the seven major taper equation models applied for parameter estimation and statistical verification, the Muhairwe 1999 model was found to be the best fit and selected as the optimal model. Stem shape-related estimates were acquired through the selected model, and sectional measurements according to the Smalian formula applied at an interval of 10 cm from the height of the stem were used to develop a volume table. A paired t-test comparison between the C. japonica volume obtained from the present study and those selected from the current yield table by NIFoS(2020), revealed significant differences (p<0.05), highlighting the necessity of a local volume table for C. japonica in Jeju Island.

The Classification of Forest Cover Types by Consecutive Application of Multivariate Statistical Analysis in the Natural Forest of Western Mt. Jiri (다변량 통계 분석법의 연속 적용에 의한 서부 지리산 천연림의 산림 피복형 분류)

  • Chung, Sang Hoon;Kim, Ji Hong
    • Journal of Korean Society of Forest Science
    • /
    • v.102 no.3
    • /
    • pp.407-414
    • /
    • 2013
  • This study was conducted to classify forest cover types using the multivariate statistical analysis in the natural forest of western Mt. Jiri. On the basis of the vegetation data by point quarter sampling, the adopted analytical methods were species-area curve (SAC), hierarchical cluster analysis (HCA), indicator species analysis (ISA), and multiple discriminant analysis (MDA). SAC selected the outlier tree species which was likely to have no influence on the classification of forest cover types, excluded from all analytical process. Based on forest vegetative information, HCA classified the study area into 2 to 10 clusters and ISA indicated that the optimal number of clusters were seven. MDA was taken to test the clusters that classified with HCA and ISA. The seven clusters were classified appropriately as overall classification success were 91.3%. The classified forest cover types were named by the ratio of the dominant species in the upper layer of each cluster. They were (1) Quercus mongolica Pure forest, (2) Mixed mesophytic forest, (3) Q. mongolica - Q. serrata forest, (4) Abies koreana - Q. mongolica forest, (5) Fraxinus mandshurica forest, (6) Q. serrata forest, and (7) Carpinus laxiflora forest.

Characteristics for the Distribution of Elderly Population by Utilizing the Census Data (센서스 데이터를 활용한 고령인구 분포 특성)

  • Nam, Kwang-Woo;Gwon, Il-Hwa
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.1
    • /
    • pp.464-469
    • /
    • 2013
  • After city of Busan has been entered to the aging society in 2000, the city has the highest aging rate among 7 representative cities in 2011. Moreover, while entire population and number of average household are decreasing, over 65 years old of elderly population is rapidly increasing. So, it is possible to enter the super-aged society, where aging rate would be about 20% after 2020. The purpose of this study is that older housing-related analysis is consisted of dong-unit, and this led microscopic analysis has become necessary. Surveys from 2000 through 2010, census aggregate (output area) unit of spatial analysis was conducted. Take advantages of this, aging population and area, soaring area, high-density areas, such as the region of interest were primary extracted, and microscopic location and spatial distribution patterns were analyzed. Upon analysis, aging population is concentrated in the city and adjacent area, the highlands, and 10 years of increasing rate was more than 30 times in certain aggregate. Regarding the characteristic of these areas, the original city center, Busan, especially concentrated and intensified in aging population. Also, 2000 to 2010, the overall distribution pattern of Busan has identified aging population that is increasingly being distributed. This is the result, which is confronted with previous research result. Entering a super aged-society for the future is accordance with migration of social costs and improve the quality of life of elderly. And this could be the basic information to use the spatial dimension for the corresponding.

Optimal National Coordinate System Transform Model using National Control Point Network Adjustment Results (국가지준점 망조정 성과를 활용한 최적 국가 좌표계 변환 모델 결정)

  • Song, Dong-Seob;Jang, Eun-Seok;Kim, Tae-Woo;Yun, Hong-Sic
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.25 no.6_2
    • /
    • pp.613-623
    • /
    • 2007
  • The main purpose of this study is to investigate the coordinate transformation based on two different systems between local geodetic datum(tokyo datum) and international geocentric datum(new Korea geodetic datum). For this purpose, three methods were used to determine seven parameters as follows: Bursa-Wolf model, Molodensky-Badekas model, and Veis model. Also, we adopted multiple regression equation method to convert from Tokyo datum to KTRF. We used 935 control points as a common points and applied gross error analysis for detecting the outlier among those control points. The coordinate transformation was carried out using similarity transformation applied the obtained seven parameters and the precision of transformed coordinate was evaluated about 9,917 third or forth order control points. From these results, it was found that Bursa-Wolf model and Molodensky-Badekas model are more suitable than other for the determination of transformation parameters in Korea. And, transforming accuracy using MRE is lower than other similarity transformation model.