• Title/Summary/Keyword: spatial data mining

Search Result 167, Processing Time 0.024 seconds

Analysis of Behavior of Seoullo 7017 Visitors - With a Focus on Text Mining and Social Network Analysis - (서울로 7017 방문자들의 이용행태 분석 -텍스트 마이닝과 소셜 네트워크 분석을 중심으로-)

  • Woo, Kyung-Sook;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.6
    • /
    • pp.16-24
    • /
    • 2020
  • The purpose of this study is to analyze the usage behavior of Seoullo 7017, the first public garden in Korea, to understand the usage status by analyzing blogs, and to present usage behavior and improvement plans for Seoullo 7017. From June 2017 to May 2020, after Seoullo 7017 was open to citizens, character data containing 'Seoullo 7017' in the title and contents of NAVER and·DAUM blogs were converted to text mining and socialization, a Big Data technique. The analysis was conducted using social network analysis. The summary of the research results is as follows. First of all, the ratio of men and women searching for Seoullo 7017 online is similar, and the regions that searched most are in the order of Seoul and Gyeonggi, and those in their 40s and 50s were the most interested. In other words, it can be seen that there is a lack of interest in regions other than Seoul and Gyeonggi and among those in their 10s, 20s, and 30s. The main behaviors of Seoullo 7017 are' night view' and 'walking', and the factors that affect culture and art are elements related to culture and art. If various programs and festivals are opened and actively promoted, the main behavior will be more varied. On the other hand, the main behavior that the users of Seoullo 7017 want is 'sit', which is a static behavior, but the physical conditions are not sufficient for the behavior to occur. Therefore, facilities that can cause sitting behavior, such as shades and benches must be improved to meet the needs of visitors. The peculiarity of the change in the behavior of Seoullo 7017 is that it is recognized as a good place to travel alone and a good place to walk alone as a public multi-use facility and group activities are restricted due to COVID-19. Accordingly, in a situation like the COVD-19 pandemic, more diverse behaviors can be derived in facilities where people can take a walk, etc., and the increase of various attractions and the satisfaction of users can be increased. Seoullo 7017, as Korea's first public pedestrian area, was created for urban regeneration and the efficient use of urban resources in areas beyond the meaning of public spaces and is a place with various values such as history, nature, welfare, culture, and tourism. However, as a result of the use behavior analysis, various behaviors did not occur in Seoullo 7017 as expected, and elements that hinder those major behaviors were derived. Based on these research results, it is necessary to understand the usage behavior of Seoullo 7017 and to establish a plan for spatial system and facility improvement, so that Seoullo 7017 can be an important place for urban residents and a driving force to revitalize the city.

Digital Gravity Anomaly Map of KIGAM (한국지질자원연구원 디지털 중력 이상도)

  • Lim, Mutaek;Shin, Younghong;Park, Yeong-Sue;Rim, Hyoungrea;Ko, In Se;Park, Changseok
    • Geophysics and Geophysical Exploration
    • /
    • v.22 no.1
    • /
    • pp.37-43
    • /
    • 2019
  • We present gravity anomaly maps based on KIGAM's gravity data measured from 2000 to 2018. Until 2016, we acquired gravity data on about 6,400 points for the purpose of regional mapping covering the whole country with data density of at least one point per $4km{\times}4km$ for reducing the time of the data acquisition. In addition, we have performed local gravity surveys for the purpose of mining development in and around the NMC Moland Mine at Jecheon in 2013 and in the Taebaeksan mineralized zone from 2015 to 2018 with data interval of several hundred meters to 2 km. Meanwhile, we carried out precise gravity explorations with data interval of about 250 m on and around epicenter areas of Gyeongju and Pohang earthquakes of relatively large magnitude which occurred in 2016 and in 2017, respectively. Thus we acquired in total about 9,600 points data as the result. We also used additional data acquired by Pusan National University for some local areas. Finally, gravity data more than 16,000 points except for the repetition and temporal control points were available to calculate free-air, Bouguer, and isostatic gravity anomalies. Therefore, the presented anomaly maps are most advanced in spatial distribution and the number of used data so far in Korea.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

Application of Time Domain Reflectometry to the Monitoring of Ground Defromation (지반변형측정을 위한 TDR기술의 적용)

  • Lee, Woo-Jin;Kim, Yong-Jin;Lee, Won-Je;Lee, Woong-Joo
    • Journal of the Korean GEO-environmental Society
    • /
    • v.4 no.2
    • /
    • pp.15-25
    • /
    • 2003
  • Time Domain Refletometry, or TDR, is a remote sensing electrical measurement technique that has been used for many years to determine the spatial location and nature of various objects, especially in the United States of America and Australia at mining industry. Since early on 1990, the TDR techniques have been applied to the geotechnical engineering such as : deformation measurement of rock slope and landslide, monitoring of ground water content and ground water level change, investigation of ground contamination and its movement. The first application of this technique, in 1996, to the domestic area is to determine the possibility of ground settlement caused by subsidence from abandoned underground mines at the Tongri and Gosari in Gangwon-d. In this paper, through the results of analysed deformation data between conventional measurements and the TDR, it was concluded that the TDR technique is a useful instrumentation method for the prediction of ground deformation.

  • PDF

An Analysis of National R&D Trends in the Metaverse Field using Topic Modeling (토픽 모델링을 활용한 메타버스 분야 국가 R&D 동향 분석)

  • Lee, Jungwoo;Lee, Soyeon
    • Smart Media Journal
    • /
    • v.11 no.8
    • /
    • pp.9-20
    • /
    • 2022
  • With the rise of the metaverse industry worldwide, relevant national strategies and nurturing systems have been prepared in Korea. As the complexity of policies increases, the importance of establishing data-based policymkaing is growing, and studies diagnosing national R&D trends in the metaverse field are still lacking. Therefore, this paper collected NTIS national R&D information for 9,651 R&D projects promoted from 2002 to 2020. And this study looked at the current status and identified major topics based on the topic modeling, and considered time-series changes in the topics. Eleven major topics of R&D tasks in the metaverse field were derived, hot topics were service/content/platform development and medical/surgical fields of application fields, and cold topics were urban/environment/spatial information fields. Strategic R&D Management, metaverse-related laws, and institutional studies were proposed as policy directions.

A Study on the Soil Contamination(Maps) Using the Handheld XRF and GIS in Abandoned Mining Areas (휴대용 XRF와 GIS를 이용한 폐광산 지역의 토양오염에 관한 연구)

  • Lee, Hyeon-Gyu;Choi, Yo-Soon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.3
    • /
    • pp.195-206
    • /
    • 2014
  • In this study, soil contamination maps related to Cu and Pb were created at the Busan abandoned mine in Korea using a handheld X-Ray Fluorescence(XRF) and Geographic Information Systems(GIS). Hydrological analysis was performed using the Digital Elevation Model(DEM) of the study area to identify the flow directions of surface runoff where pollutants can be dispersed from the soil contamination sources. 24 locations for measuring the soil contamination related to Cu and Pb were selected by considering the result of hydrological analysis. The results measured at the 24 locations using the handheld XRF showed that the highest value of Cu contamination is 8,255ppm and that of Pb is 2,146ppm. The field investigation data were entered into ArcGIS software, and then soil contamination maps regarding Cu and Pb with a 5m grid-spacing were created after performing spatial interpolations using the ordinary kriging method. As a result, we could know that high concentrations of Cu and Pb are presented at the waste and tailings dumps around the abandoned mine openings. This study also showed that the handheld XRF and GIS can be utilized to create soil contamination maps related to Cu and Pb in the field.

Prediction of Ground Subsidence Hazard Area Using GIS and Probability Model near Abandoned Underground Coal Mine (GIS 및 확률모델을 이용한 폐탄광 지역의 지반침하 위험 예측)

  • Choi, Jong-Kuk;Kim, Ki-Dong;Lee, Sa-Ro;Kim, Il-Soo;Won, Joong-Sun
    • Economic and Environmental Geology
    • /
    • v.40 no.3 s.184
    • /
    • pp.295-306
    • /
    • 2007
  • In this study, we predicted areas vulnerable to ground subsidence near abandoned underground coal mine at Sam-cheok City in Korea using a probability (frequency ratio) model with Geographic Information System (GIS). To extract the factors related to ground subsidence, a spatial database was constructed from a topographical map, geo-logical map, mining tunnel map, land characteristic map, and borehole data on the study area including subsidence sites surveyed in 2000. Eight major factors were extracted from the spatial analysis and the probability analysis of the surveyed ground subsidence sites. We have calculated the decision coefficient ($R^2$) to find out the relationship between eight factors and the occurrence of ground subsidence. The frequency ratio model was applied to deter-mine each factor's relative rating, then the ratings were overlaid for ground subsidence hazard mapping. The ground subsidence hazard map was then verified and compared with the surveyed ground subsidence sites. The results of verification showed high accuracy of 96.05% between the predicted hazard map and the actual ground subsidence sites. Therefore, the quantitative analysis of ground subsidence near abandoned underground coal mine would be possible with a frequency ratio model and a GIS.

A Study on the Contemporary Definition of 'GARDEN' - Keyword Analysis used Literature Research and Big Data - ('정원'의 시대적 정의에 관한 연구 - 문헌연구와 빅데이터를 활용한 키워드 분석을 중심으로-)

  • Woo, Kyungsook;Suh, Joo Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.44 no.5
    • /
    • pp.1-11
    • /
    • 2016
  • There has been an increasingly high interest in gardens and garden design in Korea recently. However, the usage of the term 'garden' is extremely varied and complex, and there has been very little academic research made on the meaning of garden. Therefore, this research attempts to investigate the ideas of current gardens and to elucidate their changing patterns by means of extensive literature research and big data analysis. The notion of garden in the past was broad including not only private space such as Madang(마당) and Teul(뜰), but also even field and grass land as public outdoor space. Yet, the meaning has become smaller to merely private space due to the change of dwelling systems due to high industrial development of the 20th century. Furthermore, the introduction of urban parks as an interactive space between nature and humans, the similar spatial function of gardens, has blurred the boundary between garden and park, which created confusion in understanding the concept of a garden. After all, garden is a subject for humans. The meanings of garden need to be recognized from various points of view since garden itself is a creation by the sum of diverse fields such as natural and social sciences as well as culturology. This discussion on the meaning of garden in the present day will give a conceptual foundation for future research on gardens and garden design. Also, the big data analysis employed here as a research method can help other similar research topics, particularly semantics in landscape architecture.

Top-down Hierarchical Clustering using Multidimensional Indexes (다차원 색인을 이용한 하향식 계층 클러스터링)

  • Hwang, Jae-Jun;Mun, Yang-Se;Hwang, Gyu-Yeong
    • Journal of KIISE:Databases
    • /
    • v.29 no.5
    • /
    • pp.367-380
    • /
    • 2002
  • Due to recent increase in applications requiring huge amount of data such as spatial data analysis and image analysis, clustering on large databases has been actively studied. In a hierarchical clustering method, a tree representing hierarchical decomposition of the database is first created, and then, used for efficient clustering. Existing hierarchical clustering methods mainly adopted the bottom-up approach, which creates a tree from the bottom to the topmost level of the hierarchy. These bottom-up methods require at least one scan over the entire database in order to build the tree and need to search most nodes of the tree since the clustering algorithm starts from the leaf level. In this paper, we propose a novel top-down hierarchical clustering method that uses multidimensional indexes that are already maintained in most database applications. Generally, multidimensional indexes have the clustering property storing similar objects in the same (or adjacent) data pares. Using this property we can find adjacent objects without calculating distances among them. We first formally define the cluster based on the density of objects. For the definition, we propose the concept of the region contrast partition based on the density of the region. To speed up the clustering algorithm, we use the branch-and-bound algorithm. We propose the bounds and formally prove their correctness. Experimental results show that the proposed method is at least as effective in quality of clustering as BIRCH, a bottom-up hierarchical clustering method, while reducing the number of page accesses by up to 26~187 times depending on the size of the database. As a result, we believe that the proposed method significantly improves the clustering performance in large databases and is practically usable in various database applications.

Construction of Precise Mine Geospatial Information and Ore Modeling for Smart Mining (스마트마이닝을 위한 정밀 광산공간정보 구축 및 광체 모델링)

  • Park, Joon Kyu;Jung, Kap Yong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.38 no.6
    • /
    • pp.725-731
    • /
    • 2020
  • In mineral resource development, resource exploration is a task to find economical minerals on the surface and underground, and the success rate is low compared to the development and production stages, and it is necessary to collect a lot of data through exploration and accurately analyze the collected information. In this study, mine spatial information was constructed using a 3D (Three-dimensional) laser scanner, and accuracy evaluation was performed to obtain a maximum deviation of 0.140 m and an average of 0.095 m in the X, Y and Z directions, and the possibility of utilizing the construction of mine geospatial information through a 3D laser scanner could be presented. In addition, the ore body modeling was performed by applying the interpolation method of the ore body section using the resource exploration results. The ore body modeling result was superimposed with the modeling result of the mine geospatial information built through the 3D laser scanner to construct the ore body modeling result based on the precise mine geospatial information. The results of ore body modeling based on mine geospatial information built through research can increase the ease of data interpretation and the accuracy of the calculated data, which will greatly increase the efficiency of work related to mineral resource development and mine damage prevention in the future.