• Title/Summary/Keyword: classification map

Search Result 847, Processing Time 0.022 seconds

A Study on Diagnosis of Alzheimer's Disease using Raman Spectra from Platelet (혈소판 라만 스펙트럼을 이용한 알츠하이머병 진단에 관한 연구)

  • Park, Aa-Rron;Heo, Gi-Su;Baek, Seong-Joon
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.47 no.4
    • /
    • pp.40-46
    • /
    • 2010
  • In this paper, we use the Raman spectra measured from platelet to the diagnosis of Alzheimer's disease(AD). The Raman spectra used in the experiments were preprocessed with the following method and then fed into the classifier. The first step of the preprocessing is a simple smoothing followed by background elimination to the original spectra to make it easy to measure the intensity of the peaks. The last step of the preprocessing was peak alignment with the reference peak. After the inspection of the preprocessed spectra, we found that proportion of two peak intensity at 743 and 757 $cm^{-1}$ and peak intensity at 1658 $cm^{-1}$ are the most discriminative features. Then we apply mapstd method for normalization. The method returned data with means to 0 and deviation to 1. With these two features, the classification result involving 278 spectra showed about 95.5% true classification in case of MLP(multi-layer perceptron). It means that the Raman spectra measured from platelet would be effectively used to the diagnosis of Alzheimer's disease.

Machine Classification in Ship Engine Rooms Using Transfer Learning (전이 학습을 이용한 선박 기관실 기기의 분류에 관한 연구)

  • Park, Kyung-Min
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.27 no.2
    • /
    • pp.363-368
    • /
    • 2021
  • Ship engine rooms have improved automation systems owing to the advancement of technology. However, there are many variables at sea, such as wind, waves, vibration, and equipment aging, which cause loosening, cutting, and leakage, which are not measured by automated systems. There are cases in which only one engineer is available for patrolling. This entails many risk factors in the engine room, where rotating equipment is operating at high temperature and high pressure. When the engineer patrols, he uses his five senses, with particular high dependence on vision. We hereby present a preliminary study to implement an engine-room patrol robot that detects and informs the machine room while a robot patrols the engine room. Images of ship engine-room equipment were classified using a convolutional neural network (CNN). After constructing the image dataset of the ship engine room, the network was trained with a pre-trained CNN model. Classification performance of the trained model showed high reproducibility. Images were visualized with a class activation map. Although it cannot be generalized because the amount of data was limited, it is thought that if the data of each ship were learned through transfer learning, a model suitable for the characteristics of each ship could be constructed with little time and cost expenditure.

Bankruptcy Type Prediction Using A Hybrid Artificial Neural Networks Model (하이브리드 인공신경망 모형을 이용한 부도 유형 예측)

  • Jo, Nam-ok;Kim, Hyun-jung;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.79-99
    • /
    • 2015
  • The prediction of bankruptcy has been extensively studied in the accounting and finance field. It can have an important impact on lending decisions and the profitability of financial institutions in terms of risk management. Many researchers have focused on constructing a more robust bankruptcy prediction model. Early studies primarily used statistical techniques such as multiple discriminant analysis (MDA) and logit analysis for bankruptcy prediction. However, many studies have demonstrated that artificial intelligence (AI) approaches, such as artificial neural networks (ANN), decision trees, case-based reasoning (CBR), and support vector machine (SVM), have been outperforming statistical techniques since 1990s for business classification problems because statistical methods have some rigid assumptions in their application. In previous studies on corporate bankruptcy, many researchers have focused on developing a bankruptcy prediction model using financial ratios. However, there are few studies that suggest the specific types of bankruptcy. Previous bankruptcy prediction models have generally been interested in predicting whether or not firms will become bankrupt. Most of the studies on bankruptcy types have focused on reviewing the previous literature or performing a case study. Thus, this study develops a model using data mining techniques for predicting the specific types of bankruptcy as well as the occurrence of bankruptcy in Korean small- and medium-sized construction firms in terms of profitability, stability, and activity index. Thus, firms will be able to prevent it from occurring in advance. We propose a hybrid approach using two artificial neural networks (ANNs) for the prediction of bankruptcy types. The first is a back-propagation neural network (BPN) model using supervised learning for bankruptcy prediction and the second is a self-organizing map (SOM) model using unsupervised learning to classify bankruptcy data into several types. Based on the constructed model, we predict the bankruptcy of companies by applying the BPN model to a validation set that was not utilized in the development of the model. This allows for identifying the specific types of bankruptcy by using bankruptcy data predicted by the BPN model. We calculated the average of selected input variables through statistical test for each cluster to interpret characteristics of the derived clusters in the SOM model. Each cluster represents bankruptcy type classified through data of bankruptcy firms, and input variables indicate financial ratios in interpreting the meaning of each cluster. The experimental result shows that each of five bankruptcy types has different characteristics according to financial ratios. Type 1 (severe bankruptcy) has inferior financial statements except for EBITDA (earnings before interest, taxes, depreciation, and amortization) to sales based on the clustering results. Type 2 (lack of stability) has a low quick ratio, low stockholder's equity to total assets, and high total borrowings to total assets. Type 3 (lack of activity) has a slightly low total asset turnover and fixed asset turnover. Type 4 (lack of profitability) has low retained earnings to total assets and EBITDA to sales which represent the indices of profitability. Type 5 (recoverable bankruptcy) includes firms that have a relatively good financial condition as compared to other bankruptcy types even though they are bankrupt. Based on the findings, researchers and practitioners engaged in the credit evaluation field can obtain more useful information about the types of corporate bankruptcy. In this paper, we utilized the financial ratios of firms to classify bankruptcy types. It is important to select the input variables that correctly predict bankruptcy and meaningfully classify the type of bankruptcy. In a further study, we will include non-financial factors such as size, industry, and age of the firms. Thus, we can obtain realistic clustering results for bankruptcy types by combining qualitative factors and reflecting the domain knowledge of experts.

Updating DEM for Improving Geomorphic Details (미기복 지형 표현을 위한 DEM 개선)

  • Kim, Nam-Shin
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.12 no.1
    • /
    • pp.64-72
    • /
    • 2009
  • The method to generate a digital elevation model(DEM) from contour lines causes a problem in which the low relief landform cannot be clearly presented due to the fact that it is significantly influenced by the expression of micro landform elements according to the interval of contours. Thus, this study attempts to develop a landcover burning method that recovers the micro relief landform of the DEM, which applies buffering and map algebra methods by inputting the elevation information to the landcover. In the recovering process of the micro landform, the DEM was recovered using the buffering method and elevation information through the map algebra for the landcover element for the micro landform among the primary DEM generation, making landcover map, and landcover elements. The recovering of the micro landform was applied based on stream landforms. The recovering of landforms using the buffering method was performed for the bar, which is a polygonal element, and wetland according to the properties of concave/convex through generating contours with a uniform interval in which the elevation information applied to the recovered landform. In the case of the linear elements, such as bank, road, waterway, and tributary, the landform can be recovered by using the elevation information through applying a map algebra function. Because the polygonal elements, such as stream channel, river terrace, and artificial objects (farmlands) are determined as a flat property, these are recovered by inputting constant elevation values. The results of this study were compared and analyzed for the degree of landform expression between the original DEM and the recovered DEM. In the results of the analysis, the DEM produced by using the conventional method showed few expressions in micro landform elements. The method developed in this study well described wetland, bar, landform around rivers, farmland, bank, river terrace, and artificial objects. It can be expected that the results of this study contribute to the classification and analysis of micro landforms, plain and the ecology and environment study that requires the recovering of micro landforms around streams and rivers.

  • PDF

A Study on Differences of Contents and Tones of Arguments among Newspapers Using Text Mining Analysis (텍스트 마이닝을 활용한 신문사에 따른 내용 및 논조 차이점 분석)

  • Kam, Miah;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.53-77
    • /
    • 2012
  • This study analyses the difference of contents and tones of arguments among three Korean major newspapers, the Kyunghyang Shinmoon, the HanKyoreh, and the Dong-A Ilbo. It is commonly accepted that newspapers in Korea explicitly deliver their own tone of arguments when they talk about some sensitive issues and topics. It could be controversial if readers of newspapers read the news without being aware of the type of tones of arguments because the contents and the tones of arguments can affect readers easily. Thus it is very desirable to have a new tool that can inform the readers of what tone of argument a newspaper has. This study presents the results of clustering and classification techniques as part of text mining analysis. We focus on six main subjects such as Culture, Politics, International, Editorial-opinion, Eco-business and National issues in newspapers, and attempt to identify differences and similarities among the newspapers. The basic unit of text mining analysis is a paragraph of news articles. This study uses a keyword-network analysis tool and visualizes relationships among keywords to make it easier to see the differences. Newspaper articles were gathered from KINDS, the Korean integrated news database system. KINDS preserves news articles of the Kyunghyang Shinmun, the HanKyoreh and the Dong-A Ilbo and these are open to the public. This study used these three Korean major newspapers from KINDS. About 3,030 articles from 2008 to 2012 were used. International, national issues and politics sections were gathered with some specific issues. The International section was collected with the keyword of 'Nuclear weapon of North Korea.' The National issues section was collected with the keyword of '4-major-river.' The Politics section was collected with the keyword of 'Tonghap-Jinbo Dang.' All of the articles from April 2012 to May 2012 of Eco-business, Culture and Editorial-opinion sections were also collected. All of the collected data were handled and edited into paragraphs. We got rid of stop-words using the Lucene Korean Module. We calculated keyword co-occurrence counts from the paired co-occurrence list of keywords in a paragraph. We made a co-occurrence matrix from the list. Once the co-occurrence matrix was built, we used the Cosine coefficient matrix as input for PFNet(Pathfinder Network). In order to analyze these three newspapers and find out the significant keywords in each paper, we analyzed the list of 10 highest frequency keywords and keyword-networks of 20 highest ranking frequency keywords to closely examine the relationships and show the detailed network map among keywords. We used NodeXL software to visualize the PFNet. After drawing all the networks, we compared the results with the classification results. Classification was firstly handled to identify how the tone of argument of a newspaper is different from others. Then, to analyze tones of arguments, all the paragraphs were divided into two types of tones, Positive tone and Negative tone. To identify and classify all of the tones of paragraphs and articles we had collected, supervised learning technique was used. The Na$\ddot{i}$ve Bayesian classifier algorithm provided in the MALLET package was used to classify all the paragraphs in articles. After classification, Precision, Recall and F-value were used to evaluate the results of classification. Based on the results of this study, three subjects such as Culture, Eco-business and Politics showed some differences in contents and tones of arguments among these three newspapers. In addition, for the National issues, tones of arguments on 4-major-rivers project were different from each other. It seems three newspapers have their own specific tone of argument in those sections. And keyword-networks showed different shapes with each other in the same period in the same section. It means that frequently appeared keywords in articles are different and their contents are comprised with different keywords. And the Positive-Negative classification showed the possibility of classifying newspapers' tones of arguments compared to others. These results indicate that the approach in this study is promising to be extended as a new tool to identify the different tones of arguments of newspapers.

Performance of Investment Strategy using Investor-specific Transaction Information and Machine Learning (투자자별 거래정보와 머신러닝을 활용한 투자전략의 성과)

  • Kim, Kyung Mock;Kim, Sun Woong;Choi, Heung Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.65-82
    • /
    • 2021
  • Stock market investors are generally split into foreign investors, institutional investors, and individual investors. Compared to individual investor groups, professional investor groups such as foreign investors have an advantage in information and financial power and, as a result, foreign investors are known to show good investment performance among market participants. The purpose of this study is to propose an investment strategy that combines investor-specific transaction information and machine learning, and to analyze the portfolio investment performance of the proposed model using actual stock price and investor-specific transaction data. The Korea Exchange offers daily information on the volume of purchase and sale of each investor to securities firms. We developed a data collection program in C# programming language using an API provided by Daishin Securities Cybosplus, and collected 151 out of 200 KOSPI stocks with daily opening price, closing price and investor-specific net purchase data from January 2, 2007 to July 31, 2017. The self-organizing map model is an artificial neural network that performs clustering by unsupervised learning and has been introduced by Teuvo Kohonen since 1984. We implement competition among intra-surface artificial neurons, and all connections are non-recursive artificial neural networks that go from bottom to top. It can also be expanded to multiple layers, although many fault layers are commonly used. Linear functions are used by active functions of artificial nerve cells, and learning rules use Instar rules as well as general competitive learning. The core of the backpropagation model is the model that performs classification by supervised learning as an artificial neural network. We grouped and transformed investor-specific transaction volume data to learn backpropagation models through the self-organizing map model of artificial neural networks. As a result of the estimation of verification data through training, the portfolios were rebalanced monthly. For performance analysis, a passive portfolio was designated and the KOSPI 200 and KOSPI index returns for proxies on market returns were also obtained. Performance analysis was conducted using the equally-weighted portfolio return, compound interest rate, annual return, Maximum Draw Down, standard deviation, and Sharpe Ratio. Buy and hold returns of the top 10 market capitalization stocks are designated as a benchmark. Buy and hold strategy is the best strategy under the efficient market hypothesis. The prediction rate of learning data using backpropagation model was significantly high at 96.61%, while the prediction rate of verification data was also relatively high in the results of the 57.1% verification data. The performance evaluation of self-organizing map grouping can be determined as a result of a backpropagation model. This is because if the grouping results of the self-organizing map model had been poor, the learning results of the backpropagation model would have been poor. In this way, the performance assessment of machine learning is judged to be better learned than previous studies. Our portfolio doubled the return on the benchmark and performed better than the market returns on the KOSPI and KOSPI 200 indexes. In contrast to the benchmark, the MDD and standard deviation for portfolio risk indicators also showed better results. The Sharpe Ratio performed higher than benchmarks and stock market indexes. Through this, we presented the direction of portfolio composition program using machine learning and investor-specific transaction information and showed that it can be used to develop programs for real stock investment. The return is the result of monthly portfolio composition and asset rebalancing to the same proportion. Better outcomes are predicted when forming a monthly portfolio if the system is enforced by rebalancing the suggested stocks continuously without selling and re-buying it. Therefore, real transactions appear to be relevant.

A Study on the Establishment of Database for the Efficient Management of Unexecuted Urban Planning Facilities (미집행 도시계획시설의 효율적 관리를 위한 DB구축 방안에 관한 연구)

  • KIM, Kwang-Yeol;KIM, Shin-Hey;BAEK, Tae-Kyung
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.23 no.3
    • /
    • pp.1-11
    • /
    • 2020
  • The purpose of this study is to conduct an analysis for classification of unexecuted urban planning facilities using the Geographic Information System(GIS) to prepare measures for systematic and efficient management of unexecuted urban planning facilities and to find ways to establish national territory information for continuous management and operation by database of spatial data of classified unexecuted urban planning facilities. For this purpose, the present state of urban management plan, thematic map, cadastral map, satellite image of Korea Land Information System(KLIS) were collected from Miryang City, and qualitative analysis of the execution and non-execution of urban planning facilities was conducted by combining the layer of urban planning facilities, satellite images, and continuous cadastral layers of cadastral maps with classified and processed owner attribute information. According to the analysis, the unexecuted facilities were derived as unexecuted facilities, as most of the private land, without any current status roads or facilities created in satellite imagery. In addition, although the current status road was opened, the facilities that included some private land were derived as facilities that were recognized and executed by the local government as the de facto controlling entity through public transportation. The derived unexecuted urban planning facilities were divided into layers of shape data and the unexecuted property data were organized to quickly and accurately identify the status of non-executed and statistical information. In this study, we proposed an analysis plan that introduced GIS technology for scientific and rational analysis of unexecuted urban planning facilities and the establishment of reliable spatial data, and proposed a plan to establish a database for connection with existing systems and use of information.

Analysis on the Sedimentary Environment Change Induced by Typhoon in the Sacheoncheon, Gangneung using Multi-temporal Remote Sensing Data (태풍 루사에 의한 강릉 사천천 주변 퇴적 환경 변화: 다중 시기 원격탐사 자료를 이용한 정보 분석)

  • Park, No-Wook;Jang, Dong-Ho;Chi, Kwang-Hoon
    • Journal of the Korean earth science society
    • /
    • v.27 no.1
    • /
    • pp.83-94
    • /
    • 2006
  • The objective of this paper is to extract and analyze the sediment environment change information in the Sachencheon, Gangneung, Korea that was seriously damaged as a result of typhoon Rusa aftermath early in September, 2002 using multi-temporal remote sensing data. For the extraction of change information, an unsupervised approach based on the automatic determination of thresholding values was applied. As the change detection results, turbidity changes right after typhoon Rusa, the decrease of wetlands, the increase of dry sand and channel width and changes of relative level in the stream due to seasonal variation were observed. Sedimentation in the cultivated areas and restoration works also affected the change near the Sacheoncheon. In addition to the change detection analysis, several environmental thematic maps including microtopographic map, distributions of estimated amount of flood deposits and flood hazard landform classification map were generated by using remote sensing and field survey data. In conclusion, multi-temporal remote sensing data can be effectively used for natural hazard analysis and damage information extraction and specific data processing techniques for high-resolution remote sensing data should also be developed.

A Study on Establishment of the Levee GIS Database Using LiDAR Data and WAMIS Information (LiDAR 자료와 WAMIS 정보를 활용한 제방 GIS 데이터베이스 구축에 관한 연구)

  • Choing, Yun-Jae
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.3
    • /
    • pp.104-115
    • /
    • 2014
  • A levee is defined as an man-made structure protecting the areas from temporary flooding. This paper suggests a methodology for establishing the levee GIS database using the airborne topographic LiDAR(Light Detection and Ranging) data taken in the Nakdong river basins and the WAMIS(WAter Management Information System) information. First, the National Levee Database(NLD) established by the USACE(United States Army Corps Engineers) and the levee information tables established by the WAMIS are compared and analyzed. For extracting the levee information from the LiDAR data, the DSM(Digital Surface Model) is generated from the LiDAR point clouds by using the interpolation method. Then, the slope map is generated by calculating the maximum rates of elevation difference between each pixel of the DSM and its neighboring pixels. The slope classification method is employed to extract the levee component polygons such as the levee crown polygons and the levee slope polygons from the slope map. Then, the levee information database is established by integrating the attributes extracted from the identified levee crown and slope polygons with the information provided by the WAMIS. Finally, this paper discusses the advantages and limitations of the levee GIS database established by only using the LiDAR data and suggests a future work for improving the quality of the database.

Development of Thermal Comfort Evaluation Map by the Land Cover in Yeongnam Region (영남지역의 토지피복에 따른 열쾌적성평가도 구축)

  • Kang, Dong-Hyun;Choi, Chul-Hyun;Jung, Sung-Gwan
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.2
    • /
    • pp.136-155
    • /
    • 2014
  • The purpose of this study is to analyze the thermal comfort in Yeongnam area using climatic data and GIS data in order to determine regions necessary to improve thermal environment policies. The results of the calculated PET show that Daegu city is high and Bonghwa-gun is low compared to other regions. PET was compared with the typical classification according to regional characteristics. As a result, PET value of rural areas such as Changnyeong-gun, Haman-gun and Goryeong-gun was high but Green space was too low compared to other rural areas. Yeongnam area was classified according to the value of PET using cluster analysis. As a result, more low grade areas show that green space ratio was low and facility area was high. It is determined that there is a relationship between thermal comfort and land cover. The thermal comfort evaluation map in Yeongnam area will be useful for urban planning in order to establish a sustainable city in climate change.