• 제목/요약/키워드: Data Features

검색결과 6,476건 처리시간 0.03초

Multi- Resolution MSS Image Fusion

  • Ghassemian, Hassan;Amidian, Asghar
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.648-650
    • /
    • 2003
  • Efficient multi-resolution image fusion aims to take advantage of the high spectral resolution of Landsat TM images and high spatial resolution of SPOT panchromatic images simultaneously. This paper presents a multi-resolution data fusion scheme, based on multirate image representation. Motivated by analytical results obtained from high-resolution multispectral image data analysis: the energy packing the spectral features are distributed in the lower frequency bands, and the spatial features, edges, are distributed in the higher frequency bands. This allows to spatially enhancing the multispectral images, by adding the high-resolution spatial features to them, by a multirate filtering procedure. The proposed method is compared with some conventional methods. Results show it preserves more spectral features with less spatial distortion.

  • PDF

특성중요도를 활용한 분류나무의 입력특성 선택효과 : 신용카드 고객이탈 사례 (Feature Selection Effect of Classification Tree Using Feature Importance : Case of Credit Card Customer Churn Prediction)

  • 윤한성
    • 디지털산업정보학회논문지
    • /
    • 제20권2호
    • /
    • pp.1-10
    • /
    • 2024
  • For the purpose of predicting credit card customer churn accurately through data analysis, a model can be constructed with various machine learning algorithms, including decision tree. And feature importance has been utilized in selecting better input features that can improve performance of data analysis models for several application areas. In this paper, a method of utilizing feature importance calculated from the MDI method and its effects are investigated in the credit card customer churn prediction problem with classification trees. Compared with several random feature selections from case data, a set of input features selected from higher value of feature importance shows higher predictive power. It can be an efficient method for classifying and choosing input features necessary for improving prediction performance. The method organized in this paper can be an alternative to the selection of input features using feature importance in composing and using classification trees, including credit card customer churn prediction.

항공레이저측량 자료를 활용한 IKONOS-2 위성영상의 기하보정에 관한 연구 - 선형요소를 기하보정의 기본요소로 활용하여 (Geometric Correction of IKONOS-2 Geo-level Satellite Imagery Using LiDAR Data - Using Linear Features as Registration Primitivess)

  • 이재빈;김용민;이효성;유기윤;김용일
    • 한국측량학회지
    • /
    • 제25권3호
    • /
    • pp.183-190
    • /
    • 2007
  • 최첨단 측량기술로 획득되어진 고해상도 위성영상과 항공레이저측량 자료들을 의미 있는 지리정보로 활용하고 상호보완적인 가치를 창출하기 위해서는 이러한 자료들을 같은 좌표계 상에 표현할 수 있도록 기하보정 하는 과정이 선행되어야 한다. 본 연구에서는 고해상도 위성영상을 항공레이저측량 자료를 활용하여 기하보정하기 위한 방법론을 제안하였다. 이를 위해 항공레이저측량 자료와 고해상도 위성영상인 IKONOS-2 위성영상으로부터 선형 기하요소를 추출하였으며 추출된 선형요소를 기하보정의 기본요소로 활용하여 고해상도 위성영상의 단사진과 항공레이저측량 자료의 기하보정을 수행하였다. 마지막으로 연구를 위하여 수집된 실제 측량자료에 개발된 방법론들을 적용하고 도출된 결과에 대한 통계평가를 수행함으로써 연구결과의 효용성을 입증하였다.

Language- Independent Sentence Boundary Detection with Automatic Feature Selection

  • Lee, Do-Gil
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권4호
    • /
    • pp.1297-1304
    • /
    • 2008
  • This paper proposes a machine learning approach for language-independent sentence boundary detection. The proposed method requires no heuristic rules and language-specific features, such as part-of-speech information, a list of abbreviations or proper names. With only the language-independent features, we perform experiments on not only an inflectional language but also an agglutinative language, having fairly different characteristics (in this paper, English and Korean, respectively). In addition, we obtain good performances in both languages. We have also experimented with the methods under a wide range of experimental conditions, especially for the selection of useful features.

  • PDF

한국어 화제구문의 운율적 고찰 (The Study of Prosodic Features in Korean Topic Constructions)

  • 황손문
    • 음성과학
    • /
    • 제9권2호
    • /
    • pp.59-68
    • /
    • 2002
  • This paper analyzes the prosodic features distinctively associated with Korean topic constructions (marked by nun or its variant un) and subject constructions (marked by ka or its variant i) as a way of explicating the role that prosody plays in differentially constituting their discourse messages. Using both spoken data elicited in controlled settings and spontaneous conversational data, an attempt is made to identify differentiating prosodic features and intonation contours associated with distinct meanings and functions of nun- and ka-constructions evoked in a variety of discourse contexts.

  • PDF

이동 로봇을 위한 초음파 센서의 완성도 높은 형상지도 작성법 (A Complete Feature Map Building Method of Sonar Sensors for Mobile Robots)

  • 이세진;임종환;조동우
    • 한국정밀공학회지
    • /
    • 제27권1호
    • /
    • pp.64-75
    • /
    • 2010
  • This study introduces a complete feature map building method of sonar sensors for mobile robots. This method enhances the reality of feature maps by extracting even circle features as well as line and point features from sonar data. Edge features are, moreover, generated by combining line features close to circle features extracted around comer sites. The uncertainties of the specular reflection phenomenon and wide beam width of sonar data can be, therefore, reduced through this map building method. The experimental results demonstrate a practical validity of the proposed method in those environments.

Knowledge Recommendation Based on Dual Channel Hypergraph Convolution

  • Yue Li
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제17권11호
    • /
    • pp.2903-2923
    • /
    • 2023
  • Knowledge recommendation is a type of recommendation system that recommends knowledge content to users in order to satisfy their needs. Although using graph neural networks to extract data features is an effective method for solving the recommendation problem, there is information loss when modeling real-world problems because an edge in a graph structure can only be associated with two nodes. Because one super-edge in the hypergraph structure can be connected with several nodes and the effectiveness of knowledge graph for knowledge expression, a dual-channel hypergraph convolutional neural network model (DCHC) based on hypergraph structure and knowledge graph is proposed. The model divides user data and knowledge data into user subhypergraph and knowledge subhypergraph, respectively, and extracts user data features by dual-channel hypergraph convolution and knowledge data features by combining with knowledge graph technology, and finally generates recommendation results based on the obtained user embedding and knowledge embedding. The performance of DCHC model is higher than the comparative model under AUC and F1 evaluation indicators, comparative experiments with the baseline also demonstrate the validity of DCHC model.

A Study on Data Mining Application Problem in the TFT-LCD Industry

  • Lee, Hyun-Woo;Nam, Ho-Soo;Kang, Jung-Chul
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권4호
    • /
    • pp.823-833
    • /
    • 2005
  • This paper deals the TFT-LCD process and quality, process control problems of the process. For improvement of the process quality and yield, we apply a data mining technique to the LCD industry. And some unique quality features of the LCD process are also described. We describe some preceding researches first and relate to the TFT-LCD process and the problems of data mining in the process. Also we tried to observe the problems which need to solve first and the features from description below hazard must be considered a quality mining in LCD industry.

  • PDF

Exploring the Feasibility of Neural Networks for Criminal Propensity Detection through Facial Features Analysis

  • Amal Alshahrani;Sumayyah Albarakati;Reyouf Wasil;Hanan Farouquee;Maryam Alobthani;Someah Al-Qarni
    • International Journal of Computer Science & Network Security
    • /
    • 제24권5호
    • /
    • pp.11-20
    • /
    • 2024
  • While artificial neural networks are adept at identifying patterns, they can struggle to distinguish between actual correlations and false associations between extracted facial features and criminal behavior within the training data. These associations may not indicate causal connections. Socioeconomic factors, ethnicity, or even chance occurrences in the data can influence both facial features and criminal activity. Consequently, the artificial neural network might identify linked features without understanding the underlying cause. This raises concerns about incorrect linkages and potential misclassification of individuals based on features unrelated to criminal tendencies. To address this challenge, we propose a novel region-based training approach for artificial neural networks focused on criminal propensity detection. Instead of solely relying on overall facial recognition, the network would systematically analyze each facial feature in isolation. This fine-grained approach would enable the network to identify which specific features hold the strongest correlations with criminal activity within the training data. By focusing on these key features, the network can be optimized for more accurate and reliable criminal propensity prediction. This study examines the effectiveness of various algorithms for criminal propensity classification. We evaluate YOLO versions YOLOv5 and YOLOv8 alongside VGG-16. Our findings indicate that YOLO achieved the highest accuracy 0.93 in classifying criminal and non-criminal facial features. While these results are promising, we acknowledge the need for further research on bias and misclassification in criminal justice applications

Feature Selection Using Submodular Approach for Financial Big Data

  • Attigeri, Girija;Manohara Pai, M.M.;Pai, Radhika M.
    • Journal of Information Processing Systems
    • /
    • 제15권6호
    • /
    • pp.1306-1325
    • /
    • 2019
  • As the world is moving towards digitization, data is generated from various sources at a faster rate. It is getting humungous and is termed as big data. The financial sector is one domain which needs to leverage the big data being generated to identify financial risks, fraudulent activities, and so on. The design of predictive models for such financial big data is imperative for maintaining the health of the country's economics. Financial data has many features such as transaction history, repayment data, purchase data, investment data, and so on. The main problem in predictive algorithm is finding the right subset of representative features from which the predictive model can be constructed for a particular task. This paper proposes a correlation-based method using submodular optimization for selecting the optimum number of features and thereby, reducing the dimensions of the data for faster and better prediction. The important proposition is that the optimal feature subset should contain features having high correlation with the class label, but should not correlate with each other in the subset. Experiments are conducted to understand the effect of the various subsets on different classification algorithms for loan data. The IBM Bluemix BigData platform is used for experimentation along with the Spark notebook. The results indicate that the proposed approach achieves considerable accuracy with optimal subsets in significantly less execution time. The algorithm is also compared with the existing feature selection and extraction algorithms.