• Title/Summary/Keyword: Heterogeneous data

Search Result 1,320, Processing Time 0.025 seconds

Data-centric Sensor Middleware for Heterogeneous Sensor Networks (이기종 센서 네트워크를 위한 데이터 중심적 센서 미들웨어)

  • Nam, Choon-Sung;Shin, Dong-Ryeol
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.7 no.6
    • /
    • pp.323-330
    • /
    • 2012
  • Wireless sensor networks need middleware system for efficiently managing the constrained resource and sensing data because they need different sensing data type and protocol to communicate with heterogeneous sensor networks. Thus this paper proposes data-centric sensor middleware for heterogeneous sensor networks. The proposed middleware have to support various query processing of user applications, high-level request of users, manage heterogeneous sensor systems and universal sensing data type for node and user application.

Heterogeneous Ensemble of Classifiers from Under-Sampled and Over-Sampled Data for Imbalanced Data

  • Kang, Dae-Ki;Han, Min-gyu
    • International journal of advanced smart convergence
    • /
    • v.8 no.1
    • /
    • pp.75-81
    • /
    • 2019
  • Data imbalance problem is common and causes serious problem in machine learning process. Sampling is one of the effective methods for solving data imbalance problem. Over-sampling increases the number of instances, so when over-sampling is applied in imbalanced data, it is applied to minority instances. Under-sampling reduces instances, which usually is performed on majority data. We apply under-sampling and over-sampling to imbalanced data and generate sampled data sets. From the generated data sets from sampling and original data set, we construct a heterogeneous ensemble of classifiers. We apply five different algorithms to the heterogeneous ensemble. Experimental results on an intrusion detection dataset as an imbalanced datasets show that our approach shows effective results.

Energy-efficient Real-time Computing by Utilizing Heterogenous Wireless Interfaces of the Smart Mobile Device in Small-IoT Environments (Small-IoT 환경에서 이기종 네트워크를 활용한 스마트 모바일 단말의 에너지 효율적 실시간 컴퓨팅 기법)

  • Lim, Sung-Hwa
    • Journal of the Semiconductor & Display Technology
    • /
    • v.20 no.3
    • /
    • pp.108-112
    • /
    • 2021
  • For smart mobile devices, the wireless communication module is one of the hardware modules that consume the most energy. If we can build a multi-channel multi-interface environment using heterogeneous communication modules and operate them dynamically, data transmission performance can be highly improved by increasing the parallelism. Also, because these heterogeneous modules have different data rates, transmission ranges, and power consumption, we can save energy by exploiting a power efficient and low speed wireless interface module to transmit/receive sporadic small data. In this paper, we propose a power efficient data transmission method using heterogeneous communication networks. We also compared the performance of our proposed scheme to a conventional scheme, and proved that our proposed scheme can save energy while guaranteeing reasonable data delivery time.

Global Recovery Management Protocol for Heterogeneous System in Security Environments (보안환경에서 이질형 시스템의 전역 복구 관리 프로토콜)

  • Jeong, Hyun Cheol
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.5 no.4
    • /
    • pp.51-59
    • /
    • 2009
  • Many failures are due to incorrectly programmed transactions and data entry errors. System failure causes the loss or corruption of the contents of volatile storage. Although global processing protects data values to detect direct or indirect information effluence, security environments are very important in the recovery management of heterogeneous systems. Although transaction can't control system fault, the restart for the system can cause information effluence by low bandwith. From various faults, it is not easy to maintain the consistency and security of data. This paper proposes recovery management protocols to assure global multilevel secure one-copy quasi-serializability in security environments of heterogeneous systems with replicated data and proves its correctness. The proposed secure protocols guarantee the reliability and security of system when the system fault is happened.

Design of Interworking Technology for Heterogeneous Medical Device Networks in Smart Healthcare Environments (스마트 의료 환경에서 이기종 네트워크 간 연동 기술 설계)

  • Kim, Minjin;Lee, Seunghan;Kim, Jaesoo
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.11 no.4
    • /
    • pp.25-31
    • /
    • 2015
  • Smart healthcare environments which merge medical and IT technology are getting ready for the third generation centering EHR from current second generation. As a basic technology for the introduction and activation of EHR systems it requires heterogeneous network interworking techniques between various wired and wireless medical devices. Interworking technology for heterogeneous network among various medical devices is needed to introduce EHR system. The heterogeneous network interworking technology is needed for construction of a reliable data system to convert each of unstructured data into structured data. Therefore, in this paper, we identify the domestic and international trends of smart medical field and analyze the characteristics of wired and wireless communication technology that is used in a heterogeneous network. and also suggest requirements needed for interworking technology and provide interworking technology based on them. we expect that proposed method which is designed for smart healthcare environments would provide a basic architecture needed for third smart medical technology generation.

Multivariate Procedure for Variable Selection and Classification of High Dimensional Heterogeneous Data

  • Mehmood, Tahir;Rasheed, Zahid
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.575-587
    • /
    • 2015
  • The development in data collection techniques results in high dimensional data sets, where discrimination is an important and commonly encountered problem that are crucial to resolve when high dimensional data is heterogeneous (non-common variance covariance structure for classes). An example of this is to classify microbial habitat preferences based on codon/bi-codon usage. Habitat preference is important to study for evolutionary genetic relationships and may help industry produce specific enzymes. Most classification procedures assume homogeneity (common variance covariance structure for all classes), which is not guaranteed in most high dimensional data sets. We have introduced regularized elimination in partial least square coupled with QDA (rePLS-QDA) for the parsimonious variable selection and classification of high dimensional heterogeneous data sets based on recently introduced regularized elimination for variable selection in partial least square (rePLS) and heterogeneous classification procedure quadratic discriminant analysis (QDA). A comparison of proposed and existing methods is conducted over the simulated data set; in addition, the proposed procedure is implemented to classify microbial habitat preferences by their codon/bi-codon usage. Five bacterial habitats (Aquatic, Host Associated, Multiple, Specialized and Terrestrial) are modeled. The classification accuracy of each habitat is satisfactory and ranges from 89.1% to 100% on test data. Interesting codon/bi-codons usage, their mutual interactions influential for respective habitat preference are identified. The proposed method also produced results that concurred with known biological characteristics that will help researchers better understand divergence of species.

Weighted Adaptive Opportunistic Scheduling Framework for Smartphone Sensor Data Collection in IoT

  • M, Thejaswini;Choi, Bong Jun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.12
    • /
    • pp.5805-5825
    • /
    • 2019
  • Smartphones are important platforms because of their sophisticated computation, communication, and sensing capabilities, which enable a variety of applications in the Internet of Things (IoT) systems. Moreover, advancements in hardware have enabled sensors on smartphones such as environmental and chemical sensors that make sensor data collection readily accessible for a wide range of applications. However, dynamic, opportunistic, and heterogeneous mobility patterns of smartphone users that vary throughout the day, which greatly affects the efficacy of sensor data collection. Therefore, it is necessary to consider phone users mobility patterns to design data collection schedules that can reduce the loss of sensor data. In this paper, we propose a mobility-based weighted adaptive opportunistic scheduling framework that can adaptively adjust to the dynamic, opportunistic, and heterogeneous mobility patterns of smartphone users and provide prioritized scheduling based on various application scenarios, such as velocity, region of interest, and sensor type. The performance of the proposed framework is compared with other scheduling frameworks in various heterogeneous smartphone user mobility scenarios. Simulation results show that the proposed scheduling improves the transmission rate by 8 percent and can also improve the collection of higher-priority sensor data compared with other scheduling approaches.

Development of Database Broker for Enterprise Integration (기업통합을 위한 데이터베이스 브로커 개발)

  • 신혜균;김정선;우훈식
    • The Journal of Society for e-Business Studies
    • /
    • v.5 no.1
    • /
    • pp.105-122
    • /
    • 2000
  • Enterprise integration and virtual enterprise are practical visions of industrial information implementations in the information society. In these environments, the distributed and heterogeneous data sources should be exchanged and shared in effective and integrated way. However, the distributed and heterogeneous data sources are managed by independent and heterogeneous computer systems, thus system users and developers are faced difficulties in implementing enterprise integration environments, In this study, we designed and developed a database broker system utilizing a routing broker method to provide transparent location access mechanisms for the distributed and heterogeneous data sources. The proposed mechanism is designed to act as a middle tier between clients and multiple servers, and adopts Java, CORBA, and JDBC as its state-of-the-art techniques.

  • PDF

The extension of the largest generalized-eigenvalue based distance metric Dij1) in arbitrary feature spaces to classify composite data points

  • Daoud, Mosaab
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.39.1-39.20
    • /
    • 2019
  • Analyzing patterns in data points embedded in linear and non-linear feature spaces is considered as one of the common research problems among different research areas, for example: data mining, machine learning, pattern recognition, and multivariate analysis. In this paper, data points are heterogeneous sets of biosequences (composite data points). A composite data point is a set of ordinary data points (e.g., set of feature vectors). We theoretically extend the derivation of the largest generalized eigenvalue-based distance metric Dij1) in any linear and non-linear feature spaces. We prove that Dij1) is a metric under any linear and non-linear feature transformation function. We show the sufficiency and efficiency of using the decision rule $\bar{{\delta}}_{{\Xi}i}$(i.e., mean of Dij1)) in classification of heterogeneous sets of biosequences compared with the decision rules min𝚵iand median𝚵i. We analyze the impact of linear and non-linear transformation functions on classifying/clustering collections of heterogeneous sets of biosequences. The impact of the length of a sequence in a heterogeneous sequence-set generated by simulation on the classification and clustering results in linear and non-linear feature spaces is empirically shown in this paper. We propose a new concept: the limiting dispersion map of the existing clusters in heterogeneous sets of biosequences embedded in linear and nonlinear feature spaces, which is based on the limiting distribution of nucleotide compositions estimated from real data sets. Finally, the empirical conclusions and the scientific evidences are deduced from the experiments to support the theoretical side stated in this paper.

Semantic Correspondence of Database Schema from Heterogeneous Databases using Self-Organizing Map

  • Dumlao, Menchita F.;Oh, Byung-Joo
    • Journal of IKEEE
    • /
    • v.12 no.4
    • /
    • pp.217-224
    • /
    • 2008
  • This paper provides a framework for semantic correspondence of heterogeneous databases using self- organizing map. It solves the problem of overlapping between different databases due to their different schemas. Clustering technique using self-organizing maps (SOM) is tested and evaluated to assess its performance when using different kinds of data. Preprocessing of database is performed prior to clustering using edit distance algorithm, principal component analysis (PCA), and normalization function to identify the features necessary for clustering.

  • PDF