• Title/Summary/Keyword: constrained learning

Search Result 63, Processing Time 0.022 seconds

Enhancing LoRA Fine-tuning Performance Using Curriculum Learning

  • Daegeon Kim;Namgyu Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.3
    • /
    • pp.43-54
    • /
    • 2024
  • Recently, there has been a lot of research on utilizing Language Models, and Large Language Models have achieved innovative results in various tasks. However, the practical application faces limitations due to the constrained resources and costs required to utilize Large Language Models. Consequently, there has been recent attention towards methods to effectively utilize models within given resources. Curriculum Learning, a methodology that categorizes training data according to difficulty and learns sequentially, has been attracting attention, but it has the limitation that the method of measuring difficulty is complex or not universal. Therefore, in this study, we propose a methodology based on data heterogeneity-based Curriculum Learning that measures the difficulty of data using reliable prior information and facilitates easy utilization across various tasks. To evaluate the performance of the proposed methodology, experiments were conducted using 5,000 specialized documents in the field of information communication technology and 4,917 documents in the field of healthcare. The results confirm that the proposed methodology outperforms traditional fine-tuning in terms of classification accuracy in both LoRA fine-tuning and full fine-tuning.

Opportunistic Spectrum Access Based on a Constrained Multi-Armed Bandit Formulation

  • Ai, Jing;Abouzeid, Alhussein A.
    • Journal of Communications and Networks
    • /
    • v.11 no.2
    • /
    • pp.134-147
    • /
    • 2009
  • Tracking and exploiting instantaneous spectrum opportunities are fundamental challenges in opportunistic spectrum access (OSA) in presence of the bursty traffic of primary users and the limited spectrum sensing capability of secondary users. In order to take advantage of the history of spectrum sensing and access decisions, a sequential decision framework is widely used to design optimal policies. However, many existing schemes, based on a partially observed Markov decision process (POMDP) framework, reveal that optimal policies are non-stationary in nature which renders them difficult to calculate and implement. Therefore, this work pursues stationary OSA policies, which are thereby efficient yet low-complexity, while still incorporating many practical factors, such as spectrum sensing errors and a priori unknown statistical spectrum knowledge. First, with an approximation on channel evolution, OSA is formulated in a multi-armed bandit (MAB) framework. As a result, the optimal policy is specified by the wellknown Gittins index rule, where the channel with the largest Gittins index is always selected. Then, closed-form formulas are derived for the Gittins indices with tunable approximation, and the design of a reinforcement learning algorithm is presented for calculating the Gittins indices, depending on whether the Markovian channel parameters are available a priori or not. Finally, the superiority of the scheme is presented via extensive experiments compared to other existing schemes in terms of the quality of policies and optimality.

Rear-Approaching Vehicle Detection Research using Region of Interesting based on Faster R-CNN (Faster R-CNN 기반의 관심영역 유사도를 이용한 후방 접근차량 검출 연구)

  • Lee, Yeung-Hak;Kim, Joong-Soo;Shim, Jae-Chnag
    • Journal of IKEEE
    • /
    • v.23 no.1
    • /
    • pp.235-241
    • /
    • 2019
  • In this paper, we propose a new algorithm to detect rear-approaching vehicle using the frame similarity of ROI(Region of Interest) based on deep learning algorithm for use in agricultural machinery systems. Since the vehicle detection system for agricultural machinery needs to detect only a vehicle approaching from the rear. we use Faster R-CNN model that shows excellent accuracy rate in deep learning for vehicle detection. And we proposed an algorithm that uses the frame similarity for ROI using constrained conditions. Experimental results show that the proposed method has a detection rate of 99.9% and reduced the false positive values.

A novel multi-feature model predictive control framework for seismically excited high-rise buildings

  • Katebi, Javad;Rad, Afshin Bahrami;Zand, Javad Palizvan
    • Structural Engineering and Mechanics
    • /
    • v.83 no.4
    • /
    • pp.537-549
    • /
    • 2022
  • In this paper, a novel multi-feature model predictive control (MPC) framework with real-time and adaptive performances is proposed for intelligent structural control in which some drawbacks of the algorithm including, complex control rule and non-optimality, are alleviated. Hence, Linear Programming (LP) is utilized to simplify the resulted control rule. Afterward, the Whale Optimization Algorithm (WOA) is applied to the optimal and adaptive tuning of the LP weights independently at each time step. The stochastic control rule is also achieved using Kalman Filter (KF) to handle noisy measurements. The Extreme Learning Machine (ELM) is then adopted to develop a data-driven and real-time control algorithm. The efficiency of the developed algorithm is then demonstrated by numerical simulation of a twenty-story high-rise benchmark building subjected to earthquake excitations. The competency of the proposed method is proven from the aspects of optimality, stochasticity, and adaptivity compared to the KF-based MPC (KMPC) and constrained MPC (CMPC) algorithms in vibration suppression of building structures. The average value for performance indices in the near-field and far-field (El earthquakes demonstrates a reduction up to 38.3% and 32.5% compared with KMPC and CMPC, respectively.

Lessons Learned from Institutionalization of ML (Machine Learning) Supported HR Services in the Existence of Multiple Institutional Logics

  • Gyeung-min Kim;Heesun Kim
    • Asia pacific journal of information systems
    • /
    • v.33 no.4
    • /
    • pp.1171-1187
    • /
    • 2023
  • This study explores how an organization has successfully implemented ML-supported HR services to resolve high employee turnover problems in the IT sector. The empirical setting of the research is where contradicting institutional logics exist among technical, HR, and business groups regarding the ML model development and use of the model predictions in HR services. Institutional framework is used to identify the roles of organizational actors and the legitimacy structures in the organizational environments that can shape or constrain the ML led organizational changes. In institutional theories, technology adoption and organizational change are not only constrained by organizational context, but also fostered through organizational actors' roles and efforts to increase the legitimacy for the change. This research found that when multiple contradicting institutional logics exist, legitimizing the establishment of an enabling environment for multiple logics to reconcile and for the project to move forward is critical. Industry-wide conditions, previous experiences with the pilot ML project, forming a TFT with clearly defined roles and responsibilities, and relevant KPIs are found to legitimize the HR team and the business division to collaborate with the technical personnel to launch ML-supported HR services.

Analysis and Design of Jumping Robot System Using the Model Transformation Method

  • Suh Jin-Ho;Yamakita Masaki
    • Journal of Electrical Engineering and Technology
    • /
    • v.1 no.2
    • /
    • pp.200-210
    • /
    • 2006
  • This paper proposes the motion generation method in which the movement of the 3-links leg subsystem in constrained to slider-link and a singular posture can be easily avoided. This method is the realization of jumping control moving in a vertical direction, which mimics a cat's behavior. To consider the movement from the point of the constraint mechanical system, a robotics system for realizing the motion will change its configuration according to the position. The effectiveness of the proposed scheme is illustrated by simulation and experimental results.

Beamforming Optimization Using Filterbank-based Frost Algorithm (필터뱅크 기반 프로스트 알고리즘을 이용한 빔포밍 최적화)

  • Park, Ji-Hoon;Lee, Sung-Joo;Hong, Jeong-Pyo;Jeong, Sang-Bae;Hahn, Min-Soo
    • MALSORI
    • /
    • no.66
    • /
    • pp.73-86
    • /
    • 2008
  • Beamforming is one of the spatial filtering techniques which extract only desired signals from noisy environments using microphone arrays. Fixed beamforming is a simple concept and easy to implement. However, it does not show good performance in real noisy conditions. As an adaptive beamforming, Frost algorithm can be a good candidate. It uses the concept of the linearly constrained minimum variance (LCMV) algorithm. The difference between the Frost and the LCMV algorithm is the error correction scheme which is very effective feature in the aspect of performance. In this paper, as quadrature mirror filtering (QMF)-based filterbank is utilized as the pre-processing of the Frost beamformning, the filter length and the learning rate of each band is optimized to improve the performance. The performance is measured by the signal-to-noise ratio (SNR) and the Bark's scale spectral distortion (BSD).

  • PDF

Constrained Learning Method of Bayesian Network Structure for Efficient Context Classification (효율적인 컨텍스트 분류를 위한 베이지안 네트워크 구조의 제한 학습)

  • 황금성;조성배
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.10a
    • /
    • pp.112-114
    • /
    • 2004
  • 지능형 로봇 에이전트 기술이 발전하면서 서비스 질을 높이기 위한 방법으로 컨텍스트의 활용성이 부각되고 있다. 하지만 컨텍스트 분류 기술들은 아직까지 초기 개발 단계이며 다양한 방법들이 시도되고 있다. 본 논문에서는 전문가의 지식과 학습된 지식을 함께 적용할 수 있고 사람이 그 내용을 이해하기 유리한 베이지안 네트워크(BN)를 이용한 컨텍스트 분류 방법을 제안한다. 일반적인 BN 구조 학습에 사전 지식 및 방향성, 연결 관계 범위를 부여할 수 있는 제한(Constraint)을 적용한 효율적인 컨텍스트 분류 방법을 소개하고, 몇 가지 비교 실험을 통해 기존 방법에 비해 전문가의 개입이 줄어들고 좀 더 신뢰성 있는 컨텍스트 분류기를 얻을 수 있음을 보인다.

  • PDF

Synthesis of four-bar linkage motion generation using optimization algorithms

  • Phukaokaew, Wisanu;Sleesongsom, Suwin;Panagant, Natee;Bureerat, Sujin
    • Advances in Computational Design
    • /
    • v.4 no.3
    • /
    • pp.197-210
    • /
    • 2019
  • Motion generation of a four-bar linkage is a type of mechanism synthesis that has a wide range of applications such as a pick-and-place operation in manufacturing. In this research, the use of meta-heuristics for motion generation of a four-bar linkage is demonstrated. Three problems of motion generation were posed as a constrained optimization probably using the weighted sum technique to handle two types of tracking errors. A simple penalty function technique was used to deal with design constraints while three meta-heuristics including differential evolution (DE), self-adaptive differential evolution (JADE) and teaching learning based optimization (TLBO) were employed to solve the problems. Comparative results and the effect of the constraint handling technique are illustrated and discussed.

TinyML Gamma Radiation Classifier

  • Moez Altayeb;Marco Zennaro;Ermanno Pietrosemoli
    • Nuclear Engineering and Technology
    • /
    • v.55 no.2
    • /
    • pp.443-451
    • /
    • 2023
  • Machine Learning has introduced many solutions in data science, but its application in IoT faces significant challenges, due to the limitations in memory size and processing capability of constrained devices. In this paper we design an automatic gamma radiation detection and identification embedded system that exploits the power of TinyML in a SiPM micro radiation sensor leveraging the Edge Impulse platform. The model is trained using real gamma source data enhanced by software augmentation algorithms. Tests show high accuracy in real time processing. This design has promising applications in general-purpose radiation detection and identification, nuclear safety, medical diagnosis and it is also amenable for deployment in small satellites.