Search | Korea Science

Research on Optimization Strategies for Random Forest Algorithms in Federated Learning Environments (연합 학습 환경에서의 랜덤 포레스트 알고리즘 최적화 전략 연구)

InSeo Song;KangYoon Lee
- The Journal of Bigdata
- /
- v.9 no.1
- /
- pp.101-113
- /
- 2024
Federated learning has garnered attention as an efficient method for training machine learning models in a distributed environment while maintaining data privacy and security. This study proposes a novel FedRFBagging algorithm to optimize the performance of random forest models in such federated learning environments. By dynamically adjusting the trees of local random forest models based on client-specific data characteristics, the proposed approach reduces communication costs and achieves high prediction accuracy even in environments with numerous clients. This method adapts to various data conditions, significantly enhancing model stability and training speed. While random forest models consist of multiple decision trees, transmitting all trees to the server in a federated learning environment results in exponentially increasing communication overhead, making their use impractical. Additionally, differences in data distribution among clients can lead to quality imbalances in the trees. To address this, the FedRFBagging algorithm selects only the highest-performing trees from each client for transmission to the server, which then reselects trees based on impurity values to construct the optimal global model. This reduces communication overhead and maintains high prediction performance across diverse data distributions. Although the global model reflects data from various clients, the data characteristics of each client may differ. To compensate for this, clients further train additional trees on the global model to perform local optimizations tailored to their data. This improves the overall model's prediction accuracy and adapts to changing data distributions. Our study demonstrates that the FedRFBagging algorithm effectively addresses the communication cost and performance issues associated with random forest models in federated learning environments, suggesting its applicability in such settings.
https://doi.org/10.36498/kbigdt.2024.9.1.101 인용 PDF

Dietary supplementation of protease and organic acid in poultry by-product meal-based diet in broilers

Muhammad Ahsan Yaseen; Waqar Iqbal;Shaukat Ali Bhatti;Muhammad Saif ur Rehman;Asghar Subhani;Muhammad Shoaib;Muhammad Aziz ur Rahman;Muhammad Umar Yaqoob
- Animal Bioscience
- /
- v.37 no.12
- /
- pp.2145-2154
- /
- 2024
Objective: This study investigated the impact of supplementation of protease and organic acid on growth performance and other biological parameters in broilers fed poultry by-product meal (PBM) based diet. Methods: Five hundred 1-day-old broiler chicks (Ross 308) were distributed into five treatments with 5 replicates, each pen having 20 birds, and fed each group one of five isocaloric and isonitrogenous diets in two phases: stater phase (1 to 21 days) metabolizable energy (ME) 3000 kcal/kg; crude protein (CP) 22%, and a finisher phase (22 to 35 days) ME 3,200 kcal/kg; CP 19.5%. The dietary treatments were: i) standard broiler ration (Cont); ii) The control diet with 25% of the soybean meal replaced by PBM on an equivalent protein basis (PBM); iii) PBM diet supplemented with 0.5 g/kg of protease (PBMP); iv) PBM diet supplemented with 1 g/kg organic acid (PBMO); and v) PBM diet addition with 0.5 g/kg protease and 1 g/kg organic acid (PBMPO). Results: The overall data showed that feed conversion ratio was improved (p<0.05) in the PBMP group. Apparent CP digestibility was higher (p<0.05) in both Cont and PBMP groups. Jejunal villus height increased (p<0.05) in PBMP and PBMPO groups, while only the PBMO group exhibited a higher (p<0.05) crypt depth. Lipase activity was increased (p<0.05) in the PBMP, PBMO, and PBMPO dietary treatments. However, trypsin activity showed a significant increase (p<0.05) in the PBMP and PBMO groups. Serum biochemistry increased (p<0.05) globulin and total protein levels in the PBMP group. Conclusion: PBM could partially replace the soybean meal with supplementation of either protease or organic acid in broiler diets without impairing overall growth performance. Furthermore, careful optimization must be considered when combining protease and organic acids.
https://doi.org/10.5713/ab.24.0136 인용 PDF

Ensemble Learning with Support Vector Machines for Bond Rating (회사채 신용등급 예측을 위한 SVM 앙상블학습)

Kim, Myoung-Jong
- Journal of Intelligence and Information Systems
- /
- v.18 no.2
- /
- pp.29-45
- /
- 2012
Bond rating is regarded as an important event for measuring financial risk of companies and for determining the investment returns of investors. As a result, it has been a popular research topic for researchers to predict companies' credit ratings by applying statistical and machine learning techniques. The statistical techniques, including multiple regression, multiple discriminant analysis (MDA), logistic models (LOGIT), and probit analysis, have been traditionally used in bond rating. However, one major drawback is that it should be based on strict assumptions. Such strict assumptions include linearity, normality, independence among predictor variables and pre-existing functional forms relating the criterion variablesand the predictor variables. Those strict assumptions of traditional statistics have limited their application to the real world. Machine learning techniques also used in bond rating prediction models include decision trees (DT), neural networks (NN), and Support Vector Machine (SVM). Especially, SVM is recognized as a new and promising classification and regression analysis method. SVM learns a separating hyperplane that can maximize the margin between two categories. SVM is simple enough to be analyzed mathematical, and leads to high performance in practical applications. SVM implements the structuralrisk minimization principle and searches to minimize an upper bound of the generalization error. In addition, the solution of SVM may be a global optimum and thus, overfitting is unlikely to occur with SVM. In addition, SVM does not require too many data sample for training since it builds prediction models by only using some representative sample near the boundaries called support vectors. A number of experimental researches have indicated that SVM has been successfully applied in a variety of pattern recognition fields. However, there are three major drawbacks that can be potential causes for degrading SVM's performance. First, SVM is originally proposed for solving binary-class classification problems. Methods for combining SVMs for multi-class classification such as One-Against-One, One-Against-All have been proposed, but they do not improve the performance in multi-class classification problem as much as SVM for binary-class classification. Second, approximation algorithms (e.g. decomposition methods, sequential minimal optimization algorithm) could be used for effective multi-class computation to reduce computation time, but it could deteriorate classification performance. Third, the difficulty in multi-class prediction problems is in data imbalance problem that can occur when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. SVM ensemble learning is one of machine learning methods to cope with the above drawbacks. Ensemble learning is a method for improving the performance of classification and prediction algorithms. AdaBoost is one of the widely used ensemble learning techniques. It constructs a composite classifier by sequentially training classifiers while increasing weight on the misclassified observations through iterations. The observations that are incorrectly predicted by previous classifiers are chosen more often than examples that are correctly predicted. Thus Boosting attempts to produce new classifiers that are better able to predict examples for which the current ensemble's performance is poor. In this way, it can reinforce the training of the misclassified observations of the minority class. This paper proposes a multiclass Geometric Mean-based Boosting (MGM-Boost) to resolve multiclass prediction problem. Since MGM-Boost introduces the notion of geometric mean into AdaBoost, it can perform learning process considering the geometric mean-based accuracy and errors of multiclass. This study applies MGM-Boost to the real-world bond rating case for Korean companies to examine the feasibility of MGM-Boost. 10-fold cross validations for threetimes with different random seeds are performed in order to ensure that the comparison among three different classifiers does not happen by chance. For each of 10-fold cross validation, the entire data set is first partitioned into tenequal-sized sets, and then each set is in turn used as the test set while the classifier trains on the other nine sets. That is, cross-validated folds have been tested independently of each algorithm. Through these steps, we have obtained the results for classifiers on each of the 30 experiments. In the comparison of arithmetic mean-based prediction accuracy between individual classifiers, MGM-Boost (52.95%) shows higher prediction accuracy than both AdaBoost (51.69%) and SVM (49.47%). MGM-Boost (28.12%) also shows the higher prediction accuracy than AdaBoost (24.65%) and SVM (15.42%)in terms of geometric mean-based prediction accuracy. T-test is used to examine whether the performance of each classifiers for 30 folds is significantly different. The results indicate that performance of MGM-Boost is significantly different from AdaBoost and SVM classifiers at 1% level. These results mean that MGM-Boost can provide robust and stable solutions to multi-classproblems such as bond rating.
https://doi.org/10.13088/jiis.2012.18.2.029 인용 PDF KSCI

Performance Evaluation of User Mobility Management Scheme based-on Dwell Time Optimization for Effective Inter-working with Heterogeneous Networks under Cognitive Networking Environments (인지 네트워킹 환경 하에서 체류시간 관리 최적화를 통한 사용자 이동성 모델 기반 이동성 관리방법의 성능평가)

Choi, Yu-Mi;Kim, Jeong-Ho
- Journal of the Institute of Electronics Engineers of Korea TC
- /
- v.49 no.5
- /
- pp.77-83
- /
- 2012
The importance of mobility management is becoming to be one of the upcomming issues to be addressed to provide the converged services and the convergence of the heterogeneous network environments. In this paper, the new user mobility management scheme which can be utilized to model the user's mobility behaviors for interworking with heterogeneous overlay convergent networks under the time-varying radio propagation environment has been proposed. Thus user mobility management scheme based on user mobility model is considered in order to optimize the dwell time of users in the overlay convergent networks. This Mobile IP user mobility management will be very useful to model the user mobility behaviors and can be used to estimate the signaling traffic and frequency spectrum demands for massive data transfer for the heterogeneous overlay convergent networks.
PDF KSCI

The Study on Marker-less Tracking Algorithm Performance based on Mobile Augmented Reality (모바일 증강현실 기반의 마커리스 추적 알고리즘 성능 연구)

Yoon, Ji-Yean;Moon, Il-Young
- Journal of Advanced Navigation Technology
- /
- v.16 no.6
- /
- pp.1032-1037
- /
- 2012
Augmented reality (AR) is augmented virtual information on the real world with real-time. And user can interact with information. In this paper, Marker-less tracking algorithm has been studied, for implement the augmented reality system on a mobile environment. In marker-less augmented reality, users do not need to attach the markers, and constrained the location. So, it's convenient to use. For marker-less tracking, I use the SURF algorithm based on feature point extraction in this paper. The SURF algorithm can be used on mobile devices because of the computational complexity is low. However, the SURF algorithm optimization work is not suitable for mobile devices. Therefore, in this paper, in order to the suitable tracking in mobile devices, the SURF algorithm was tested in a variety of environments. And ways to optimize has been studied.
https://doi.org/10.12673/jkoni.2012.16.6.1032 인용 PDF KSCI

Symbolizing Numbers to Improve Neural Machine Translation (숫자 기호화를 통한 신경기계번역 성능 향상)

Kang, Cheongwoong;Ro, Youngheon;Kim, Jisu;Choi, Heeyoul
- Journal of Digital Contents Society
- /
- v.19 no.6
- /
- pp.1161-1167
- /
- 2018
The development of machine learning has enabled machines to perform delicate tasks that only humans could do, and thus many companies have introduced machine learning based translators. Existing translators have good performances but they have problems in number translation. The translators often mistranslate numbers when the input sentence includes a large number. Furthermore, the output sentence structure completely changes even if only one number in the input sentence changes. In this paper, first, we optimized a neural machine translation model architecture that uses bidirectional RNN, LSTM, and the attention mechanism through data cleansing and changing the dictionary size. Then, we implemented a number-processing algorithm specialized in number translation and applied it to the neural machine translation model to solve the problems above. The paper includes the data cleansing method, an optimal dictionary size and the number-processing algorithm, as well as experiment results for translation performance based on the BLEU score.
https://doi.org/10.9728/dcs.2018.19.6.1161 인용 KSCI

FinFET Gate Resistance Modeling and Optimization (FinFET 게이트 저항 압축 모델 개발 및 최적화)

Lee, SoonCheol;Kwon, Kee-Won;Kim, SoYoung
- Journal of the Institute of Electronics and Information Engineers
- /
- v.51 no.8
- /
- pp.30-37
- /
- 2014
In this paper, the compact model for FinFET gate resistance is developed. Based on the FinFET geometry and material, the value of the gate resistance is extracted by Y-parameter analysis using 3D device simulator, Sentaurus. By dividing the gate resistance into horizontal and vertical components, the proposed gate resistance model captures the non-linear characteristics. The proposed compact model reflects the realistic gate structure which has two different materials (Tungsten, TiN) stacked. Using the proposed model, the number of fins for the minimum gate resistance can be proposed based on the variation of gate geometrical parameters. The proposed gate resistance model is implemented in BSIM-CMG. A ring-oscillator is designed, and its delay performance is compared with and without gate resistance.
https://doi.org/10.5573/ieie.2014.51.8.030 인용 PDF KSCI

QoS-, Energy- and Cost-efficient Resource Allocation for Cloud-based Interactive TV Applications

Kulupana, Gosala;Talagala, Dumidu S.;Arachchi, Hemantha Kodikara;Fernando, Anil
- IEIE Transactions on Smart Processing and Computing
- /
- v.6 no.3
- /
- pp.158-167
- /
- 2017
Internet-based social and interactive video applications have become major constituents of the envisaged applications for next-generation multimedia networks. However, inherently dynamic network conditions, together with varying user expectations, pose many challenges for resource allocation mechanisms for such applications. Yet, in addition to addressing these challenges, service providers must also consider how to mitigate their operational costs (e.g., energy costs, equipment costs) while satisfying the end-user quality of service (QoS) expectations. This paper proposes a heuristic solution to the problem, where the energy incurred by the applications, and the monetary costs associated with the service infrastructure, are minimized while simultaneously maximizing the average end-user QoS. We evaluate the performance of the proposed solution in terms of serving probability, i.e., the likelihood of being able to allocate resources to groups of users, the computation time of the resource allocation process, and the adaptability and sensitivity to dynamic network conditions. The proposed method demonstrates improvements in serving probability of up to 27%, in comparison with greedy resource allocation schemes, and a several-orders-of-magnitude reduction in computation time, compared to the linear programming approach, which significantly reduces the service-interrupted user percentage when operating under variable network conditions.
https://doi.org/10.5573/IEIESPC.2017.6.3.158 인용 PDF KSCI

The Way of Establishing Weights for IS Evaluation Areas and Items by means of AHP : Focusing on Public Sector (계층분석기법을 이용한 정보시스템 평가영역 및 평가항목별 가중치 설정 방안： 공공부문을 중심으로)

Jung Haeyong;Kim Sanghoon
- Journal of Information Technology Applications and Management
- /
- v.11 no.4
- /
- pp.61-85
- /
- 2004
It is tried that evaluation areas and items of information system in public sector are derived ration-ally and its weight value call be applied differently to type of information system to enhance validity and objectiveness of measurement in evaluating IS in this research. To obtain the goal of this research, firstly, five sectors - system sector, user sector, organization and management sector, the degree of strategic contribution to IS, and the degree of optimizing re-source in IS - are categorized based on broadly reviewing previous theoretical and practical research. Secondly, IS type in public sector is divided into internal operation one and customer oriented one that is object of the IS, and divided into application oriented and IT infrastructure oriented which are influence by IS. Thirdly, evaluation areas and its items are measured by 5 point scales (Likert summated scales) in addition to analysis of validity and reliability to improve objectiveness of establishing evaluation areas and its items. Fourthly, the weight values in the evaluation areas and its items are derived by using analytic hierarchy process. According to the results of analysis of weight value through AHP, it were found to be 30.4% to organization and management sector. 25.5% to degree of strategic contribution, 21.0% to user sector, 13.5% to degree of optimization of resource management, and 9.6% to system sector. and. different weight values each of the four IS type are proposed which establishing in this research. The main implications of this study is that the criteria by which IS in public sector can be categorized 4 ones is suggested and The weighted evaluation for four types of IS based on the AHP analysis enables proposing an objective evaluation method of IS in public sector for considering individual IS characterics.
PDF

A Study on the Redesign of the Two-Stage Axial Compressor for Helicopter Engines (헬리콥터용 2단 축류압축기의 재설계에 관한 연구)

Kim, Jin-Han;Choi, Chang-Ho;Kim, Chul-Taek;Yang, Sooseok;Lee, Daesung
- The KSFM Journal of Fluid Machinery
- /
- v.4 no.1 s.10
- /
- pp.7-13
- /
- 2001
In developing a multistage compressor, the stage matching is one of the critical design issues. The mismatching can be often observed even if each stage has been proven good and then used as part of a compression system. A good matching among the stages can be achieved by changing various design parameters (i.e., passage cross sectional areas, blades angles, stagger angles, curvature, solidity, etc.). Therefore, designers need to find out what parameters must be changed and how much. In this study, a method to search the design parameters for optimum stage matching has been used based on an 1-D mathematical model of a compressor, which uses the data obtained from the preliminary test to identify the design parameters. This methodology is applied with a two-stage axial compressor, which was originally designed for a helicopter gas turbine engine. After identifying design parameters using preliminary test data, an optimization process has been employed to achieve the best matching between the stages (i.e., maximum efficiency of the compressor at its operation modes within a given range of the rotor speed under given restrictions for required stall margins and mass flow). 3-D flow calculations have been performed to confirm the usefulness of the corrections based on the 1-D mathematical model. Calculational results agree well with the experimental data in view of the performance characteristics. Some promising results were produced through the methodology proposed in this paper in conjunction with flow calculations.
PDF

Search Result 2,574, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)