Search | Korea Science

Applying CEE (CrossEntropyError) to improve performance of Q-Learning algorithm (Q-learning 알고리즘이 성능 향상을 위한 CEE(CrossEntropyError)적용)

Kang, Hyun-Gu;Seo, Dong-Sung;Lee, Byeong-seok;Kang, Min-Soo
- Korean Journal of Artificial Intelligence
- /
- v.5 no.1
- /
- pp.1-9
- /
- 2017
Recently, the Q-Learning algorithm, which is one kind of reinforcement learning, is mainly used to implement artificial intelligence system in combination with deep learning. Many research is going on to improve the performance of Q-Learning. Therefore, purpose of theory try to improve the performance of Q-Learning algorithm. This Theory apply Cross Entropy Error to the loss function of Q-Learning algorithm. Since the mean squared error used in Q-Learning is difficult to measure the exact error rate, the Cross Entropy Error, known to be highly accurate, is applied to the loss function. Experimental results show that the success rate of the Mean Squared Error used in the existing reinforcement learning was about 12% and the Cross Entropy Error used in the deep learning was about 36%. The success rate was shown.
https://doi.org/10.24225/kjai.2017.5.1.1 인용 PDF

Deriving a New Divergence Measure from Extended Cross-Entropy Error Function

Oh, Sang-Hoon;Wakuya, Hiroshi;Park, Sun-Gyu;Noh, Hwang-Woo;Yoo, Jae-Soo;Min, Byung-Won;Oh, Yong-Sun
- International Journal of Contents
- /
- v.11 no.2
- /
- pp.57-62
- /
- 2015
Relative entropy is a divergence measure between two probability density functions of a random variable. Assuming that the random variable has only two alphabets, the relative entropy becomes a cross-entropy error function that can accelerate training convergence of multi-layer perceptron neural networks. Also, the n-th order extension of cross-entropy (nCE) error function exhibits an improved performance in viewpoints of learning convergence and generalization capability. In this paper, we derive a new divergence measure between two probability density functions from the nCE error function. And the new divergence measure is compared with the relative entropy through the use of three-dimensional plots.
https://doi.org/10.5392/IJoC.2015.11.2.057 인용 PDF KSCI KPUBS HTML

A Modified Error Function to Improve the Error Back-Propagation Algorithm for Multi-Layer Perceptrons

Oh, Sang-Hoon;Lee, Young-Jik
- ETRI Journal
- /
- v.17 no.1
- /
- pp.11-22
- /
- 1995
This paper proposes a modified error function to improve the error back-propagation (EBP) algorithm for multi-Layer perceptrons (MLPs) which suffers from slow learning speed. It can also suppress over-specialization for training patterns that occurs in an algorithm based on a cross-entropy cost function which markedly reduces learning time. In the similar way as the cross-entropy function, our new function accelerates the learning speed of the EBP algorithm by allowing the output node of the MLP to generate a strong error signal when the output node is far from the desired value. Moreover, it prevents the overspecialization of learning for training patterns by letting the output node, whose value is close to the desired value, generate a weak error signal. In a simulation study to classify handwritten digits in the CEDAR [1] database, the proposed method attained 100% correct classification for the training patterns after only 50 sweeps of learning, while the original EBP attained only 98.8% after 500 sweeps. Also, our method shows mean-squared error of 0.627 for the test patterns, which is superior to the error 0.667 in the cross-entropy method. These results demonstrate that our new method excels others in learning speed as well as in generalization.
PDF

Multi-labeled Domain Detection Using CNN (CNN을 이용한 발화 주제 다중 분류)

Choi, Kyoungho;Kim, Kyungduk;Kim, Yonghe;Kang, Inho
- 한국어정보학회:학술대회논문집
- /
- 2017.10a
- /
- pp.56-59
- /
- 2017
CNN(Convolutional Neural Network)을 이용하여 발화 주제 다중 분류 task를 multi-labeling 방법과, cluster 방법을 이용하여 수행하고, 각 방법론에 MSE(Mean Square Error), softmax cross-entropy, sigmoid cross-entropy를 적용하여 성능을 평가하였다. Network는 음절 단위로 tokenize하고, 품사정보를 각 token의 추가한 sequence와, Naver DB를 통하여 얻은 named entity 정보를 입력으로 사용한다. 실험결과 cluster 방법으로 문제를 변형하고, sigmoid를 output layer의 activation function으로 사용하고 cross entropy cost function을 이용하여 network를 학습시켰을 때 F1 0.9873으로 가장 좋은 성능을 보였다.
PDF

Application of Subarray Averaging and Entropy Minimization Algorithm to Stepped-Frequency ISAR Autofocus (부배열 평균과 엔트로피 최소화 기법을 이용한 stepped-frequency ISAR 자동초점 기법 성능 향상 연구)

Jeong, Ho-Ryung;Kim, Kyung-Tae;Lee, Dong-Han;Seo, Du-Chun;Song, Jeong-Heon;Choi, Myung-Jin;Lim, Hyo-Suk
- Proceedings of the KSRS Conference
- /
- 2008.03a
- /
- pp.158-163
- /
- 2008
In inverse synthetic aperture radar (ISAR) imaging, An ISAR autofocusing algorithm is essential to obtain well-focused ISAR images. Traditional methods have relied on the approximation that the phase error due to target motion is a function of the cross-range dimension only. However, in the stepped-frequency radar system, it tends to become a two-dimensional function of both down-range and cross-range, especially when target's movement is very fast and the pulse repetition frequency (PRF) is low. In order to remove the phase error along down-range, this paper proposes a method called SAEM (subarray averaging and entropy minimization) [1] that uses a subarray averaging concept in conjunction with the entropy cost function in order to find target motion parameters, and a novel 2-D optimization technique with the inherent properties of the proposed entropy-based cost function. A well-focused ISAR image can be obtained from the combination of the proposed method and a traditional autofocus algorithm that removes the phase error along the cross-range dimension. The effectiveness of this method is illustrated and analyzed with simulated targets comprised of point scatters.
PDF

Comparative Analysis on Error Back Propagation Learning and Layer By Layer Learning in Multi Layer Perceptrons (다층퍼셉트론의 오류역전파 학습과 계층별 학습의 비교 분석)

곽영태
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.7 no.5
- /
- pp.1044-1051
- /
- 2003
This paper surveys the EBP(Error Back Propagation) learning, the Cross Entropy function and the LBL(Layer By Layer) learning, which are used for learning the MLP(Multi Layer Perceptrons). We compare the merits and demerits of each learning method in the handwritten digit recognition. Although the speed of EBP learning is slower than other learning methods in the initial learning process, its generalization capability is better. Also, the speed of Cross Entropy function that makes up for the weak points of EBP learning is faster than that of EBP learning. But its generalization capability is worse because the error signal of the output layer trains the target vector linearly. The speed of LBL learning is the fastest speed among the other learning methods in the initial learning process. However, it can't train for more after a certain time, it has the lowest generalization capability. Therefore, this paper proposes the standard of selecting the learning method when we apply the MLP.
PDF KSCI

Contour Plots of Objective Functions for Feed-Forward Neural Networks

Oh, Sang-Hoon
- International Journal of Contents
- /
- v.8 no.4
- /
- pp.30-35
- /
- 2012
Error surfaces provide us with very important information for training of feed-forward neural networks (FNNs). In this paper, we draw the contour plots of various error or objective functions for training of FNNs. Firstly, when applying FNNs to classifications, the weakness of mean-squared error is explained with the viewpoint of error contour plot. And the classification figure of merit, mean log-square error, cross-entropy error, and n-th order extension of cross-entropy error objective functions are considered for the contour plots. Also, the recently proposed target node method is explained with the viewpoint of contour plot. Based on the contour plots, we can explain characteristics of various error or objective functions when training of FNNs proceeds.
https://doi.org/10.5392/IJoC.2012.8.4.030 인용 PDF KSCI

Tri-training algorithm based on cross entropy and K-nearest neighbors for network intrusion detection

Zhao, Jia;Li, Song;Wu, Runxiu;Zhang, Yiying;Zhang, Bo;Han, Longzhe
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.12
- /
- pp.3889-3903
- /
- 2022
To address the problem of low detection accuracy due to training noise caused by mislabeling when Tri-training for network intrusion detection (NID), we propose a Tri-training algorithm based on cross entropy and K-nearest neighbors (TCK) for network intrusion detection. The proposed algorithm uses cross-entropy to replace the classification error rate to better identify the difference between the practical and predicted distributions of the model and reduce the prediction bias of mislabeled data to unlabeled data; K-nearest neighbors are used to remove the mislabeled data and reduce the number of mislabeled data. In order to verify the effectiveness of the algorithm proposed in this paper, experiments were conducted on 12 UCI datasets and NSL-KDD network intrusion datasets, and four indexes including accuracy, recall, F-measure and precision were used for comparison. The experimental results revealed that the TCK has superior performance than the conventional Tri-training algorithms and the Tri-training algorithms using only cross-entropy or K-nearest neighbor strategy.
https://doi.org/10.3837/tiis.2022.12.006 인용 PDF KSCI HTML

Comparison of Objective Functions for Feed-forward Neural Network Classifiers Using Receiver Operating Characteristics Graph

Oh, Sang-Hoon;Wakuya, Hiroshi
- International Journal of Contents
- /
- v.10 no.1
- /
- pp.23-28
- /
- 2014
When developing a classifier using various objective functions, it is important to compare the performances of the classifiers. Although there are statistical analyses of objective functions for classifiers, simulation results can provide us with direct comparison results and in this case, a comparison criterion is considerably critical. A Receiver Operating Characteristics (ROC) graph is a simulation technique for comparing classifiers and selecting a better one based on a performance. In this paper, we adopt the ROC graph to compare classifiers trained by mean-squared error, cross-entropy error, classification figure of merit, and the n-th order extension of cross-entropy error functions. After the training of feed-forward neural networks using the CEDAR database, the ROC graphs are plotted to help us identify which objective function is better.
https://doi.org/10.5392/IJoC.2014.10.1.023 인용 PDF KSCI KPUBS HTML

Multi-labeled Domain Detection Using CNN (CNN을 이용한 발화 주제 다중 분류)

Choi, Kyoungho;Kim, Kyungduk;Kim, Yonghe;Kang, Inho
- Annual Conference on Human and Language Technology
- /
- 2017.10a
- /
- pp.56-59
- /
- 2017
CNN(Convolutional Neural Network)을 이용하여 발화 주제 다중 분류 task를 multi-labeling 방법과, cluster 방법을 이용하여 수행하고, 각 방법론에 MSE(Mean Square Error), softmax cross-entropy, sigmoid cross-entropy를 적용하여 성능을 평가하였다. Network는 음절 단위로 tokenize하고, 품사정보를 각 token의 추가한 sequence와, Naver DB를 통하여 얻은 named entity 정보를 입력으로 사용한다. 실험결과 cluster 방법으로 문제를 변형하고, sigmoid를 output layer의 activation function으로 사용하고 cross entropy cost function을 이용하여 network를 학습시켰을 때 F1 0.9873으로 가장 좋은 성능을 보였다.
PDF

Search Result 23, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)