DOI QR코드

DOI QR Code

Online VQ Codebook Generation using a Triangle Inequality

삼각 부등식을 이용한 온라인 VQ 코드북 생성 방법

  • Lee, Hyunjin (Dept. of Computer Science & Software, Korea Soongsil Cyber University)
  • Received : 2015.04.28
  • Accepted : 2015.05.30
  • Published : 2015.06.30

Abstract

In this paper, we propose an online VQ Codebook generation method for updating an existing VQ Codebook in real-time and adding to an existing cluster with newly created text data which are news paper, web pages, blogs, tweets and IoT data like sensor, machine. Without degrading the performance of the batch VQ Codebook to the existing data, it was able to take advantage of the newly added data by using a triangle inequality which modifying the VQ Codebook progressively show a high degree of accuracy and speed. The result of applying to test data showed that the performance is similar to the batch method.

본 논문에서는 실시간으로, 문서, 웹 페이지, 블로그, tweet 등 텍스트 정보와 센서, 머신데이터등 IoT의 데이터가 생성되는 상황에서 새로 추가되는 데이터들을 기존에 만들어진 VQ 코드북에 추가시키면서, 기존 VQ 코드북 모델을 실시간으로 갱신하기 위한 온라인 VQ 코드북 생성 방법을 제안한다. 기존에 일괄 작업으로 만들어진 VQ 코드북의 성능을 저하시키지 않으면서, 새로 추가된 데이터를 활용하여 VQ 코드북을 점진적으로 수정하는 방식으로 삼각 부등식을 활용하여 높은 정확도와 속도를 보일 수 있었다. 테스트 데이터에 적용한 결과 일괄 작업과 유사한 성능을 보이면서, 다른 온라인 K-Means 보다 빠른 속도를 보였다.

Keywords

References

  1. K. Cox, S. Hibino, L. Hong, A. Mockus and G. Wills, "A Multi-Modal Natural Language Interface to an Information Visualization Environment," International Journal of Speech Technology, Vol. 4, No. 3, pp. 297-314, 2001. https://doi.org/10.1023/A:1011368926479
  2. T. Li, S. Feng and L.X. Li, "Information Visualization for Intelligent Decision Support Systems," Knowledge-Based Systems, Vol. 14, pp. 259-262, 2001. https://doi.org/10.1016/S0950-7051(01)00104-6
  3. G.S. Linoff and M.J. Berry, "Data Mining Techniques : For Marketing, Sales, and Customer Relationship Management", Wiley Computer Publishing, New York, 2011.
  4. C. W. Tsai, C. Y. Lee, M. C. Chiang, and C. S. Yang, "A Fast VQ Codebook Generation Algorithm via Pattern Reduction, Pattern Recognition Letters", vol. 30, pp. 653-660, 2009. https://doi.org/10.1016/j.patrec.2009.02.003
  5. Xiao-Gang W, and Yue L, "Web mining based on user access patterns for web personalization," ISECS International Colloquium on Computing, Communication, Control, and Management. 1: 194-197, 2009.
  6. Hyunjin Lee, "Decombined Distributed Parallel VQ Codebook Generation Based on MapReduce," Journal of Digital Contents Society, Vol. 15, No.3, pp.365-371, 2014. https://doi.org/10.9728/dcs.2014.15.3.365
  7. Krishnamoorthy R, Kalpana J, "Minimum distortion clustering technique for orthogonal polynomials transform vector quantizer," Proc. 2011 Inter. Conf. Communication, Computing & Security. pp.443-448, 2011.
  8. S.J. Kim, C.W. Ahn, S.H. Kim, "Fast Codebook Search Method using Triangle Inequality for Vector Quantization," Proceedings of the Korean Information Science Society Conference, Vol.25 No.2 (2), pp.526-528, 1998.
  9. Hyunjin Lee, "An Efficient Vector Quantization Code book generation using a Triangle Inequality," Journal of Digital Contents Society, Vol. 13, No.3, pp.309-315, 2012. https://doi.org/10.9728/dcs.2012.13.3.309
  10. D. Fotakis and P. Koutris, "Online Sum-Radii Clustering," Theoretical Computer Science, Vol. 540-541, pp. 27-39, 2014. https://doi.org/10.1016/j.tcs.2013.03.010
  11. W. Barbakh and C. Fyfe, "Online Clutering Algorithms," International Journal of Neural Systems(IJNS), Vol. 18, No. 3, pp. 1-10, 2008. https://doi.org/10.1142/S0129065708001397
  12. C.C. Aggarwal and I.K. Chandan, "Data Clustering:algorithms and applications," CRC Press, 2013.
  13. J.H, Yoo, K.A. Han, D.H. Jeong, H.J. Lee "Cluster-Based Routing Mechanism for Efficient Data Delivery to Group Mobile Users in Wireless Ad-Hoc Networks," The Journal of Korea Information and Communications Society,Vol. 38C, No.11, pp. 323-324, 2013.
  14. A. King, "Online k-Means Clustering of Nonstationary Data," Prediction Project Report, 2012.
  15. Charles Elkan, "Using the Triangle Inequality to Accelerate k-Means," Proceedings of the Twentieth International Conference on Machine Learning, 147-153, 2003.
  16. A.W. Moore, "The anchors hierarchy: Using the triangle inequality to survive high dimensional data," Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence, pp. 397-405, 2000.

Cited by

  1. Mathematical Thinking of Elementary School Students and Teacher Roles during Problem-Solving Process to Form the Concept of Triangle Inequality vol.28, pp.2, 2015, https://doi.org/10.29275/jerm.2018.05.28.2.203