DOI QR코드

DOI QR Code

A Study on the Development of Product Planning Prediction Model Using Logistic Regression Algorithm

로지스틱 회귀 알고리즘을 활용한 상품 기획 예측 모형 개발에 관한 연구

  • Ahn, Yeong-Hwil (Division of Computer Engineering, Kongju National University) ;
  • Park, Koo-Rack (Division of Computer Science & Engineering, Kongju National University) ;
  • Kim, Dong-Hyun (Dept. of IT Artificial Intelligence, Korea Nazarene University) ;
  • Kim, Do-Yeon (Division of Computer Engineering, Kongju National University)
  • 안영휘 (공주대학교 대학원) ;
  • 박구락 (공주대학교 컴퓨터공학부) ;
  • 김동현 (나사렛대학교 IT인공지능학부) ;
  • 김도연 (공주대학교 대학원)
  • Received : 2021.08.31
  • Accepted : 2021.09.20
  • Published : 2021.09.28

Abstract

This study was conducted to propose a product planning prediction model using logistic regression algorithm to predict seasonal factors and rapidly changing product trends. First, we collected unstructured data of consumers in portal sites and online markets using web crawling, and analyzed meaningful information about products through preprocessing for transformation of standardized data. The datasets of 11,200 were analyzed by Logistic Regression to analyze consumer satisfaction, frequency analysis, and advantages and disadvantages of products. The result of analysis showed that the satisfaction of consumers was 92% and the defective issues of products were confirmed through frequency analysis. The results of analysis on the use satisfaction, system efficiency, and system effectiveness items of the developed product planning prediction program showed that the satisfaction was high. Defective issues are very meaningful data in that they provide information necessary for quickly recognizing the current problem of products and establishing improvement strategies.

본 연구에서는 계절적인 요인과 급변하는 상품의 트렌드를 사전예측하기 위해 로지스틱 회귀 알고리즘을 이용한 상품기획 예측 모형을 제안하고자 수행되었다. 먼저 웹크롤링을 이용하여 포털 사이트 및 온라인 마켓의 소비자의 비정형 데이터를 수집하고 정형 데이터 변환을 위한 전처리 작업을 통해 상품에 대한 의미 있는 정보를 분석하였다. 최종 수집된 11,200개의 데이터셋은 Logistic Regression을 이용하여 상품에 대한 소비자의 만족도, 빈도분석, 상품에 대한 장점과 단점을 분석할 수 있었다. 분석 결과 소비자의 만족도는 92%이었으며, 빈도분석을 통해 상품에 대한 불량이슈를 확인할 수 있었다. 또한, 개발된 상품 기획 예측 프로그램에 대한 사용 만족도, 시스템 효율성, 시스템 효과성 항목에 대한 분석결과에서도 만족도가 높게 나타났다. 특히, 불량이슈는 상품에 대한 현 문제를 신속히 인지하고 개선 전략을 수립하는데 필요한 정보를 제공한다는 점에서 매우 의미 있는 자료가 된다.

Keywords

References

  1. H, Evanschitzky., M., Eisend, R, Calantone, (2012). Success factors of product innovation: An updated meta-analysis. Journal of product innovation management, 29, .21-37. DOI:10.1111/j.1540-5885.2012.00964.x
  2. K. B. Kahn, Barczak, G., Nicholas, J., Ledwith, A., & Perks, H., 2012. An examination of new product development best practice. Journal of product innovation management, 29(2), 180-192. DOI:10.1111/j.1540-5885.2011.00888.x
  3. M. J. J. Lin, & C. H. Huang. (2013). The impact of customer participation on NPD performance: The mediating role of inter-organisation relationship. Journal of Business & Industrial Marketing, 28(1), 3-15. https://doi.org/10.1108/08858621311285679
  4. P. Trotter, & J. Vaughan. (2012). Innovation in UK companies. An evaluation of the implementation of best practice in front end innovation processes and methodologies. International Journal of Innovation Science, 4(4), 191-204. DOI:10.1260/1757-2223.4.4.191
  5. S. H. Kim, & K. H. Huarng. (2011). Winning strategies for innovation and high-technology products management. Journal of Business Research, 64(11), 1147-1150. DOI:10.1016/j.jbusres.2011.06.013
  6. B. Jeong, J. Yoon, & J. M. Lee. (2019). Social media mining for product planning A product opportunity mining approach based on topic modeling and sentiment analysis. International Journal of Information Management, 48, 280-290. DOI:10.1016/j.ijinfomgt.2017.09.009
  7. D. Silverstein, P. Samuel, & N. DeCarlo. (2013). The innovator's toolkit: 50+ techniques for predictable and sustainable organic growth. John Wiley & Sons.
  8. W. Medhat, A. Hassan, & H. Korashy, (2014). Sentiment analysis algorithms and applications: A survey. Ain Shams engineering journal, 5(4), 1093-1113. DOI:10.1016/j.asej.2014.04.011
  9. B. Pang, & L. Lee. (2004). A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. arXiv preprint cs/0409058. DOI: 10.3115/1218955.1218990
  10. S. M. Kim, & E. Hovy. (2004). Determining the sentiment of opinions. In COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics, 1367-1373.
  11. A. Agarwal, F. Biadsy, & K. Mckeown. (2009). Contextual phrase-level polarity analysis using lexical affect scoring and syntactic n-grams. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), 24-32.
  12. G. Pant. P. Srinivasan. & F. Menczer. (2004). Crawling the web. In Web dynamics, Springer, Berlin, Heidelberg, 153-177.
  13. V. Shkapenyuk, & T. Suel. (2002). Design and implementation of a high-performance distributed web crawler. In Proceedings 18th International Conference on Data Engineering, IEEE, 357-368. DOI:10.1109/ICDE.2002.994750
  14. F. E. Harrell. (2015). Ordinal logistic regression. Regression modeling strategies, Springer, Cham, 311-325.
  15. C. M. Lewis, & J. Knight (2012). Introduction to genetic association studies. Cold Spring Harbor Protocols, 2012(3), pdb-top068163.
  16. S. Agarwal, P. Kachroo, & E. Regentova. (2016). A hybrid model using logistic regression and wavelet transformation to detect traffic incidents. Iatss Research, 40(1), 56-63. DOI:10.1016/j.iatssr.2016.06.001
  17. W. Chen, et al. (2018). GIS-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. Science of the Total Environment, 634, 853-867. DOI:10.1016/j.scitotenv.2018.04.055