I. Introduction
1. Introductory Remarks
Big data play a critical role in today’s unpredictable socio-economic environment and will continue to do so in the post-COVID-19 times. Data generate variations on two grounds: its production and its application. With the proper processing and the storage support of edge hardware, big data control and usage will be more important than ever especially from the business perspective. As the world adapts to the post-COVID-19 environment, data may hold the core of our new everyday life [1,2]. The outbreak of the novel COVID-19 has changed the worldwide daily lives of citizens globally. As COVID-19 has found to be highly contagious, public anxiety levels rise and many policies have been implemented to control the spread of COVID-19 at the global level. Public health precautions include containment, social distancing, and home residency orders, which have been implemented at the local, regional and national levels. These policies have changed the way of life of citizens around the world, and gradually transformed a powerful socio-economic model into a contactless society. Many industrial sectors have been greatly influenced by these changes, especially in the trade and distribution of goods, education and business [3, 4, 5].
In Korea, social distancing has been adopted as a primary guideline to slow the spread of infection. Under these environments, the most common example of contactless service in Korea is food delivery. Consumers prefer to order food online through a mobile application and take it directly in their home. In addition, the delivery business has expanded its services to groceries and errands etc., and the consumption of these delivery services is expected to become increasing greatly as the pandemic continues [6, 7, 8].
Information on disease coronics and disasters provided by the government in Covid-19 pandemic era has become agile and accessible, which implies the possibility of indiscriminate infection in the community. With more than half of the medical community predicting a prolonged Covid-19 and cases of cross-regional mass infection still prevalent around the world, consumers have brought increased demand of contactless services considering consumers’ psychological stability, and companies are investing heavily the services by predicting these situations and needs [9, 10, 11, 12]. According to the report by Samsung Electronics [13], untact consumption and satisfaction among all age groups increased by 31.2 ~ 72 percent compared to a year earlier.
The research will analyze the relationship of untact consumption and satisfaction by sales data of 'Baedarui Minjok', Woowa Brothers Corp.'s catering delivery application, showing a rapid growth of more than 50% each year [14]. Especially, it will analyze the relationship among the sales data of untact consumption, Covid-19 confirmed patient data, Covid-19 media reports, weather information and week data, identify their relevance and key factors among the untact services and the number of sales, and provide useful results.
This paper is organized as follows: Section 1 describes introductory remarks, big data sources and previous research. Section 2 describes system architecture of this research including E-R (Entity-Relation) Model. Section 3 provides representative examples of utilization plans. Finally, discussions and contributions are described in Section 4.
2. Big Data Sources and Previous Research
2.1. Big Data Sources Utilized in the Research
The big data used in the analysis were provided by Naver Data Lab, Public Data Portal, Financial Data Exchange, the Korea Meteorological Administration, the Statistics Korea, three major broadcasting companies and etc. [15, 16, 17, 18, 19, 20, 21, 22]. The research will use sample data provided by the Financial Data Exchange, delivery sales data paid by only KB Kookmin Card from 'Baedarui Minjok', Woowa Brothers Corp.'s catering delivery application [14]. In reality, the price of the whole data from 'Baedarui Minjok' was traded between about 15 million won and about 20 million won, which was not suitable for the researchers with operational budget caps. In the future, the data hub center to be established is expected to provide relatively cheap financial data to the researchers.
2.2. Analysis on the Survival Strategies of Restaurant Companies with the Appearance of Covid-19
The previous work [23] was performed to analyze important factors that change consumers' consumption trends with focusing on the survey-based structural approach, and provided the desirable response strategies for the Korean restaurant companies to the risk of Covid-19. As a result, the strategies revealed were that maintaining the distance among customers and hygiene management in the store were more important than customer sanitary services, that the most important factor to reduce fixed costs was hot potatoes in offline stores, and that increasing sales outside restaurant were due to the customer service with quick response. Instead of focusing on Korean restaurants, this paper will analyze important factors of the trends of consumers’ consumption focusing on delivery services.
2.3. Analysis of the Impact of Covid-19 Occurrence on Regional Business Area Using Credit Card Big Data (A Case of Suwon City)
A previous study [24] analyzed through credit card sales, the impact of Covid-19 on the local economy and business districts in Suwon, where sales amount of business fell sharply after Covid-19. The direction of analyzing how much Covid-19 in the post-Covid-19 era directly affected sales amount was similar to that of this research, but the results were specifying detail by limiting the scope of the project to Suwon City, and the project data used in the analysis were given by direct support of credit card big data from the Korea Data Industry Promotion Agency. Eventually, since April, 2020 the overall consumption activity in Suwon City recovered to the previous level before Covid-19, but the recovery was not sufficient for the consumption of the area which had showed a significant decline compared to the sales during proliferation. In addition, it was confirmed that commercial districts such as Suwon Station that relied on floating populations and relatively young weekend consumers were directly affected and saw a sharp drop in sales.
2.4. Effects of Social Risk from Covid-19 on Consumer Sentiment and HMR Purchase Patterns
A previous work [25] analyzed the purchase patterns and the threat of disease changed by Covid-19 on HMR (Home Meal Replacement), which drew a sharp upward linear graph in sales amount along with delivery food. And the way how it changed consumers were the topic of the research. Various quantitative data such as gender, monthly income level, occupation, child status and marital status were used to present specific conclusions. Qualitative analysis methods together with quantitative data were also used to analyze purchase patterns of consumers in the post-Covid-19 era in various ways.
Referring to the previous studies, the research will analyze the relationship among the sales data of untact consumption, Covid-19 confirmed patient data, Covid-19 media reports, weather information and week data etc.
II. System Architecture
To analyze the relationship among the sales data of untact consumption, Covid-19 confirmed patient data, Covid-19 media reports, weather information and week data, the database system was designed and implemented. The Entity-Relation Diagram (E-R diagram) was developed for the system as in Fig. 1.
Fig. 1. E-R Diagram
The E-R Diagram shown in Fig. 1 investigated the daily food delivery & consumption trend in the post-Covid-19 era by sales data paid by only KB Credit Card from 'Baedarui Minjok', Woowa Brothers Corp.'s catering delivery application, from Jan. 1, 2020 to July 1, 2020 [14]. The system created 8 entities with the data: Population confirmed by Covid-19 (Coronic), Number of orders delivered (PMT), Media reporting related news (SBS, KBS, MBC), Analysis of search figures for set search terms (LAB), daily weather data(Weather), and population per region (POP).
The CORONIC relation has CODE, DATE, DAY, AREA, and NEW_CORONIC attributes, where CODE is the primary key. The MEDIA relation has AT_ID, DATE, DAY, and AT_NM attributes, where DATE was given as the primary key. The LAB relation has DATE, DAY, L1, L2, L3, L4, L5, L6 attributes, and L1~L6 refers to the search terms we set. L1 is ‘Covid-19’, L2 is ‘Confirmed cases’, L3 is ‘Self quarantine’, L4 is ‘Regional infection’, L5 is ‘Foreign inflow’, and L6 is ‘Social distancing’. The PMT (PAYMENT) relation has CODE, DATE, DAY, AREA, and PMT_GS attributes, where CODE is the primary key. The WEATHER relation has WeatherID, AREA, DATE, TEMPERATURE, PRECIPITATION, and RELATIVE_HUMIDITY attributes, where we set WeatherID as the primary key. Lastly, the POPULATION relation has AREA and POPULATION attributes, where AREA is the primary key.
There is a ‘Provide’ relationship between the Coronic entity and the Media (SBS, KBS, MBC) entity, and this relationship is a one-to-many (1:n) relationship. There is a ‘Fluctuate’ relationship between the Coronic entity and the LAB entity, which is a one-to-many (1:n) relationship. Also, there is a ‘Confirmed’ relationship between the Coronic entity and the Population entity, and this relationship is a one-to-many (1:n) relationship. There is an ‘Order’ relationship between the Population entity and the Payment entity, and this relationship is a one-to-many (1:n) relationship. Finally, there is an ‘Influence’ relationship between the Payment entity and the Weather entity, which is a one-to-one (1:1) relationship.
III. Representatives of Useful Information
The database system was implemented for analyzing the relationship among the sales data of untact consumption, Covid-10 confirmed patient data, Covid-10 media reports, weather information and week data, based on the E-R Diagram in Fig. 1. The system used sales data provided by the Financial Data Exchange, delivery sales data paid by only KB Kookmin Card from 'Baedarui Minjok', Woowa Brothers Corp.'s catering delivery application. The representative valuable information drawn from the database system is as follow.
1. Daily and Regional Covid-19 Confirmed Cases
The regions of the top 10 most new confirmed cases were all Daegu. On February 29, there were 741 new confirmed cases in Daegu, which recorded the highest number as illustrated in Fig. 2. The second place was 520 new confirmed cases in Daegu on March 3, and the third place was 514 new confirmed cases in Daegu on March 1. We could figure out that the highest number of new confirmed cases showed from late February to early March.
Fig. 2. Top 10 Daily and Regional Covid-19 Confirmed Cases
2. Daily and Regional Delivery Cases
In Fig. 3 the top 10 delivery cases were in Daegu, Gyeongbuk, and Seoul. Most of the time, there was a lot of delivery cases in Daegu. On February 29, 94 delivery cases were reported in Daegu, recording the highest figure. Fig. 3 showed that the highest delivery cases were made at the end of February.
Fig. 3. Top 10 Daily and Regional Delivery Cases
3. The Number of Deliveries in Rainy Weather by Region
There were deliveries on rainy days as shown in Fig. 4, but it was hard to conclude that the more precipitation the more delivery. Therefore, the delivery data said that there was little correlation between precipitation and the delivery case.
Fig. 4. The Number of Deliveries in Rainy Weather by Region
4. Headlines with Top 10 Delivery Days
Delivery cases on February 29 recorded the highest figure with 171 cases. Except 4 of the 30 headlines on KBS, MBC, and SBS as shown in Fig. 5, Fig. 6 and Fig. 7, the rest dealt with Covid-19. In other words, news headlines on days with a high number of delivery cases mostly consisted of Covid-19-related content. Also, the more news about Covid-19 was reported, the higher the number of deliveries were made.
Select top 10 PMT.Date, SUM(PMT.PMT_GS) as "Total number of deliveries(cases)", KBS.AT_NM as "KBS headline", MBC.AT_NM as "MBC headline", SBS.AT_NM as "SBS headline"
From PMT, KBS, MBC, SBS
Where PMT.date = KBS.date and PMT.date=MBC.date and PMT.date=sbs.date
Group by PMT.Date, KBS.AT_NM, MBC.AT_NM, SBS.AT_NM
Order by [Total number of deliveries(cases)] desc;
Fig. 5. Headlines with Top 10 Delivery Days (KBS)
Fig. 6. Headlines with Top 10 Delivery Days (MBC)
Fig. 7. Headlines with Top 10 Delivery Days (SBS)
5. Top 10 Delivery Cases of Daily and Regional Covid-19 Confirmed Days
On February 29, Daegu recorded the highest number of new confirmed persons compared to the other region as illustrated in Fig. 8. And the number of Daegu delivery persons was 94 and also recorded the highest figure on the same day. Next, on February 27, there were 340 new confirmed persons with 68 delivery cases in Daegu. Therefore, the number of deliveries increased as the number of new confirmed persons increased.
Fig. 8. Top 10 Delivery Cases of Daily and Regional Covid-19 Confirmed Days
6. Number of Daily Deliveries Based on ‘Covid-19’ Search Volume
The criteria of search volume were relatively measured from January 1 to July 1, 2020 and the highest search volume was 100%. The search for ‘Covid-19 (Corona)’ on February 25 was set to 100%, the highest rate. The number of delivery cases was 91, as shown in Fig. 9. On February 24, the search volume for ‘Covid-19’ was 91.59% and the number of delivery cases was 119. If the rate of ‘Covid-19’ search volume exceeded 50%, the number of delivery cases was mostly more than 100. The higher rate of ‘Covid-19’ search volume, the higher number of daily deliveries.
Fig. 9. Number of Daily Deliveries based on ‘Covid-19’ Search Volume
Select top 10 LAB.L1 as "Covid-19(%)", PMT.DATE, Sum(PMT.PMT_GS) as "Total number of deliveries(cases)"
From PMT, LAB
Where LAB.DATE= PMT.DATE
Group by PMT.DATE, LAB.L1
Order by [Covid-19(%)]desc;
7. Covid-19 related Keywords Search Volumes based on the Number of New Confirmed Cases
Select top 10 Coronic.DATE, Sum(Coronic.New_Coronic) as "New coronic(persons)", LAB.L1 as "Covid-19(%)", LAB.L2 as "Confirmed cases(%)", LAB.L3 as "Self quarantine(%)", LAB.L4 as "Regional infection(%)", LAB.L6 as "Social distancing(%)"
From Coronic, LAB
Where Coronic.DATE = LAB.DATE
Group by Coronic.DATE, Coronic.New_Coronic, LAB.L1, LAB.L2, LAB.L3, LAB.L4, LAB.L6
Order by [New coronic(persons)] desc;
The research selected ‘Covid-19 (Corona)’ search volumes, ‘Confirmed Case’, ‘Self-quarantine’, ‘Community Spread’, and ‘Distancing’ for Covid-19 related search terms, as illustrated in Fig. 10. The terms were also relatively measured based on 100% of February 29, 2020, when the number of new confirmed cases reached the highest figure with 68% of ‘Covid-19’ search volumes, ‘21%’ of ‘Confirmed Case’ search volumes, ‘4%’ of ‘Self-quarantine’, 5% of ‘Community Spread’, and 0.9% of ‘Distancing’ search volumes. The more Covid-19 confirmed, the more Covid-19 search volumes. It was difficult to identify a linear correlation between the keyword search volume and the number of the new confirmed case. However, keywords related to Covid-19 were constantly being searched.
Fig. 10. Covid-19-related Keywords Search Volumes based on the Number of New Confirmed Cases
IV. Conclusions
The research suggested the method of analysis for daily food delivery and consumption trends through bid data of the post-Covid-19 era. It was confirmed that using sales data of the food delivery application 'Baedarui Minjok' serviced by 'Woowa Brothers Corporation, ' four factors, except weather, had a significant correlation with the sales data. Among them, the most closely correlated factor was data from three broadcasting companies (KBS, MBC, SBS), that is, media. On the day after the Covid-19 related new article was released, food delivery sales soared up to about 60%, showing remarkable results. In addition, it was confirmed that a significant increase in sales occurred on the days before and after the relative figures of the Naver search results replacing SNS data soared up.
During the research, it was confirmed that factors such as mobile media and web surfing were also the main factors in the increase in sales of delivery applications. This suggested that crawling and analyzing data from social network services (SNS) used by Y, Z, generation and millennials could also be a major factor in increasing food delivery sales. Therefore, it can be seen that viral marketing can be an important key for market exploration and product positioning for companies that want to launch similar types of applications. In addition, it is expected that delivery companies in the post-Covid-19 era can be useful in business operation based on meaningful information derived from finding the association between the set external factor (entity) and the increase of delivery sales.
The research can contribute the companies in the economic recession era to survive by providing the method for analyzing the big data and increasing their sales.
Limitation of the research exists in collecting delivery application sales data. The price of the actual data was traded between about 15 million won and about 20 million won, which was not suitable for the researchers with operational budget caps. Therefore, the biggest limitation of this research is that the results are a little insufficient by using the delivery sales data paid by only KB Kookmin Card instead of the whole sales data from 'Baedarui Minjok'.
ACKNOWLEDGEMENT
This work was supported by Hankuk University of Foreign Studies Research Fund of 2020.
References
- Bruce Lehrman, "Big Data's Role in the Post-COVID Era", Data Agility, Vol. 16, Issue 11, Sept. 2020. www.pipelinepub.com.
- Jun Wu, Jian Wang, Stephen Nicholas, Elizabeth Maitland, and Qiuyan Fan, "Application of Big Data Technology for COVID-19 Prevention and Control in China: Lessons and Recommendations", Journal of Medical Internet Research, Vol. 22, No. 10: e21980, Oct. 2020. doi:10.2196/21980.
- Hanghun Jo, Eunha Shin, and Heungsoon Kim, "Changes in Consumer Behaviour in the Post-COVID-19 Era in Seoul, South Korea", Sustainability, Vol. 13, No. 136, 2021. https://dx.doi.org/10.3390/su13010136
- Jagdish Sherth, "Impact of Covid-19 on Consumer Behavior: Will the Old Habits Return or Die?", Journal of Business Research, Vol. 117, pp. 280-283, Sept. 2020. https://doi.org/10.1016/j.jbusres.2020.05.059
- Victor Fabius, Sajal Kohli, Bjorn Timelin, and sofia Moulvad Varanen, "How COVID-19 is Changing Consumer Behavior-Now and Forever", McKinsey & Company, July 2020.
- Y. Bae and H. Shin, "COVID-19, Accelerating the Non-contact Society", GRI Issue Analysis, Vol. 416, pp. 1-21, 2020.
- Hung-Hao Chang and Chad D. Meyerhoffer, "COVID-19 and the Demand for Online Food Shopping Services: Empirical Evidence From Taiwan", American Journal of Agricultural Economics, Nov. 2020. https://doi.org/10.1111/ajae.12170
- Duralia Oana, "The Impact of the Current Crisis Generated by the Pandemic on Consumer Behavior", Studies in Business and Economics, Vol. 15, Issue 2, Oct. 2020. https://doi.org/10.2478/sbe.2020-0027
- Job News, "71% of Adults -- Covid-19 increased 'untact consumption'!", 2020. http://www.saramin.co.kr/zf_user/help/live/view?idx=105407&listType=news
- Jihye Oh, "More than half of the doctors said, domestic Covid-19 will be prolonged... 85.4% of them said. reduced patients and sales will be cotinued", Medi-Consumer News. http://www.medisobizanews.com/news/articleView.html?idxno=67288
- Bonggi Kim, "It's stuffy outside the blanket.-- Untact consumption explosion.", https://biz.chosun.com/site/data/html_dir/2020/02/02/2020020201462.html
- Suwan Kim, "When is it going to be over? -- Covid-19, Red to Black…Mind health -- What do I do?", https://view.asiae.co.kr/article/2020111315215069790
- Samsung Electronics, "How far have you come with the untact technology?", https://m.post.naver.com/viewer/postView.nhn?volumeNo=23486419&memberNo=36410102
- Sungyong Hong, "Woowa Brothers Corp., 500 Billion in Sales...Turn to Deficit in Four Years", https://www.mk.co.kr/news/business/view/2020/03/290043/
- Naver Data Lab, https://datalab.naver.com/
- Meteorological Data Open Portal, Automated Surface Observing System(ASOS).
- Financial Data Exchange, Samples of Sales Data for Customers and Restaurants Using KB Card Delivery Application.
- Korean Statistical Information Service, Statistical Database: Population. 2020. https://kosis.kr/statHtml/statHtml.do?orgId=101&tblId=DT_1B040A3
- Statistics Korea, Covid-19 Occurrence Status, https://kosis.kr/statHtml/statHtml.do?orgId=101&tblId=DT_COVID19_005_D
- KBS, https://news.kbs.co.kr/vod/program.do?bcd=0001&ref=pGnb#20201109&1
- SBS, https://news.sbs.co.kr/news/programMain.do?prog_cd=R1&plink=GNB&cooper=SBSNEWS
- MBC, https://imnews.imbc.com/replay/2020/nwdesk/
- Gyusung Lee, "Analysis on the Survival Strategies of Restaurant Companies with the Appearance of Covid-19", Master Thesis, Graduate School of Knowledge Services & Consulting, Hansung University, 2020.
- Sangho Lee and Seokhwan Choi, "Analysis of the Impact of Covid-19 Occurrence on Regional Commercial Area Using Credit Card Big Data -- Focusing on Suwon City's Case)", Space and Society, Vol. 30, No. 3, pp. 167-208, 2020.
- Jaeyun Choi, "Effects of Social Risk from Corona-19 (COVID-19) on Consumer Sentiment and HMR Purchasing Patterns", Graduate School of Living and Environment, Yonsei University, 2020.