• Title/Summary/Keyword: Statistical Forecasting

Search Result 480, Processing Time 0.027 seconds

Analysis and Forecast of Venture Capital Investment on Generative AI Startups: Focusing on the U.S. and South Korea (생성 AI 스타트업에 대한 벤처투자 분석과 예측: 미국과 한국을 중심으로)

  • Lee, Seungah;Jung, Taehyun
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.18 no.4
    • /
    • pp.21-35
    • /
    • 2023
  • Expectations surrounding generative AI technology and its profound ramifications are sweeping across various industrial domains. Given the anticipated pivotal role of the startup ecosystem in the utilization and advancement of generative AI technology, it is imperative to cultivate a deeper comprehension of the present state and distinctive attributes characterizing venture capital (VC) investments within this domain. The current investigation delves into South Korea's landscape of VC investment deals and prognosticates the projected VC investments by juxtaposing these against the United States, the frontrunner in the generative AI industry and its associated ecosystem. For analytical purposes, a compilation of 286 investment deals originating from 117 U.S. generative AI startups spanning the period from 2008 to 2023, as well as 144 investment deals from 42 South Korean generative AI startups covering the years 2011 to 2023, was amassed to construct new datasets. The outcomes of this endeavor reveal an upward trajectory in the count of VC investment deals within both the U.S. and South Korea during recent years. Predominantly, these deals have been concentrated within the early-stage investment realm. Noteworthy disparities between the two nations have also come to light. Specifically, in the U.S., in contrast to South Korea, the quantum of recent VC deals has escalated, marking an augmentation ranging from 285% to 488% in the corresponding developmental stage. While the interval between disparate investment stages demonstrated a slight elongation in South Korea relative to the U.S., this discrepancy did not achieve statistical significance. Furthermore, the proportion of VC investments channeled into generative AI enterprises, relative to the aggregate number of deals, exhibited a higher quotient in South Korea compared to the U.S. Upon a comprehensive sectoral breakdown of generative AI, it was discerned that within the U.S., 59.2% of total deals were concentrated in the text and model sectors, whereas in South Korea, 61.9% of deals centered around the video, image, and chat sectors. Through forecasting, the anticipated VC investments in South Korea from 2023 to 2029 were derived via four distinct models, culminating in an estimated average requirement of 3.4 trillion Korean won (ranging from at least 2.408 trillion won to a maximum of 5.919 trillion won). This research bears pragmatic significance as it methodically dissects VC investments within the generative AI domain across both the U.S. and South Korea, culminating in the presentation of an estimated VC investment projection for the latter. Furthermore, its academic significance lies in laying the groundwork for prospective scholarly inquiries by dissecting the current landscape of generative AI VC investments, a sphere that has hitherto remained void of rigorous academic investigation supported by empirical data. Additionally, the study introduces two innovative methodologies for the prediction of VC investment sums. Upon broader integration, application, and refinement of these methodologies within diverse academic explorations, they stand poised to enhance the prognosticative capacity pertaining to VC investment costs.

  • PDF

Rice Yield Estimation of South Korea from Year 2003-2016 Using Stacked Sparse AutoEncoder (SSAE 알고리즘을 통한 2003-2016년 남한 전역 쌀 생산량 추정)

  • Ma, Jong Won;Lee, Kyungdo;Choi, Ki-Young;Heo, Joon
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.5_2
    • /
    • pp.631-640
    • /
    • 2017
  • The estimation of rice yield affects the income of farmers as well as the fields related to agriculture. Moreover, it has an important effect on the government's policy making including the control of supply demand and the price estimation. Thus, it is necessary to build the crop yield estimation model and from the past, many studies utilizing empirical statistical models or artificial neural network algorithms have been conducted through climatic and satellite data. Presently, scientists have achieved successful results with deep learning algorithms in the field of pattern recognition, computer vision, speech recognition, etc. Among deep learning algorithms, the SSAE (Stacked Sparse AutoEncoder) algorithm has been confirmed to be applicable in the field of forecasting through time series data and in this study, SSAE was utilized to estimate the rice yield in South Korea. The climatic and satellite data were used as the input variables and different types of input data were constructed according to the period of rice growth in South Korea. As a result, the combination of the satellite data from May to September and the climatic data using the 16 day average value showed the best performance with showing average annual %RMSE (percent Root Mean Square Error) and region %RMSE of 7.43% and 7.16% that the applicability of the SSAE algorithm could be proved in the field of rice yield estimation.

Development of Multiple Regression Models for the Prediction of Daily Ammonia Nitrogen Concentrations (일별 암모니아성 질소(NH3-N)농도 예측을 위한 다중회귀모형 개발)

  • Chug, Se-Woong
    • Journal of Korea Water Resources Association
    • /
    • v.36 no.6
    • /
    • pp.1047-1058
    • /
    • 2003
  • Seasonal occurrence of high ammonia nitrogen(NH3-N) concentrations has hampered chemical treatment processes of a water plant that intakes water at Buyeo site of Geum river. Thus it is often needed to quantify the effect of Daecheong Dam ouflow on the mitigation of $NH_3$-N contamination. In this study, multiple regression models were developed for forecasting daily $NH_3$-N concentrations using 8 years of water quality and dam outflow data, and verified with another 2 years of data set. During model development, the coefficients of determination($R^2$) and model efficiency($E_{m}$) were greater than 0.95. The verification results were also satisfactory although those statistical indices were slightly reduced to 0.84∼0.94 and 0.77∼0.93, respectively. The validated model was applied to assess the effect of different amounts of dam outflow on the reduction of $NH_3$-N concentrations in 2002. The NH3-N concentrations dropped by 0.332∼0.583 mg/L on average during January∼March as outflow increases from 5 to 50cms, and was most significant on February. The results of this research show that the multiple regression approach has potential for efficient cause and effect analysis between dam outflow and downstream water quality.

Comparison of Daily Rainfall Interpolation Techniques and Development of Two Step Technique for Rainfall-Runoff Modeling (강우-유출 모형 적용을 위한 강우 내삽법 비교 및 2단계 일강우 내삽법의 개발)

  • Hwang, Yeon-Sang;Jung, Young-Hun;Lim, Kwang-Suop;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.43 no.12
    • /
    • pp.1083-1091
    • /
    • 2010
  • Distributed hydrologic models typically require spatial estimates of precipitation interpolated from sparsely located observational points to the specific grid points. However, widely used estimation schemes fail to describe the realistic variability of daily precipitation field. We compare and contrast the performance of statistical methods for the spatial estimation of precipitation in two hydrologically different basins, and propose a two-step process for effective daily precipitation estimation. The methods assessed are: (1) Inverse Distance Weighted Average (IDW); (2) Multiple Linear Regression (MLR); (3) Climatological MLR; and (4) Locally Weighted Polynomial Regression (LWP). In the suggested simple two-step estimation process, precipitation occurrence is first generated via a logistic regression model before applying IDW scheme (one of the local scheme) to estimate the amount of precipitation separately on wet days. As the results, the suggested method shows the better performance of daily rainfall interpolation which has spatial differences compared with conventional methods. And this technique can be used for streamflow forecasting and downscaling of atmospheric circulation model effectively.

Forecasting Future Market Share between Online-and Offline-Shopping Behavior of Korean Consumers with the Application of Double-Cohort and Multinomial Logit Models (생잔효과와 다중로짓모형으로 분석한 구매형태별 시장점유율 예측)

  • Lee, Seong-Woo;Yun, Seong-Do
    • Journal of Distribution Research
    • /
    • v.14 no.1
    • /
    • pp.45-65
    • /
    • 2009
  • As a number of people using the internet for their shopping steadily rises, it is increasingly important for retailers to understand why consumers decide to buy products via online or offline. The main purpose of this study is to develop and test a model that enhance our understanding of how consumers respond future online and offline channels for their purchasing. Rather than merely adopting statistical models like most other studies in this field, the present study develops a model that combines double-cohort method with multinomial logit model. It is desirable if one can adopt an overall encompassing criterion in the study of consumer behaviors form diverse sales channels. This study uses the concept of cohort or aging to enable this comparison. It enables us to analyze how consumers respond to online and offline channels as people aged by measuring their shopping behavior for an online and offline retailers and their subsequent purchase intentions. Based on some empirical findings, this study concludes with policy implications and some necessary fields of future studies desirable.

  • PDF

Development and Application of the Mode Choice Models According to Zone Sizes (분석대상 규모에 따른 수단분담모형의 추정과 적용에 관한 연구)

  • Kim, Ju-Yeong;Lee, Seung-Jae;Kim, Do-Gyeong;Jeon, Jang-U
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.6
    • /
    • pp.97-106
    • /
    • 2011
  • Mode choice model is an essential element for estimating- the demand of new means of transportation in the planning stage as well as in the establishment phase. In general, current demand analysis model developed for the mode choice analysis applies common parameters of utility function in each region which causes inaccuracy in forecasting mode choice behavior. Several critical problems from using common parameters are: a common parameter set can not reflect different distribution of coefficient for travel time and travel cost by different population. Consequently, the resulting model fails to accurately explain policy variables such as travel time and travel cost. In particular, the nonlinear logit model applied to aggregation data is vulnerable to the aggregation error. The purpose of this paper is to consider the regional characteristics by adopting the parameters fitted to each area, so as to reduce prediction errors and enhance accuracy of the resulting mode choice model. In order to estimate parameter of each area, this study used Household Travel Survey Data of Metropolitan Transportation Authority. For the verification of the model, the value of time by marginal rate of substitution is evaluated and statistical test for resulting coefficients is also carried out. In order to crosscheck the applicability and reliability of the model, changes in mode choice are analyzed when Seoul subway line 9 is newly opened and the results are compared with those from the existing model developed without considering the regional characteristics.

Development of a Freeway Travel Time Estimating and Forecasting Model using Traffic Volume (차량검지기 교통량 데이터를 이용한 고속도로 통행시간 추정 및 예측모형 개발에 관한 연구)

  • 오세창;김명하;백용현
    • Journal of Korean Society of Transportation
    • /
    • v.21 no.5
    • /
    • pp.83-95
    • /
    • 2003
  • This study aims to develop travel time estimation and prediction models on the freeway using measurements from vehicle detectors. In this study, we established a travel time estimation model using traffic volume which is a principle factor of traffic flow changes by reviewing existing travel time estimation techniques. As a result of goodness of fit test. in the normal traffic condition over 70km/h, RMSEP(Root Mean Square Error Proportion) from travel speed is lower than the proposed model, but the proposed model produce more reliable travel times than the other one in the congestion. Therefore in cases of congestion the model uses the method of calculating the delay time from excess link volumes from the in- and outflow and the vehicle speeds from detectors in the traffic situation at a speed of over 70km/h. We also conducted short term prediction of Kalman Filtering to forecast traffic condition and more accurate travel times using statistical model The results of evaluation showed that the lag time occurred between predicted travel time and estimated travel time but the RMSEP values of predicted travel time to observations are as 1ow as that of estimation.

Estimation and assessment of long-term drought outlook information using the long-term forecasting data (장기예보자료를 활용한 장기 가뭄전망정보 산정 및 평가)

  • So, Jae-Min;Oh, Taesuk;Bae, Deg-Hyo
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.10
    • /
    • pp.691-701
    • /
    • 2017
  • The objective of this study is to evaluate the long-term drought outlook information based on long-term forecast data for the 2015 drought event. In order to estimate the Standardized Precipitation Index (SPI) for different durations (3-, 6-, 9-, 12-months), we used the observation precipitation of 59 Automated Synoptic Observing System (ASOS) sites, forecast and hindcast data of GloSea5. The Receiver Operating Characteristic (ROC) analysis and statistical analysis (Correlation Coefficient, CC; Root Mean Square Error, RMSE) were used to evaluate the utilization of drought outlook information for the forecast lead-times (1~6months). As a result of ROC analysis, ROC scores of SPI(3), SPI(6), SPI(9) and SPI(12) were estimated to be over 0.70 until the 2-, 3-, 4- and 5-months. The CC and RMSE values of SPI(3), SPI(6), SPI(9) and SPI(12) for forecast lead-time were estimated as (0.60, 0.87), (0.72, 0.95), (0.75, 0.95) and (0.77, 0.89) until the 2-, 4-, 5- and 6-months respectively.

Development of Urban Flood Forecasting Model using Statistical Method (통계학적 기법을 이용한 도시홍수 예.경보모형의 개발)

  • Lee, Beum-Hee;Lim, Jong-Il
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.805-809
    • /
    • 2007
  • 최근 도시의 발달은 하상공간에 대한 이용도를 높이는 방향으로 개발이 진행되어가는 추세이며, 하상도로 및 하상주차장의 이용은 이제 도시 내에서 이용 가능한 마지막 여유 공간으로 인식될 정도로 그 의존도가 높아져가고 있다. 그러나 하상공간의 활용도가 높아져갈수록 도시홍수의 발생으로 인한 대피문제가 발생하게 되고 돌발홍수로 인하여 하상도로의 차단 혹은 하상 주차장에 주차된 차량의 소거가 늦어지는 경우 고스란히 피해를 보게 되는 등 그 부작용도 계속 증가되고 있다. 도시홍수의 특성을 살펴보면 국지성 돌발 강우에 의한 유량의 급격한 증가와 짧은 유하시간, 작은 유역면적 등에 의하여 주요 예보지점까지의 도달시간이 매우 짧아 수문학적 홍수예측 모형을 이용하여 홍수예측 업무를 수행하는데 선행시간을 충분히 확보할 수 없다는 단점을 지니고 있다. 이에 따라 본 연구에서는 기존의 하천시스템에 대한 설계 등을 목적으로 하여 모형의 적용을 통한 시뮬레이션 기법을 적용하고 이를 통하여 홍수 예경보를 발령하기에는 선행시간의 확보(대피시간의 확보)라는 측면에서 상당한 어려움을 지닐 수 있으므로 시시각각으로 측정되는 실시간 수위측정 자료 및 실시간 강우자료를 이용하여 모형의 수행과정을 생략하고 하천의 수위변동을 직접 예측하고 대피할 수 있는 시나리오 기반의 수문모형을 개발하였다. SPSS를 사용한 통계학적 모형을 대전광역시 3대 하천에 대하여 적용한 결과 예측자료가 실측자료를 고수위 및 저수위 부근에서 정확히 모의하지 못하는 경향이 나타났으나 경계 및 위험수위를 설정하고 이를 넘어가는 시점에 대한 예측을 하는 홍수경보 시점 예측에는 효율적인 적용성을 나타내었다.씬 간편하면서도 정확도가 높아서, 환경방사성 스트론튬의 정량분석에 적절히 사용될 수 있다.e form of Jones matrix, which allows a new interpretation in the conversion efficiency of the thin-film optical waveguides.있다는 장점이 있었다. 따라서 소아에서 복막투석도관 수술 시 복강경적 방법을 이용하는 것이 효율적인 복막 투석을 위해 유용하다고 생각된다.상부 방광천자에 비해 민감도 59.5%(25/42), 특이도 86.6%(13/15)였고 위양성률 13.3%(2/15), 위음성률 40.5%(17/42) 로 정확도가 낮았다. 결론 : 소변을 가리지 못하는 영유아에서 요로 감염을 진단하기 위해서는 도뇨관 채뇨에 비해 초음파 감시하 치골상부 방광천자가 정확하고 안전한 채뇨법으로 권장되어야 한다고 생각한다.應裝置) 및 운용(運用)에 별다른 어려움이 없고, 내열성(耐熱性)이 강(强)하므로 쉬운 조건하(條件下)에서 경제적(經濟的)으로 공업적(工業的) 이용(利用)에 유리(有利)하다고 판단(判斷)되어진다.reatinine은 함량이 적었다. 관능검사결과(官能檢査結果) 자가소화(自家消化)시킨 크릴간장은 효소(酵素)처리한 것이나 재래식 콩간장에 비하여 품질 면에서 손색이 없고 저장성(貯藏性)이 좋은 크릴간장을 제조(製造)할 수 있다는 결론을 얻었다.이 있음을 확인할 수 있었다.에 착안하여 침전시 슬러지층과 상등액의 온도차를 측정하여 대사열량의 발생량을 측정하고 슬러지의 활성을 측정할 수 있는 방법을 개발하였다.enin과 Rhaponticin의 작용(作用)에 의(依)한 것이며,

  • PDF

Analysis the Impact of Topographic Factors on the Structure of Forest Vegetation in Deogyusan National Park (덕유산 국립공원 산림식생구조의 지형적 영향 분석)

  • Kim, Tae-Geun;Noh, Il;Jeong, Jong-Chul;Cho, Young-Hwan;Oh, Jang-Geun
    • Korean Journal of Ecology and Environment
    • /
    • v.46 no.1
    • /
    • pp.53-59
    • /
    • 2013
  • The purpose of this study was to analyze the topographic effect of the LAI (Leaf Area Index), which has been widely used as an index that quantifies the structure of forest vegetation in Deogyusan National Park. With this aim, the study was conducted through a regression analysis which took as explanation the following variables: the elevation, slope, aspect, and soil moisture conditions. The LAI was taken as the response variable. Overall, the correlation between the Field-LAI and topographic factors was less than 0.5, which was relatively low. Except for topographic altitude, there was no statistical significance regarding the correlation with other factors. Meanwhile, regarding the orientation of the correlation, the higher the attitude, the steeper slope, the lower the soil moist, the lower the LAI value. The topographic altitude was found as a statistically significant explanation variable. The TWI (Topographic Wetness Index), which was used in this study to explain the soil moisture conditions, was not significantly related to the LAI distribution. The results of this study are expected to be utilized as basic data in more accurate forecasting the LAI distribution using remote sensing data.