• Title/Summary/Keyword: 득점 모형

Search Result 15, Processing Time 0.022 seconds

Evaluating the quality of baseball pitch using PITCHf/x (PITCHf/x를 이용한 투구의 질 평가)

  • Park, Sungmin;Jang, Woncheol
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.2
    • /
    • pp.171-184
    • /
    • 2020
  • Major League Baseball (MLB) records and releases the trajectory data for every baseball pitch, called the PITCHf/x, using three high-speed cameras installed in every stadium. In a previous study, the quality of the pitch was assessed as the expected number of bases yielded using PITCHf/x data. However, the number of bases yielded does not always lead to baseball scores, or runs. In this paper, we assess the quality of a pitch by combining baseball analytics metric Run Expectancy and Run Value using a Random Forests model. We compare the quality of pitches evaluated with Run Value to the quality of pitches evaluated with the expected number of bases yielded.

Analysis of Scoring Difficulty in Different Match Situations in Relation to First Athlete to Score in World Taekwondo Athletes (세계태권도 겨루기 선수들의 선제득점에 따른 경기 내용별 득점 난이도 분석)

  • Mi-Na Jin;Jung-Hyun Yun;Chang-Jin Lee
    • Journal of Industrial Convergence
    • /
    • v.22 no.4
    • /
    • pp.21-29
    • /
    • 2024
  • This study analyzed the difficulty of scoring in different match situations in relation to which competitor scored first. The study analyzed the data from the 2022 Guadalajara World Taekwondo Championships. The analysis was performed for two separate weight classes: lightweight and heavyweight. Four game content variables were used: whether the athlete scored first, attack type, attack area, and game situation. Descriptive statistics, the Rasch model, and discrimination function questions were applied for data processing. SPSS and Winsteps were used for the statistical analysis, and the statistical significance level was set at 0.05. Consequently, in the lightweight class, the scoring frequency of the first scorer was high for all the game variables. In the heavyweight class, the scoring frequency for the first scorer was high for the attack type and attack area. By contrast, those who did not score first were more frequently found to be in a loss situation. By analyzing the scoring difficulties in different match situations based on whether the competitor scored first, the athletes who scored first in attack type most easily scored first. In losing situations, the athletes who scored first in attack area scored most easily, whereas those who did not score first scored most easily in body and match situations. For the heavyweight class, those who scored first in terms of attack type, counter-attack, and attack area scored the most easily while winning in body and match situations.

Measuring the accuracy of the Pythagorean theorem in Korean pro-baseball (한국프로야구에서의 피타고라스 정리의 정확도 측정)

  • Lee, Jangtaek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.3
    • /
    • pp.653-659
    • /
    • 2015
  • The Pythagorean formula for baseball postulated by James (1982) indicates the winning percentage as a function of runs scored and runs allowed. However sometimes, the Pythagorean formula gives a less accurate estimate of winning percentage. We use the records of team vs team historic win loss records of Korean professional baseball clubs season from 2005 and 2014. Using assumption that the difference between winning percentage and pythagorean expectation are affected by unusual distribution of runs scored and allowed, we suppose that difference depends on mean, standard deviation, and coefficient of variation of runs scored per game and runs allowed per game, respectively. In conclusion, the discrepancy is mainly related to the coefficient of variation and standard deviation for run allowed per game regardless of run scored per game.

Analyzing records of Korean pro-basketball using general linear model (일반선형모형을 적용한 한국남자프로농구 경기기록분석 : 2014-2015 정규리그)

  • Kim, Sae Hyung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.4
    • /
    • pp.957-970
    • /
    • 2015
  • The purpose of this study was to analyze records of Korean pro-basketball using general linear model (two-way ANOVA and hierarchical multiple regression analysis). Korea Basketball League (KBL) informed the records (2014-2015 season) of this study. The eight variables (TA, 2PA, 3PA, 2P, 3P, Ast, TFB, CH) were selected in content validity. SPSS program was used to analyze general linear model. All alpha level was set at 0.05. Major results were as follow. 3PA had significant interaction effect between victory & defeat variable and home & away variable. Victory teams showed that 3PA was higher in home games than away games, and defeat teams was the other. 2PA, AS, TFB, and CH were selected significant variables affecting victory and defeat. In result of hierarchical regression, Ast had significant moderation effect between 3PA and TS. TFB also had significant moderation effect between AS between 2P. The other construct (Ast between 2PA and TS; TFB between AS between 3P) had no significant moderation effect. In the effect of 2PA, 3PA and Ast to TS, CH also had no significant moderation effect.

Analysis of the Korean Baseball League using a Markov Chain Model (마르코프 연쇄를 이용한 한국 프로야구 경기 분석)

  • Moon, Hyung Woo;Woo, Yong Tae;Shin, Yang Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.4
    • /
    • pp.649-659
    • /
    • 2013
  • We use a Markov chain model to analyze the Korean Baseball League. We derive the distributions of the number of runs scored and the number of batters that complete their turn at bat in a baseball game using the time inhomogeneous Markov chain. The model is tested with real data produced from the 2011 Korean Baseball League.

Performances analysis of football matches (축구경기의 경기력분석)

  • Min, Dae Kee;Lee, Young-Soo;Kim, Yong-Rae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.1
    • /
    • pp.187-196
    • /
    • 2015
  • The team's performances were analyzed by evaluating the scores gained by their offense and the scores allowed by their defense. To evaluate the team's attacking and defending abilities, we also considered the factors that contributed the team's gained points or the opposing team's gained points? In order to analyze the outcome of the games, three prediction models were used such as decision trees, logistic regression, and discriminant analysis. As a result, the factors associated with the defense showed a decisive influence in determining the game results. We analyzed the offense and defense by using the response variable. This showed that the major factors predicting the offense were non-stop pass and attack speed and the major factor predicting the defense were the distance between right and left players and the distance between front line attackers and rearmost defenders during the game.

Prediction of K-league soccer scores using bivariate Poisson distributions (이변량 포아송분포를 이용한 K-리그 골 점수의 예측)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1221-1229
    • /
    • 2014
  • In this paper we choose the best model among several bivariate Poisson models on Korean soccer data. The models considered allow for correlation between the number of goals of two competing teams. We use an R package called bivpois for bivariate Poisson regression models and the data of K-league for season 1983-2012. Finally we conclude that the best fitted model supported by the AIC and BIC is the bivariate Poisson model with constant covariance. The zero and diagonal inflated models did not improve the model fit. The model can be used to examine home-away effect, goodness of fit, attack and defense parameters.

통계모형의 전문스포츠 현장 적용 사례

  • Eom, Han-Ju;Jo, Jeong-Hwan;Sin, Seung-Yun
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.51-59
    • /
    • 2002
  • 스포츠 통계와 관련된 주제들은 경기결과의 설명 예측력 분석, 선수와 팀 평가, 경기내용의 요약, 경기의 환경적 요인 평가, 경기 규칙 분석, 경기결과의 시각적 표현, 토너먼트 구조 연구 등의 영역으로 다양하다. 이 글에서는 야구 축구 농구 테니스 종목을 중심으로 통계적 방법을 적용하여 스포츠 현장의 문제를 어떻게 접근하고 있는지를 개괄하였다. 전통적으로 스포츠 통계는 선수와 팀 그리고 전략의 평가가 주를 이루었지만, 오늘날 스포츠통계는 경기의 잠재적, 외적 요인 등의 주제로 관심영역을 넓혀가고 있다. 그러나 국내에서는 경기내용의 기술적(descriptive) 분석이 주류를 이루고 있으며, 전문적 수준의 통계적 접근은 활발하지 못하고 있다. 현장과 자료의 특성을 고려하여 통계와 스포츠(체육)분야의 다양한 협동연구가 필요하다.

  • PDF

Predication of win/lose of Professional baseball using Heuristic model (Heuristic model를 이용한 프로야구 승패 예측)

  • Kim, Dong-Sik;Hong, Seok-Mi;Jung, Tae-Chung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10a
    • /
    • pp.325-328
    • /
    • 2000
  • 프로야구경기의 승패 예측의 문제는 그리 쉬운 일이 아니다. 왜냐하면 경기에 영향을 미치는 요소가 무한하기 때문이다. 예를 들어, 경기당일의 선수들의 컨디션이나 사기, 경기당일의 날씨, 구장요건, 상대팀에 대한 심리적 요인등 사전에 경기영향을 미치는 요소가 무한하다. 본 연구실에서는 과거 경기기록 자료를 기반으로 유용한 규칙을 찾아내어 분류트리를 만들어 학습하는 ID3 알고리즘을 프로야구 승패예측 시스템 구성에 사용하여 보았으나, 이산적인 자료의 처리로 인해 연속적인 경기자료를 고려하지 못하는 문제로 예측율이 더이상 향상되지 않았다. 따라서, 본 논문에서는 휴리스틱 방법을 이용한 경기전 예측과 경기중 예측을 이닝별 득점으로 세분화하여, 실제 경기상황을 고려한 일반적인 예측모형을 만들어 예측율을 향상시키고자 한다. 향후에는 더욱 세분화시켜 Case-based에 의한 예측을 하고자 한다.

  • PDF

Value Evaluation Model for Korean Professional Baseball Players (한국프로야구선수의 가치평가모형)

  • Oh, Taeyeon;Lee, Young Hoon
    • Journal of Labour Economics
    • /
    • v.39 no.2
    • /
    • pp.113-139
    • /
    • 2016
  • The purpose of this study is to establish evaluation model that can explain marginal effects of baseball players of Korean Baseball Organization (KBO) on their team winnings in terms of productivity. We proposed econometric model with using variables that are provided from official homepage of KBO to overcome the complexity of mainly used productivity index: wins above replacement (WAR). Also, compare to the previous studies such as Scully(1974) or Krautmann(1999) that using limited indices of baseball stats, this study included 61 indices that are provided from official homepage of KBO. We estimated regression based WAR(RBWAR) by conducting panel regression with each team's statistics data of 2002 to 2014. As a results, RBWAR shows 0.869 correlation coefficient for batters and 0.882 for pitchers with WAR in 2014 that can be concluded that two indices shows similar results. From the results of estimation, we analyze the relationship between productivity and actual contract of free agent players in 2015 and it showed that teams have contracted reasonably.

  • PDF