• Title/Summary/Keyword: 야구 데이터 분석

Search Result 54, Processing Time 0.023 seconds

Measurements for hitting ability in the Korean pro-baseball (한국프로야구에서 타자능력의 측정)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.349-356
    • /
    • 2014
  • In baseball, sabermetric batting statistics are used to compare an offensive performance of players. There exist dozens of sabermetric statistics, but baseball fans don't like the complexity of an abundance of measures. This paper provides a batting grade index (BGI) using principal component based on eight batting statistics. These are OPS, ISO, SECA, TA, RC, RC/27, wOBA and XR. We show that how standardized batting statistics are aggregated and weighted to arrive at a single composite measure of BGI. Also our result allows for segmentation of players into groups using the K-means clustering algorithm.

Prediction of KBO playoff Using the Deep Neural Network (DNN을 활용한 'KBO' 플레이오프진출 팀 예측)

  • Ju-Hyeok Park;Yang-Jae Lee;Hee-Chang Han;Yoo-Lim Jun;Yoo-Jin Moon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2023.01a
    • /
    • pp.315-316
    • /
    • 2023
  • 본 논문에서는 딥러닝을 활용하여 KBO (Korea Baseball Organization)의 다음 시즌 플레이오프 진출 확률을 예측하는 Deep Neural Network (DNN) 시스템을 설계하고 구현하는 방법을 제안한다. 연구 방법으로 KBO 각 시즌별 데이터를 1999년도 데이터부터 수집하여 분석한 결과, 각 시즌 데이터 중 경기당 평균 득점, 타자 OPS, 투수 WHIP 등이 시즌 결과에 유의미한 영향을 끼치는 것을 확인하였다. 모델 설계는 linear, softmax 함수를 사용하는 것보다 relu, tanh, sigmoid 함수를 사용했을 때 더 높은 정확도를 얻을 수 있었다. 실제 2022 시즌 결과를 예측한 결과 88%의 정확도를 도출했다. 폭투의 수, 피홈런 등 가중치가 높은 변수의 값이 우수할 경우 시즌 결과가 좋게 나온다는 것이 증명되었다. 본 논문에서 설계한 이 시스템은 KBO 구단만이 아닌 모든 야구단에서 선수단을 구성하는데 활용 가능하다고 사료된다.

  • PDF

The Effect of Daily Average Humidity on Pitcher's Stats of Strike-Out (일일 평균 습도가 투수의 탈삼진 기록에 미치는 영향)

  • Kim, Semin;You, Kangsoo
    • Journal of Industrial Convergence
    • /
    • v.18 no.1
    • /
    • pp.65-71
    • /
    • 2020
  • Recently, the field of using data has begun to attract attention in professional sports. In the field of data utilization, in addition to the classic records obtained within the economy, secondary records that emphasize efficiency are also actively used. Therefore, in this study, we try to study the correlation with the pitcher's strikeout ability through the daily average humidity, which is data outside the competition. For this reason, referring to the daily average record of the area of the home base of 10 teams belonging to the KBO league and the auxiliary stadium, the top 5 in the win, hold, save section to grasp the characteristics of the starting pitcher and the rescue pitcher We analyzed K / 9 records for each person. Through the results of this study, we found a significant difference in the K / 9 record between the starting pitcher and the rescue pitcher, and we can expect to investigate the use of professional sports data and develop the industry in general.

Implementation of Mahalanobis-Taguchi System for the Election of Major League Baseball Hitters to the Hall of Fame (메이저리그 타자들의 명예의 전당 입성과 탈락에 대한 Mahalanobis-Taguchi System의 적용과 비교)

  • Kim, Su Whan;Park, Changsoon
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.2
    • /
    • pp.223-236
    • /
    • 2013
  • Various statistical classification methods to predict election to the Major League Baseball hall of fame of are implemented and their accuracies are compared. Seventeen independent variables are selected from the data of candidates eligible for the hall of fame and well-known classification methods such as discriminant analysis and logistic regression as well as the recently proposed Mahalanobis-Taguchi system(MTS). The MTS showed a better performance than the others in classification accuracy because it is especially efficient in cases where multivariate data does not constitute directionally geographical groups according to attributes.

Sport Situational Analysis Using Artificial Intelligence : Focused on Football Expected Goal (인공지능을 이용한 스포츠 상황 분석 서비스 : 축구의 기대 득점을 중심으로)

  • Kim, Jin Sob;Kim, Min Jun;Lee, Kwanhyeong;Yoon, Yongsoo;Moon, Jaehyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.826-829
    • /
    • 2020
  • 스포츠팀 운영에 있어서 경기 중 상황에 대한 통계와 분석을 통해 좋은 성과를 내는 것은 스포츠 야구 종목의 Sabermetrics를 통해 이미 증명된 바가 있다. 한편, 축구에서는 최근 들어 선수의 역량을 평가하기 위하여 객관적인 시각에서 슈터(Shooter)에게 주어진 기회, 즉 슈팅 상황을 바라보는 기대 득점(Expected Goal; 이하 xG)이라는 지표가 등장하였으나, 객관성이라는 평가 의도와 다르게 경기 내 각각의 슈팅 상황을 정의하는 것에 있어 축구 분석관들의 주관성에 의존하는 한계성을 지녔다. 본 논문은 xG를 산출하는 방식에 있어서 기존의 주관성을 배제하고 인공지능을 통해 상황을 정의하여 객관적인 평가지표를 지향하며 유의미한 통계적 수치를 지닌 xG를 도출함으로써 결과 위주의 분석만이 존재하던 축구 종목에 있어서 경기 중 상황에 대한 객관적인 판단 및 정의에 대한 방향성을 제시한다. 또한, 본 논문에서의 인공지능은 국내 K리그 슈팅 데이터를 통해 학습되어 K리그 내 전략적인 상황들에 대한 특화된 xG를 도출하며, 이를 웹을 통해 K리그 내 선수 개개인에 대해서 시계열, 상대 팀, 슈팅 위치별 그래프로 시각화하여 제공하는 시스템을 구축함으로써 K리그를 기준으로 선수에 대한 평가 및 경기 운영에 기여할 수 있는 기대 득점 분석 서비스를 제공한다.

The Analysis on Sport Emotion Type by Sport Game Characteristics: with Social Big-Data (스포츠 경기의 특성에 따른 스포츠 감정 유형 분석 : 소셜 빅데이터를 중심으로)

  • Kim, Young-Mee;Yang, Jae-Sik
    • Journal of Digital Convergence
    • /
    • v.19 no.7
    • /
    • pp.371-377
    • /
    • 2021
  • This study tried to analyze the types of sport emotion by sport game characteristics. For that, 7 soccer games and 6 baseball games of Korean team in 2018 Asian Games were selected, and the articles and their replies about those on social network services were collected as study materials. Python was used for the collecting and expert group meeting was held for the emotion analysis. As the results of the analysis on sport emotion types by win or lose, the level of opponents and the performance of Korean team as game characteristics, the following conclusions were drawn. First, it was hard to say that win or lose and opponent's level make certain sport emotion type. Second, The performance could made contended, enthusiastic and joyful emotions when judged good, but frustrated, angry, humiliated emotions when bad. Third, social·cultural background or certain event of the games also could effect on the sport emotion types. Follow-up studies with the other game characteristics and more game cases were needed to find out more clear causal relationship.

Norm-referenced criteria for strength of the elbow joint for the korean high school baseball players using the isokinetic equipment: (Focusing on seoul and gyeonggi-do) (등속성 장비를 이용하여 한국고교야구선수 주관절 근력 평가기준치 설정: (서울 및 경기도 중심으로))

  • Kim, Su-Hyun;Lee, Jin-Wook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.10
    • /
    • pp.442-447
    • /
    • 2017
  • The purpose of this study was to establish norm-referenced criteria for the isokinetic strength of the elbow joint in Korean high school baseball players. Two hundred and one high school baseball players participated in this study, none of whom had any medical problem with their upper limbs. The elbow flexion/extension test was conducted four times at a speed of $60^{\circ}/sec$. The HUMAC NORM (CSMI, USA) system was used to obtain the values of the peak torque and peak torque per body weight. The results were presented as norm-referenced criterion valuesusing the 5-point scale of Cajori which consists of five stages (6.06%, 24.17%, 38.30%, 24.17%, and 6.06%). In the results of this study, the peak torques of the elbow (flexor and extensor?) at an angular velocity of $60^{\circ}/sec$ were $37.88{\pm}8.14Nm$ and $44.59{\pm}11.79Nm$, and the peak torque per body weight of the elbow (flexor and extensor?) were $50.06{\pm}8.66Nm$ and $58.28{\pm}12.84Nm$, respectively. The reference values of the peak torque and peak torque per body weight of the elbow flexor and extensor were setat an angular velocity of $60^{\circ}/sec$. On the basis of the results analyzed in this study, the following conclusions were drawn. There is a lack of proper studies on the elbow joint strength, even though the most common injury in baseball players occurs in the elbow joint. Therefore, we need to establish a standard muscle strength in order to prevent elbow joint injuries and improve their performance. The criteria for the peak torque and peak torque per body weight established here in will provide useful information for high school baseball players, baseball coaches, athletic trainers and sports injury rehabilitation specialists in injury recovery and return to rehabilitation, which can beutilized as objective clinical assessment data.

Variable selection with quantile regression tree (분위수 회귀나무를 이용한 변수선택 방법 연구)

  • Chang, Youngjae
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.6
    • /
    • pp.1095-1106
    • /
    • 2016
  • The quantile regression method proposed by Koenker et al. (1978) focuses on conditional quantiles given by independent variables, and analyzes the relationship between response variable and independent variables at the given quantile. Considering the linear programming used for the estimation of quantile regression coefficients, the model fitting job might be difficult when large data are introduced for analysis. Therefore, dimension reduction (or variable selection) could be a good solution for the quantile regression of large data sets. Regression tree methods are applied to a variable selection for quantile regression in this paper. Real data of Korea Baseball Organization (KBO) players are analyzed following the variable selection approach based on the regression tree. Analysis result shows that a few important variables are selected, which are also meaningful for the given quantiles of salary data of the baseball players.

Top batter select through the BAI in 2016 KBO -Focusing on the sabermetrics statistics WAR (2016 KBO 최고 타자의 타격능력선수는? - 대체선수대비승수 (WAR)을 중심으로)

  • Kim, Hyeon-Gyu;Lee, Jea-Young;Cho, Gyu-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1501-1509
    • /
    • 2017
  • Wins above replacement (WAR) is the most commonly used statistics of the sabermetrics that measure baseball players' abilities. The advantage of a WAR is that it enables to compare performances of players even though they have different roles such as pitcher and hitter. However, WAR is difficult to obtain with common records. Thus, a past studies (Lee and Kim, 2016) suggested the batting ability index to determine the ability of the batter focused on the sabermetrics statistics WAR. In this paper, we selected the best hitter with applying Korea baseball 2016 data based on a proposed model and then observed a total raking of others according to BAI. We are assured that BAI is very excellent statistics through comparing BAI and WAR which is in the spotlight in evaluating performances of players.

The Structural Relationships among the Variables of Fan Attachment, Location-Based Service, and Future Fan Behavior by Utilizing Technology Acceptance Model (TAM) (스마트경기장 환경에 따른 위치기반서비스 품질이 구단애착심 및 미래행동에 미치는 효과 분석)

  • Chang, Deok-Seon;Kwon, Tae-Geun;Jeon, Jong-Hwan;Park, Sung-Bae Roger
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.2
    • /
    • pp.231-238
    • /
    • 2020
  • The main purpose of this current study was to identify the structural relationships between the variables of team attachment, location-based service, and future fan behaviors by utilizing Technology Acceptance Model (TAM). Among the 10 KBO franchises, SK Wyverns and KT Wiz were qualified to have their own smart applications programs and the relevant infrastructure at their home venues. Thus, a total of 500 surveys were collected from SK Wyverns and KT Wiz games during September of 2019 and a total of 448 were used for data analysis after deleting 52 surveys due to the missing data. According to the results of a structural equation modeling, 12 positive (+) causality out of 14 hypotheses were confirmed that there must be causal relationships among the variables of location-based service at the smart stadium, TAM, fan attachment, and future fan behavior. It is hoped that this study can be contributing to the foundational developments of marketing strategies by adopting the new technological advancement in the Korean sport industry in the future.