NBA Basketball Analytics
  • Home
  • About
  • Contact

Tracking the Miami Heat with Machine Learning

11/17/2017

0 Comments

 
Embed from Getty Images
My last few posts have been based around producing a scouting report from a machine learning algorithm I have put together.
The algorithm comes up with the biggest factors that are causing teams to win or lose games. I take those factors and find data based evidence to support the algorithms findings. Finally then producing a scouting report much like a coaches/scouting departments would for their team.

These machine learning scouts not only provide the biggest factors for winning and losing, but also provide teams with specific and tangible targets to hit. eg "We are 7-1 this season when we get to the free throw line 18 times a game" .

Coaches can use these targets to form offensive/defensive schemes or use certain lineups to try and hit these targets.

These targets can also be used to incentivize players before and during games, where coaches can track and continually harp on them during the game. Putting them up on the whiteboard pre-game and at half time to show how the team is tracking.

To give you an idea of how long these machine learning scouts are taking to put together:

Gather data, format data, run it through algorithm to get winning/losing factors = 10 minutes

Find data backed evidence to support the factors, set targets, write scout = 45 minutes

All up the machine learning based scout is put together in under an hour. It would interesting to compare  against the traditional process that scouting/coaches go through especially on back to back games where time is of the essence.

To stress test the algorithm a little further, I decided to run it and produce a machine learning scout for the Miami Heat, based on their first 10 games of the season. Then I have tracked the Heat vs the targets the scout set to see how they have fared over the next 5 games of the season.

When training the algorithm based on the Heats first 10 games, it produced a means squared error value of 1.84.

For non statistic nerds, basically this means the algorithm takes the actual statistics from the Heats games and without knowing the outcome, it is coming within 1.84 points of predicting the actual score. I'm pretty surprised with the performance to be honest, as it only has a small sample size of 10 games to work with.

Running the algorithm for other teams after 10 games, it usually gets somewhere between 5-10 points of the actual score, which is still pretty accurate but not at the accuracy I am seeing with Miami.

Here is the Miami Heat Scout I put together after the first 10 games:
Picture

There are 4 targets set for the Heat:
  • Take 34 or more 3's a game and I have assumed they hit them at their average rate of 39.6%
  • <10% of the Heats points to come from the mid-range
  • Grab 75% or more defensive rebounds
  • Get to the free throw line 18 or more times a game

As Miami were playing a rare back to back series against Washington, I have also put together a machine learning scout of the Washington Wizards and set the Heat some targets.

Wizards scout:
Picture

Targets for the Heat:
  • Keep Washington under 50% of total rebounds 
  • Miami to score 48 or more points in the paint
  • Limit Washington to 19.5% of points from the free throw line
  • Protect the ball and give up less than 7 steals

So we have all our targets, these can't just magically happen for teams. Coaches and staff will need to put things into place to help make them happen and communicate them to the players.

I have provided some data to help:
  • Washington get a lot of steals but give up the most points per possession on back cuts, so this can be exploited.
  • I have provided Miami's best lineup for when Hassan Whiteside is out of the game and the heat subsequently struggle on the boards.

Targets such as getting to the free throw line 18 times a game, again you can't just magically click your fingers and get to the free throw line more often. The Heat staff need to look deeper at why they aren't getting to the line and put things in place to improve it.

So after another 5 games, how did the heat fare?

Picture

There is a pretty clear trend happening. The factors the algorithm have found and subsequent targets I have set are co-relating to the outcome of the game.

When the Heat have only hit 2 or less targets they have lost the game. When they have hit the majority of the targets they have won.

The results are looking very favorable and are showing that the algorithm is pointing to the right factors that are causing teams to win and lose.

For my next post, I will re-train the algorithm. Adding the Heats last 5 games and see if the scout changes, and also monitor how the Heat fare in the next batch of games.
0 Comments



Leave a Reply.

    Archives:

    September 2017
    March 2017
    January 2017
    December 2016
    October 2016
    September 2016
    July 2016
    May 2016
    April 2016
    March 2016

    Social Stuff:
    Tweets by @zigzaganalytics
    Follow @zigzaganalytics

    RSS Feed

      Sign up for the latest posts:

    Subscribe
© COPYRIGHT 2015. ALL RIGHTS RESERVED.
  • Home
  • About
  • Contact