Is there a reason you're saying this? I'm not saying your not right, I just was wondering what the background is here.
(And if you say, "I walked by his office and he was playing solitaire on windows 3.1 - I will accept that)
He got started in 2004 with Dodgers, using a different suite of stats to pick out players and this was revolutionary at the time.
But 15 years later, it's... underwhelming. I've never seen anything to indicate he's moved beyond that approach in both finding "new" statistics (ie espn' "Expected Gains" which is a respin of Basketball's Plus-Minus reflected in entire Offense / Defense / ST units) ... or in being focused on finding the right player more frequently.
One problem is that Baseball is a 1v1 sport most of the time. Pitcher vs Batter. It's an oversimplification, but it's one that's useful for statistics. Basketball becomes a bit more complex with 5v5... but Football jumps the shark completely. There's so much more going on in any given football play... the guys who don't have the ball are just as important as the one who does. DePodesta's approach seems utterly incapable of capturing the trenches, or the influence of OC vs DC mind games.
And I suspect this complexity is why a lot of us love football. It's the only sport I can think of where the Coaching Staff is a vital part of the team, a player themselves - though not on the field.
There's a lot of interesting things you could do, even with just vision processing that focuses on all22 footage, tracking the players, and having an intern add a few pieces of information (down and distance, yards gained, type of play - designed pass/run/broken)... and see what the computer finds as "meaningful" predictors. There's 256 games a season... go back 10 seasons, run the tape, and see what it picks out. Then start refining what the ML identified to pick out better features and rerun the same tests. You can run these tests 24hrs a day 365 days a year. Move to simulating 3 weeks into a season against a team with a new QB. The amount of cash, time, and good data available... everyone already has box scores and all22 footage... could make t a hotbed for Deep Learning and Neural Networks. Maybe I'm wrong, but I assume the guys in the booth are allowed to use a computer?
But even this raises some questions. Let's say your model predicts 80/20 run from a given formation and set with certain tells. And your DC lines up to stop the run, and they throw over your head with no Safety protection for a back-breaking 86 yard TD -- because the OC noted your reaction to his tendencies at halftime.
ML wouldn't pick up on things like Bud Foster running bear against us in 2014 either... if there's no sample data from somebody else doing it first, it's highly unlikely a model could predict that as an effective counter... and if you had a model that somehow did, it's unlikely to know the risk associated with doing it.
Upvote
0