• Hi Guest Just in case you were not aware I wanted to highlight that you can now get a free 7 day trial of Horseracebase here.
    We have a lot of members who are existing users of Horseracebase so help is always available if needed, as well as dedicated section of the fourm here.
    Best Wishes
    AR

AD's daily selections

We had two winners with Romeo Coolio and Francesco Barracca. What this is telling us is that the models can identify winners with Italian names but are otherwise crap. They are going in the bin. We are 9 of 20 for the thread but a -10% ROI and its because my effort to look at top weights earlier identified that in races over 14 runners almost half the time it just picks the top weight. It is making the exercise pointless. So I'm probably about 95% of the way back to the drawing board. Again, suspending and this time I sadly expect the break to be a lot longer.
 
These were the previous 2 days ratings which take account of a vast array of different things. What the model does is takes the racecards and my personal HRB ratings for the day, loads them all up and compares them with the vast database I've now compiled, and checks for all kinds of possible features of importance in a deterministic regressor before presenting the output. So for Flat Handicaps run today, it decided that these were the main important factors. I create my own race z-scores and ELO ratings for horses, jockeys, trainers, stallions and damstallions, so you'd have to create your own, and if you know HRB you'll see it has a stat attack feature and I get it to calculate the different stat attacks myself but it also considers things like more recent form, percentage of runners beaten (and squared), and previous odds / pedigree. You can see from this model that for these types of race today it places great weight on the last HRB score achieved.

You'll also see how often my top rated horse is the market leader, but in the list of determining factors below, the model is completely blind to today's actual odds.

1777412742094.png

So the previous two days are attached, the ratings are rudimentary and converted to resemble Timeform figures as opposed to being on anything like their scale, but even so, it's achieved a 37% strike rate and a 22% return on investment. Where horses haven't won, it hasn't had to go too far down the list most often to find the success. I'll try and post tomorrow's sheet before going off to work.

The HTML is still being worked on but I am developing it to provide commentary on each horse's chances. When done it will rate the horses accurately based on their chances on the day, and give strong evidence for why a horse is being backed.
 

Attachments

  • Live_Analysis_2026-04-28.xlsx
    32.7 KB · Views: 5
  • Live_Analysis_2026-04-27.xlsx
    24.9 KB · Views: 4
1777413364819.png
An example of my HTML output. You can see how it matches the spreadsheet ratings and this one went to On Message. But you can see how it will look at all the data and draft basic comments on how the horse might perform. This is an area in development.
 
Trying a second time. Today's full ratings attached. For now don't worry too much about the ratings, they are fairly proprietory and it's more getting horses in a ranking order though the gaps between horses are going to likely be more telling. If a horse is highlighted in yellow, the model will see it as more of an each way bet.

I am working on getting the htmls converted to PDFs when the model runs and when done for individual meetings the PDFs should be small enough for me to introduce. I think we'll get a lot of winners today but the question of whether they will be profitable is another matter of course.

These are the main features informing today's handicaps over jumps - it looks at the last 4 years of those races and identifies the most important factors. As a slight shock, I've found it's not actually pulled the 14, 30 and 180 day trainer and jockey stats through - I think it will see the z-scores for those periods and start weaving them into the forecasts.

1777447954458.png
 

Attachments

  • Live_Analysis_2026-04-29.xlsx
    32.5 KB · Views: 6
It also has an 82% correlation with morning line odds across all runners, which seems almost unbelievable, so I am going to try and run the model later but strip those morning odds out of today's info all together to make sure it's not just inadvertently peeking at that information too. However, if it was, it'd be the sort of red flag that would appear in the feature engineering.

To clarify the more recent trainer jockey info is missing from the master matrix which is why it appears not be in the feature importance list.
 
Back
Top