• Hi Guest, The forum will be moving hosts on 26 July and as such will be closed from Midday until the move has completed.
    As we will be with new hosts it may take a while before DNS get updated so it could take while before you can get back on the forum.
    I think it will take at least 4 hours but could easily be 48!
    Ark Royal
  • There seems to be a problem with some alerts not being emailed to members. I have told the hosts and they are investigating.
  • Sorry for the ongoing issues that you may have been experiencing whilst using the forum lately

    It really is frustrating when the forum slows down or Server Error 500 pops up.

    Apparently the hosts acknowledge there is a problem.
    Thank you for using our services and sorry for the experienced delay!
    Unfortunately, these errors are due to a higher server load. Our senior department knows about the issue and they are working towards a permanent resolution of the issue, however, I'd advise you to consider using our new cPanel cloud solutions: https://www.tsohost.com/web-hosting


    I will have to investigate what the differences are with what We have know compared to the alternative service they want us to migrate to.
    Keep safe.
    AR

Scraper/Method to obtain Top Speed and RPR from Racing Post Website into Excel

daveat91

Yearling
Hi Jock12345

I can get the top rated no problem with the formula that I have I just can't get it correct when there are multiple horses rated the same so they need to have the same rating and it's only for OR TS RPR that I need to do it for...What I am doing at the moment is getting the data using the formula I have to rank them, then I manually have to go through each race to see if more that one horse is on the same rating and correct it manually.. I have to do this for each of the three columns which is a pain, would be much easier with a better formula
 
112​
1​
112​
1​
<----------IF(A2=A1,B1,COUNT($A$1:A2))
110​
3​
109​
4​
109​
4​
109​
4​
108​
7​
107​
8​
90​
9​
90​
9​
87​
11​
 

daveat91

Yearling
Thanks

If I have the data like this where do I put the formula I tried putting it in the cell after RPR but nothing happened

CourseTimeNameORTSRPR
Goodwood01:10Declared Interest8371103
Goodwood01:10Separate9579102
Goodwood01:10Secret Return7977101
Goodwood01:10Im Available7781101
Goodwood01:10Wasaayef9391100
Goodwood01:10Angel Power9094100
Goodwood01:10Tomorrows Dream8471100
Goodwood01:10Agent Of Fortune7659100
Goodwood01:10Queen Of Silca7793100
Goodwood01:10Hateya957898
 

daveat91

Yearling
CourseTimeNameORTSRPROrder
Goodwood01:10Declared Interest8371103IF(A2=A1,B1,COUNT($A$1:A2))
Goodwood01:10Separate9579102
Goodwood01:10Secret Return7977101
Goodwood01:10Im Available7781101
Goodwood01:10Wasaayef9391100
Goodwood01:10Angel Power9094100
Goodwood01:10Tomorrows Dream8471100
Goodwood01:10Agent Of Fortune7659100
 
1595972330592.png

G2 =IF(F2="","",1)
Then put the formula in bar into G3 and fill down (make sure $ signs are around $F$2 in count formula only ).
***You have to link the formula with the corresponding cells***

Greenfield...... Saddleworth ??????
Ex Saddleworth myself, more Hare&Hounds Uppermill than Wellington Greenfield
 
Last edited:

daveat91

Yearling
Small world eh I live on Beech Ave... Used to go in The Railway for the live turns

Thanks that works great now...much appreciated :):)


 

daveat91

Yearling
Actually no it didn't work...it did for the first race then it just kept going and did not reset for the next race...do I need to put the first formula in for the start of each race?

CourseTimeNameORTSRPROrder
Goodwood01:10Declared Interest8371103
1​
Goodwood01:10Separate9579102
2​
Goodwood01:10Secret Return7977101
3​
Goodwood01:10Im Available7781101
3​
Goodwood01:10Wasaayef9391100
5​
Goodwood01:10Angel Power9094100
5​
Goodwood01:10Tomorrows Dream8471100
5​
Goodwood01:10Agent Of Fortune7659100
5​
Goodwood01:10Queen Of Silca7793100
5​
Goodwood01:10Hateya957898
10​
Goodwood01:10Alabama Whitman888296
11​
Goodwood01:10Vividly847295
12​
Ayr01:30Epeius68073
13​
Ayr01:30Highland Acclaim597073
13​
Ayr01:30Prissy Missy626973
13​
Ayr01:30Dilithium634272
16​
Ayr01:30Jessie Allan466772
16​
Ayr01:30Star Cracker516071
18​
Ayr01:30Sienna Dream566269
19​
Ayr01:30Rapid Russo654968
20​
Ayr01:30Garnock Valley524868
20​
Ayr01:30Lady Calcaria664167
22​
Ayr01:30Reely Bonnie635064
23​
 
Actually no it didn't work...it did for the first race then it just kept going and did not reset for the next race...do I need to put the first formula in for the start of each race?

CourseTimeNameORTSRPROrder
Goodwood01:10Declared Interest8371103
1​
Goodwood01:10Separate9579102
2​
Goodwood01:10Secret Return7977101
3​
Goodwood01:10Im Available7781101
3​
Goodwood01:10Wasaayef9391100
5​
Goodwood01:10Angel Power9094100
5​
Goodwood01:10Tomorrows Dream8471100
5​
Goodwood01:10Agent Of Fortune7659100
5​
Goodwood01:10Queen Of Silca7793100
5​
Goodwood01:10Hateya957898
10​
Goodwood01:10Alabama Whitman888296
11​
Goodwood01:10Vividly847295
12​
Ayr01:30Epeius68073
13​
Ayr01:30Highland Acclaim597073
13​
Ayr01:30Prissy Missy626973
13​
Ayr01:30Dilithium634272
16​
Ayr01:30Jessie Allan466772
16​
Ayr01:30Star Cracker516071
18​
Ayr01:30Sienna Dream566269
19​
Ayr01:30Rapid Russo654968
20​
Ayr01:30Garnock Valley524868
20​
Ayr01:30Lady Calcaria664167
22​
Ayr01:30Reely Bonnie635064
23​
Try paste this in G2, and copy down
=1+SUMPRODUCT(($B$2:$B$2000=B2)*($F$2:$F$2000>F2))
 
Last edited:

JOCK12345

Yearling
Hi Jock12345

I can get the top rated no problem with the formula that I have I just can't get it correct when there are multiple horses rated the same so they need to have the same rating and it's only for OR TS RPR that I need to do it for...What I am doing at the moment is getting the data using the formula I have to rank them, then I manually have to go through each race to see if more that one horse is on the same rating and correct it manually.. I have to do this for each of the three columns which is a pain, would be much easier with a better formula
Hi Dave, I am having the opposite problem when i rank the columns using the cloth numbers and you have two with the same percents ... say sire at the cse the sire may have 3 runners in the race but they all have the same percentage but using the cloth numbers i get the ranked 1,2,3. My level of expertise is copy and paste in excel lol . So I am afraid i cant help on that front.
 

daveat91

Yearling
=1+SUMPRODUCT(($B$2:$B$2000=B2)*($F$2:$F$2000>F2))

Great stuff Outlander that has done the trick :):) Thanks all for your much appreciated help
 

craigmuzza

Yearling
Thanks

If I have the data like this where do I put the formula I tried putting it in the cell after RPR but nothing happened

CourseTimeNameORTSRPR
Goodwood01:10Declared Interest8371103
Goodwood01:10Separate9579102
Goodwood01:10Secret Return7977101
Goodwood01:10Im Available7781101
Goodwood01:10Wasaayef9391100
Goodwood01:10Angel Power9094100
Goodwood01:10Tomorrows Dream8471100
Goodwood01:10Agent Of Fortune7659100
Goodwood01:10Queen Of Silca7793100
Goodwood01:10Hateya957898
Hi Dave,

How are you managing to extract that data from Racing post?

Thanks
 

daveat91

Yearling
Yes very easy to use all you have to do is put the date in you want and presto you have it...But watch out for duplicates... I have noticed at the big meetings it does duplicate the horses...I use conditional formatting to find them and delete
 

craigmuzza

Yearling
Looking back at this again, just getting around to try it out for the first time, the attached file opens, and automatically closes? Would you have any idea on what that could be?
 

daveat91

Yearling
Yes you should find the file in the unzipped folder that you unzipped the file to it will be there...I created a folder called Racing Post and unzipped the file into there...so now I just go to the folder and each day's file gets put in there once I have run the program
 

craigmuzza

Yearling
Yes you should find the file in the unzipped folder that you unzipped the file to it will be there...I created a folder called Racing Post and unzipped the file into there...so now I just go to the folder and each day's file gets put in there once I have run the program
Between me replying to you, and me seeing this, i had worked it out myself! Thanks for the help anyhow :)
 
T Theloantrader I suspect you need an Excel guru - which isn't my forte.

Scraping RP, Timeform etc depends largely on what you want to scrape from them - stuff they hide (like Timeform's ratings that are reserved for paying customers) can't be accessed unless you're into hacking, but the easily accessed stuff like racecards and results are generally available in exchange for time and effort in coding.

Somebody posted an Excel based results scraper on here the other day, check the other 'scraper' threads - I think it was on the 'which language is best for...' thread. (I'll see if I can find it for you after posting this).

To be absolutely honest other than haunting the scraper threads the only real option is to learn how to do it yourself, at least then you'd be able to fix things when sites get updated.

For your bit regarding how they change the URL's daily, the 'trick' (if you can call it that) here is that there will be a home page for a site that then links to the information you want, and that page will be constant. For example, scraping RP cards my program simply asks for the date (and I have coded in a 'press return if you just want tomorrow', as after all that is the day I am usually after) and I know the URL for the 'master' cards page will be at
https://www.racingpost.com/racecards/+the date I just entered, so for tomorrow that's
XXXhttps://www.racingpost.com/racecards/2020-07-20 - note the XXX is dummy stuff to stop the forum software turning the address into a link!

My program goes to that one page, reads through the text of the page to find the addresses of the individual race cards, then goes to each in turn and downloads them, stripping the information from each page and storing it into a spreadsheet. How others do the job I haven't a clue, but that's how I do the job in Python.

Each site, each job within a site, takes a bit of analysis to figure out formats and where/how to identify information, and then a fair bit of coding to make it all work. If you want to code the sort of way I do, then first you need to learn to program. For an Excel based sheet, you need somebody who uses Excel to do what I use Python for.

Dave

ps - I was right, it's post 8 in the 'preferred language for scraping' thread that has a link to an RP results scraper.
Hello, does anyone know how I can extract the following information from Timeform Race Passes in order to keep a results database Date, Horse, Course, Time Going, Raw rating, raw Timefigure, Any symbol All this information is available in my race passes but trying to harness it together to be...
 
Top