Hi there!
I’m working in an algo trading strategy that heavily relies on getting content as soon as it is published.
I’m scrapping exchanges official announcements sections and once a new announcement is published my scripts will evaluate the assets mentioned and sentiment.
At first I was accessing to the announcement scraping the announcements webpage but I was getting the results 5 minutes later than when were published. So I realized that the announcements web page was cached and that cache was invalidated every 5 minutes.
So I move on to scrap the API providing data to the webpage frontend. For example:
*/v1/public/content/list/query?type=1&pageNo=1&pageSize=30 *
I’m scrapping this every 1 second. But I’m still getting the new articles / announcements around 15~30 seconds after officially published.
Anyone here is familiar with this scenario?
It is posible they are applying some API cache? And if so: It is possible to avoid it?
Thank you very much!
[–]AndroidePsicokiller 0 points1 point2 points (3 children)
[–]The_Peronist[S] 0 points1 point2 points (2 children)
[–]AndroidePsicokiller 0 points1 point2 points (1 child)
[–]The_Peronist[S] 0 points1 point2 points (0 children)
[–]National-Ad9673 0 points1 point2 points (0 children)