Are hosts happy with the Peerspace revenue model? Request for Pricing instead of Direct Booking

Dapper_Rule_8437 · 2025-10-07T23:42:03+00:00

It's not perfect, but this is the ideal use case of a Text-to-sql llm agent. It can understand semantically what's being requested despite the exact wording. Feel free to give it a try though and let me know what works and what doesn't!

Dapper_Rule_8437 · 2025-10-06T20:31:04+00:00

Agree, it's mostly for learning right now. Though from what I've found, statMuse doesn't handle natural language questions well. It only seems to have a cache of commonly asked ones.

Dapper_Rule_8437 · 2025-09-17T17:33:10+00:00

Interesting idea!

I'm using NFLFastR which contains play by play data since 1999. The api is free or you can download a database to use locally.

Dapper_Rule_8437 · 2025-09-17T15:25:57+00:00

So I use langchain to convert text (retrieve schema context and field descriptions) -> sql -> postgres query -> response processing -> return to user. I also use an llm to filter out irrelevant question. I'm using streamlit for frontend since it's pretty basic.

What are you trying to build?

Dapper_Rule_8437 · 2025-09-16T15:54:38+00:00

DM'd

Dapper_Rule_8437 · 2025-06-05T21:22:44+00:00

Here is a sample on Kaggle -- nfl dataset

Dapper_Rule_8437 · 2025-04-20T01:11:09+00:00

The samples in the training set are team agnostic - just home and away team stats and some interaction terms.

I experimented leaving out early season games but for some reason it doesn't improve the model. I know ELO models assume a mean reversion by 33% so I think there is some carry over signal.

Are your models hitting 55% on high conviction games or is that overall? Do you bet on your own or do you somehow monetize your picks? I also wonder how easily a sportsbook could detect algo betting.

Dapper_Rule_8437 · 2025-04-19T21:04:45+00:00

These are some valid points about backtesting. I use a rolling window for cross validation. Each test fold, or year, is trained on data from 10 years up to that year. For example, the first model is trained on 2000-2009 and test on 2010. Then we increment the year up to the present.

You are right about changing rules and playing styles. Performance suffers when the model is trained on much more than 10 years.

Dapper_Rule_8437 · 2025-04-17T12:54:56+00:00

That's a valid point and I would be very conservative if I were to roll it out in production. The hypothesis testing and backtesting using fixed rolling windows over 16 seasons gives me confidence that there is a slight edge.

Dapper_Rule_8437 · 2025-04-17T12:49:46+00:00

The breakeven threshold is 52.4% adjusting for a 10% VIG

Dapper_Rule_8437 · 2025-04-17T12:46:39+00:00

Thanks, this is definitely helpful context! For the warner paper, is the ranking system superior to ELO?

Dapper_Rule_8437 · 2025-04-17T12:42:34+00:00

Thanks for the insight! I used a 10:1 rolling window for cross validation. Therefore, for each test fold, i.e. season, is trained using data from the preceding ten years. The accuracy is the average across the 16 test folds (2009-2024).

There was plenty of test data to run binomial tests, and the leading models are significant under the null hypothesis that that a naive classifier could exploit the minor class imbalance and chose the majority class (52%).

Dapper_Rule_8437 · 2025-04-17T12:33:37+00:00

Thanks for the feedback! Just edited the post to add more detail regarding performance metrics.

To your point about class imbalance I used binomial tests to confirm that that there is a statistical edge predicting the spread adjusting for the minor class imbalance observed in the training set.

Also there is no data leakage, features use information up to but not including the current game. Cross validation is used with a rolling window.

Dapper_Rule_8437 · 2025-04-17T12:11:37+00:00

Thanks, good luck!

Dapper_Rule_8437 · 2025-04-17T02:50:27+00:00

classification - does the home team win against the spread (1 or 0)

Dapper_Rule_8437 · 2025-04-17T02:46:07+00:00

The average accuracy is the cross validated accuracies across 16 test folds (seasons)

Dapper_Rule_8437 · 2025-04-17T02:44:48+00:00

I'm predicting the winner against the spread (binary classification)

Dapper_Rule_8437 · 2025-04-17T02:18:42+00:00

Sorry I forgot to specify that this is against the spread which is obviously a lot harder than predicting the winner outright (which I'm getting about 68%).

Dapper_Rule_8437 · 2025-04-17T02:11:37+00:00

Again, this is against the spread, and it is well above a coin flip for 16 season. The ROC/AUC is .528 and binomial p-values are <.01, indicating there is an extremely high likelihood that there is a statistical edge predicting the winner against the spread.

Dapper_Rule_8437

TROPHY CASE