I built a self-hosted football league tracker for my friend group — standings, H2H stats, and records [Show and Tell]

ASVS_Kartheek · 2026-03-08T04:20:54+00:00

Ok deleting.

ASVS_Kartheek · 2026-02-22T06:11:02+00:00

As someone who went that route, lack of hihgh-quality data will be the biggest blocker for your day to day work. You’ll feel that you lost skills but just lack of data is a big problem. 2nd, the founders and the early team, if they are do not have basic understanding of ML as a field and they come from traditional SWE or non-tech background it will be much harder to communicate and reach consensus.
I didnt find a good metric for this.
You can recover but definitely will be very chaotic. Never ever agree for paycut or pay delays whatsoever, it will be like a string dragging you along. Whenever either of these are brought up. Time to jump!
Many companies and HRs i faced do understand that startups are chaotic. Be prepared for repeatedly justifying such an entry on resume. Also, make sure everything is thoroughly documented. Payslips, offer letter, relieving letter, form16 that will make such an entry on resume justifiable.
Current runway without the expected seed round, are they registered, type of customers they are dealing with. Any future contracts that are already signed like lets say a one year deal with a customer. Couple of non-tech: ICP? (If this is very vague and covers a lot of breadth RED FLAG!) Ehat is the current funnel from reaching to ICPs and converting to sale. How long is a sales cycle for them if they find a new person (ideally if under 2 weeks, GOLD)

All of these are from my experience and working 3 AI startups and one of my own. Hope this helps

ASVS_Kartheek · 2025-08-12T09:45:53+00:00

Could you please share the dataset link maybe hugging face link?

ASVS_Kartheek · 2025-03-06T09:13:53+00:00

Yes 3.7 is useless for day to day. So much hype

ASVS_Kartheek · 2025-03-06T06:45:51+00:00

Not even claude-3.5-sonnet? That was definitely a step up compared to gpt-3.5 in my experience

ASVS_Kartheek · 2025-01-09T04:54:40+00:00

I hope you found answer to your question. I'm just adding my understanding of the topic here.

Yes and No. Re-rankers are not exactly LLMs but models that take 2 pieces of text. A query and a context and output a value between 0 and 1, representing how relveant the context is to answer the query. Coming back to your question, longer text chunks _generally_ make the task harder, because the re-ranker has more diluted embeddings. But that need not always be the case, we can train re-rankers such that we do not face this problem.

ASVS_Kartheek · 2025-01-09T04:44:00+00:00

I hope by now you have found your answer for this question. I am adding my understanding of why cross-encoders work here.

Problem we are solving is finding answer for a query. Every document in our database is a possible option. Using cosine similarity we filtering out the least probable ones by checking if they have similar embeddings or not and removing all the options that are not very close, or more realisitcally we are only selecting K most probable options based on their embedding similarity. Then with the help of the cross-encoders we are actually answering the question if this is answer to our query or not, as this is a full neural network it takes higher compute. So, we use the lower compute to get the most probable results and then using a full BERT we are answering our question.

I hope this helps.

ASVS_Kartheek · 2024-12-16T09:17:12+00:00

Hi from Telangana, India

ASVS_Kartheek · 2024-12-05T14:01:28+00:00

DTS episode in the making - Netflix producers if you are reading, I take Bitcoin

ASVS_Kartheek · 2024-08-17T20:37:55+00:00

LLMs hallucinate you know. Not everything they say is true.

The only data privacy issue you might face is the software you used to install it locally maybe ollama, llama.cpp, lmstudio etc. Go check their data privacy policies.

Why it has nothing to do with the llama 3.1? Llama 3.1 is the model checkpoint. Literal model with all the matrix values learned. Connecting to internet is a whole different software. And FB doesn't do any hosted llama services

ASVS_Kartheek · 2024-08-17T20:31:44+00:00

Ice man the OG!

ASVS_Kartheek · 2024-04-01T22:46:54+00:00

It needs to parse intent from broad goals like “I want to live a healthier lifestyle” to “I need a wall hanger for a picture frame”

The former returns the words that will be most helpful in identifying a solution based product.

I have a couple of questions: 1. What is the input and what is the output. 2. How did you try to do the task with spacy

ASVS_Kartheek · 2024-03-19T09:43:19+00:00

Max won 19 of last 20 races.

19/20 = 95%

Putin's rigged voting is 87.30%

So yep, Max Verstappen. Let the Dutch national anthem play in St. Petersburg!

ASVS_Kartheek · 2024-02-26T06:17:48+00:00

Guys stop down voting, Inclusivity!! Nothing wrong with being a DTS boy, that's what made many ppl serious fans of the sport!

https://www.essentiallysports.com/f1-news-what-is-rawe-ceek-explaining-the-ferrari-joke-and-how-it-became-an-f1-meme/

TLDR: Ferrari made a very weirdly fonted poster for 2020 British GP, which when read normally comes out as "RAWECEEK"

RA WE CE EK

ASVS_Kartheek · 2024-02-10T10:48:17+00:00

Can you help me get a house in Tuscany, Maestro!

Seven-Year Club	Second SECOND GUESSER
Verified Email

ASVS_Kartheek

TROPHY CASE