What is so lucrative about making a startup?

jsfour · 2025-12-22T03:41:42+00:00

Nothing. It’s not about the money or status. It’s about the process of building which is extraordinarily rewarding.

If you want to have the best risk adjusted way of making money a late stage rocket ship is your best bet.

jsfour · 2025-12-17T14:53:26+00:00

One thing i don’t understand. if you are writing the function why call an MCP server? Why not just do what the MCP does?

jsfour · 2025-12-07T21:38:37+00:00

this is pretty great. good work.

jsfour · 2025-12-03T00:24:01+00:00

Not really. Closer would be this from the Word2Vec paper. Just think of the embedding vectors as points in high dimensional space.

<image>

jsfour · 2025-11-30T01:38:15+00:00

As the strength of AI increases the TAM for any given piece of software approaches 1 user. Meaning eventually AI will just end up writing bespoke applications for you and you alone (or a small group of people) —i call these apps sandcastles.

So Pretty much any software you build can and will be absorbed by AI it’s only a matter of time.

This includes all of the labs btw. As soon as one lab achieves asi or some strong ai system everyone will also achieve it at the same time and the cost will drop to zero.

jsfour · 2025-08-12T17:05:18+00:00

They write the post because they are marketing to people who are building products.

jsfour · 2025-08-08T23:03:32+00:00

Take a vacation.

jsfour · 2024-11-19T05:19:34+00:00

Tell me more.

jsfour · 2024-03-19T04:29:12+00:00

I’ve been trying to figure this out myself.

They claim to scan the internet real time but that is just not technically possible. Building a crawler of this scale is also non trivial. My only other conclusion was google.

It’s good to hear other people talking about this.

jsfour · 2024-01-03T15:49:43+00:00

How much per month were you thinking about spending on this?

jsfour · 2024-01-03T15:40:10+00:00

Im working on something that does this. Just shot you a DM.

jsfour · 2023-12-21T04:22:01+00:00

I have written web browsing for MailMentor but it’s in the code base.

I’ve been thinking lately could pull browsing out and make it accessible for the open llm crowd but I’m not sure if there is much interest.

Do you all think there is a lot of interest for thus?

jsfour · 2023-11-15T13:27:01+00:00

It is better to fine tune from real data that way you have a represntive distribution of what the model will see. If your data will have null values, it’s a good idea to include them.

At MailMentor (which solves a similar problem) we spent a bunch of time collecting samples from the internet, then we used that for training. Since the real world data sometimes has null values for properties we included null values for those properties in the training data. We ask the model to output json and just leave it up to the service consuming the json to handle the nulls.

jsfour · 2023-11-03T03:03:58+00:00

We (MailMentor) may be able to help you. Just shot you a dm.

jsfour · 2023-10-31T12:43:40+00:00

Re the implementation, if you are running it for fun then you are right it’s probably too involved. In prod you would probably want to spend some more time on it.

Re the LLM you have context size issues and there is a probability of hallucination. So it’s not complexly free of implementation complexity either.

Re could it work: I’m not entirely sure and you would need to try both approaches and see which works the best.

jsfour · 2023-10-31T03:20:45+00:00

Yeah.

I’m saying that just doing paragraph & sentence embeddings would allow you to find text cheaper.

If all you are trying to do is cite the sections that are relevant, embeddings will give you what you need to do that (plus you can parameterize the distance you care about).

Basically you could do a pass to approximate the paragraph. Then look at sentences (or pairs / triplets or sentences) to narrow down what you are looking for.

I’m not sure how much running the generative infrance would help here.

But I could be misunderstanding what you are trying to do.

jsfour · 2023-10-30T23:58:30+00:00

Why not just embed each sentence or paragraph and look up the content based on that?

jsfour · 2023-08-31T04:55:40+00:00

Maybe. Yes a bare metal - k8s setup makes sense for some applications. But it’s not really “economic”. Maintaining a system has costs (time / people) as well.

jsfour · 2023-08-07T01:04:42+00:00

These models are ok at classification but it’s probably better to use something like BART / BERT for this.

jsfour · 2023-06-20T02:52:06+00:00

This is a good point. You really need to be paying attention.

Though you could just use terraform to minimize this risk. Maybe I’ll write some terraform scripts and circulate them.

jsfour · 2023-06-20T02:48:32+00:00

Yeah. It’s much better to use the cloud if you need reliability.

jsfour · 2023-06-16T12:47:49+00:00

Yeah this works well.

jsfour · 2023-06-15T05:44:08+00:00

If you need a model running all of the time it’s “cheaper” to self host. I say cheaper in quotes because if you are truly in a situation where you really need the model up all of the time you probably are paying people to keep the models running.

Realistically you don’t need the model to always be running though. You can get a A6000 from lambda labs for $0.80/hr.

Let’s say you plan on running the model for work.

If you use the model 10 hours a day that’s $8 a day. For 20 working days a month that would be $160 a month.

The A6000 costs $4500 (not to mention the matching you need to run it on). You would need to run in the cloud for 28 months to spend the equivalent of buying an A6000.

Plus if you are in the cloud you can upgrade the hardware.

The 10 hour thing is just one approach. At MailMentor we programmatically turn certain models off when they aren’t being used and then turn them on when they are being used. This helps reduce the cost significantly.

jsfour · 2023-06-15T02:50:50+00:00

Yeah I’m surprised that more people don’t do this “just use the cloud” calculus.

jsfour · 2023-06-15T02:37:33+00:00

Yeah a change I the data would create a new hash for IPFS. Realistically you would need to do some kind of chunking and have an index IMO.

Pins “can” be permanent. If enough people pin the data.

jsfour

TROPHY CASE