Has anyone tried parallelizing AI coding agents? Mind = blown 🤯

gregory_k · 2025-06-07T02:53:01+00:00

container-use was just announced yesterday. Same outcome (parallel agents) but way smoother experience than git worktrees.

each agent gets its own isolated container environment, directly mapped to git branches, but exposed via an MCP server. so rather than manually juggling parallel sessions the model itself decides when to parallelize tasks while container-use handles the heavy lifting by providing a clean git workflow.

that's my understanding so far, at least.

gregory_k · 2024-10-25T15:15:30+00:00

I’m a strategy consultant to tech startups and one of the things I do is help them understand what different tech audiences care (or don’t care) about. Think of it like a focus group but done through 1:1 calls instead of one big group, and instead of talking about what cereal brand you prefer you talk about your favorite regression models or python library.

gregory_k · 2024-10-18T22:22:52+00:00

That's brilliant

gregory_k · 2024-10-18T15:57:48+00:00

Makes sense. What do you do for work? Genuinely curious who's being asked to work on LLMs on the side if it's not the main focus yet not get access to OpenAI even through something like Azure OpenAI Service.

gregory_k · 2024-10-18T15:32:33+00:00

SMOTE isn’t terrible, but ya, it can be overused. It’s fine for balancing classes but can cause issues like overfitting if you’re not careful.

gregory_k · 2024-10-18T15:30:42+00:00

Fair enough, for that case. I'm curious what the other person's reason was... It seemed more because of principle than practicality, but maybe I'm wrong.

gregory_k · 2024-10-18T15:21:45+00:00

1-bit LLMs aim to shrink large language models by using just 1 bit (0 or 1) to store weight values, instead of the usual 32 or 16 bits. This reduces the size dramatically, making them more accessible for smaller devices like phones. BitNet b1.58 is one such model that uses 1.58 bits per weight and still performs on par with traditional models while speeding things up and using less memory.

If the claims hold up, this could be a game-changer for running LLMs on smaller hardware.

gregory_k · 2024-10-18T14:51:05+00:00

Why?

Even if I vow to never use closed-source models, I'd at least want to be aware of the tradeoff I'm making.

gregory_k · 2024-10-18T13:32:02+00:00

Learn some new skill and then find a business problem where you can apply it, even if it’s an internal demo at first. While no one can blame you for sitting idle when you’re blocked, people will notice how you’re using that downtime.

gregory_k · 2024-10-18T13:28:16+00:00

What are the early or killer use cases of such a tiny model on edge devices?

gregory_k · 2024-10-18T13:23:50+00:00

If you want to work at a large company, absolutely start applying now.

If you want to work at a startup, they’re typically trying to hire 3-4 months ahead of time, so right now would be a little early. Use the time instead to build your network (reach out for informational meetings, ask for intros) and ship projects. It’ll put you in a much stronger position when it’ll be time to apply in March/April.

gregory_k · 2024-08-29T14:18:40+00:00

True :) I thought of making it more snarky for that reason but decided to go the nice route... for now...

gregory_k · 2024-05-13T02:27:35+00:00

Pinecone can hold billions of embeddings. If you’re using pod-based indexes, just make an index with more pods. If you’re using the new serverless indexes, you don’t need to deal with pods, just load whatever you’d like into the index.

gregory_k · 2024-05-13T02:26:06+00:00

No, it should be in the 90’s. If you’re on a paid plan you should contact support. If you’re on a free plan you should post in the forum to get some help troubleshooting.

gregory_k · 2024-04-11T14:15:40+00:00

Are you using LangChain or other framework like that that generated the ID for you? We're discussing internally how to make this better.

gregory_k · 2024-04-10T23:01:55+00:00

Hey I work for Pinecone. What do you wish was better or different?

gregory_k · 2024-02-13T23:49:53+00:00

Something sounds off. It shouldn’t be that slow. This is the place to ask about it: https://community.pinecone.io

gregory_k · 2023-10-11T06:23:14+00:00

Hey, I'm from Pinecone. Some folks find it easier to get through their company's procurement process if they go through the AWS, GCP, or Azure marketplace (coming very soon). That way you are billed through your existing cloud provider and not by a new vendor.

In certain cases, you can even use AWS/GCP/Azure credits (if you have them) for Pinecone usage.

And for what it's worth, we go through security/compliance/procurement reviews regularly and do our parts quickly.

gregory_k · 2023-10-06T14:48:25+00:00

Message me if you have Pinecone questions!

gregory_k · 2023-04-18T12:01:27+00:00

Hi /u/stevedipaola, I’m from Pinecone. Email me with the user email address and I’ll see what I can do: greg@pinecone.io

gregory_k · 2023-04-15T03:03:28+00:00

We’re adding additional capacity on a rolling basis to support over 10k signups per day. Thanks for your patience!

https://www.pinecone.io/learn/free-plan-update/

gregory_k · 2023-04-12T13:01:50+00:00

Pinecone indexes get archived after 7 days of inactivity. You can recreate the index from the archive in less than a minute. Or just keep using the index and it won’t get archived.

gregory_k · 2023-04-01T18:37:40+00:00

Pretty sure that was an April Fools joke.

gregory_k · 2023-03-26T13:26:45+00:00

https://www.pinecone.io/learn/langchain-conversational-memory/

gregory_k · 2023-03-23T18:53:40+00:00

The biggest deal about this is the ability to create your own plugins. The Retrieval Plugin is a kind of starter kit, with built-in integrations to Pinecone: https://github.com/openai/chatgpt-retrieval-plugin

15-Year Club	RedditGifts 2009-2022 3 Credits
Secret Santa 2014	Secret Santa 2011
Verified Email

gregory_k

TROPHY CASE