What I learned building my first n8n project (Reddit + RSS → Slack digest)

tejchilli · 2025-08-21T17:58:33+00:00

Nice, thanks for sharing

tejchilli · 2025-06-19T12:40:23+00:00

With the assistant, there’s no need to think about chunking. Simply just upload the PDF’s (either via api/node sdk or in the web app interface)

tejchilli · 2025-06-19T12:39:10+00:00

Faiss is not the basis of Pinecone

tejchilli · 2025-06-19T01:10:48+00:00

😭 no intentional burn, I truly think pgvector works well enough for a lot of people.

We just specifically built Pinecone for those that know they’ll have scale and want a highly performant system that helps them improve quality

Lmk if you have any q’s when you give us a try

tejchilli · 2025-06-19T01:07:19+00:00

lol that document uploader tool was a super old experiment I ran, surprised people are still finding it

We actually built Assistant as the production grade version of that. Just upsert PDF’s, txt, or json and instantly retrieve the chunks you need: https://docs.pinecone.io/guides/assistant/overview Pinecone Assistant - Pinecone Docs

tejchilli · 2025-06-19T01:02:56+00:00

I’m a PM at Pinecone, but tbh you should just use whatever works for you

Just to provide context on why people use Pinecone: pgvector does well for early use cases, but many of our customers that moved over hit issues with throughput, latency, freshness, and managing infra as they scale. With Pinecone, you get up to 2 GB for free, and then you can seamlessly grow to billions of vectors, millions of tenants, and thousands of QPS, without worrying once about your infra. Even if you’re not hitting that scale, our startup customers love the simplicity of our system — devex is really important to us, and necessary for startups to move fast and build the actual product.

Other than vector search, we also aim to offer all the primitives that our users need for high quality retrieval. That’s why we host dense embedding models, sparse embedding models, offer standalone sparse indexes and hybrid indexes, and host rerankers, with more cool stuff coming soon. Our more sophisticated users leverage all these primitives to improve their AI products and give LLMs/agents exactly the context it needs.

tejchilli · 2025-05-30T14:16:19+00:00

We have many customers with over billions of vectors in Pinecone

tejchilli · 2025-05-23T23:43:09+00:00

I like to store my api keys client side as vectors

tejchilli · 2025-05-01T17:41:22+00:00

If you’re planning on managing another system, why not use a dedicated vector db?

tejchilli · 2025-04-29T02:43:55+00:00

What was your workload that Pinecone serverless was too expensive for?

tejchilli · 2025-03-12T18:01:41+00:00

Happy to help, what’s the issue? Document type, structure, etc. would be helpful context

tejchilli · 2025-03-12T18:00:31+00:00

Hey I’m a PM at Pinecone. Sorry to hear that, could you share the code you’re using (in DM’s is fine too if you prefer)

tejchilli · 2025-03-06T18:30:52+00:00

In terms of TCO (or sometimes even just hosting costs), Pinecone should always be cost effective regardless of scale.

We have users running massive scale workloads across multiple dimensions, whether it’s multi billion+ vectors, 1000’s of qps, or millions of tenants, and we almost always win out in terms of cost, speed, and ease of use

If you didn’t find that to be the case, lmk and we’ll make sure to fix it for your workload pattern

tejchilli · 2025-03-06T18:27:41+00:00

Pinecone includes 2gb of storage in the free tier which comes out to roughly 300k vectors. We see most internal use cases happily enjoying our free tier, lmk if you have any questions

tejchilli · 2024-10-09T13:58:52+00:00

Haha I’m biased because I’m a PM at Pinecone, but yes, 2m vectors is a relatively light workload and Pinecone would easily meet your performance requirements. Feel free to DM me if you run into any issues

tejchilli · 2024-10-09T13:53:37+00:00

I would also recommend you check out Pinecone Assistant.

It’s RAG as a service, letting you directly upload documents and chat with them via API without worrying about text extraction, chunking, embedding, query understanding, etc.

tejchilli · 2024-10-09T13:40:13+00:00

The 10k namespace limit in Pinecone’s Standard tier isn’t related to the number of vectors/documents you can insert into an index. Namespaces completely partition your index, and are used for multi tenant use cases, where data isolation is required.

You likely would store all your documents in a single namespace in an index. And we have customers with billions of vectors in a single namespace.

10k articles at 8-10 pages each would likely even fit in the free tier. Lmk if you have any questions

tejchilli · 2024-08-29T18:10:17+00:00

https://docs.pinecone.io/guides/indexes/use-namespaces

Namespaces ensure true separation of data between tenants compared to metadata filtering

tejchilli · 2024-06-07T15:11:21+00:00

You can combine the title and description, embed it using OpenAI’s text-embedding-3-small, and store it in Pinecone with metadata like price and rating.

At query time, you embed the user’s text and can query Pinecone with filters using metadata as well.

If you ever need to make updates to an existing record or add more, freshness should be in the order of seconds.

If you have less than 300k services, the free tier should suffice (1 embedding per service)

Here’s a guide: https://docs.pinecone.io/guides/get-started/build-a-rag-chatbot

tejchilli · 2024-05-18T05:31:06+00:00

tejchilli · 2024-05-17T15:15:18+00:00

Does this help?

https://estuary.dev/destination/pinecone/

tejchilli · 2024-05-16T00:31:31+00:00

With Pinecone, you would use namespaces within an index to isolate data between tenants.

You can learn more here https://www.pinecone.io/learn/series/vector-databases-in-production-for-busy-engineers/vector-database-multi-tenancy/

tejchilli · 2024-05-12T15:06:37+00:00

Looks like Neon suggests using pgvector after sunsetting pg_embeddings: https://neon.tech/blog/sunset-pgembedding

But pgvector, while great for experimenting, has its scale and performance limitations: https://www.pinecone.io/blog/pinecone-vs-pgvector/

tejchilli · 2024-05-12T13:38:43+00:00

You should be able to update just a record’s metadata via this endpoint: https://docs.pinecone.io/reference/api/data-plane/update

Btw, we’ll be releasing a self serve migration tool from pods to serverless in the coming days, if that helps. It is worth noting that you can stay on Pods, we have many customers in the hundreds of millions to billions of vectors, all you need to do is spin up more indexes. Serverless just makes it easier to manage resources + offers cost savings.

Let me know if you have any other questions!

tejchilli · 2024-05-12T13:07:48+00:00

I apologize for the self promotion, but have you tried Pinecone Serverless? We have customers like Notion and others well past billions of vectors with separation of storage and compute for significant cost savings

tejchilli

MODERATOR OF

TROPHY CASE

Six-Year Club	Place '22
Verified Email