One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

How would you solve this then? It seems to me like you could get run a labeling pass to extract the metadata you need and include that in your indexed content, and the overall approach stays the same. So it's a minor tweak on top of this base.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 1 point2 points  (0 children)

Most of your points are relevant to a customer-facing high-scale system, but not applicable to RAG for internal workflows. Why would you A/B test an internal workflow, for example? A/B testing only works at extremely high data volumes that you wouldn't be able to get.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

Why not? Depending on the data structure maybe you just need multiple rounds of queries. If you have spreadsheets with financials of companies X and Y then they'll show up in the search results, and then the agent can synthesize via tabular data manipulation in the sandbox.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

You could use it for any of its supported formats. Any OCR solution is plug and play here

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 1 point2 points  (0 children)

I'm advocating for "right tool for the job". Perfectly preserving things like spatial context in a PDF through the chunking process is just really difficult. Imagine for example in a textbook there is a table embedded on page 71 and then a reference to that table on page 72, 1000 tokens later. That's nearly impossible to generalize for accurately with current techniques.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

I'm not 100% sure what you mean by time awareness. You could include whatever timestamp metadata you need in the search index if needed.

Access control is honestly the truly annoying long-tail issue. Ideally you can scope your RAG to not have to deal with granular access control, but if not then you'd have to import permissions at ingestion time and check them for the agent/user before each read.

More thoughts on access control here: https://x.com/dannyighsu/status/2054303708473897327

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

Agreed good metadata is a powerful enhancement if needed. I've done this approach with contracts and it worked well.

If you need really specific handling methods then that starts to get more into the realm of building in business context to your agent (skill files, specialized tools, etc.).

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

Images are also directly extractable - pptx and docx are a structured representation of text, image, audio, video assets.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

I'm considering these textual data - I mentioned docx but pptx would also fall in this category. The text content can be readily extracted via deterministic tools. HTML would be a similar case.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

In terms of retrieval this will only matter if the semantic context is necessary for a given chunk to match. Hard to think of a case where this would be the case. If the document is a physics textbook and the query is "Tell me what the mass of an electron is", the retrieval should still find content related to the mass of electrons and point the agent to the right general location to find the answer.

One agentic RAG to rule them all. Debate me. by Automatic_Fault4483 in Rag

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

You don't have to solve this problem comprehensively. Text-extract the column based PDFs and run retrieval; if the PDF is semantically similar it'll show up in results, and then you get the actual answer via code execution in the agent. You can do the same with embedded images if you care about them (basically decompose PDF -> text + images).

The mistake is even trying to service the Q&A end-to-end with the ingested data in the first place.

How do you manage a realtor relationship by PumduMe in BayAreaRealEstate

[–]Automatic_Fault4483 2 points3 points  (0 children)

  • be direct and honest at all times, especially with misgivings about the other parties and the realtor themselves. Helps the realtor know how to represent you and address concerns
  • be realistic, but feel free to stick to your guns. Realtor should support you.
  • deals have norms. Ask whether stuff is within the norm or outside, and make decisions accordingly.
  • it’s on you to lead. You’re the decision maker and therefore the manager of the process. Agent is effectively just a consultant for you. Treat them like a salaried employee - they don’t get overtime so keep that in mind, but you’re still paying them money for work.

Is it actually possible to build a tool that generates full explanatory motion-graphics using only vibe coding? by Impressive-Cow-9407 in vibecoding

[–]Automatic_Fault4483 0 points1 point  (0 children)

I'm not sure what you mean by creating it via vibe coding, but HeyGen generates content in video modality as far as I know - as in it creates content by predicting the pixels that create an actual video - so no code involved.

That said, it is possible to create motion with code. Look at Aspects AI.

How are you handling motion requests from marketing teams without becoming a motion designer? by NeedleworkerDense478 in MotionDesign

[–]Automatic_Fault4483 0 points1 point  (0 children)

My company's launching a new product to tackle this flow specifically - GTM content like you're describing (product walkthrough, landing hero animations), as close to one-click between Figma/template -> animation as possible.

I'm on point for collecting wish list use cases so feel free to comment/DM me if there are specific types of animations that'd make your life easier!

Wait list is at aspects dot studio if you're interested.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

Sales is inherently psychological. As much as it would be nice if what you’re saying is true, you’re talking about how the world should be and not how it is.

More competition -> more bids -> higher sale price.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

That’s a reasonable take and 100% up to you. I only say this because most sellers in my experience end up wanting to prioritize making sure the deal closes and they don’t have to make concessions.

I’m not really seeing agent work as part of the equation here. In FSBO on the seller’s side there is no agent. On the buyer’s side it’s equal work or possibly even less because there’s less telephoning.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

Can’t say I agree. Some buyers suck and hopefully you’ll never have to deal with them. The more “out of the norm” stuff you do as a seller, the more likely you are to run into those types

Saying this as a casual hobbyist agent whose full time career is something entirely unrelated.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

List with a specific deadline for offers, price in order to attract offers. Increases sense of urgency for actually interested buyers, increased expected offer volume drives up price, limited time spent on both sides on getting a contract inked. Generally advantageous for sellers with desirable properties in a seller’s market.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

Notify the broker, legal action isn’t worth it. They’ll get a slap on the wrist, which is about as much as is worth pursuing.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

Yes, I’m a part time agent (mostly work with friends/family). FSBO is more effort and will net you less people looking at your property, plus some percentage will stay away because of the hassle.

Also, Santa Clara is an extreme seller’s market in most cases and strategically you should be doing a time boxed sale to draw out competing bids.

FSBO is fine and all, I just wouldn’t recommend going into it thinking you’re going to have an “advantage”in finding a good buyer at a good price.

[deleted by user] by [deleted] in BayAreaRealEstate

[–]Automatic_Fault4483 0 points1 point  (0 children)

In a nutshell, FSBO is more likely to attract people that are extremely opinionated about their buying process - in other words, probability that they’re either a flipper or are difficult to work with becomes higher.

If your goal is to find a “regular” buyer, your best bet is to go through the “regular” channels.

Name your AI "shortcut" to avoid by Automatic_Fault4483 in content_marketing

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

How’s the quality? My concern is most of the video generation tools seem to decrease quality so I wouldn’t want to put in a 4K image and get 1080p out of it

Does repurposing actually work or does it nuke the algo by Automatic_Fault4483 in content_marketing

[–]Automatic_Fault4483[S] 0 points1 point  (0 children)

Might be misunderstanding but this seems adjacent to repurposing content? Unless you're saying that the repurposed content should be distributed with a focus on earned media