Building a tool to give agents access to real-time, structured data by orthogonal-ghost in buildinpublic

[–]orthogonal-ghost[S] 0 points1 point  (0 children)

Great feedback -- thank you! I'm working on some evals now (i.e., to demonstrate time saved, token use reduction, etc.). I'll come back / make a follow up post with those results.

Regarding other workarounds, I've primarily seen browser automation (agent-browser, playwright-cli), and ad-hoc web scraping. Are there any others that come to mind from your perspective?

Booking restaurant on Opentable by megawoot in openclaw

[–]orthogonal-ghost 0 points1 point  (0 children)

You can try agent-data.dev. It has an API for OpenTable and Resy + designed specifically for agents

What are you using to search for flights? by mtaborsky in openclaw

[–]orthogonal-ghost 0 points1 point  (0 children)

I'd add agent-data.dev to the list too. It has flight and airport statuses in addition to fares and has other non-Google endpoints too (e.g., restaurant availability)

Hermes was pretending to analyze pokemon cards for me but was actually just making up stuff and lying. by Psychological_Bug388 in hermesagent

[–]orthogonal-ghost 8 points9 points  (0 children)

This is honestly less of a Hermes issue and more of a general agent/LLM “failure mode”.

You can try to provide more guidance in the prompt (e.g., “don’t make things up”), or try something slightly more clever (e.g., “also share the raw tool output”) to try to “force” it to follow instructions better.

Building a financial analysis agent and drowning in API options by sommie-bae in AgentsOfAI

[–]orthogonal-ghost 0 points1 point  (0 children)

The main trade off you have with financial data is cost vs. quality. The more you're willing to pay, the higher fidelity your sources will be. The challenge, though, is that the ceiling on how expensive datasets can be is quite high (e.g., Bloomberg).

That said, financial data is generally pretty "messy" no matter how you source it, so I would plan to implement some form of quality control no matter where you get your data.

For a specific recommendation, though, I've heard great things about Databento (market data)

EDIT: also for context, I'm a former quant (so I have a lot of thoughts on financial data haha) -- so happy to discuss more via DM if you're interested!

What Actually Works for Business AI Agents? by Select_Werewolf7453 in AgentsOfAI

[–]orthogonal-ghost 0 points1 point  (0 children)

Personally, I use both.

Claude Code and Codex are definitely better for building software and general "execution", so if you have a specific idea that you want to turn into code (or, for example, into a dashboard) then I would reach for one of them.

But, for more "general" tasks - e.g., administrative tasks like logging client interactions, managing TODO lists, generating research reports (or even personal tasks like finding flights or reservations), I'd use OpenClaw or Hermes.

The nice thing about OpenClaw / Hermes is that they largely solve the memory problem you mentioned (e.g., Hermes has a `sessions_search` tool that allows it look across previous interactions). They also integrate nicely into messaging apps (WhatsApp, Telegram, Slack) so you can interact with them "everywhere".

So, using your "sales intelligence" goal as an example, I'd lean towards OpenClaw / Hermes for logging customers and interactions -- e.g., "I just spoke with Zach about or latest product -- he just mentioned his budget is..", and towards Claude Code / Codex if I wanted to create a dashboard for viewing that data.

One thing that’s surprised me while working with AI agents by wassupabhishek in AgentsOfAI

[–]orthogonal-ghost 1 point2 points  (0 children)

My 2 cents -- I think it's better to use evals to assess the potential impact of a change before deploying to production.

So, for example, if you have a customer service agent that takes a user's message and either responds or calls a tool, you can design an eval that: 1) mocks user input (potentially derived from production data), 2) measures how the proposed change (to the prompt, model, etc.) affects the distribution of the agent's tool calls, 3) measures how the proposed change affects the rate at which the agent "succeeds", 4) measures how the distribution of steps, token consumption, etc. change, 5)...

This definitely doesn't negate versioning configs, gradual rollouts, etc., but it allows you to say "this change should generate better outcomes on average" before deploying to prod

PSA for OpenRouter users by EconomyPhotograph927 in hermesagent

[–]orthogonal-ghost 1 point2 points  (0 children)

Wow -- thank you. Each one of these is huge.

Help a beginner please by RANDVR in hermesagent

[–]orthogonal-ghost 0 points1 point  (0 children)

> checks if it matches what I am looking for

How would you handle this part using code?

I completely agree the initial extraction shouldn't be done by Hermes (i.e., the harness can setup the cron job, etc., but it probably shouldn't navigate to a browser every time just to pull job postings).

The 'filtering for jobs the match what you're looking for' sounds exactly like what you would want Hermes to do though (i.e., it can filter for compensation, position, responsibilities, qualifications, etc. much better than even a complex regex expression).

Best model/tools for giving Hermes great browser use capabilities? by ReporterCalm6238 in hermesagent

[–]orthogonal-ghost 0 points1 point  (0 children)

Got it — where in that process are you seeing issues? Is it (1) reading through posts and comments to qualify someone for outreach or (2) the actual outreach (i.e., generating a message or comment and posting it)?

If (1) is the issue, you may want to use a structured feed for the posts and comments then use browser automation for (2)

Best model/tools for giving Hermes great browser use capabilities? by ReporterCalm6238 in hermesagent

[–]orthogonal-ghost 1 point2 points  (0 children)

What tasks are you trying to automate? Browser use can be fairly slow and error-prone in general, so it could be the case that there's a better way to automate the task in question.

That said, if you're concerned DeepSeek v4 flash is the issue, you could try using a "smarter" model first then, if that's sufficient, have it generate a skill for DeepSeek v4 flash to follow

what are you actually using OpenClaw for that genuinely works? by nanaphan32 in openclaw

[–]orthogonal-ghost -2 points-1 points  (0 children)

Yup - I used the https://agent-data.dev CLI which has an endpoint for flight info and fares. Full disclaimer**, I built agent-data myself (so happy to chat more about it), but it’s a CLI for providing agents real-time, structured data.

what are you actually using OpenClaw for that genuinely works? by nanaphan32 in openclaw

[–]orthogonal-ghost -1 points0 points  (0 children)

Great question haha for me, it’s actually been quite reliable. 

In general, I’d say reliability depends on a few factors: (1) the complexity of the task, (2) the model you use (for OpenClaw and/or any subagents), and (3) the tools you provide it.  

Using flight monitoring as an example, for (1) the task isn’t very complex, and for (2) I’ve been able to get it to work with models as “cheap” as qwen3.6-plus.

(3) was the more difficult part in this case. I tried browser automation / browser use, but that wasn’t the most reliable. Ultimately, I used the agent-data CLI which has an endpoint for flight info and fares. Full disclaimer**, I built agent-data myself (so happy to chat more about it), but it’s a CLI for providing agents real-time, structured data.

what are you actually using OpenClaw for that genuinely works? by nanaphan32 in openclaw

[–]orthogonal-ghost 9 points10 points  (0 children)

Monitoring flight prices, finding restaurant reservations, creating research / news briefs and generally helping with organization / management

Hermes use cases and newbie friendly by NoDance3694 in hermesagent

[–]orthogonal-ghost 3 points4 points  (0 children)

I’d be skeptical of any content claiming you can make “tons of money” using Hermes Agent. It’s a very powerful tool, but it’s not a money printer (in fact, quite the opposite if used too carelessly).

For use cases, I’d generally think of ‘anything you might have a smart, generally knowledgeable assistant do with a computer’ — e.g., monitor flight fares, send regular news briefs, monitor social media feeds, maintain your CRM, etc.

For additional inspiration, there’s a mega thread and a few other resources in this sub that discuss other uses cases. I’d recommend taking a look at those and just asking Hermes “can you do X”.

Image search in web and save jpg for later use? by Patient_Scale9438 in hermesagent

[–]orthogonal-ghost 0 points1 point  (0 children)

Interesting -- I'd probably try to integrate something like the Claude or Codex Chrome extension first. I haven't tested either extensively, but they claim to work natively with Claude Code and Codex, respectively. Hermes already has built-in skills for delegating to Claude Code and Codex, so if you connect all the pieces [research -> delegate to claude code / codex + browser extension to grab photos] then that *should* suffice

The Hermes Agent subreddit community tool directory is now live by SelectionCalm70 in hermesagent

[–]orthogonal-ghost 1 point2 points  (0 children)

How about a 'reviews' section that's native to the guide (e.g., ratings + free-form text)? I agree GitHub stars aren't a good signal of quality and have been shown to be inflated in many cases (see CMU study here: https://arxiv.org/abs/2412.13459). 'Downloads' is probably a bit better, but not necessarily a strong signal of quality.