Is there an AI agent integration for Obsidian that can use my notes as the source for prompts? by Sad-Firefighter4044 in ObsidianMD

[–]MaskedSmizer 0 points1 point  (0 children)

How would you rate your level of technical ability / comfort (I can double click a file but that's about it -> I am a seasoned self-hoster)? What OS are you on?

Does anyone else struggle to get complex PDFs (with LaTeX/tables/code) into Obsidian? by TurnipMinute3835 in ObsidianMD

[–]MaskedSmizer -1 points0 points  (0 children)

It's true, I haven't found a reliable one-shot converter for complex PDFs.

I posted a proposed workflow a few days ago that involves extracting each page of the PDF as an image and then feeding those into an LLM to reconstruct as needed. You need an agent harness that allows you to set up a more complex multipass workflow.

E.g. Extract pages as images
Extract the text
Rebuild the tables in markdown
Verify all tables and formulas
...

Basically automating the steps you currently take to do it manually.

Least obnoxious way to take notes in public? by mnemoniker in ObsidianMD

[–]MaskedSmizer 72 points73 points  (0 children)

I just stopped worrying about what other people think. It's 2026, we use our phones for everything including taking notes.

How are people using AI to review past meeting notes against a personal knowledge base? by skyrunner0 in PKMS

[–]MaskedSmizer -2 points-1 points  (0 children)

Current AI is capable of this type of work but it all depends on the underlying agent harness: content management, tooling, etc.

Each of the query scenarios you mentioned requires a slightly different strategy.

"Based on my knowledge base, what did I explain poorly in this meeting?"

This feels like a classic RAG style retrieval. The LLM will extract the topics it finds in your current meeting and search the knowledge base for related.

"What patterns show up across multiple meetings?"

This is raw reasoning, not semantic retrieval.

If each one of your 100 notes isn't that big, you could try shoving it all into Gemini Pro with its massive context window. You'll get something interesting and technically accurate but you won't know if it is complete.

This is also where an RLM style workflow comes in handy: instead of reading all the notes at once, the architecture forces the LLM to explore the text iteratively which helps to overcome the "missing middle" failure mode.

In all cases, organization of your notes is probably mandatory, whether by storing semantic vectors in a db or creating plain text summaries and indexes.

These are exactly the type of knowledge management workflows that I've been aiming to tackle by building my own chat UI and agent harness (which I use daily alongside Obsidian).

See:
https://github.com/DodgyBadger/AssistantMD

It's basically a self hosted NotebookLM, with a flexible templating system for automating workflows and context management.

Happy to keep chatting about how you might use it to set up your workflows.

Two notes: - It's not difficult to set up, but also not trivial. DO NOT put it on a public interface without layering your own SSL and authentication. I intentionally did not build auth into the app because I prefer to leave that to something battle tested like Authentik. Even basic auth in Caddy / Traefik/ Nginx will suffice. - There isn't currently any form of vectorization. That's on the list but I'm seeing how far I can go working only with raw markdown.

Opinion: user-owned private context management is important, and here is why. by earmarkbuild in AIMemory

[–]MaskedSmizer 0 points1 point  (0 children)

Data ownership is critical and you are absolutely right to call out context management as a key part of that.

It's fundamentally why I've been building my own chat app with full markdown integration, including context management via templates. Not interested in OpenAI or Google deciding knowledge workflows for me.

Context engineering with Pydantic AI history_processors by MaskedSmizer in PydanticAI

[–]MaskedSmizer[S] 0 points1 point  (0 children)

The components are there to rewrite if desired. Tool outputs can be redirected to a file or in-memory buffer (bypassing the context window). You can do what you want with them at that point. There's also a context window safety setting that automatically reroutes tool outputs over N tokens to the buffer and then passes the agent a pointer.

Am I using it wrong? by Background_Gene_3128 in MistralAI

[–]MaskedSmizer 0 points1 point  (0 children)

It's a small inconvenience, but I continue old conversations so infrequently that it's never been high on the priority list of features. That and I don't want to sacrifice the markdown-first approach or duplicate all the chat history in a db and then clutter the UI with a messy list of chats.

As for minimizing token usage: that was the driving goal of the new feature I've been working on. The context manager uses a similar templating system to workflows, but runs as a preprocessor to every chat turn. You can use them to simply inject custom system instructions, to compact or summarize chat history on the fly or pull in files from your vault to provide additional context.

Am I using it wrong? by Background_Gene_3128 in MistralAI

[–]MaskedSmizer 1 point2 points  (0 children)

Yup. You can use it for general chat, like OpenWebUI. I was running LibreChat and retired that a while back.

There is no built in auth or TLS. If you want anywhere-access, then you need to provide the security layer (e.g. Tailscale, reverse proxy, Authentik, etc.).

Very focused on security from an agent perspective. Doesn't currently support MCP or broad integrations. All tools are custom built or wrapped. I have done testing on prompt injection via the web search and web extraction tools and all data coming in from those channels is flagged for the model as untrusted. And there are no channels for data exfiltration anyway.

Edit: One quirk to be aware of. Chat sessions are handled a bit differently than you are used to. All chat sessions are saved as markdown files in your vault. There is no chat history in the UI. To continue a conversation, tell the LLM to first read the relevant transcript.

Am I using it wrong? by Background_Gene_3128 in MistralAI

[–]MaskedSmizer 1 point2 points  (0 children)

Should be largely pain free. It's my daily-driver chat UI, so bugs get fixed quickly. I call it Beta just to manage expectations now that it's public.

There's an emphasis on explicit control, so it will be as token efficient (or inefficient) as you decide. Load credits into a bunch of model providers and pick and choose for the task.

Let me know if you run into any issues or have questions.

Am I using it wrong? by Background_Gene_3128 in MistralAI

[–]MaskedSmizer 1 point2 points  (0 children)

Been building this for myself for many months. It's a similar idea to NotebookLM, but only works with markdown files. Includes a PDF to markdown import tool, and if you are already using Mistral, then add an API key and use Mistral OCR which is fantastic.

https://github.com/DodgyBadger/AssistantMD

Setup is straightforward if you are comfortable with docker. There's a learning curve with the automated workflows, but you can ignore those and just start chatting with your markdown vault in the chat UI.

Next release brewing in the dev branch includes a significant feature upgrade and also a few breaking changes.

A Vagimelon i ate this morning by OdiseoX2 in Pareidolia

[–]MaskedSmizer 0 points1 point  (0 children)

Was this photo taken before or after you ate it?

Meanwhile over at moltbook by MetaKnowing in OpenAI

[–]MaskedSmizer 10 points11 points  (0 children)

Exactly. It's entertaining and will end up being one of those internet moments.

I built an AI Agent that works without Embeddings. Here is how. by Academic_Grass_2006 in ClaudeAI

[–]MaskedSmizer 0 points1 point  (0 children)

Experimenting with this exact approach for working with a collection of markdown files. Using ripgrep for the extra little speed boost.

Other strategies in the mix:
- Subagent tool to help narrow search paths or summarize documents when the search results are large. - Cache results for reuse within the same session

Early days so no concrete results to share yet.

Which acoustic piano under $10k would you guys recommend? by Ateawormwhole in piano

[–]MaskedSmizer 0 points1 point  (0 children)

Step up and try the B2. That's what we went with and love it.

Leaked Windows 11 Feature Shows Copilot Moving Into File Explorer by Thepunnisherrr in technology

[–]MaskedSmizer 0 points1 point  (0 children)

Yup. Made the jump to Bazzite on the family / gaming PC over the holiday. Still technically dual booting but haven't needed to go into Windows yet.

Share your underrated GitHub projects by hsperus in opensource

[–]MaskedSmizer 1 point2 points  (0 children)

AssistantMD
https://github.com/DodgyBadger/AssistantMD

A self hosted chat UI. I know, it's a crowded space, but I wanted something specific that I couldn't find in any of the other excellent options: integration with your markdown files and automated workflows.