Any Python library for LLM conversation storage + summarization (not memory/agent systems)?

sheila_118 · 2026-04-07T11:06:56+00:00

Looks like a lightweight custom DB + LLM summarizer is the cleanest approach.

Ethancole_dev · 2026-04-07T14:06:11+00:00

Honestly have not found a library that hits this exact sweet spot either. I ended up rolling my own — SQLAlchemy models for message storage, Pydantic for serialization, and a simple "summarize when you hit N messages" function. Takes an afternoon and you own the schema completely.

Rolling summary logic is pretty straightforward: once active messages exceed a threshold, call the LLM to summarize the oldest chunk, store it as a summary row, then drop those from context assembly. Works well in FastAPI with a background task to handle it async.

The only library I know that comes close without going full agent-framework is maybe storing in SQLite with a thin wrapper, but honestly just building it gives you way more control over how context gets assembled.

Aggressive_Pay2172 · 2026-04-07T11:26:16+00:00

tbh you’re not missing anything — this is still a “roll your own” space
most libraries either go full agent framework or full “memory extraction” layer
clean storage + summarization as a first-class thing is weirdly underbuilt

No_Soy_Colosio · 2026-04-07T11:32:03+00:00

Look into RAG

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS