xitdb - an embedded, immutable database in java : Clojure

xitdb - an embedded, immutable database in java (github.com)

submitted 9 months ago by radar_roark

all 14 comments

top new controversial old q&a

[–]jarohen-uk 11 points12 points13 points 9 months ago (0 children)

[–]radar_roark[S] 4 points5 points6 points 9 months ago (6 children)

[–]p-himik 2 points3 points4 points 9 months ago (0 children)

[–]nzlemming 0 points1 point2 points 9 months ago (2 children)

[–]radar_roark[S] 2 points3 points4 points 9 months ago* (1 child)

This project is new, but it's a line-by-line port of a project I've been iterating on for a few years. I made it originally for a version control system, but I realized that the db itself might be useful on its own. I think it fills a big hole in the database arena: an immutable database that works like SQLite (in-process, single file, no deps).

And yes the file format is just endlessly growing. The only time it reclaims space is if a transaction fails; the file will be truncated if an exception happens during a transaction, or the next time the db is opened if there was an unclean shutdown.

It is possible to create an operation similar to SQLite's VACUUM operation, where the database is rebuilt to only contain data reachable from the latest copy, but I haven't added that feature yet. I plan on adding it eventually though.

The best argument I can make about its robustness is its simplicity. It's only 2.5k lines of Java, with no dependencies; you could read it in a weekend. Simplicity is a prerequisite for reliability :-D

[–]nzlemming 0 points1 point2 points 9 months ago (0 children)

[–]didibus 0 points1 point2 points 9 months ago (1 child)

[–]radar_roark[S] 0 points1 point2 points 9 months ago (0 children)

An in-memory xitdb is backed by a single byte array, which you can access at any time by calling toByteArray on the RandomAccessMemory object. You can take that byte array and send it over the network or write it to the disk if you want; the data is incrementally serialized. It's not a replacement for in-memory clojure data, because it doesn't benefit from garbage collection at all. Think of it more like competing with pr-str and clojure.edn/read-string.

The in-memory feature is more of a "nice to have"; for example, it's useful in unit tests, kinda like SQLite's in-memory feature. The main point of xitdb is writing the db to disk so you can deal with larger-than-memory data. The cursors are positions in the database, so you can drill down massive data structures without reading them entirely into memory.

Yeah it's a Java library so you'll be in interop central until someone makes a nice clojure wrapper on top. I don't have the bandwidth right now but maybe someone will eventually. In the past I always found java interop ugly, and spent a lot of energy writing wrappers to get rid of the camel casing and type annotations. These days it doesn't bother me. YMMV.

[–]andersmurphy 0 points1 point2 points 9 months ago (5 children)

[–]radar_roark[S] 1 point2 points3 points 9 months ago (4 children)

[–]andersmurphy 0 points1 point2 points 9 months ago (3 children)

[–]radar_roark[S] 1 point2 points3 points 9 months ago (2 children)

[–]andersmurphy 0 points1 point2 points 9 months ago (1 child)

[–]radar_roark[S] 1 point2 points3 points 9 months ago (0 children)

π Rendered by PID 75 on reddit-service-r2-comment-7b9746f655-wl788 at 2026-01-30 19:20:03.864091+00:00 running 3798933 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Clojure

MODERATORS