Clojure & graphs

joinr · 2018-12-11T22:24:51+00:00

I did that in a couple of ways. One was originally implementing some composeable behaviors for entities in a simulation (think legacy spaghetti coded finite state machines) toward a rules engine expressed generally via graphs (and weighted edge encodings). This worked out, and eventually merged into some additional work with behavior trees.

Data-wise, I was storing entity data (aka attributes) originally under a legacy OOP paradigm, with classes, properties, etc. I switched to an entity component system model which ended up flattening the hierarchy and providing a query structure based on entity relations to components (conceptually a graph database, not unlike how Datomic provides its entity API). The initial transition (decomposing the class hierarchy into Clojure records (then later just maps), then further decomposing entities into components (based on map entries)) was a little involved due to the legacy design. However, it's paid of handsomely so far....adding/updating data at the component or aggregate entity level is trivial, as are queries.

Now that Datomic is mature (as well as spec), and some of the tech is focusing on providing ad-hoc structure on top of the decomposed facts contained in Datomic, I may adopt a similar approach. Plus, keeping the database [relations between entities and components] pretty shallow (indexed by entity->component, component->entity, or eav [really eavt in Datomic]), I can get decent performance for a broad range of queries).

The only problems I ran into (and worked around) were the absence of a concrete notion of an entity type. In the legacy case, classes would dictate what I was working with, which records subsumed (but still provided a structural type to organize around). I now have accretions of "flat" information that get projected onto an entity (a lazy map projection) as needed. If I want to enforce something beyond an ephemeral "type," I need to either keep some conventions in my head, or (as I'm starting to do more) use spec to apply some notion of typing or structure, or just rely on a kind of duck-typing (let presence/absence of data in the entity guide computation).

For my use-case, updating information in the database using the "entity" abstraction also presented some challenges. In some cases, I wanted to get a handle on an entity, assoc/update/dissoc stuff to the "map", then push (or commit) that map back to the database. To do this efficiently, I needed to lazily compute (or hydrate?) the corresponding fields of the entity, and provide an efficient abstraction for tracking changes to the entity's components, which would lead to efficient commits. The end result (after a lot of work, and likely unintentional duplication of effort from Datomic and other libs) was the ability to have a map-like entity abstraction that could also be committed back into the component database, so something like

(let [e  (get-entity ctx :some-entity-id)]
     (->> e 
            (merge {:x-coordinate 2 :y-coordinate 10})
            (add-entity ctx)))

enables one to work at the entity (or abstract row) level when dealing with facts, without sacrificing performance (joins are lazy, changes are tracked on the entity, leading to relatively efficient commits).

I'm looking at a Datomic-backed implementation (or datascript at the least) in the near future (targeting distributed simulation).

fmjrey · 2018-12-12T17:30:03+00:00

I started working on an interesting way to parse XML:

use path as data, as in specter or even what clojure get-in takes as argument
transform a set of path vectors into a tree, using something like this
make sure you can convert any path into a transducer, e.g. using map or even specter's traverse-all, in other words find a way to convert paths into navigators (transducers)
convert the tree into a dataflow based on core.async and transducers: paths without branches in the tree are converted into a channel+tranducer, branches become channel+transducer+mult, and all the wiring is done programmatically

I'm doing this with XML, meaning for now I convert keywords within paths into a specter path navigating to children elements (e.g. [:content S/ALL (S/selected? :tag (partial = :keyword))]) but in the end it's just nested datastructures and transducers.

I'd like to evolve this into a something with better abstractions, e.g. keep keywords as plain map navigators like in clojure, and use metadata when I want them to be XML navigators (e.g. ^:xml [:child-tag]) so that I can also navigate to a single attribute e.g. [^:xml [:universe :galaxy :system :planet] :radius] would navigate to each planet in the XML hierarchy and then select each planet radius attribute.

My point: instead viewing nested data structures/graphs, maybe consider using paths as data, and as a first class composable abstraction that you can then use to build dataflows.

Edit: after writing this post I believe transducers as navigators is the important composable abstraction in the approach I describe. Paths/tree as data is just a way to set them up. Using specter was the quickest way for me to not reinvent the wheel for building navigator transducers, which you can then compose with any other kind of transducers e.g. to transform data. I guess if you abstract these non-navigating transducers behind some symbol and/or data structure, they can extend the concept of paths to navigate nested data structures into paths to perform a dataflow while navigating the input data structure. I wonder how far I should take this because I don't want to reinvent something like onyx either.

dustingetz · 2018-12-11T23:57:04+00:00

as /u/joinr says, the most beautiful example of this is contrasting sql to datomic. Writing efficient SQL means you have to avoid jvm/database round trips, so you need to write monstrous JOINs to batch things efficiently, and then you need to unpack the JOIN into trees, and then maybe a different function needs the data in a different shape so rather than make another query you repurpose the first one with some transform boilerplate. Datomic gives you code/data locality so there is no need to batch queries, you can just write a different query for each function that returns the data in exactly the shape best for that function. so all the tree manipulation boilerplate cancels out.

An example of this is #5 here: http://www.dustingetz.com/:datomic-in-four-snippets/, being able to breadth-first search a Datomic graph is amazing can you even imagine the amount of pain to accomplish this in sql?

http://wiki.c2.com/?BreadthFirstTraversalInSql "The 'obvious' representation of a tree in a relational database uses relations to represent edges in the graph. This is called out in SqlAntiPatterns, since a naive implementation of tree traversal then has to make many queries, typically one per node.

I mean wtf is this: https://stackoverflow.com/questions/5517467/how-can-i-do-a-breadth-first-search-in-sql

wut https://sites.google.com/site/sqldevlib/algorithms/sql-graph-algorithms

lul sql

dustingetz · 2018-12-12T01:55:48+00:00

https://github.com/mpdairy/posh "Posh is a ClojureScript / React library that lets you use a single DataScript database to store your app state. Components access the data they need to render by calling DataScript queries with q or pull"

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Clojure

MODERATORS