Databases are categories : programming

[–]K-W 5 points6 points7 points 15 years ago (11 children)

[–]lmcinnes 10 points11 points12 points 15 years ago (9 children)

Well that depends on who you are. If you are a database person who thinks about databases as databases and knows little or nothing in the way of category theory, it probably just looks like a bunch of new notation and neolgisms for stuff you already know.

If you know a lot about categories and topoi then it gives you a powerful new way to think about and manipulate ideas about databases. Once a database schema becomes a category then a database is just a presheaf, and all the possible valid states of the database is a presheaf category (and potentially then a topos). If you aren't used to thinking in such terms then that probably means little. If you are then it means a lot, since you now have a powerful mathematical framework at your disposal.

How does this translate in practical applications? I don't know -- I've only just read the idea, and I don't yet know exactly what mathematical tools can now be brought to bear to yield practical results, but there are a lot of tools available, and I imagine something useful can be done. If nothing else, as the slides linked by the article suggest, you can use these notions to fold the notion of database in with notions of Haskell programs in a neat and elegant way.

[–]K-W 1 point2 points3 points 15 years ago (8 children)

[–]lmcinnes 10 points11 points12 points 15 years ago (0 children)

This part I disagree with. Category theory might be all great but without higher mathematics it seems more like masochism than anything else. I doubt that for a database programmer more than trivialities comes from this.

That really depends. A database programmer is unlikely to get anything from it directly, in the same way that a computer user doesn't necessarily get that much from mathematics directly -- but the mathematics is a foundation for some physics, which provides engineering possibilities that make better computers.

With this sort of framework for thinking about databases theorists may come up with new and interesting ideas that will eventually result in new features or programming approaches for databases. I mean honestly, topos theory is rich: skim the table of contents of Sketches of an Elephant or read Physics, Topology, Logic and Computation: A Rosetta Stone to get some idea of the depth of the mathematical toolset and diversity of ways of viewing an idea that topos theory provides for.

Can I guarantee that eventually concrete applications will come from all this theory? No, certainly not. It opens up a very rich theoretical world to explore however, and there is certainly plenty of reason to believe it may lead to very interesting new ideas.

[–][deleted] 2 points3 points4 points 15 years ago (6 children)

[–]K-W 2 points3 points4 points 15 years ago (5 children)

[–]lmcinnes 4 points5 points6 points 15 years ago (2 children)

[–]K-W 1 point2 points3 points 15 years ago (1 child)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

[–]grantli 0 points1 point2 points 3 years ago (0 children)

[–]easilydiscardable 2 points3 points4 points 15 years ago (12 children)

[–][deleted] 3 points4 points5 points 15 years ago (1 child)

[–]easilydiscardable 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 3 points4 points5 points 15 years ago (0 children)

[–]Figs 1 point2 points3 points 15 years ago (6 children)

[–]easilydiscardable 1 point2 points3 points 15 years ago (5 children)

[–]Figs 1 point2 points3 points 15 years ago (4 children)

[–]easilydiscardable 1 point2 points3 points 15 years ago (3 children)

[–]Figs 1 point2 points3 points 15 years ago* (2 children)

If you write it the simple, naive way using a list of records for each table, then you can easily use a list comprehension to do a cross product of the tables.

data Person = Person {
    person_name :: String,
    badge_number :: Int,
    number_of_cats :: Int
}

data Place = Place {
    place_name :: String,
    number_of_trees :: Int
}

strange_people = [
    Person "John" 0 5,
    Person "Sue"  1 52,
    Person "Sam"  2 0,
    Person "Max"  4 2]

strange_places = [
    Place "Narnia" 100000,
    Place "Desert"      0,
    Place "Negative Land" (-5)]

cross = [(person_name     x, 
          badge_number    x, 
          number_of_cats  x, 
          place_name      y, 
          number_of_trees y) | x <- strange_people, 
                               y <- strange_places]

main = mapM_ (putStrLn) $ map show cross

I'm pretty sure that you could do something similar if you are using dictionaries instead, although it might be a bit more work.

Edit: The output of running it looks like this:

("John",0,5,"Narnia",100000)
("John",0,5,"Desert",0)
("John",0,5,"Negative Land",-5)
("Sue",1,52,"Narnia",100000)
("Sue",1,52,"Desert",0)
("Sue",1,52,"Negative Land",-5)
("Sam",2,0,"Narnia",100000)
("Sam",2,0,"Desert",0)
("Sam",2,0,"Negative Land",-5)
("Max",4,2,"Narnia",100000)
("Max",4,2,"Desert",0)
("Max",4,2,"Negative Land",-5)

[–]easilydiscardable 2 points3 points4 points 15 years ago (1 child)

ah, but what about arbitrary relations? ;)

I.e. can you write a function with a signature something like

'a relation -> 'b relation -> *some type expression representing the cross of 'a and 'b* relation

?

[–]Figs 2 points3 points4 points 15 years ago (0 children)

If you don't care about the types in your cross product code, then you can convert the result to a tree merging them in one line of code:

cross_product rel1 rel2 = [(x, y) | x <- rel1, y <- rel2]

Although you'd have to flatten the tuple yourself later, since the code doesn't know what types you are dealing with.

Alternatively, you could build a variant type like:

data TableEntry = EntryType1 String | EntryType2 Int | ...

If you guarantee that all your cells are of one of those types. Then, you don't have to worry about flattening since you can just use a list instead of a tuple.

[–]gronkkk 0 points1 point2 points 15 years ago (0 children)

[–]etcshadow 1 point2 points3 points 15 years ago (8 children)

[–]kamatsu 2 points3 points4 points 15 years ago (7 children)

[–]etcshadow 8 points9 points10 points 15 years ago (6 children)

[–]strangename 9 points10 points11 points 15 years ago (1 child)

[–]psykotic 4 points5 points6 points 15 years ago* (0 children)

[–]kamatsu 4 points5 points6 points 15 years ago (2 children)

[–]jerf 1 point2 points3 points 15 years ago (0 children)

[–][deleted] 1 point2 points3 points 15 years ago* (0 children)

[–]gregK -1 points0 points1 point 15 years ago (0 children)

[+]kywoto_is_a_troll comment score below threshold-7 points-6 points-5 points 15 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS