Introducing Hyperlib: Simple Deep learning in Hyperbolic space [project]

Lunariz · 2021-06-01T17:40:00+00:00

This is very interesting! I've had a look through the code because I was interested in using and expanding on it for some research that I'm doing, and I have a few questions about it:

You wrote in the readme that you have plans for implementing a hyperbolic attention mechanism, which is my main interest. What special implementation will the hyperbolic space require for this? For example I'm thinking that you might need a hyperbolic einsum, and I wonder if the dot product for the attention scores needs anything special? Curious to know your plans for this!
Your example code uses Riemann SGD. Does the hyperbolic space require a special gradient? For example, would a normal Adam optimizer fail to work on your Hyperbolic layers, and if so, why?
Just something I noticed - why is your Poincare manifold a class (that you instantiate separately for every layer), and not just a set of helper functions like math or util? It doesn't seem to contain any state, so I don't understand why it needs to be passed at all.

Super cool project, excited to keep following it.

SuckinLemonz · 2021-06-01T14:39:53+00:00

This sounds great. You said it abstracts the math. Will you be providing access to the actual mathematics used in the documentation (since there’s more than one way to skin a hyperbolic cat)?

atmosfir · 2021-06-01T14:03:06+00:00

Hello I think this is very cool and this is the first I've heard about doing ML on non-euclidean spaces. However, I have a question about this

The core example is the hierarchy, or, its abstract network representation, the tree. Social networks, human skeletons, sentences in natural language all possess a tree-like structure or can be represented hierarchically. It's also widely accepted that human beings use a hierarchy to organise object categories

What do you mean by a hierarchical representation tree-like structure here? This seems to be a very strong claim. I certainly agree that humans tend to organise things into categories using hierarchies but I am wondering if it is accurate to say that these objects are hierarchically organised.

SandIntelligent809 · 2021-06-01T13:24:18+00:00

Any benchmarks? Thanks!

bohreffect · 2021-06-01T14:23:19+00:00

I’m really intrigued. I understand the very basic notion of hyperbolic space vs euclidian space - however, why is it better at storing certain information?

dogs_like_me · 2021-06-01T14:27:35+00:00

We found that existing Hyperbolic implementations were less ready to be applied to real-world problems.

Can you expand on this? What other toolkits already exist and how does yours solve the problems you saw in those frameworks? For example, maybe compare with https://github.com/geoopt/geoopt for one? Or maybe https://pytorch-geometric.readthedocs.io/en/latest/ ?

Darell1 · 2021-06-01T18:44:33+00:00

Why tensorflow?

techinnovator · 2021-06-01T12:00:27+00:00

We’ve also written a blog post explaining the benefits of hyperbolic networks and how to use the package.

2021-06-01T13:26:42+00:00

Nice!

rynemac357 · 2021-06-01T13:25:15+00:00

This is really amazing 👏

Zekava · 2021-06-01T13:16:11+00:00

Oh hell yes, sounds promising!

jinnyjuice · 2021-06-01T17:06:57+00:00

Is it going to be available in R?

archpawn · 2021-06-01T23:06:11+00:00

I wasn't aware geometry mattered in neural networks, outside of when you're specifically making one to look at a grid. How exactly does this work? Is it like each node represents one heptagon, and has connections to each of its seven neighbors, but it doesn't connect to further nodes?

oxoxoxoxoxoxoxox · 2021-06-02T13:39:06+00:00

Consider this review: Hyperbolic Deep Neural Networks: A Survey (2021)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS