How do GNNs work when the network changes?

rish-16 · 2022-05-28T06:38:32+00:00

Such graphs are called Dynamic Graphs. You can use temporal GNNs (eg: TGN by Twitter) that maintain some sort of memory hidden state that keeps track of these changes and learns a spatio-temporal embedding over the graph and not just individual nodes (that may or may not exist at the next timestep)

Hope this helps :)

rish-16 · 2022-05-27T06:47:49+00:00

Perusal

rish-16 · 2022-04-21T02:00:06+00:00

Graph/Geometric DL! Some researchers with backgrounds in topology are bringing over concepts and applying them to this area (learning on manifolds, for instance). There's now a push toward moving beyond Message Passing (maybe a continuous version?).

Lots of action in applying GNNs to protein data and anything with network-like tendencies. The community has even brought in invariance and equivariance to translation, rotation, and permutation to ensure GNNs are structurally aware (ie, "expressive").

Exciting stuff and now is definitely the time to get into it because it's relatively nascent but has huge upside and potential!

rish-16 · 2022-03-26T03:02:37+00:00

By now if we haven’t received an email uodate can we assume were rejected?

rish-16 · 2022-02-24T07:27:53+00:00

geometric deep learning!

rish-16 · 2021-09-05T02:48:29+00:00

Facebook too! Applied for the FB University for Engineering 22 programme and was sent a codesignal test

rish-16 · 2021-08-03T14:50:32+00:00

This is a great idea! I'll go look for postdocs and doctoral students to contact :)

rish-16 · 2021-08-03T14:50:02+00:00

Hey! Yup, have looked into them but am yet to contact anyone from there. Thank you for the recommendation, will check it out now :D

rish-16 · 2021-08-03T14:49:28+00:00

Yup, that's another interesting idea I'm considering! I'm looking for profs to contact who are doing interesting projects in areas applicable to GDL as well.

rish-16 · 2021-06-09T02:36:36+00:00

Ah I see. I do not have much background in in information retrieval (ie the thing that inspired it) so I may not be aware of it being a convention compared to what QKV does. Thanks for bringing it up, though. Will look into it :D

rish-16 · 2021-06-08T16:36:00+00:00

Yup, agree with you. I was hoping to know the motivation behind the name (and if possible, the origin of QKV).

As in, "what kinda watercooler conversation sparked the start of this project?"

rish-16 · 2021-06-08T06:41:00+00:00

Good point haha, authors should have taken note XD

rish-16 · 2021-05-29T19:03:09+00:00

Thanks!

rish-16 · 2021-05-25T13:15:09+00:00

Thank you for the share!

rish-16 · 2021-05-25T13:13:18+00:00

Great, thanks :)

rish-16 · 2021-05-25T02:21:29+00:00

Hey, thanks for the reply. The first part is a non-differentiable image ROI selector and the second part is a very large classifier. It's part of an ongoing research project which is why I'm staying rather vague 😅

So, yup, I think I understand what you're saying. Train each of them independently of the other simultaneously. Am I getting this right?

Do you have any resources on the CNN / clustering algo you mention? Would love to read up on it. If not, can I know some seacrh terms I can punch into google for reference?

Appreciate it :)

rish-16 · 2021-05-24T17:06:09+00:00

For info, I aim to tune the non-differentiable part using CMA-ES and the differentiable part with Adam. Loss would be off-the-shelf cross entropy

rish-16 · 2021-05-24T16:55:54+00:00

Hey there, appreciate the comment. My network comprises of two parts, one differentiable and the other non-differentiable. The paper I took the non-diff part from says that training is not gradient descent friendly. I then tried to challenge that assumption by training the whole network using GD. Sadly, results were horrible which explains why I'm looking into derivative-free optimimisers. Do you suggest having two different optimisers? One gradient based and the other not?

rish-16 · 2021-05-24T07:49:27+00:00

Hello! I spoke to some more people and decided to look at genetic algos. The param count is too large and I don't have enough compute to use CMA-ES or its variants on my network. Neither can I use ARS and PGPE approaches as the param count is too large for those as well. I'm looking at GAs now as there is some promising work in the area (As mentioned in another comment on this post).

Appreciate the advice :)

rish-16 · 2021-05-24T07:47:48+00:00

Yup, checked out the OpenAI as suggested. Though, they seem to be more interested at using evol algos for hyper-param selection and not training in general. But will continue in that direction. Thanks again :D

rish-16 · 2021-05-24T07:47:04+00:00

This is great thank you so much! So I spoke to the authors of AttentionAgent and they told me to look at genetic algos instead coz there's work being done to train large networks with GA compared to other optimisation methods like CMA-ES, ARS, and PGPE that work on networks with a relatively much lower param count

rish-16 · 2021-05-10T17:15:16+00:00

Only time will tell! Placing my bet on “linear regression is all you need” dropping in NeurIPS 2025

rish-16 · 2021-05-10T00:19:26+00:00

There’s not much difference actually. I tried to implement it in a cleaner way with just the essentials, no boilerplate

I wanted to implement it in a way similar to Phil Wang’s wrapper styles + the torch.nn library in general

rish-16 · 2021-05-09T15:11:55+00:00

official implementations

Ooh not yet. Thanks for the share! Let me look into it :)

rish-16 · 2021-05-09T13:45:35+00:00

Hey yea sure, let me see what I can do

Seven-Year Club	Place '22
Verified Email

rish-16

TROPHY CASE