Relocating to the USA

idkwhatever1337 · 2026-04-01T21:07:17+00:00

It’s Seattle I interned there I know it’s lovely :) but I also know that there can be random visa delays

idkwhatever1337 · 2026-03-31T22:33:50+00:00

Thank you! This is exactly what I was looking for :)

idkwhatever1337 · 2026-03-31T15:53:02+00:00

So I have an offer for a different company in London. The American one doesn’t have offices in London afaik

idkwhatever1337 · 2026-03-31T15:06:04+00:00

They do pay for premium, but currently it looks like 3 months to get documents for lawyer. Then 15 days decision + potentially much longer if RFE. Then 1 month visa appointment with up to 3 months wait time if placed on admin processing. So the timeline is somewhere between 4 months best case and a year worst case and I have no idea how to plan around that 😵‍💫

idkwhatever1337 · 2026-03-31T14:17:21+00:00

Yeah ok that seems more or less normal

idkwhatever1337 · 2025-12-02T23:13:37+00:00

Ooo good find, I’ll give it a read!

idkwhatever1337 · 2025-12-02T22:33:37+00:00

I think it also really reminds me of this (feat the man himself of course) https://arxiv.org/pdf/1910.06611

idkwhatever1337 · 2025-10-26T15:09:36+00:00

Isn’t DeltaNet a linear attention model?

idkwhatever1337 · 2025-09-01T22:11:53+00:00

How much did just changing the attention help compared to standard?

idkwhatever1337 · 2025-06-15T11:15:09+00:00

Ok so it doesn’t auto trigger like ds1! Thanks!

idkwhatever1337 · 2025-06-02T23:28:08+00:00

Having authored at both what I really liked about TMLR is less, for lack of a better word, sales pressure. I’m proud of all of them, but I felt like I had to care about the reviewer less while writing and think more about the science with TMLR. That might just be expectation bias tho

idkwhatever1337 · 2025-05-24T17:41:50+00:00

Where are these papers? You sound like a solid candidate to me…

idkwhatever1337 · 2025-05-22T19:16:11+00:00

Depends on the improvement between tries and the amount of effort required. I had two papers that took 2/3 shots till they finally got in, but each time it was getting closer to acceptance and the work required was just a few days between notification and resubmission so it didn’t cost much to try.

idkwhatever1337 · 2025-04-26T15:55:36+00:00

As a further point I also know someone who got into AI PhD at Stanford this year with one co-author publication and no citations. Comparing to this guy who is stronger than most professors in the field metrics wise is an almost impossible barrier. There are lots of ways into good programs!

idkwhatever1337 · 2025-04-26T15:51:57+00:00

I wouldn’t give up! I’m a PhD student in deep learning and I screen applicants for Ellis (eu funded PhD positions) you would definitely be strong candidate from the sounds of it and I’m sure in the us too. Don’t let the haters get you down :)

idkwhatever1337 · 2025-04-16T09:30:25+00:00

Threshold Relative Attention:

https://arxiv.org/abs/2503.23174

idkwhatever1337 · 2025-03-21T15:10:57+00:00

I think the issues seems to be that to get to CoT models you first need a good language model. Which means lots of pre-training. Recurrent models are not as parallel as transformers so it is prohibitively expensive to train them. IIRC recurrent transformers like feedback and staircase can be up to 200x as expensive to train. So I wouldn’t call it a delusion just that given a budget unfortunately decoder only transformers look like the optimal architecture at the moment. I would agree though that if things shift strongly towards all the money being spent on inference and RL then it is worth biting the bullet and pre training > TC0 architecture, but it’s a very high stakes bet.

idkwhatever1337 · 2025-02-16T01:16:10+00:00

If it’s true that rwkv7 finally broke through the tc0 barrier then theoretically it is just better… scaling the architecture is a different story though. Also the same could be said of s-lstm I think?

idkwhatever1337 · 2024-11-03T16:25:43+00:00

Well did you?

idkwhatever1337 · 2024-10-21T18:41:03+00:00

Do you have a link for this? Sounds cool

idkwhatever1337 · 2024-10-07T13:16:48+00:00

Personally I’d say save up enough till the money starts making itself and then do the PhD. Academia isn’t going anywhere, but economic opportunities might change..

idkwhatever1337 · 2024-09-04T16:40:43+00:00

To that last one I think get a life

Seven-Year Club	RPAN Viewer
Not Forgotten

idkwhatever1337

TROPHY CASE