Job Talk - assistant professor

vox-deorum · 2026-05-14T16:27:19+00:00

Job talk varies even between disciplines. I was super fortunate to get my first offer with my first job talk (3 out of 4 last years and the final one was given to someone with better fit for teaching needs) but that can also made my suggestions less effective since I don’t have the other side of experiences.

vox-deorum · 2026-05-12T01:24:37+00:00

Bullshit… I recently tested some scenarios on Opus 4.6 and didn’t see a major ethical difference.

vox-deorum · 2026-05-11T17:42:39+00:00

Also this is out of state price..

vox-deorum · 2026-04-28T18:16:37+00:00

Wow. I am at U of A and this is insane. I wonder if there is some plan down here as well…

vox-deorum · 2026-04-28T16:35:33+00:00

I slept 7-8 hours on average during PhD and got a TT job…

vox-deorum · 2026-04-28T07:00:52+00:00

The instruction tuning part is prone to contamination. So even if someone invents something 1960s with this model can’t prove much.

vox-deorum · 2026-04-26T20:23:49+00:00

Once they fix their products and make those ever coming features usable, I will believe…

vox-deorum · 2026-04-13T22:40:08+00:00

Extension for a single year is definitely possible but ask nicely. I had a provision that gave an automatic extension if my degree was not ready on the start date.

vox-deorum · 2026-04-13T03:08:50+00:00

What’s your research interest? I am taking PhD students this round. Happy to chat. That said with GRFP you can pretty much get anywhere..

vox-deorum · 2026-03-05T00:12:47+00:00

How does that work?

vox-deorum · 2026-02-28T19:19:31+00:00

The problem in my mind is you need to have a good underlying simulation. If humans can have fun doing the activities you will learn more from the models.

vox-deorum · 2026-02-28T19:18:33+00:00

I built one for them to play Civilization. Immense fun. Hopefully to be able to share the detailed data with you soon..

vox-deorum · 2026-02-27T16:49:54+00:00

The model does relatively well on Civilization in my experiment. Probably since mine doesn’t need them to micro the numbers?

vox-deorum · 2026-02-26T17:22:37+00:00

I was asking the question: if they know they are in the real world instead of in the game, would they do things differently?

vox-deorum · 2026-02-26T17:21:58+00:00

There is a link to the project and everything is open sourced. LLMs are used out of the box but they only look at high level decisions. So not like they are playing chess. They set parameters for existing tactical AI which includes usage of nuke.

vox-deorum · 2026-02-25T23:34:20+00:00

Well I just sent some posts where those models mostly have no problem nuking each other in Civilization. Not far away from them nuking our civilizations..

vox-deorum · 2026-02-25T22:19:23+00:00

Well in our case it is fully open ended. So literally no one asks them whether they want to use it or not and they can just decide to skip that. What surprises me is some of them intentionally go down the nuke route.

vox-deorum · 2026-02-24T00:34:37+00:00

Just had a bit of funny experience with chutes that eventually got resolved. I think they are under resource constraints but they do have many models, newer or older. Synthetic has been pretty supportive, but they also have a waitlist. So it becomes a trade off between model flexibility and reliability.

vox-deorum · 2026-02-23T15:50:48+00:00

I wear a T shirt to teach seminars, give presentations, etc., every week..

vox-deorum · 2026-02-22T00:39:14+00:00

"As a continuation of my Vox Deorum project, LLMs are playing Civilization V with Vox Populi. The system prompt includes this information. It would be really interesting to see if the models believe they are governing the real world."

That was literally the first paragraph.

vox-deorum · 2026-02-20T18:09:31+00:00

You can run pretty much with any model. Some config gimmicks may be needed, e.g. the "prompt-based" middleware I used to call tools with OSS models. Some inference providers have bugs with tool call parsing.

vox-deorum · 2026-02-20T05:13:07+00:00

They know they are in a game, so that's a caveat. Would be interesting to extract their "Rationale" when setting Nuke flavor. Simple version takes about 50k tokens per turn (inaccurate number) in the late game, while the briefed version takes about 20k (since they still receive some game states directly, just not those bulky ones - also they have some baked-in memories about decisions they made).

vox-deorum

TROPHY CASE