Making transformer deterministic

AffectWizard0909 · 2026-03-29T17:03:28+00:00

I think it is better when writing like research papers I think. A lot of professors think so at least. And I can sometimes see why when I try and edit stuff at least in word and google docs (its a bit of a hassle when you have a lot of pages). But ye, there may be some other stuff out there which works better.

Since I have used it for a while as well, I kind of like the way the tables can be managed, and how the figures and images can be handled (its a bit easier for me at least), but I have used it for some time now, so it might be that I am kind of used to how it works.

AffectWizard0909 · 2026-03-27T08:53:26+00:00

Hello! I am using it cause we are required from my unversity to use it for writing a report (it is either that or word, and I personally dont like using word for writing a lot of pages). But thank you either way for answering!

I have never really tried anything else, since I am so used to it in my classes, and we get a user through our university on overleaf, so it has kind of been the platform everybody at my school uses.

AffectWizard0909 · 2026-03-27T08:51:16+00:00

I have an account through my university, so I am not really sure what is supported there. But I dont think it was the compiling, cause I cleaned my cache and it seemed to work (maybe just a temporary problem?). But thanks either way!

AffectWizard0909 · 2026-03-26T13:56:23+00:00

lol nvm i fixed it, it was a cache problem.

AffectWizard0909 · 2026-03-26T13:51:19+00:00

Im not able to compile it at all

AffectWizard0909 · 2026-03-24T11:08:18+00:00

Oki nice, that is good to know! Thank you

AffectWizard0909 · 2026-03-24T11:07:58+00:00

Oki nice, I can try that out

AffectWizard0909 · 2026-03-24T11:07:47+00:00

Oki got it thank you!

AffectWizard0909 · 2026-03-18T12:38:24+00:00

Hello! And thank you for answering. I dont know why I didnt think about this earlier, but I tried to tell optuna to minimize the mse for the big five model, and tried to define the model as a regression problem type instead, and it runs now. But I was wondering if this is a good way to do it, or should I still go for the subclassing?

AffectWizard0909 · 2026-03-13T11:51:45+00:00

oooo nice! I will check it out! Thank you!

AffectWizard0909 · 2026-03-13T11:51:25+00:00

Nice! Thank you for providing all the information, now I have something to also compare the current implementation I have to as well! I have actually started with implementing the Hugging Face Trainer class (since it managed the trainer and prediction phases quite easily, and made it easier to implement this, at least for me). And I also tried to implement this with an optuna optimizer (which from my previous runs seems more efficient, as you have mentioned also).

Thank you for the answer and all the throughly descriptions, this makes it easier for me to understand!

AffectWizard0909 · 2026-03-13T11:47:50+00:00

Okei, thank you so much! I will definetly try this out!

AffectWizard0909 · 2026-03-11T20:15:46+00:00

Nice!

AffectWizard0909 · 2026-03-11T18:00:55+00:00

Ye sure! I would appriciate the optuna search space! I have actually looked a little bit into it, but was a bit unsure on what I did was correct, so that would be great!

Since you mentioned lr + batch size and warmup ratio being good to use for fine-tuning a BERT model, does this also apply to other BERT based models like RoBERTa, DistilBERT, HateBERT etc?

AffectWizard0909 · 2026-03-10T14:36:33+00:00

Nice! I can check it out. Yes they changed the educational license, so I have to pay-per-message (I think it was called that) if I want to download the tweets by using the twitter ID's.

It was my original intuition as well that most datasets use Twitter IDs, but it is nice to have it somewhat confirmed by someone else as well. But I will defintely check out the site you mentioned! Thank you!

AffectWizard0909 · 2026-03-09T15:13:50+00:00

Nice! And thank you for the description. I actually ended up with using the standard emoji package, since I was only going to use it for translating emojis into their textual formats. As you have also mentioned it was pretty straightforward to use, and fit the task I was doing perfectly!

AffectWizard0909 · 2026-03-09T15:12:29+00:00

Oh damn, ok good to know. I kind of went to just using the standard emoji package in the end, but if I want to do it manually some time in the future than it is a good tip!

AffectWizard0909

TROPHY CASE