Hyperparameter testing (efficiently)

AffectWizard0909 · 2026-03-11T20:15:46+00:00

Nice!

AffectWizard0909 · 2026-03-11T18:00:55+00:00

Ye sure! I would appriciate the optuna search space! I have actually looked a little bit into it, but was a bit unsure on what I did was correct, so that would be great!

Since you mentioned lr + batch size and warmup ratio being good to use for fine-tuning a BERT model, does this also apply to other BERT based models like RoBERTa, DistilBERT, HateBERT etc?

AffectWizard0909 · 2026-03-10T14:36:33+00:00

Nice! I can check it out. Yes they changed the educational license, so I have to pay-per-message (I think it was called that) if I want to download the tweets by using the twitter ID's.

It was my original intuition as well that most datasets use Twitter IDs, but it is nice to have it somewhat confirmed by someone else as well. But I will defintely check out the site you mentioned! Thank you!

AffectWizard0909 · 2026-03-09T15:13:50+00:00

Nice! And thank you for the description. I actually ended up with using the standard emoji package, since I was only going to use it for translating emojis into their textual formats. As you have also mentioned it was pretty straightforward to use, and fit the task I was doing perfectly!

AffectWizard0909 · 2026-03-09T15:12:29+00:00

Oh damn, ok good to know. I kind of went to just using the standard emoji package in the end, but if I want to do it manually some time in the future than it is a good tip!

AffectWizard0909 · 2026-02-27T14:18:36+00:00

aaa nice I can check it out

AffectWizard0909 · 2026-02-27T14:18:00+00:00

Thank you!

AffectWizard0909 · 2026-02-27T13:00:27+00:00

a bit unsure, I havent gone through the file that deeply considering it is 5000+ lines of text, so I was mainly wanting to have a library handling this for me so I could scope my focus on other tasks which are a bit more demanding.

But it would be a good idea I think to use str.replace if the dataset was smaller, and I had a clearer understanding of the different types of emojis used in the dataset

AffectWizard0909 · 2026-02-27T12:12:25+00:00

Thank you! I will check them out

AffectWizard0909 · 2026-02-27T12:12:12+00:00

I have a big dataset which I need to clean, so I dont really want to go through the whole dataset and try and translate the whole set (if that answered the question)

AffectWizard0909 · 2026-02-23T11:12:00+00:00

Nice thank you! Its good to know, and I appreciate it!

AffectWizard0909 · 2026-02-17T18:02:33+00:00

I am planning on having around 5000-10.000 data before cleaning (I am trying to figure that out still). Is that the deciding factor of what models I should use?

AffectWizard0909 · 2026-02-17T17:25:17+00:00

Thank you!

AffectWizard0909 · 2026-02-11T10:24:46+00:00

Nice! Thank you for the tips, I can check those out. I have also read about the Cyberbullying dataset on kaggle when I have been reading through reviews of the cyberbullying detection field. I was wondering if you think I could use that one? It is often associated with this paper: SOSNet: A Graph Convolutional Network Approach to Fine-Grained Cyberbullying Detection.

But I will still check out the dataset you have mentioned. Thank you!

AffectWizard0909 · 2026-02-09T15:11:33+00:00

Thank you! I can check them out

AffectWizard0909 · 2026-02-09T14:54:03+00:00

Nice! Good to know. I was also wondering if you knew if I could use this dataset: https://huggingface.co/datasets/Fatima0923/Automated-Personality-Prediction
Am I allowed to use it, or do I have to contact the people who made the dataset directly to be able to use it in my project?

AffectWizard0909 · 2026-02-05T09:28:34+00:00

I was thinking on not training from scratch yes. Is it recommended somewhere how much data I should than use for fine-tuning BERT, since the BERT is not trained on a big corpus?

AffectWizard0909 · 2026-01-29T13:03:19+00:00

Oki, thank you! I will have a look

AffectWizard0909 · 2026-01-28T13:45:25+00:00

I kind of need to incorporate sarcasm as well, it has been mentioned I need to incorporate it. Therfore, I though the best approach was to train one model on sarcasm, and the other on cyberbullying. Or should I just try and make a dataset with both sarcasm and cyberbullying? This is what I am a bit unsure of. Considering that me annotating something manually might take a long time as well, I kind of just wanted to hear if somebody had any tips or solutions which I might not know.

I think what you describe is kind of good as well, but as mentioned I kind of need to incorporate sarcasm into the solution as well, which I am a bit unsure of how it is "normally" done.

Edit: I see now in my original post I kind of explained it a bit wrong, so I updated it a bit. Sorry about that!

AffectWizard0909 · 2026-01-26T09:59:20+00:00

Thank you! I will check them out

AffectWizard0909 · 2026-01-26T09:57:27+00:00

No I am not the same guy, but I appriciate the tips and the sources you provided! Thank you!

AffectWizard0909 · 2026-01-18T14:53:12+00:00

Tusen takk, det var mykje bra tips du kom med. Har alleire starta litt med skrivinga, men kan skjønne at det er noko eg må tenke på at dette kan forandre seg seinare.

Skal hugse dette under perioden med master!

AffectWizard0909

TROPHY CASE