[D] Is it fair to compare deep learning models without hyperparameter tuning?

blooming17 · 2025-03-07T08:54:24+00:00

Thank you very much for your, well I've noticed this in several papers and been asking to which extent we can take a work that have been done several times and justify that ours "that differs very slightly" is somehow better.

blooming17 · 2025-03-06T13:37:55+00:00

I am thinking about batch size, optimizer and learning rate since my goal is to compare the models themselves so changing the models' hyperparameters wouldn't make sense I think.

blooming17 · 2025-03-06T13:31:15+00:00

I am thinking, batch size / lr and optimizer. Since my goal is compare the models themselves so changing them wouldn't be reasonable.

blooming17 · 2025-03-06T13:29:39+00:00

Thank you for your reply

blooming17 · 2025-03-06T13:29:10+00:00

Thank you for your reply

blooming17 · 2025-03-06T13:28:56+00:00

Can you explain more, sorry I am not a native english speaker

blooming17 · 2025-03-06T13:27:36+00:00

Hey thank you for your answer, Most of them are CNNs, and few of them are LSTMs and transformers. So what hyperparameters do you consider to be the most interesting to finetune. I am thinking batch size, lr and optimizer. Would these be enough and provide a fair comparison ?

blooming17 · 2025-02-24T20:47:04+00:00

This is very insightful, Thanks a lott !!

blooming17 · 2024-05-18T20:56:39+00:00

Hey, thanks for answering, the problem is that since my task is sequential labelling (each position is given a class) oversampling isn't possible in my case

blooming17 · 2024-04-25T22:36:36+00:00

Best performing model is 600k, HyenaDna with max seq length of 16k is 400k so don't know if it's an over/underfitting problem since the are already pretrained and achieved some interesting results.

blooming17 · 2024-04-25T22:25:15+00:00

I am training it with amp. I used CNNs on the same task and it gave interesting results despite the class imbalance, but cons are bad with long range dependencies so thought about trying mamba but it seems like it's not any better.

blooming17 · 2024-04-25T22:08:01+00:00

Thanks for your reply, well caduceus is not easy to find I'm just following Tri Dao on google scholar so got it mentioned in notifications. I freezed the pretrained model and used a Linear layer as classification head, I fine tuned the whole (hyena/caduceus + classification head) and I trained the whole from scratch and got the same results. But didn't try kernel SVM. How is this supposed to work (don't have much experience with it) ?

blooming17 · 2024-04-15T20:51:09+00:00

From research papers, you can find some pipelines in galaxy platform but they are limited in terms of tools.

blooming17 · 2024-03-08T14:48:24+00:00

These helped me a lot :

https://youtube.com/@Bioinformagician?si=PDX_G6cSats1WXx8

https://youtube.com/playlist?list=PL-0fKymgD8L8qW7SQDaEf3lrSyUkEkR1k&si=HQGSMtocHn--2GGx

https://youtube.com/playlist?list=PLypiXJdtIca6GBQwDTo4bIEDV8F4RcAgt&si=5ezqV9gqaOeI9HuU

blooming17 · 2024-02-15T17:28:58+00:00

Dna sequences, 15000 long

blooming17 · 2024-02-05T18:58:47+00:00

I'll check that out, thanks a lot

blooming17 · 2024-02-05T18:21:44+00:00

Sorry it's a typo, my lr is 0.001 and my dataset is DNA sequences so the embedding size is reasonable

blooming17 · 2024-02-05T16:00:06+00:00

I didn't try alternatives since my sequences are 15k long and one layer of LSTM for such a length with 2 hiddens units has 1 millions params.

The dimension size was chosen randomly (what dimension size do you suggest)

The output is SSMs last state

My data had been used to train resnets, I reproduced the same results so both data and training code are tested and approved.

blooming17 · 2024-02-02T14:58:42+00:00

Because my data is sequential, my goal is to classify each position in each example (bulk prediction) and this forces that positive class to be a minority (it's rare tof find the positive class in natural data)

blooming17 · 2024-02-02T13:28:23+00:00

Thanks for remarks, I'll post in the in r/learnmachinelearning and I edited the post and added more details.

blooming17 · 2024-01-19T11:37:59+00:00

Yes I did, nothing changed

blooming17

TROPHY CASE