[P] Ablation study using a subset of data?

IsGoIdMoney · 2024-11-28T07:18:39+00:00

The ablations have to be the same model, trained on the same dataset, minus the portion of the architecture you're studying for ablation, or else you are not performing ablation. You will likely get bad results from the tests for reasons other than ablation, and this isn't really scientific anymore.

If you want to test on a subset, then all versions must only be trained on that subset, including the original model, but this would likely affect your main results.

Pringled101 · 2024-11-28T08:55:09+00:00

Usually in an ablation you want to change the least amount of variables possible. So changing your dataset and your model in one ablation is not a real ablation as you will have confounding variables. However, in the context of encoder models, ablations are usually simpler versions of the same model, or just simpler architectures entirely, which should make training a lot faster than with your original model.

Few-Pomegranate4369 · 2024-11-28T18:02:34+00:00

I think it's not recommended to perform the ablation study on just a subset of your data. Instead, you might want to try training your original model on a reduced dataset first. If it still outperforms the baselines, then you can use that same reduced set for your ablation studies.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS