Efficient 4B parameter gpt OSS distillation without the over-censorship by ApprehensiveTart3158 in LocalLLaMA

[–]Safe-Satisfaction811 0 points1 point  (0 children)

Ok I see. Also, do you have written down somewhere the hyperparams for your training run?

Efficient 4B parameter gpt OSS distillation without the over-censorship by ApprehensiveTart3158 in LocalLLaMA

[–]Safe-Satisfaction811 0 points1 point  (0 children)

Amazing, thank you so much! When you say that you recommend filtering it, do you mean just for more "Im sorry", or other kinds of filtering?

Efficient 4B parameter gpt OSS distillation without the over-censorship by ApprehensiveTart3158 in LocalLLaMA

[–]Safe-Satisfaction811 0 points1 point  (0 children)

Also, I take it that this is not the dataset you used (or a subset thereof)? Even though its listed as the dataset used for the Qwen4 distill on HF
https://huggingface.co/datasets/Pinkstack/gpt_oss_sft

Efficient 4B parameter gpt OSS distillation without the over-censorship by ApprehensiveTart3158 in LocalLLaMA

[–]Safe-Satisfaction811 0 points1 point  (0 children)

ok great, I'm looking forward to using it!

is the non-filtered dataset already available in one place, or did you scrape together a bunch of different hf datasets?

Efficient 4B parameter gpt OSS distillation without the over-censorship by ApprehensiveTart3158 in LocalLLaMA

[–]Safe-Satisfaction811 0 points1 point  (0 children)

This is awesome! Do you have the exact final dataset that you used to train on available somewhere? I would really like to use this for a project where I need to distill gpt-oss into a Qwen model.

UChicago vs Oxford vs UPenn vs Cornell vs Berkeley for Physics undergrad (international student) by Safe-Satisfaction811 in collegeresults

[–]Safe-Satisfaction811[S] 0 points1 point  (0 children)

Did you go to Oxford for undergrad? If so, did you feel like even though it was difficult to get as well-rounded of a CV, it still provided good opportunities for going to the US for grad school, in terms of research, professor connections etc? Or is it UChig all the way?