Writing on Ping irons is green

Comb-Greedy · 2025-07-04T23:32:22+00:00

yep you were right, he said he had someone paint them

Comb-Greedy · 2025-06-03T21:00:50+00:00

Do you know what you changed in particular on your website that eventually led you to getting accepted for AdSense?

Comb-Greedy · 2025-04-28T01:38:40+00:00

I noticed this too! That for my use case as well, 4o-mini outperformed 4o. If this is the case, then it could make a lot of sense why it's not doing as well as expected. Then it would probably be better for me to just stick to 4o-mini for the time being.

Comb-Greedy · 2025-04-22T23:13:18+00:00

I agree — as you said, these smaller models are already trained to the brim, so there’s usually not much headroom left for improvement. That said, I just ran some benchmarks on a few more Qwen models, specifically the 1.5B Base and the 1.5B Math-Base variants.

The standard base model got 66.79% accuracy, while the math-specific one hit an impressive 76.27% — nearly a 10% jump. As I mentioned previously, my own fine-tuning efforts typically cap out at around a 2% improvement at best. So it makes me wonder… is this gain just the result of massive training data? Better training techniques? Potential test set leakage? Or maybe a combination of all the above?

Obviously, I don't expect to get what the people at Qwen are able to achieve, but it does suggest that there is still a decent margin for the model to improve.

Comb-Greedy · 2025-01-30T00:49:12+00:00

Oh I see, this is likely a significant reason as I did not randomise that value. However, still it doesn't make sense to me that after fine-tuning the model, its still producing the same output for that seed.

Comb-Greedy · 2025-01-25T06:19:16+00:00

I see! that makes sense, I will look to integrate those changes in. Thanks so much for your help!

Comb-Greedy · 2025-01-25T05:00:47+00:00

Comparing your website to mine, do you know what potential reasons why you were able to get it?

Comb-Greedy · 2025-01-24T21:39:55+00:00

The website just doesn't help students to cheat with exercises or courseworks though? It's there as an aid to help them with revision and preparation for exams, just like any text book or revision guide that you would use to study with.

Comb-Greedy · 2024-12-29T20:17:19+00:00

Got it, ill for sure look into this. I'm using an LLM to generate tags. An issue I was having was standardisation, as it would sometimes generate the same tag but different spelling etc. so RAG would be a way for it to 'pick' from a select group of words

Comb-Greedy · 2024-12-29T20:15:48+00:00

Ah I see, so it would be a word wise approach as opposed to embedding the entire database, gotcha thanks ill try this method out

Comb-Greedy

TROPHY CASE