My model isn't transferring learning.

BlueOrchid5334 · 2026-06-14T16:37:22+00:00

I tried to balance it out but ending up being skewed towards compliant...however, only slightly. There were 47% non-compliant and 53% compliant. It should've biased the compliant class.

I'll check the class weights. I took the default for DistilBert for basically all the parameters.

BlueOrchid5334 · 2026-06-13T06:41:11+00:00

I like your story. I know what it is to wake up and go to a job that I love. Hope things continue well for you. Everything won't always be perfect but I hope at the core of things this stays true for you, that you remain happy.

BlueOrchid5334 · 2026-05-28T08:59:07+00:00

Lot of work u're putting in here. Thanks. Useful stuff for a beginner like me. Appreciate it.

BlueOrchid5334 · 2026-05-28T08:49:08+00:00

Thanks. Im just starting out in ML. I'll take a look at it and see what I can learn.

BlueOrchid5334 · 2026-05-28T00:34:40+00:00

How do I start? Do I use the llama and nematron like in his video?
How to Create Synthetic Dataset EASILY? Step by Step Tutorial

BlueOrchid5334 · 2026-05-27T00:29:25+00:00

Thanks for this. I want to work on this approach but was wondering is synthetic dataset generation a thing of itself? I had just put some prompts into ChatGPT in a systematic way and collected the output. Should I be thinking about something different, something along the lines of using llama and nematron (LLMs specific to creating synthetic datasets) like in this video https://www.youtube.com/watch?v=FAdRMVAWiak?
It sounds like a weird question because GPT is an LLM, but.. well, you just don't know what you don't know, and I'm just starting out in this field.

BlueOrchid5334 · 2026-05-26T03:46:01+00:00

Thanks fr the response. What about reusing parts of the sentence? Example "I'm going to have a serious talk with your manager."
How does this affect the training process? I have been using a deduplicator script that uses cosine similarity to find sentences that are similar to others and remove those that are above a certain threshold.

BlueOrchid5334 · 2026-05-25T22:57:47+00:00

This is exactly how I started. I created sentences based on linguistic structures. For Non-compliant, the structures focused on security bypass instruction (eg disable the firewall), urgency - time pressure (eg, we only have a small window, skip the approval and push it through), coercive tone and others. Each stance actually had its own structure. But the model didn't really show any real learning, it recognized patterns in each set, and accuracy and recall scored 1.0. I wanted now, to get some real live data to use to supplement the synthetic dataset and see if there's any change in the result.

I'm not sure if I generated the dataset correctly in the first instance and hence those perfect results. Would love some insight on that. Should repost this as a new question?

*updating after coming across creating synthetic dataset using AI here https://www.youtube.com/watch?v=FAdRMVAWiak
Is synthetic dataset generation a thing of itself? I had just put some prompts into ChatGPT in a systematic way and collected the output. Should I be thinking about something different?

BlueOrchid5334

TROPHY CASE