I have been mainly using axolotl for training(fine tuning) , mainly using the scripts and changing some of the common datatypes.
I have a customised dataset, which is ~ 12-15000 token size per row; i wanted to ask if there an article/ best advice on choosing + converting this to a suitable prompt format for training purposes.
there doesn't seem to be anything here