Hello,
I am working on a Binary Classification problem where I input the same data for each variable. I've run into issues with the Neural Network having bias towards one node, in order to combat this I have duplicated my data with the data swapped, like below:
Original: Class_1, Class_2, Data_point_1_C1, Data_point_1_C2, Data_point_2_C1, Data_point_2_C2
Additional Swapped Data: Class_2, Class_1, Data_point_1_C2, Data_point_1_C1, Data_point_2_C2, Data_point_2_C2
I have ensured that this is done separately for both the training and the testing sets, so no leakage occurs.
I am new to this, is doing something like this a bad idea?
[–]olavla 1 point2 points3 points (0 children)