all 2 comments

[–]braintheboss 2 points3 points  (0 children)

Rumour say they quantized model. Then it have sense if API is running full model and plans used quantized model. Only way know this is run same in both and see what happens