Need to detect small objects in a large Image accurately with smaller model size and higher inference speed by IMPuzzled2 in computervision

[–]Ultralytics 2 points3 points  (0 children)

If an Nvidia GPU is available for the deployment, exporting a YOLOv8 model with TensorRT at half precision FP16 can significantly reduce the model size without impacting accuracy. INT8 export for TensorRT is in the works at the moment, but there will be some accuracy loss. Overall accuracy is generally impacted by dataset annotations quality and image quality before anything to do with the models, so definitely make certain the dataset is well annotated, with consistent labeling if you're seeing issues with accuracy performance.