

All checkpoints are trained to 300 epochs with SGD optimizer with lr0=0.01 and weight_decay=5e-5 at image size 640 and all default settings.We ran all speed tests on Google Colab Pro notebooks for easy reproducibility. We exported all models to ONNX FP32 for CPU speed tests and to TensorRT FP16 for GPU speed tests. We trained YOLOv5 segmentations models on COCO for 300 epochs at image size 640 using A100 GPUs. Reproduce by python val.py -data coco.yaml -img 1536 -iou 0.7 -augment TTA Test Time Augmentation includes reflection and scale augmentations.Reproduce by python val.py -data coco.yaml -img 640 -task speed -batch 1


