YOLO Detection Models Benchmark Report

Generated: January 17, 2026
Benchmark Version: 2.0.0
Framework: YOLOs-CPP (C++ ONNX Runtime Inference)

System Specifications

Component	Details
OS	Ubuntu 24.04 LTS (Linux 6.14.0-37-generic)
Architecture	x86_64
CPU	Intel Core i7-1185G7 @ 3.00GHz (11th Gen Tiger Lake)
CPU Cores	4 physical cores, 8 threads
CPU Frequency	400 MHz - 4800 MHz (Turbo)
RAM	38 GB DDR4
GPU	Integrated Intel Iris Xe (CPU inference only)
Storage	NVMe SSD
OpenCV	4.6.0
ONNX Runtime	1.20.1 (CPU)
Compiler	GCC with C++17

Benchmark Configuration

Parameter	Value
Input Image	dog_bike_car.jpg (768×576 pixels)
Model Input Size	320×320
Precision	FP32
Warmup Iterations	20
Benchmark Iterations	100
Confidence Threshold	0.25
NMS Threshold	0.45
Device	CPU
Dataset	Pascal VOC (20 classes)

Performance Results Summary

FPS Comparison (Higher is Better)

Model	FPS	Rank
YOLOv11n	97.18	🥇 1st
YOLOv8n	85.55	🥈 2nd
YOLOv12n	80.99	🥉 3rd
YOLO26n	78.31	4th
YOLOv5nu	77.22	5th
YOLOv6n	76.85	6th
YOLOv10n	69.27	7th
YOLOv9t	46.28	8th

Detailed Benchmark Results

Latency Statistics (milliseconds)

Model	Avg	StdDev	Min	Max	P50	P90	P95	P99
YOLOv11n	10.29	0.96	8.47	12.65	10.25	11.59	12.11	12.57
YOLOv8n	11.69	1.25	9.86	16.08	11.56	13.10	14.18	15.73
YOLOv12n	12.35	0.99	10.36	18.76	12.14	13.33	13.95	15.32
YOLO26n	12.77	0.94	11.70	18.49	12.56	13.96	14.34	15.21
YOLOv5nu	12.95	1.30	11.32	19.56	12.70	13.98	15.28	19.39
YOLOv6n	13.01	1.26	11.61	19.57	12.49	14.36	15.00	17.36
YOLOv10n	14.44	8.13	9.12	55.35	11.45	23.08	33.35	48.04
YOLOv9t	21.61	11.06	15.48	87.97	17.44	32.16	41.20	67.76

Load & Warmup Times (milliseconds)

Model	Load Time	Warmup Time	Total Startup
YOLOv8n	55.34	228.29	283.63
YOLOv11n	59.09	230.38	289.47
YOLOv6n	61.70	261.34	323.04
YOLOv5nu	67.94	278.62	346.56
YOLO26n	70.16	279.41	349.57
YOLOv10n	74.65	200.34	274.99
YOLOv9t	98.71	402.03	500.74
YOLOv12n	103.60	243.54	347.14

Memory Usage

Model	Peak Memory (MB)	Memory Delta (MB)	CPU Usage (%)
YOLOv5nu	110.2	5.2	84.7
YOLOv10n	111.4	7.3	94.0
YOLOv8n	116.8	9.4	86.6
YOLOv11n	119.4	13.8	85.2
YOLO26n	120.1	14.8	85.6
YOLOv6n	121.2	4.1	85.6
YOLOv12n	121.2	13.4	85.6
YOLOv9t	123.4	14.8	90.4

Model File Sizes

Model	ONNX Size	Architecture
YOLOv9t	7.8 MB	Gelan-based
YOLOv10n	8.8 MB	End-to-end NMS-free
YOLO26n	9.3 MB	End-to-end NMS-free
YOLOv5nu	9.7 MB	CSPDarknet
YOLOv11n	10 MB	C3k2 blocks
YOLOv12n	10 MB	Area Attention
YOLOv8n	12 MB	C2f blocks
YOLOv6n	17 MB	EfficientRep

Analysis & Insights

🏆 Performance Leaders

YOLOv11n achieves the highest FPS (97.18) with excellent latency consistency (σ=0.96ms)
YOLOv8n follows closely at 85.55 FPS with good stability
YOLOv12n and YOLO26n perform similarly (~80 FPS)

⚡ Latency Stability

Model	Stability Rating	Notes
YOLO26n	⭐⭐⭐⭐⭐	Lowest std dev (0.94ms), most predictable
YOLOv11n	⭐⭐⭐⭐⭐	Very stable (0.96ms std dev)
YOLOv12n	⭐⭐⭐⭐⭐	Excellent consistency (0.99ms)
YOLOv8n	⭐⭐⭐⭐	Good stability (1.25ms)
YOLOv5nu	⭐⭐⭐⭐	Acceptable variance (1.30ms)
YOLOv6n	⭐⭐⭐⭐	Good for production (1.26ms)
YOLOv10n	⭐⭐	High variance (8.13ms), outliers up to 55ms
YOLOv9t	⭐	Very high variance (11.06ms), not production-ready

💾 Memory Efficiency

YOLOv5nu and YOLOv10n are most memory-efficient (~110 MB)
All models fit comfortably in typical embedded systems with 256MB+ RAM
Memory delta during inference is minimal (<15 MB)

🎯 End-to-End NMS-Free Models

YOLO26n and YOLOv10n feature built-in NMS: - ✅ No separate NMS postprocessing step - ✅ Simpler deployment pipeline - ✅ Consistent latency (no NMS variance) - ⚠️ YOLOv10n shows higher variance likely due to model complexity

Recommendations

For Real-Time Applications (>60 FPS required)

Best Choice: YOLOv11n (97 FPS, excellent stability)
Runner-up: YOLOv8n (86 FPS, proven reliability)

For Edge Deployment (Memory Constrained)

Best Choice: YOLOv5nu (110 MB, 77 FPS)
Alternative: YOLOv10n (111 MB, end-to-end architecture)

For Predictable Latency (Production Systems)

Best Choice: YOLO26n (lowest variance, 78 FPS)
Alternative: YOLOv11n (stable with higher throughput)

For Simplest Deployment (No NMS handling)

Best Choice: YOLO26n (end-to-end, stable, modern)
Alternative: YOLOv10n (end-to-end, but less stable)

Raw Data

Full benchmark CSV available at: benchmarks/results/benchmark_results.csv

Notes

All models were fine-tuned on Pascal VOC dataset (20 classes)
Benchmarks run on CPU-only configuration
Results may vary with GPU acceleration (typically 3-5x faster)
Input image resolution affects performance proportionally
Models exported with ONNX opset 12 for maximum compatibility

Report generated by YOLOs-CPP Unified Benchmark Suite v2.0.0