fpgaConvNet

MLPerf Tiny Benchmark

The results presented in this table are from our submission to MLPerf Tiny's benchmark, as part of the round v1.1 results. The MLPerf Tiny v1.1 submission demonstrates the versatility of the fpgaConvNet toolflow by targeting a range of low-cost FPGAs, whilst achieving ultra low latency across these devices. The obtained high performance is due to the exploration of the reconfigurability feature of FPGAs, allowing the tool to create highly tailored accelerator designs for each specific task and device. fpgaConvNet showcases the potential of FPGA devices for TinyML applications, as performance similar to that of ASICs is achieved whilst having the programmability of MCUs.

Device	Task	Latency	LUT	DSP	BRAM	Freq.
ZC706	Image Classification	0.15 ms	108K	564	281	187 MHz	link
	Visual Wake Word	0.72 ms	133K	564	366	200 MHz	link
ZedBoard	Image Classification	0.41 ms	47K	211	93	143 MHz	link
	Visual Wake Word	9.49 ms	34K	189	123	111 MHz	link
	Keyword Spotting	0.32 ms	37K	188	97	143 MHz	link
ZyBo	Image Classification	3.15 ms	16K	78	36	125 MHz	link
	Keyword Spotting	2.15 ms	15K	60	16	125 MHz	link
Cora-Z7	Keyword Spotting	4.21 ms	13K	55	28	143 MHz	link

Instructions on how to use the bitstream can be found in the MLPerf-Tiny repo.

Latest Research Results

The results presented in this table are from our recently published research papers.

Year	Task	Network	Accuracy	Latency	Throughput	Device	LUT	DSP	BRAM18k	URAM	Freq.	Cite
2024	CamVid	UNet	71.75%	-	16.96fps (2206GOP/s)	VCU1525	993k	6019	3715	864	200MHz	link
2024	CamVid	UNet	71.75%	-	21.21fps (2758GOP/s)	U200	1040k	6062	3654	864	250MHz	link
2024	CamVid	UNet	71.75%	-	1.28fps (166GOP/s)	ZCU102	213k	1461	1368	-	200MHz	link
2024	COCO	YOLOv8n	35.10%	-	184.27fps (808GOP/s)	VCU118	543k	5061	1813	431	250MHz	link
2024	UCF101	X3D-M	96.29%	-	13.44fps (86GOP/s)	ZCU102	235k	932	857	-	200MHz	link
2024	BraTS2020	UNet3D	85.34%	-	1.75fps (1595GOP/s)	U200	289k	5677	2980	528	250MHz	link
2024	ImageNet	ResNet18	70.3%	27.0ms	-	ZC706	38k	150	1709	-	200MHz	link
2024	ImageNet	ResNet18	70.5%	7.0ms	-	ZCU102	127k	1251	2318	-	200MHz	link
2024	ImageNet	ResNet18	70.0%	1.3ms	-	U50	704k	5817	2490	576	250MHz	link
2024	ImageNet	ResNet50	76.0%	3.4ms	-	U50	867k	3807	2698	640	250MHz	link
2024	ImageNet	ResNet50	76.0%	1.8ms	-	U250	1714k	7804	4025	967	250MHz	link
2024	ImageNet	MobileNetV2	65.6%	4.8ms	-	ZC706	219k	391	1084	-	200MHz	link
2024	ImageNet	MobileNetV2	65.7%	2.3ms	-	ZCU102	273k	1222	1428	-	200MHz	link
2023	COCO	YOLOv3-Tiny	33.9%	14.3ms	418.9GOP/s	VCU110	127k	1780	4181	-	220MHz	link
2023	COCO	YOLOv3-Tiny	33.9%	6.8ms	875.7GOP/s	VCU118	431k	6687	4296	90	255MHz	link
2023	COCO	YOLOv5s	56.2%	46.4ms	392.0GOP/s	VCU110	602k	1794	3776	-	200MHz	link
2023	COCO	YOLOv5s	56.2%	14.9ms	1219.8GOP/s	VCU118	117k	5077	4052	33	270MHz	link
2023	COCO	YOLOv8s	61%	122.8ms	248.2GOP/s	VCU110	629k	1767	5565	-	200MHz	link
2023	COCO	YOLOv8s	61%	24.5ms	1244GOP/s	VCU118	1023k	6815	1322	713	240MHz	link