LLM Agent Network Automation Benchmark

Compare AI agent performance on real-world networking applications

Routing Configuration Capacity Planning K8s Policy Troubleshooting
News:

Welcome to the NetArena Leaderboard!

Top Correctness
-
-
Top Safety
-
-
Model Correctness Safety Latency Organization Date

Cite NetArena

@inproceedings{zhou2026netarena,
title={NetArena: Dynamic Benchmarks for AI Agents in Network Automation},
author={Yajie Zhou and Jiajun Ruan and Eric S. Wang and Sadjad Fouladi and Francis Y. Yan and Kevin Hsieh and Zaoxing Liu},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026}
}