ai-infrastructure 3 gateway API inference extension Aug 27, 2025 Kubernetes dynamic resource allocation Nov 16, 2024 rdma overview Aug 21, 2024