add backoff, use direct requests instead of inference client 875e2f3 zulissimeta commited on Jun 6, 2025