Introducing WebSockets in the Responses API. Built for low-latency, long-running agents with heavy tool calls.
WebSockets keep a persistent connection to the Responses API, allowing you to send only new inputs instead of round-tripping the entire context on every turn. By maintaining in-memory state across interactions, it avoids repeated work and speeds up agentic runs with 20+ tool calls by 20%-40%.
19