Current Support
| Input type | Current behavior |
|---|---|
| Plain text | Supported for request, tool-result, and output checkpoints. |
| Image data URLs | Supported on documented OpenAI rich-media paths. |
| Anthropic base64 image blocks | Supported for request preflight when the MIME type is supported. |
| Raw API image media | Supported through the media array on request checkpoints. |
| Remote image URLs | Not treated as supported guarded media. |
| Provider file IDs | Not treated as supported guarded media. |
| Audio, video, documents, and GIFs | Not supported by the current wrappers. |
image/jpegimage/pngimage/webp
How Media Is Sent
The request checkpoint separates text from media:Wrapper Behavior
OpenAI rich-media request preflight supports image data URLs on the documented Responses API and Chat Completions paths in both JavaScript and Python. Remote image URLs, OpenAI file IDs, file content parts, and audio parts are rejected before preflight. Anthropic request preflight supports base64 image content blocks with supported image MIME types. URL image sources, GIF images, document blocks, and other non-text media fail closed before provider execution. Do not assume provider parity outside the documented paths. If a page does not list a media path as supported, treat it as unguarded or unsupported until tested.Design Guidance
For multimodal agents:- keep user-visible text separate from media attachments
- normalize images before calling the provider
- avoid remote media URLs when you expect Averta to inspect the content
- test blocked and unsupported media paths in staging
- inspect Events to confirm the request checkpoint includes media
Next Steps
Create decision API
See the raw
media payload shape.OpenAI Responses
Review OpenAI rich-media limits.
Anthropic Messages
Review Anthropic image block limits.
Fail-closed behavior
Understand unsupported-path failures.