feat: Support images and PDFs in tool results #735
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR adds support for tool results to return images or PDF results.
This isn't a feature that's widely supported in provider APIs, but we get around this limitation by moving image and PDF content out of the tool result and into the abstract user turn that carries the tool results.
We support two cases:
content_image()
orcontent_pdf()
as a tool result.In all cases, we replace the value in the tool result with
"[see below]"
(or"[see below: item N]"
in the list case) and we wrap the extra content in<content tool-call-id="abc123" item="N">...content...</content>
XML tags.Notes
as_json()
methods forTurn
and updated them to returntool_message, user_message
.tool_string()
doesn't support having these content types in the tool result because it callsjsonlite::toJSON()
. I updated this function so that internally we can force the JSON conversion for printing, but require this work for the actual tool results that we send across the wire. If it fails, it now fails with a more informative error message. (Internally we call this function when echoing the tool result, before we've pulled out the content types.)Example