feat(agentkit/local): add MultiModalRead for images and PDFs#812
feat(agentkit/local): add MultiModalRead for images and PDFs#812JonXSnow wants to merge 4 commits into
Conversation
Need to create a new tagThe following modules have changes and may need version updates:
|
Need to create a new tagThe following modules have changes and may need version updates:
|
1 similar comment
Need to create a new tagThe following modules have changes and may need version updates:
|
Add MultiModalRead support to both agentkit and local backends, enabling sandboxed reading of images and PDFs with configurable size and page limits. PDF rendering honors context cancellation, and the size field is widened to int64 to support larger files.
d3b5ea9 to
d99cf36
Compare
Need to create a new tagThe following modules have changes and may need version updates:
|
d99cf36 to
1bcfb71
Compare
Need to create a new tagThe following modules have changes and may need version updates:
|
1bcfb71 to
e53270b
Compare
Need to create a new tagThe following modules have changes and may need version updates:
|
Need to create a new tagThe following modules have changes and may need version updates:
|
…Template Align readPythonCodeTemplate with other templates (readAllBytes, glob, execute) by base64-encoding the file_path parameter, preventing path injection when paths contain special characters like single quotes.
e4f933e to
b8e8189
Compare
Need to create a new tagThe following modules have changes and may need version updates:
|
Replace "PNGs" with "images" in comments to avoid typo-checker false positive. Add unit tests for isImageExt, isPDFExt, newImageContentPart, and renderPDFPagesToImages to meet the 70% coverage threshold.
Need to create a new tagThe following modules have changes and may need version updates:
|
Need to create a new tagThe following modules have changes and may need version updates:
|
Add MultiModalRead to SandboxTool that returns structured multimodal content for image and PDF files, with fallback to Read for other types.
Images:
PDFs:
Size enforcement:
Also: