
Amazon SageMaker AI Async Inference now supports inline request payloads
Quick Answer
Amazon SageMaker AI Async Inference now supports inline request payloads, allowing users to send inference data directly in the InvokeEndpointAsync API request body.
Quick Take
Amazon SageMaker AI Async Inference now supports inline request payloads, allowing users to send inference data directly in the InvokeEndpointAsync API request body. This enhancement eliminates the need for prior uploads to Amazon S3, streamlining the inference process for customers.
Key Points
- Inline payloads enhance efficiency by removing S3 upload requirements.
- Users can now directly send inference data in API requests.
- This update simplifies the overall workflow for SageMaker users.
- Amazon continues to improve its AI services with user-friendly features.
Article Excerpt
From source RSS / original summaryToday, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload input data to Amazon Simple Storage Service (Amazon S3) before each invocation.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from AWS Machine Learning
See more →
Build context-rich research agents with Deep Agents and Bedrock AgentCore
AWS introduces a method to build context-rich research agents using Deep Agents and Bedrock AgentCore. This guide is aimed at developers creating multi-step AI workflows requiring isolated execution environments, allowing deployment to Bedrock AgentCore Runtime via AgentCore CLI for managed services.

