Amazon SageMaker AI Async Inference now supports inline request payloads

6/17/2026

·~7 min·6/17/2026·en·1

Quick Answer

Amazon SageMaker AI Async Inference now supports inline request payloads, allowing users to send inference data directly in the InvokeEndpointAsync API request body.

Quick Take

This enhancement eliminates the need for prior uploads to Amazon S3, streamlining the inference process for customers.

Key Points

Inline payloads enhance efficiency by removing S3 upload requirements.
Users can now directly send inference data in API requests.
This update simplifies the overall workflow for SageMaker users.
Amazon continues to improve its AI services with user-friendly features.

Source Excerpt

Today, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload input data to Amazon Simple Storage Service (Amazon S3) before each invocation.

Read the full article on aws.amazon.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from AWS Machine Learning

See more →

Build an explainable next-best-product recommendation system for banking on AWS

AWS Machine Learning·Ayush Singh Chauhan

1w ago

FeaturedOriginal

Build an explainable next-best-product recommendation system for banking on AWS

AI Summary

AWS presents a deep learning-based Next-Best-Product recommendation system for banks, utilizing Amazon SageMaker and PyTorch to enhance customer product predictions. This architecture leverages a multi-tower neural network for improved accuracy and explainability, addressing the complexities of customer data in financial services.

#AI Coding #Inference #Open Source #Enterprise AI