deploy¶

TimeSeriesFoundationModel.deploy(instance_type: str | None = None, endpoint_name: str | None = None, hyperparameters: Dict[str, Any] | None = None, framework_version: str = 'latest', custom_image_uri: str | None = None, wait: bool = True, inference_mode: Literal['realtime', 'serverless'] = 'realtime', inference_config: Dict[str, Any] | None = None, **backend_kwargs) → TimeSeriesEndpoint[source]¶

Deploy model to an inference endpoint.

Parameters:

instance_type – Instance type for the endpoint. Defaults to the model registry value. Must be None when inference_mode="serverless".
endpoint_name – Custom endpoint name. If None, will auto-generate a unique name.
hyperparameters – Model hyperparameters for inference. Overrides values passed to the constructor.
framework_version – Container framework version. If ‘latest’, uses the most recent available.
custom_image_uri – Custom Docker image URI for the inference container.
wait – Whether to block until the endpoint is ready.
inference_mode – Endpoint type. "serverless" provisions a SageMaker Serverless Inference endpoint (no instance management, scales to zero).
inference_config – Mode-specific overrides forwarded to sagemaker.serverless.ServerlessInferenceConfig (e.g. memory_size_in_mb, max_concurrency).
**backend_kwargs – Backend-specific arguments (e.g., initial_instance_count, volume_size, model_kwargs, deploy_kwargs).