deploy¶
- TimeSeriesFoundationModel.deploy(instance_type: str | None = None, endpoint_name: str | None = None, hyperparameters: Dict[str, Any] | None = None, framework_version: str = 'latest', custom_image_uri: str | None = None, wait: bool = True, inference_mode: Literal['realtime', 'serverless'] = 'realtime', inference_config: Dict[str, Any] | None = None, **backend_kwargs) TimeSeriesEndpoint[source]¶
Deploy model to an inference endpoint.
- Parameters:
instance_type – Instance type for the endpoint. Defaults to the model registry value. Must be
Nonewheninference_mode="serverless".endpoint_name – Custom endpoint name. If None, will auto-generate a unique name.
hyperparameters – Model hyperparameters for inference. Overrides values passed to the constructor.
framework_version – Container framework version. If ‘latest’, uses the most recent available.
custom_image_uri – Custom Docker image URI for the inference container.
wait – Whether to block until the endpoint is ready.
inference_mode – Endpoint type.
"serverless"provisions a SageMaker Serverless Inference endpoint (no instance management, scales to zero).inference_config – Mode-specific overrides forwarded to
sagemaker.serverless.ServerlessInferenceConfig(e.g.memory_size_in_mb,max_concurrency).**backend_kwargs – Backend-specific arguments (e.g., initial_instance_count, volume_size, model_kwargs, deploy_kwargs).