deploy

TimeSeriesFoundationModel.deploy(instance_type: str | None = None, endpoint_name: str | None = None, hyperparameters: Dict[str, Any] | None = None, framework_version: str = 'latest', custom_image_uri: str | None = None, wait: bool = True, inference_mode: Literal['realtime', 'serverless'] = 'realtime', inference_config: Dict[str, Any] | None = None, **backend_kwargs) TimeSeriesEndpoint[source]

Deploy model to an inference endpoint.

Parameters:
  • instance_type – Instance type for the endpoint. Defaults to the model registry value. Must be None when inference_mode="serverless".

  • endpoint_name – Custom endpoint name. If None, will auto-generate a unique name.

  • hyperparameters – Model hyperparameters for inference. Overrides values passed to the constructor.

  • framework_version – Container framework version. If ‘latest’, uses the most recent available.

  • custom_image_uri – Custom Docker image URI for the inference container.

  • wait – Whether to block until the endpoint is ready.

  • inference_mode – Endpoint type. "serverless" provisions a SageMaker Serverless Inference endpoint (no instance management, scales to zero).

  • inference_config – Mode-specific overrides forwarded to sagemaker.serverless.ServerlessInferenceConfig (e.g. memory_size_in_mb, max_concurrency).

  • **backend_kwargs – Backend-specific arguments (e.g., initial_instance_count, volume_size, model_kwargs, deploy_kwargs).