Sagemaker-V1 18 0
Sagemaker-V1 18 0
Release 1.18.0
1 Overview 3
1.1 Estimators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 HyperparameterTuner . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.3 Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
1.4 PipelineModel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
1.5 Predictors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
1.6 Transformer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
1.7 Session . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
1.8 Analytics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
2 MXNet 49
2.1 MXNet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3 TensorFlow 53
3.1 TensorFlow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
4 Scikit-Learn 61
4.1 Scikit Learn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
5 PyTorch 65
5.1 PyTorch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
6 Chainer 69
6.1 Chainer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
7 Reinforcement Learning 73
7.1 RLEstimator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
8 SparkML Serving 77
8.1 SparkML Serving . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
i
9.7 LinearLearner . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
9.8 NTM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
9.9 Object2Vec . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
9.10 PCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
9.11 Random Cut Forest . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
10 Workflows 143
10.1 Airflow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
ii
sagemaker, Release 1.18.0
Amazon SageMaker Python SDK is an open source library for training and deploying machine-learned models on
Amazon SageMaker.
With the SDK, you can train and deploy models using popular deep learning frameworks, algorithms provided by
Amazon, or your own algorithms built into SageMaker-compatible Docker images.
Here you’ll find API docs for SageMaker Python SDK. The project homepage is in Github: https://github.com/aws/
sagemaker-python-sdk, where you can find the SDK source, installation instructions and a general overview of the
library.
Contents 1
sagemaker, Release 1.18.0
2 Contents
CHAPTER 1
Overview
1.1 Estimators
3
sagemaker, Release 1.18.0
training data and model artifacts. After the endpoint is created, the inference code might use
the IAM role, if it needs to access an AWS resource.
• train_instance_count (int) – Number of Amazon EC2 instances to use for train-
ing.
• train_instance_type (str) – Type of EC2 instance to use for training, for example,
‘ml.c4.xlarge’.
• train_volume_size (int) – Size in GB of the EBS volume to use for storing input
data during training (default: 30). Must be large enough to store training data if File Mode
is used (which is the default).
• train_volume_kms_key (str) – Optional. KMS key ID for encrypting EBS volume
attached to the training instance (default: None).
• train_max_run (int) – Timeout in seconds for training (default: 24 * 60 * 60). After
this amount of time Amazon SageMaker terminates the job regardless of its current status.
• input_mode (str) – The input mode that the algorithm supports (default: ‘File’). Valid
modes: ‘File’ - Amazon SageMaker copies the training dataset from the S3 location to
a local directory. ‘Pipe’ - Amazon SageMaker streams data directly from S3 to the con-
tainer via a Unix-named pipe. This argument can be overriden on a per-channel basis using
sagemaker.session.s3_input.input_mode.
• output_path (str) – S3 location for saving the trainig result (model artifacts and output
files). If not specified, results are stored to a default bucket. If the bucket with the specific
name does not exist, the estimator creates the bucket during the fit() method execution.
• output_kms_key (str) – Optional. KMS key ID for encrypting the training output
(default: None).
• base_job_name (str) – Prefix for training job name when the fit() method launches.
If not specified, the estimator generates a default job name, based on the training image name
and current timestamp.
• sagemaker_session (sagemaker.session.Session) – Session object which
manages interactions with Amazon SageMaker APIs and any other AWS services needed.
If not specified, the estimator creates one using the default AWS configuration chain.
• tags (list[dict]) – List of tags for labeling a training job. For more, see https://docs.
aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• subnets (list[str]) – List of subnet ids. If not specified training job will be created
without VPC config.
• security_group_ids (list[str]) – List of security group ids. If not specified
training job will be created without VPC config.
• model_uri (str) – URI where a pre-trained model is stored, either locally or in S3
(default: None). If specified, the estimator will create a channel pointing to the model so the
training job can download it. This model can be a ‘model.tar.gz’ from a previous training
job, or other artifacts coming from a different source.
In local mode, this should point to the path in which the model is located and not the file
itself, as local Docker containers will try to mount the URI as a volume.
More information: https://docs.aws.amazon.com/sagemaker/latest/dg/cdf-training.html#
td-deserialization
• model_channel_name (str) – Name of the channel where ‘model_uri’ will be down-
loaded (default: ‘model’).
4 Chapter 1. Overview
sagemaker, Release 1.18.0
1.1. Estimators 5
sagemaker, Release 1.18.0
• job_name (str) – Training job name. If not specified, the estimator generates a default
job name, based on the training image name and current timestamp.
compile_model(target_instance_family, input_shape, output_path, framework=None, frame-
work_version=None, compile_max_run=300, tags=None, **kwargs)
Compile a Neo model using the input model.
Parameters
• target_instance_family (str) – Identifies the device that you want to run
your model after compilation, for example: ml_c5. Allowed strings are: ml_c5,
ml_m5, ml_c4, ml_m4, jetsontx1, jetsontx2, ml_p2, ml_p3, deeplens, rasp3b
• input_shape (dict) – Specifies the name and shape of the expected inputs for
your trained model in json dictionary form, for example: {‘data’:[1,3,1024,1024]}, or
{‘var1’: [1,1,28,28], ‘var2’:[1,1,28,28]}
• output_path (str) – Specifies where to store the compiled model
• framework (str) – The framework that is used to train the original model. Allowed
values: ‘mxnet’, ‘tensorflow’, ‘pytorch’, ‘onnx’, ‘xgboost’
• framework_version (str) – The version of the framework
• compile_max_run (int) – Timeout in seconds for compilation (default: 3 * 60).
After this amount of time Amazon SageMaker Neo terminates the compilation job
regardless of its current status.
• tags (list[dict]) – List of tags for labeling a compilation job. For more, see
https://docs.aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns A SageMaker Model object. See Model() for full details.
Return type sagemaker.model.Model
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
6 Chapter 1. Overview
sagemaker, Release 1.18.0
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
1.1. Estimators 7
sagemaker, Release 1.18.0
delete_endpoint()
Delete an Amazon SageMaker Endpoint.
Raises ValueError – If the endpoint does not exist.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
Session and base job name used by the Estimator.
Parameters
• instance_count (int) – Number of EC2 instances to use.
• instance_type (str) – Type of EC2 instance to use, for example, ‘ml.c4.xlarge’.
• strategy (str) – The strategy used to decide how to batch records in a single re-
quest (default: None). Valid values: ‘MULTI_RECORD’ and ‘SINGLE_RECORD’.
• assemble_with (str) – How the output is assembled (default: None). Valid
values: ‘Line’ or ‘None’.
• output_path (str) – S3 location for saving the transform result. If not specified,
results are stored to a default bucket.
• output_kms_key (str) – Optional. KMS key ID for encrypting the transform
output (default: None).
• accept (str) – The content type accepted by the endpoint deployed during the
transform job.
• env (dict) – Environment variables to be set for use during the transform job (de-
fault: None).
• max_concurrent_transforms (int) – The maximum number of HTTP re-
quests to be made to each individual transform container at one time.
• max_payload (int) – Maximum size of the payload in a single HTTP request to
the container in MB.
• tags (list[dict]) – List of tags for labeling a transform job. If none specified,
then the tags used for the training job are used for the transform job.
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• volume_kms_key (str) – Optional. KMS key ID for encrypting the volume at-
tached to the ML compute instance (default: None).
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
8 Chapter 1. Overview
sagemaker, Release 1.18.0
1.1. Estimators 9
sagemaker, Release 1.18.0
• base_job_name (str) – Prefix for training job name when the fit() method
launches. If not specified, the estimator generates a default job name, based on the train-
ing image name and current timestamp.
• sagemaker_session (sagemaker.session.Session) – Session object which
manages interactions with Amazon SageMaker APIs and any other AWS services needed.
If not specified, the estimator creates one using the default AWS configuration chain.
• hyperparameters (dict) – Dictionary containing the hyperparameters to initialize
this estimator with.
• tags (list[dict]) – List of tags for labeling a training job. For more, see https:
//docs.aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• subnets (list[str]) – List of subnet ids. If not specified training job will be created
without VPC config.
• security_group_ids (list[str]) – List of security group ids. If not specified
training job will be created without VPC config.
• model_uri (str) – URI where a pre-trained model is stored, either locally or in S3
(default: None). If specified, the estimator will create a channel pointing to the model
so the training job can download it. This model can be a ‘model.tar.gz’ from a previous
training job, or other artifacts coming from a different source.
In local mode, this should point to the path in which the model is located and not the file
itself, as local Docker containers will try to mount the URI as a volume.
More information: https://docs.aws.amazon.com/sagemaker/latest/dg/cdf-training.html#
td-deserialization
• model_channel_name (str) – Name of the channel where ‘model_uri’ will be
downloaded (default: ‘model’).
• metric_definitions (list[dict]) – A list of dictionaries that defines the met-
ric(s) used to evaluate the training jobs. Each dictionary contains two keys: ‘Name’ for
the name of the metric, and ‘Regex’ for the regular expression used to extract the metric
from the logs. This should be defined only for jobs that don’t use an Amazon algorithm.
• encrypt_inter_container_traffic (bool) – Specifies whether traffic be-
tween training containers is encrypted for the training job (default: False).
train_image()
Returns the docker image to use for training.
The fit() method, that does the model training, calls this method to find the image to use for model training.
set_hyperparameters(**kwargs)
hyperparameters()
Returns the hyperparameters as a dictionary to use for training.
The fit() method, that does the model training, calls this method to find the hyperparameters you specified.
create_model(role=None, image=None, predictor_cls=None, serial-
izer=None, deserializer=None, content_type=None, accept=None,
vpc_config_override=’VPC_CONFIG_DEFAULT’, **kwargs)
Create a model to deploy.
Parameters
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
10 Chapter 1. Overview
sagemaker, Release 1.18.0
• image (str) – An container image to use for deploying the model. Defaults to the
image used for training.
• predictor_cls (RealTimePredictor) – The predictor class to use when de-
ploying the model.
• serializer (callable) – Should accept a single argument, the input data, and
return a sequence of bytes. May provide a content_type attribute that defines the
endpoint request content type
• deserializer (callable) – Should accept two arguments, the result data and
the response content type, and return a sequence of bytes. May provide a content_type
attribute that defines th endpoint response Accept content type.
• content_type (str) – The invocation ContentType, overriding any content_type
from the serializer
• accept (str) – The invocation Accept, overriding any accept from the deserializer.
• vpc_config_override (dict[str, list[str]]) – Optional override for
VpcConfig set on the model. Default: use subnets and security groups from this
Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
• serializer, deserializer, content_type, and accept
arguments are only used to define a default (The) –
• They are ignored if an explicit predictor class is
passed in. Other arguments (RealTimePredictor.) –
• passed through to the Model class. (are) –
Returns: a Model ready for deployment.
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
1.1. Estimators 11
sagemaker, Release 1.18.0
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
12 Chapter 1. Overview
sagemaker, Release 1.18.0
1.1. Estimators 13
sagemaker, Release 1.18.0
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
Session and base job name used by the Estimator.
Parameters
• instance_count (int) – Number of EC2 instances to use.
• instance_type (str) – Type of EC2 instance to use, for example, ‘ml.c4.xlarge’.
• strategy (str) – The strategy used to decide how to batch records in a single re-
quest (default: None). Valid values: ‘MULTI_RECORD’ and ‘SINGLE_RECORD’.
• assemble_with (str) – How the output is assembled (default: None). Valid
values: ‘Line’ or ‘None’.
• output_path (str) – S3 location for saving the transform result. If not specified,
results are stored to a default bucket.
• output_kms_key (str) – Optional. KMS key ID for encrypting the transform
output (default: None).
• accept (str) – The content type accepted by the endpoint deployed during the
transform job.
• env (dict) – Environment variables to be set for use during the transform job (de-
fault: None).
• max_concurrent_transforms (int) – The maximum number of HTTP re-
quests to be made to each individual transform container at one time.
• max_payload (int) – Maximum size of the payload in a single HTTP request to
the container in MB.
• tags (list[dict]) – List of tags for labeling a transform job. If none specified,
then the tags used for the training job are used for the transform job.
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• volume_kms_key (str) – Optional. KMS key ID for encrypting the volume at-
tached to the ML compute instance (default: None).
class sagemaker.estimator.Framework(entry_point, source_dir=None, hyperparame-
ters=None, enable_cloudwatch_metrics=False,
container_log_level=20, code_location=None, im-
age_name=None, dependencies=None, **kwargs)
Bases: sagemaker.estimator.EstimatorBase
14 Chapter 1. Overview
sagemaker, Release 1.18.0
>>> $ ls
>>> opt/ml/code
>>> |------ train.py
>>> |------ common
>>> |------ virtual-env
1.1. Estimators 15
sagemaker, Release 1.18.0
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
16 Chapter 1. Overview
sagemaker, Release 1.18.0
1.1. Estimators 17
sagemaker, Release 1.18.0
18 Chapter 1. Overview
sagemaker, Release 1.18.0
1.1. Estimators 19
sagemaker, Release 1.18.0
1.2 HyperparameterTuner
20 Chapter 1. Overview
sagemaker, Release 1.18.0
1.2. HyperparameterTuner 21
sagemaker, Release 1.18.0
Examples
>>> my_tuner.fit()
>>> job_name = my_tuner.latest_tuning_job.name
Later on:
>>> attached_tuner = HyperparameterTuner.attach(job_name)
>>> attached_tuner.deploy()
Returns
A HyperparameterTuner instance with the attached hyperparameter tuning
job.
Return type sagemaker.tuner.HyperparameterTuner
22 Chapter 1. Overview
sagemaker, Release 1.18.0
1.2. HyperparameterTuner 23
sagemaker, Release 1.18.0
Examples
Later On:
>>> transfer_learning_tuner.fit(inputs={})
identical_dataset_and_algorithm_tuner(additional_parents=None)
Creates a new HyperparameterTuner by copying the request fields from the provided parent
to the new instance of HyperparameterTuner. Followed by addition of warm start configu-
ration with the type as “IdenticalDataAndAlgorithm” and parents as the union of provided list of
additional_parents and the self
Parameters
• additional_parents (set{str}) – Set of additional parents along with the
self to be used in warm starting
• identical dataset and algorithm tuner. (the) –
Returns HyperparameterTuner instance which can be used to launch identical dataset and
algorithm tuning job.
Return type sagemaker.tuner.HyperparameterTuner
Examples
>>> additional_
˓→parents={"parent-job-2"})
Later On:
>>> identical_dataset_algo_tuner.fit(inputs={})
24 Chapter 1. Overview
sagemaker, Release 1.18.0
1.2. HyperparameterTuner 25
sagemaker, Release 1.18.0
1.3 Model
26 Chapter 1. Overview
sagemaker, Release 1.18.0
1.3. Model 27
sagemaker, Release 1.18.0
28 Chapter 1. Overview
sagemaker, Release 1.18.0
1.4 PipelineModel
1.4. PipelineModel 29
sagemaker, Release 1.18.0
The name of the created endpoint is accessible in the endpoint_name field of this Model after deploy
returns.
Parameters
• instance_type (str) – The EC2 instance type to deploy this Model to. For
example, ‘ml.p2.xlarge’.
• initial_instance_count (int) – The initial number of instances to run in
the Endpoint created from this Model.
• endpoint_name (str) – The name of the endpoint to create (default: None). If
not specified, a unique endpoint name will be created.
• tags (List[dict[str, str]]) – The list of tags to attach to this specific end-
point.
Returns
Invocation of self.predictor_cls on the created endpoint name, if self.
predictor_cls is not None. Otherwise, return None.
Return type callable[string, sagemaker.session.Session] or None
1.5 Predictors
30 Chapter 1. Overview
sagemaker, Release 1.18.0
1.6 Transformer
1.6. Transformer 31
sagemaker, Release 1.18.0
• output_kms_key (str) – Optional. KMS key ID for encrypting the transform output
(default: None).
• accept (str) – The content type accepted by the endpoint deployed during the trans-
form job.
• max_concurrent_transforms (int) – The maximum number of HTTP requests
to be made to each individual transform container at one time.
• max_payload (int) – Maximum size of the payload in a single HTTP request to the
container in MB.
• env (dict) – Environment variables to be set for use during the transform job (default:
None).
• tags (list[dict]) – List of tags for labeling a transform job (default: None). For
more, see https://docs.aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• base_transform_job_name (str) – Prefix for the transform job when the
transform() method launches. If not specified, a default prefix will be generated
based on the training image name that was used to train the model associated with the
transform job.
• sagemaker_session (sagemaker.session.Session) – Session object which
manages interactions with Amazon SageMaker APIs and any other AWS services needed.
If not specified, the estimator creates one using the default AWS configuration chain.
• volume_kms_key (str) – Optional. KMS key ID for encrypting the volume attached
to the ML compute instance (default: None).
transform(data, data_type=’S3Prefix’, content_type=None, compression_type=None,
split_type=None, job_name=None)
Start a new transform job.
Parameters
• data (str) – Input data location in S3.
• data_type (str) – What the S3 location defines (default: ‘S3Prefix’). Valid val-
ues:
– ’S3Prefix’ - the S3 URI defines a key name prefix. All objects with this prefix will be used as
inputs for the transform job.
– ’ManifestFile’ - the S3 URI points to a single manifest file listing each S3 object to use as
an input for the transform job.
• content_type (str) – MIME type of the input data (default: None).
• compression_type (str) – Compression type of the input data, if compressed
(default: None). Valid values: ‘Gzip’, None.
• split_type (str) – The record delimiter for the input object (default: ‘None’).
Valid values: ‘None’, ‘Line’, ‘RecordIO’, and ‘TFRecord’.
• job_name (str) – job name (default: None). If not specified, one will be generated.
wait()
classmethod attach(transform_job_name, sagemaker_session=None)
Attach an existing transform job to a new Transformer instance
Parameters
• transform_job_name (str) – Name for the transform job to be attached.
32 Chapter 1. Overview
sagemaker, Release 1.18.0
1.7 Session
class sagemaker.session.LogState
Bases: object
STARTING = 1
WAIT_IN_PROGRESS = 2
TAILING = 3
JOB_COMPLETE = 4
COMPLETE = 5
class sagemaker.session.Session(boto_session=None, sagemaker_client=None, sage-
maker_runtime_client=None)
Bases: object
Manage interactions with the Amazon SageMaker APIs and any other AWS services needed.
This class provides convenient methods for manipulating entities and resources that Amazon SageMaker uses,
such as training jobs, endpoints, and input datasets in S3.
AWS service calls are delegated to an underlying Boto3 session, which by default is initialized using the AWS
configuration chain. When you make an Amazon SageMaker API call that accesses an S3 bucket location and
one is not specified, the Session creates a default bucket based on a naming convention which includes the
current AWS account ID.
Initialize a SageMaker Session.
Parameters
• boto_session (boto3.session.Session) – The underlying Boto3 session
which AWS service calls are delegated to (default: None). If not provided, one is created
with default AWS configuration chain.
• sagemaker_client (boto3.SageMaker.Client) – Client which makes Ama-
zon SageMaker service calls other than InvokeEndpoint (default: None). Estimators
created using this Session use this client. If not provided, one will be created using
this instance’s boto_session.
• sagemaker_runtime_client (boto3.SageMakerRuntime.Client) –
Client which makes InvokeEndpoint calls to Amazon SageMaker (default: None).
Predictors created using this Session use this client. If not provided, one will be
created using this instance’s boto_session.
boto_region_name
upload_data(path, bucket=None, key_prefix=’data’)
Upload local file or directory to S3.
1.7. Session 33
sagemaker, Release 1.18.0
If a single file is specified for upload, the resulting S3 object key is {key_prefix}/{filename}
(filename does not include the local path, if any specified).
If a directory is specified for upload, the API uploads all content, recursively, preserving rel-
ative structure of subdirectories. The resulting object key names are: {key_prefix}/
{relative_subdirectory_path}/filename.
Parameters
• path (str) – Path (absolute or relative) of local file or directory to upload.
• bucket (str) – Name of the S3 Bucket to upload to (default: None). If not speci-
fied, the default bucket of the Session is used (if default bucket does not exist, the
Session creates it).
• key_prefix (str) – Optional S3 object key name prefix (default: ‘data’). S3 uses
the prefix to create a directory structure for the bucket content that it display in the S3
console.
Returns
The S3 URI of the uploaded file(s). If a file is specified in the path argument, the URI format is:
s3://{bucket name}/{key_prefix}/{original_file_name}. If a
directory is specified in the path argument, the URI format is s3://{bucket
name}/{key_prefix}.
Return type str
default_bucket()
Return the name of the default bucket to use in relevant Amazon SageMaker interactions.
Returns The name of the default bucket, which is of the form:
sagemaker-{region}-{AWS account ID}.
Return type str
train(input_mode, input_config, role, job_name, output_config, resource_config, vpc_config, hy-
perparameters, stop_condition, tags, metric_definitions, enable_network_isolation=False, im-
age=None, algorithm_arn=None, encrypt_inter_container_traffic=False)
Create an Amazon SageMaker training job.
Parameters
• input_mode (str) – The input mode that the algorithm supports. Valid modes:
– ’File’ - Amazon SageMaker copies the training dataset from the S3 location to
a directory in the Docker container.
– ’Pipe’ - Amazon SageMaker streams data directly from S3 to the container via a
Unix-named pipe.
• input_config (list) – A list of Channel objects. Each channel is a named input
source. Please refer to the format details described: https://botocore.readthedocs.io/
en/latest/reference/services/sagemaker.html#SageMaker.Client.create_training_job
• role (str) – An AWS IAM role (either name or full ARN). The Amazon Sage-
Maker training jobs and APIs that create Amazon SageMaker endpoints use this role
to access training data and model artifacts. You must grant sufficient permissions to
this role.
• job_name (str) – Name of the training job being created.
• output_config (dict) – The S3 URI where you want to store the training results
and optional KMS key ID.
34 Chapter 1. Overview
sagemaker, Release 1.18.0
1.7. Session 35
sagemaker, Release 1.18.0
36 Chapter 1. Overview
sagemaker, Release 1.18.0
1.7. Session 37
sagemaker, Release 1.18.0
Parameters
• container_defs (list[dict[str, str]] or [dict[str, str]])
– A single container definition or a list of container definitions which will
be invoked sequentially while performing the prediction. If the list con-
tains only one container, then it’ll be passed to SageMaker Hosting as the
PrimaryContainer and otherwise, it’ll be passed as Containers.You can
also specify the return value of sagemaker.get_container_def() or
sagemaker.pipeline_container_def(), which will used to create more
advanced container configurations ,including model containers which need artifacts
from S3.
• vpc_config (dict[str, list[str]]) – The VpcConfig set on the model
(default: None) * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’
(list[str]): List of security group ids.
• enable_network_isolation (bool) – Wether the model requires network
isolation or not.
• primary_container (str or dict[str, str]) – Docker image which
defines the inference code. You can also specify the return value of sagemaker.
container_def(), which is used to create more advanced container configura-
tions, including model containers which need artifacts from S3. This field is depre-
cated, please use container_defs instead.
Returns Name of the Amazon SageMaker Model created.
Return type str
38 Chapter 1. Overview
sagemaker, Release 1.18.0
Parameters
• training_job_name (str) – The Amazon SageMaker Training Job name.
• name (str) – The name of the SageMaker Model to create (default: None). If not
specified, the training job name is used.
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, specified
either by an IAM role name or role ARN. If None, the RoleArn from the SageMaker
Training Job will be used.
• primary_container_image (str) – The Docker image reference (default:
None). If None, it defaults to the Training Image in training_job_name.
• model_data_url (str) – S3 location of the model data (default: None). If None,
defaults to the ModelS3Artifacts of training_job_name.
• env (dict[string,string]) – Model environment variables (default: {}).
• vpc_config_override (dict[str, list[str]]) – Optional override for
VpcConfig set on the model. Default: use VpcConfig from training job. * ‘Subnets’
(list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]): List of security group
ids.
Returns The name of the created Model.
Return type str
create_model_package_from_algorithm(name, description, algorithm_arn, model_data)
Create a SageMaker Model Package from the results of training with an Algorithm Package
Parameters
• name (str) – ModelPackage name
• description (str) – Model Package description
• algorithm_arn (str) – arn or name of the algorithm used for training.
• model_data (str) – s3 URI to the model artifacts produced by training
wait_for_model_package(model_package_name, poll=5)
Wait for an Amazon SageMaker endpoint deployment to complete.
Parameters
• endpoint (str) – Name of the Endpoint to wait for.
• poll (int) – Polling interval in seconds (default: 5).
Returns Return value from the DescribeEndpoint API.
Return type dict
create_endpoint_config(name, model_name, initial_instance_count, instance_type, accelera-
tor_type=None)
Create an Amazon SageMaker endpoint configuration.
The endpoint configuration identifies the Amazon SageMaker model (created using the CreateModel
API) and the hardware configuration on which to deploy the model. Provide this endpoint configuration
to the CreateEndpoint API, which then launches the hardware and deploys the model.
Parameters
• name (str) – Name of the Amazon SageMaker endpoint configuration to create.
• model_name (str) – Name of the Amazon SageMaker Model.
1.7. Session 39
sagemaker, Release 1.18.0
40 Chapter 1. Overview
sagemaker, Release 1.18.0
1.7. Session 41
sagemaker, Release 1.18.0
42 Chapter 1. Overview
sagemaker, Release 1.18.0
• wait (bool) – Whether to wait for the endpoint deployment to complete before
returning (default: True).
• model_environment_vars (dict[str, str]) – Environment variables to
set on the model container (default: None).
• model_vpc_config (dict[str, list[str]]) – The VpcConfig set on the
model (default: None) * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’
(list[str]): List of security group ids.
• accelerator_type (str) – Type of Elastic Inference accelerator to attach to the
instance. For example, ‘ml.eia1.medium’. For more information: https://docs.aws.
amazon.com/sagemaker/latest/dg/ei.html
Returns Name of the Endpoint that is created.
Return type str
endpoint_from_production_variants(name, production_variants, tags=None, wait=True)
Create an SageMaker Endpoint from a list of production variants.
Parameters
• name (str) – The name of the Endpoint to create.
• production_variants (list[dict[str, str]]) – The list of production
variants to deploy.
• tags (list[dict[str, str]]) – A list of key-value pairs for tagging the end-
point (default: None).
• wait (bool) – Whether to wait for the endpoint deployment to complete before
returning (default: True).
Returns The name of the created Endpoint.
Return type str
expand_role(role)
Expand an IAM role name into an ARN.
If the role is already in the form of an ARN, then the role is simply returned. Otherwise we retrieve the
full ARN and return it.
Parameters role (str) – An AWS IAM role (either name or full ARN).
Returns The corresponding AWS IAM role ARN.
Return type str
get_caller_identity_arn()
Returns the ARN user or role whose credentials are used to call the API. :returns: The ARN user or role
:rtype: (str)
logs_for_job(job_name, wait=False, poll=10)
Display the logs for a given training job, optionally tailing them until the job is complete. If the output is
a tty or a Jupyter cell, it will be color-coded based on which instance the log entry is from.
Parameters
• job_name (str) – Name of the training job to display the logs for.
• wait (bool) – Whether to keep looking for new log entries until the job completes
(default: False).
1.7. Session 43
sagemaker, Release 1.18.0
• poll (int) – The interval in seconds between polling for new log entries and job
completion (default: 5).
Raises ValueError – If waiting and the training job fails.
sagemaker.session.container_def(image, model_data_url=None, env=None)
Create a definition for executing a container as part of a SageMaker model. :param image: Docker image to run
for this container. :type image: str :param model_data_url: S3 URI of data required by this container,
e.g. SageMaker training job model artifacts (default: None).
Parameters env (dict[str, str]) – Environment variables to set inside the container (de-
fault: None).
Returns A complete container definition object usable with the CreateModel API if passed via
PrimaryContainers field.
Return type dict[str, str]
sagemaker.session.pipeline_container_def(models, instance_type=None)
Create a definition for executing a pipeline of containers as part of a SageMaker model. :param models: this
will be a list of sagemaker.Model objects in the order the inference :type models: list[sagemaker.Model]
:param should be invoked.: :param instance_type: The EC2 instance type to deploy this Model to. For example,
‘ml.p2.xlarge’ (default: None). :type instance_type: str
Returns list of container definition objects usable with with the CreateModel API for inference
pipelines if passed via Containers field.
Return type list[dict[str, str]]
sagemaker.session.production_variant(model_name, instance_type, initial_instance_count=1,
variant_name=’AllTraffic’, initial_weight=1, accelera-
tor_type=None)
Create a production variant description suitable for use in a ProductionVariant list as part of a
CreateEndpointConfig request.
Parameters
• model_name (str) – The name of the SageMaker model this production variant refer-
ences.
• instance_type (str) – The EC2 instance type for this production variant. For ex-
ample, ‘ml.c4.8xlarge’.
• initial_instance_count (int) – The initial instance count for this production
variant (default: 1).
• variant_name (string) – The VariantName of this production variant (default:
‘AllTraffic’).
• initial_weight (int) – The relative InitialVariantWeight of this produc-
tion variant (default: 1).
• accelerator_type (str) – Type of Elastic Inference accelerator for this produc-
tion variant. For example, ‘ml.eia1.medium’. For more information: https://docs.aws.
amazon.com/sagemaker/latest/dg/ei.html
Returns An SageMaker ProductionVariant description
Return type dict[str, str]
44 Chapter 1. Overview
sagemaker, Release 1.18.0
sagemaker.session.get_execution_role(sagemaker_session=None)
Return the role ARN whose credentials are used to call the API. Throws an exception if :param sage-
maker_session: Current sagemaker session :type sagemaker_session: Session
Returns The role ARN
Return type (str)
class sagemaker.session.s3_input(s3_data, distribution=’FullyReplicated’, compres-
sion=None, content_type=None, record_wrapping=None,
s3_data_type=’S3Prefix’, input_mode=None, at-
tribute_names=None, shuffle_config=None)
Bases: object
Amazon SageMaker channel configurations for S3 data sources.
config
A SageMaker DataSource referencing a SageMaker S3DataSource.
Type dict[str, dict]
Create a definition for input data used by an SageMaker training job.
See AWS documentation on the CreateTrainingJob API for more details on the parameters.
Parameters
• s3_data (str) – Defines the location of s3 data to train on.
• distribution (str) – Valid values: ‘FullyReplicated’, ‘ShardedByS3Key’ (default:
‘FullyReplicated’).
• compression (str) – Valid values: ‘Gzip’, None (default: None). This is used only
in Pipe input mode.
• content_type (str) – MIME type of the input data (default: None).
• record_wrapping (str) – Valid values: ‘RecordIO’ (default: None).
• s3_data_type (str) – Valid values: ‘S3Prefix’, ‘ManifestFile’, ‘AugmentedMani-
festFile’. If ‘S3Prefix’, s3_data defines a prefix of s3 objects to train on. All ob-
jects with s3 keys beginning with s3_data will be used to train. If ‘ManifestFile’ or
‘AugmentedManifestFile’, then s3_data defines a single s3 manifest file or augmented
manifest file (respectively), listing the s3 data to train on. Both the ManifestFile and
AugmentedManifestFile formats are described in the SageMaker API
documentation: https://docs.aws.amazon.com/sagemaker/latest/dg/API_
S3DataSource.html
• input_mode (str) – Optional override for this channel’s input mode (default: None).
By default, channels will use the input mode defined on sagemaker.estimator.
EstimatorBase.input_mode, but they will ignore that setting if this parameter is
set. * None - Amazon SageMaker will use the input mode specified in the Estimator.
* ‘File’ - Amazon SageMaker copies the training dataset from the S3 location to a local
directory. * ‘Pipe’ - Amazon SageMaker streams data directly from S3 to the container
via a Unix-named pipe.
• attribute_names (list[str]) – A list of one or more attribute names to use that
are found in a specified AugmentedManifestFile.
• shuffle_config (ShuffleConfig) – If specified this configuration enables shuf-
fling on this channel. See the SageMaker API documentation for more info: https:
//docs.aws.amazon.com/sagemaker/latest/dg/API_ShuffleConfig.html
1.7. Session 45
sagemaker, Release 1.18.0
class sagemaker.session.ShuffleConfig(seed)
Bases: object
Used to configure channel shuffling using a seed. See SageMaker documentation for more detail: https://docs.
aws.amazon.com/sagemaker/latest/dg/API_ShuffleConfig.html
Create a ShuffleConfig. :param seed: the long value used to seed the shuffled sequence. :type seed: long
class sagemaker.session.ModelContainer(model_data, image, env=None)
Bases: object
Amazon SageMaker Model configurations for inference pipelines. .. attribute:: model_data
S3 Model artifact location
type str
image
Docker image URL in ECR
Type str
env
Environment variable mapping
Type dict[str,str]
Create a definition of a model which can be part of an Inference Pipeline :param model_data: The S3 location
of a SageMaker model data .tar.gz file. :type model_data: str :param image: A Docker image URI. :type
image: str :param env: Environment variables to run with image when hosted in SageMaker (default: None).
:type env: dict[str, str]
1.8 Analytics
class sagemaker.analytics.AnalyticsMetricsBase
Bases: object
Base class for tuning job or training job analytics classes. Understands common functionality like persistence
and caching.
export_csv(filename)
Persists the analytics dataframe to a file.
Parameters filename (str) – The name of the file to save to.
dataframe(force_refresh=False)
A pandas dataframe with lots of interesting results about this object. Created by calling SageMaker List
and Describe APIs and converting them into a convenient tabular summary.
Parameters force_refresh (bool) – Set to True to fetch the latest data from SageMaker
API.
clear_cache()
Clear the object of all local caches of API methods, so that the next time any properties are accessed they
will be refreshed from the service.
class sagemaker.analytics.HyperparameterTuningJobAnalytics(hyperparameter_tuning_job_name,
sage-
maker_session=None)
Bases: sagemaker.analytics.AnalyticsMetricsBase
Fetch results about a hyperparameter tuning job and make them accessible for analytics.
46 Chapter 1. Overview
sagemaker, Release 1.18.0
1.8. Analytics 47
sagemaker, Release 1.18.0
clear_cache()
Clear the object of all local caches of API methods, so that the next time any properties are accessed they
will be refreshed from the service.
48 Chapter 1. Overview
CHAPTER 2
MXNet
2.1 MXNet
49
sagemaker, Release 1.18.0
50 Chapter 2. MXNet
sagemaker, Release 1.18.0
Initialize an MXNetModel.
Parameters
• model_data (str) – The S3 location of a SageMaker model data .tar.gz file.
• role (str) – An AWS IAM role (either name or full ARN). The Amazon SageMaker
training jobs and APIs that create Amazon SageMaker endpoints use this role to access
training data and model artifacts. After the endpoint is created, the inference code might
use the IAM role, if it needs to access an AWS resource.
• entry_point (str) – Path (absolute or relative) to the Python source file which
should be executed as the entry point to model hosting. This should be compatible with
either Python 2.7 or Python 3.5.
• image (str) – A Docker image URI (default: None). If not specified, a default image
for MXNet will be used.
• py_version (str) – Python version you want to use for executing your model training
code (default: ‘py2’).
• framework_version (str) – MXNet version you want to use for executing your
model training code.
• predictor_cls (callable[str, sagemaker.session.Session]) – A
function to call to create a predictor with an endpoint name and SageMaker Session. If
specified, deploy() returns the result of invoking this function on the created endpoint
name.
• model_server_workers (int) – Optional. The number of worker processes used
by the inference server. If None, server will use one worker per vCPU.
• **kwargs – Keyword arguments passed to the FrameworkModel initializer.
prepare_container_def(instance_type, accelerator_type=None)
Return a container definition with framework configuration set in model environment variables.
Parameters
• instance_type (str) – The EC2 instance type to deploy this Model to. For
example, ‘ml.p2.xlarge’.
• accelerator_type (str) – The Elastic Inference accelerator type to deploy
to the instance for loading and making inferences to the model. For example,
‘ml.eia1.medium’.
Returns A container definition object usable with the CreateModel API.
Return type dict[str, str]
2.1. MXNet 51
sagemaker, Release 1.18.0
52 Chapter 2. MXNet
CHAPTER 3
TensorFlow
3.1 TensorFlow
Parameters
• checkpoint_path (str) – Identifies S3 location where checkpoint data during
model training can be saved (default: None). For distributed model training, this pa-
rameter is required.
• py_version (str) – Python version you want to use for executing your model training
code (default: ‘py2’).
53
sagemaker, Release 1.18.0
To enable MPI:
{ ‘mpi’: {
’enabled’: True
}
}
LATEST_VERSION = '1.12'
fit(inputs=None, wait=True, logs=True, job_name=None, run_tensorboard_locally=False)
Train a model using the input training dataset.
See fit() for more details.
Parameters
• inputs (str or dict or sagemaker.session.s3_input) – Informa-
tion about the training data. This can be one of three types: (str) - the S3 location
where training data is saved. (dict[str, str] or dict[str, sagemaker.session.s3_input]) -
If using multiple channels for
training data, you can specify a dict mapping channel names to strings or
s3_input() objects.
54 Chapter 3. TensorFlow
sagemaker, Release 1.18.0
• wait (bool) – Whether the call should wait until the job completes (default: True).
• logs (bool) – Whether to show the logs produced by the job. Only meaningful
when wait is True (default: True).
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
• run_tensorboard_locally (bool) – Whether to execute TensorBoard in a
different process with downloaded checkpoint information (default: False). This is
an experimental feature, and requires TensorBoard and AWS CLI to be installed. It
terminates TensorBoard when execution ends.
create_model(model_server_workers=None, role=None, vpc_config_override=’VPC_CONFIG_DEFAULT’,
endpoint_type=None)
Create a SageMaker TensorFlowModel object that can be deployed to an Endpoint.
Parameters
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• model_server_workers (int) – Optional. The number of worker processes
used by the inference server. If None, server will use one worker per vCPU.
• vpc_config_override (dict[str, list[str]]) – Optional override for
VpcConfig set on the model. Default: use subnets and security groups from this
Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
• endpoint_type – Optional. Selects the software stack used by the inference
server. If not specified, the model will be configured to use the default SageMaker
model server. If ‘tensorflow-serving’, the model will be configured to use the Sage-
Maker Tensorflow Serving container.
Returns
A SageMaker TensorFlowModel object. See TensorFlowModel() for full de-
tails.
Return type sagemaker.tensorflow.model.TensorFlowModel
hyperparameters()
Return hyperparameters used by your custom TensorFlow code during model training.
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
3.1. TensorFlow 55
sagemaker, Release 1.18.0
56 Chapter 3. TensorFlow
sagemaker, Release 1.18.0
3.1. TensorFlow 57
sagemaker, Release 1.18.0
prepare_container_def(instance_type, accelerator_type=None)
Return a dict created by sagemaker.container_def() for deploying this model to a specified
instance type.
Subclasses can override this to provide custom container definitions for deployment to a specific instance
type. Called by deploy().
Parameters
• instance_type (str) – The EC2 instance type to deploy this Model to. For
example, ‘ml.p2.xlarge’.
• accelerator_type (str) – The Elastic Inference accelerator type to deploy
to the instance for loading and making inferences to the model. For example,
‘ml.eia1.medium’.
Returns A container definition object usable with the CreateModel API.
Return type dict
58 Chapter 3. TensorFlow
sagemaker, Release 1.18.0
predict(data, initial_args=None)
Return the inference from the specified endpoint.
Parameters
• data (object) – Input data for which you want the model to provide inference.
If a serializer was specified when creating the RealTimePredictor, the result of the
serializer is sent as input data. Otherwise the data must be sequence of bytes, and the
predict method then sends the bytes in the request body as is.
• initial_args (dict[str,str]) – Optional. Default arguments for boto3
invoke_endpoint call. Default is None (no default arguments).
Returns
Inference for the given input. If a deserializer was specified when creating the Real-
TimePredictor, the result of the deserializer is returned. Otherwise the response re-
turns the sequence of bytes as is.
Return type object
3.1. TensorFlow 59
sagemaker, Release 1.18.0
60 Chapter 3. TensorFlow
CHAPTER 4
Scikit-Learn
61
sagemaker, Release 1.18.0
62 Chapter 4. Scikit-Learn
sagemaker, Release 1.18.0
64 Chapter 4. Scikit-Learn
CHAPTER 5
PyTorch
5.1 PyTorch
65
sagemaker, Release 1.18.0
66 Chapter 5. PyTorch
sagemaker, Release 1.18.0
• role (str) – An AWS IAM role (either name or full ARN). The Amazon SageMaker
training jobs and APIs that create Amazon SageMaker endpoints use this role to access
training data and model artifacts. After the endpoint is created, the inference code might
use the IAM role, if it needs to access an AWS resource.
• entry_point (str) – Path (absolute or relative) to the Python source file which
should be executed as the entry point to model hosting. This should be compatible with
either Python 2.7 or Python 3.5.
• image (str) – A Docker image URI (default: None). If not specified, a default image
for PyTorch will be used.
• py_version (str) – Python version you want to use for executing your model training
code (default: ‘py3’).
• framework_version (str) – PyTorch version you want to use for executing your
model training code.
• predictor_cls (callable[str, sagemaker.session.Session]) – A
function to call to create a predictor with an endpoint name and SageMaker Session. If
specified, deploy() returns the result of invoking this function on the created endpoint
name.
• model_server_workers (int) – Optional. The number of worker processes used
by the inference server. If None, server will use one worker per vCPU.
• **kwargs – Keyword arguments passed to the FrameworkModel initializer.
prepare_container_def(instance_type, accelerator_type=None)
Return a container definition with framework configuration set in model environment variables.
Parameters
• instance_type (str) – The EC2 instance type to deploy this Model to. For
example, ‘ml.p2.xlarge’.
• accelerator_type (str) – The Elastic Inference accelerator type to deploy
to the instance for loading and making inferences to the model. For example,
‘ml.eia1.medium’.
Returns A container definition object usable with the CreateModel API.
Return type dict[str, str]
5.1. PyTorch 67
sagemaker, Release 1.18.0
68 Chapter 5. PyTorch
CHAPTER 6
Chainer
6.1 Chainer
69
sagemaker, Release 1.18.0
• use_mpi (bool) – If true, entry point is run as an MPI script. By default, the Chainer
Framework runs the entry point with ‘mpirun’ if more than one instance is used.
• num_processes (int) – Total number of processes to run the entry point with. By
default, the Chainer Framework runs one process per GPU (on GPU instances), or one
process per host (on CPU instances).
• process_slots_per_host (int) – The number of processes that can run on each
instance. By default, this is set to the number of GPUs on the instance (on GPU in-
stances), or one (on CPU instances).
• additional_mpi_options (str) – String of options to the ‘mpirun’ command
used to run the entry point. For example, ‘-X NCCL_DEBUG=WARN’ will pass that
option string to the mpirun command.
• source_dir (str) – Path (absolute or relative) to a directory with any other training
source code dependencies aside from tne entry point file (default: None). Structure within
this directory are preserved when training on Amazon SageMaker.
• hyperparameters (dict) – Hyperparameters that will be used for training (default:
None). The hyperparameters are made accessible as a dict[str, str] to the training code
on SageMaker. For convenience, this accepts other types for keys and values, but str()
will be called to convert them before training.
• py_version (str) – Python version you want to use for executing your model training
code (default: ‘py2’). One of ‘py2’ or ‘py3’.
• framework_version (str) – Chainer version you want to use for execut-
ing your model training code. List of supported versions https://github.com/aws/
sagemaker-python-sdk#chainer-sagemaker-estimators
• image_name (str) – If specified, the estimator will use this image for training and
hosting, instead of selecting the appropriate SageMaker official image based on frame-
work_version and py_version. It can be an ECR url or dockerhub image and tag. ..
rubric:: Examples
123.dkr.ecr.us-west-2.amazonaws.com/my-custom-image:1.0 custom-image:latest.
• **kwargs – Additional kwargs passed to the Framework constructor.
LATEST_VERSION = '5.0.0'
hyperparameters()
Return hyperparameters used by your custom Chainer code during training.
create_model(model_server_workers=None, role=None, vpc_config_override=’VPC_CONFIG_DEFAULT’)
Create a SageMaker ChainerModel object that can be deployed to an Endpoint.
Parameters
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• model_server_workers (int) – Optional. The number of worker processes
used by the inference server. If None, server will use one worker per vCPU.
• vpc_config_override (dict[str, list[str]]) – Optional override for
VpcConfig set on the model. Default: use subnets and security groups from this
Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
Returns
70 Chapter 6. Chainer
sagemaker, Release 1.18.0
6.1. Chainer 71
sagemaker, Release 1.18.0
72 Chapter 6. Chainer
CHAPTER 7
Reinforcement Learning
A managed environment for Reinforcement Learning training and hosting on Amazon SageMaker
7.1 RLEstimator
73
sagemaker, Release 1.18.0
• toolkit_version (str) – RL toolkit version you want to be use for executing your
model training code.
• framework (sagemaker.rl.RLFramework) – Framework (MXNet or Tensor-
Flow) you want to be used as a toolkit backed for reinforcement learning training.
• source_dir (str) – Path (absolute or relative) to a directory with any other training
source code dependencies aside from the entry point file (default: None). Structure within
this directory is preserved when training on Amazon SageMaker.
• hyperparameters (dict) – Hyperparameters that will be used for training (default:
None). The hyperparameters are made accessible as a dict[str, str] to the training code on
SageMaker. For convenience, this accepts other types for keys and values.
• image_name (str) – An ECR url. If specified, the estimator will use this im-
age for training and hosting, instead of selecting the appropriate SageMaker official
image based on framework_version and py_version. Example: 123.dkr.ecr.us-west-
2.amazonaws.com/my-custom-image:1.0
• metric_definitions (list[dict]) – A list of dictionaries that defines the met-
ric(s) used to evaluate the training jobs. Each dictionary contains two keys: ‘Name’ for
the name of the metric, and ‘Regex’ for the regular expression used to extract the metric
from the logs. This should be defined only for jobs that don’t use an Amazon algorithm.
• **kwargs – Additional kwargs passed to the Framework constructor.
COACH_LATEST_VERSION = '0.11.0'
RAY_LATEST_VERSION = '0.5.3'
create_model(role=None, vpc_config_override=’VPC_CONFIG_DEFAULT’, entry_point=None,
source_dir=None, dependencies=None)
Create a SageMaker RLEstimatorModel object that can be deployed to an Endpoint.
Parameters
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• vpc_config_override (dict[str, list[str]]) – Optional override for
VpcConfig set on the model. Default: use subnets and security groups from this
Estimator.
– ’Subnets’ (list[str]): List of subnet ids.
– ’SecurityGroupIds’ (list[str]): List of security group ids.
• entry_point (str) – Path (absolute or relative) to the Python source file which
should be executed as the entry point for MXNet hosting. This should be compatible
with Python 3.5 (default: self.entry_point)
• source_dir (str) – Path (absolute or relative) to a directory with any other train-
ing source code dependencies aside from tne entry point file (default: self.source_dir).
Structure within this directory are preserved when hosting on Amazon SageMaker.
• dependencies (list[str]) – A list of paths to directories (absolute or rel-
ative) with any additional libraries that will be exported to the container (default:
self.dependencies). The library folders will be copied to SageMaker in the same
folder where the entry_point is copied. If the `source_dir` points to S3, code
will be uploaded and the S3 location will be used instead.
Returns
7.1. RLEstimator 75
sagemaker, Release 1.18.0
SparkML Serving
Parameters
• role (str) – An AWS IAM role (either name or full ARN). The Amazon SageMaker
training jobs and APIs that create Amazon SageMaker endpoints use this role to access
training data and model artifacts. After the endpoint is created, the inference code might
use the IAM role, if it needs to access an AWS resource.
• spark_version (str) – Spark version you want to use for executing the inference
(default: ‘2.2’).
• sagemaker_session (sagemaker.session.Session) – Session object which
manages interactions with Amazon SageMaker APIs and any other AWS services needed.
If not specified, the estimator creates one using the default AWS configuration chain. For
local mode, please do not pass this variable.
77
sagemaker, Release 1.18.0
Amazon provides implementations of some common machine learning algortithms optimized for GPU architecture
and massive datasets.
79
sagemaker, Release 1.18.0
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
data_location
fit(records, mini_batch_size=None, wait=True, logs=True, job_name=None)
Fit this Estimator on serialized Record objects, stored in S3.
records should be an instance of RecordSet. This defines a collection of S3 data files to train this
Estimator on.
Training data is expected to be encoded as dense or sparse vectors in the “values” feature on each Record.
If the data is labeled, the label is expected to be encoded as a list of scalas in the “values” feature of the
Record label.
More information on the Amazon Record format is available at: https://docs.aws.amazon.com/sagemaker/
latest/dg/cdf-training.html
See record_set() to construct a RecordSet object from ndarray arrays.
Parameters
• records (RecordSet) – The records to train this Estimator on
• mini_batch_size (int or None) – The size of each mini-batch to use when
training. If None, a default value will be used.
• wait (bool) – Whether the call should wait until the job completes (default: True).
• logs (bool) – Whether to show the logs produced by the job. Only meaningful
when wait is True (default: True).
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
9.2 FactorizationMachines
9.2. FactorizationMachines 81
sagemaker, Release 1.18.0
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
9.2. FactorizationMachines 83
sagemaker, Release 1.18.0
• framework (str) – The framework that is used to train the original model. Allowed
values: ‘mxnet’, ‘tensorflow’, ‘pytorch’, ‘onnx’, ‘xgboost’
• framework_version (str) – The version of the framework
• compile_max_run (int) – Timeout in seconds for compilation (default: 3 * 60).
After this amount of time Amazon SageMaker Neo terminates the compilation job
regardless of its current status.
• tags (list[dict]) – List of tags for labeling a compilation job. For more, see
https://docs.aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns A SageMaker Model object. See Model() for full details.
Return type sagemaker.model.Model
data_location
delete_endpoint()
Delete an Amazon SageMaker Endpoint.
Raises ValueError – If the endpoint does not exist.
deploy(initial_instance_count, instance_type, accelerator_type=None, endpoint_name=None,
use_compiled_model=False, **kwargs)
Deploy the trained model to an Amazon SageMaker endpoint and return a sagemaker.
RealTimePredictor object.
More information: http://docs.aws.amazon.com/sagemaker/latest/dg/how-it-works-training.html
Parameters
• initial_instance_count (int) – Minimum number of EC2 instances to de-
ploy to an endpoint for prediction.
• instance_type (str) – Type of EC2 instance to deploy to an endpoint for pre-
diction, for example, ‘ml.c4.xlarge’.
• accelerator_type (str) – Type of Elastic Inference accelerator to attach to
an endpoint for model loading and inference, for example, ‘ml.eia1.medium’. If not
specified, no Elastic Inference accelerator will be attached to the endpoint. For more
information: https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html
• endpoint_name (str) – Name to use for creating an Amazon SageMaker end-
point. If not specified, the name of the training job is used.
• use_compiled_model (bool) – Flag to select whether to use compiled (opti-
mized) model. Default: False.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns
A predictor that provides a predict() method, which can be used to send requests
to the Amazon SageMaker endpoint and obtain inferences.
Return type sagemaker.predictor.RealTimePredictor
enable_network_isolation()
Return True if this Estimator will need network isolation to run.
Returns Whether this Estimator needs network isolation or not.
Return type bool
fit(records, mini_batch_size=None, wait=True, logs=True, job_name=None)
Fit this Estimator on serialized Record objects, stored in S3.
records should be an instance of RecordSet. This defines a collection of S3 data files to train this
Estimator on.
Training data is expected to be encoded as dense or sparse vectors in the “values” feature on each Record.
If the data is labeled, the label is expected to be encoded as a list of scalas in the “values” feature of the
Record label.
More information on the Amazon Record format is available at: https://docs.aws.amazon.com/sagemaker/
latest/dg/cdf-training.html
See record_set() to construct a RecordSet object from ndarray arrays.
Parameters
• records (RecordSet) – The records to train this Estimator on
• mini_batch_size (int or None) – The size of each mini-batch to use when
training. If None, a default value will be used.
• wait (bool) – Whether the call should wait until the job completes (default: True).
• logs (bool) – Whether to show the logs produced by the job. Only meaningful
when wait is True (default: True).
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
9.2. FactorizationMachines 85
sagemaker, Release 1.18.0
• tags (list[dict]) – List of tags for labeling a transform job. If none specified,
then the tags used for the training job are used for the transform job.
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• volume_kms_key (str) – Optional. KMS key ID for encrypting the volume at-
tached to the ML compute instance (default: None).
create_model(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Return a FactorizationMachinesModel referencing the latest s3 model data produced by this
Estimator.
Parameters vpc_config_override (dict[str, list[str]]) – Optional over-
ride for VpcConfig set on the model. Default: use subnets and security groups from
this Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
class sagemaker.FactorizationMachinesModel(model_data, role, sagemaker_session=None,
**kwargs)
Bases: sagemaker.model.Model
Reference S3 model data created by FactorizationMachines estimator. Calling deploy() creates an Endpoint
and returns FactorizationMachinesPredictor.
class sagemaker.FactorizationMachinesPredictor(endpoint, sagemaker_session=None)
Bases: sagemaker.predictor.RealTimePredictor
Performs binary-classification or regression prediction from input vectors.
The implementation of predict() in this RealTimePredictor requires a numpy ndarray as input. The array
should contain the same number of columns as the feature-dimension of the data used to fit the model this
Predictor performs inference on.
predict() returns a list of Record objects, one for each row in the input ndarray. The prediction is
stored in the "score" key of the Record.label field. Please refer to the formats details described: https:
//docs.aws.amazon.com/sagemaker/latest/dg/fm-in-formats.html
9.3 IP Insights
9.3. IP Insights 87
sagemaker, Release 1.18.0
For further information on the AWS IPInsights algorithm, please consult AWS technical documentation: https:
//docs.aws.amazon.com/sagemaker/latest/dg/ip-insights-hyperparameters.html
Parameters
• role (str) – An AWS IAM role (either name or full ARN). The Amazon SageMaker
training jobs and APIs that create Amazon SageMaker endpoints use this role to access
training data and model artifacts. After the endpoint is created, the inference code might
use the IAM role, if accessing AWS resource.
• train_instance_count (int) – Number of Amazon EC2 instances to use for
training.
• train_instance_type (str) – Type of EC2 instance to use for training, for exam-
ple, ‘ml.m5.xlarge’.
• num_entity_vectors (int) – Required. The number of embeddings to train for
entities accessing online resources. We recommend 2x the total number of unique entity
IDs.
• vector_dim (int) – Required. The size of the embedding vectors for both entity and
IP addresses.
• batch_metrics_publish_interval (int) – Optional. The period at which to
publish metrics (batches).
• epochs (int) – Optional. Maximum number of passes over the training data.
• learning_rate (float) – Optional. Learning rate for the optimizer.
• num_ip_encoder_layers (int) – Optional. The number of fully-connected layers
to encode IP address embedding.
• random_negative_sampling_rate (int) – Optional. The ratio of random neg-
ative samples to draw during training. Random negative samples are randomly drawn
IPv4 addresses.
• shuffled_negative_sampling_rate (int) – Optional. The ratio of shuffled
negative samples to draw during training. Shuffled negative samples are IP addresses
picked from within a batch.
• weight_decay (float) – Optional. Weight decay coefficient. Adds L2 regulariza-
tion.
• **kwargs – base class keyword argument values.
repo_name = 'ipinsights'
repo_version = 1
MINI_BATCH_SIZE = 10000
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
9.3. IP Insights 89
sagemaker, Release 1.18.0
delete_endpoint()
Delete an Amazon SageMaker Endpoint.
Raises ValueError – If the endpoint does not exist.
deploy(initial_instance_count, instance_type, accelerator_type=None, endpoint_name=None,
use_compiled_model=False, **kwargs)
Deploy the trained model to an Amazon SageMaker endpoint and return a sagemaker.
RealTimePredictor object.
More information: http://docs.aws.amazon.com/sagemaker/latest/dg/how-it-works-training.html
Parameters
• initial_instance_count (int) – Minimum number of EC2 instances to de-
ploy to an endpoint for prediction.
• instance_type (str) – Type of EC2 instance to deploy to an endpoint for pre-
diction, for example, ‘ml.c4.xlarge’.
• accelerator_type (str) – Type of Elastic Inference accelerator to attach to
an endpoint for model loading and inference, for example, ‘ml.eia1.medium’. If not
specified, no Elastic Inference accelerator will be attached to the endpoint. For more
information: https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html
• endpoint_name (str) – Name to use for creating an Amazon SageMaker end-
point. If not specified, the name of the training job is used.
• use_compiled_model (bool) – Flag to select whether to use compiled (opti-
mized) model. Default: False.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns
A predictor that provides a predict() method, which can be used to send requests
to the Amazon SageMaker endpoint and obtain inferences.
Return type sagemaker.predictor.RealTimePredictor
enable_network_isolation()
Return True if this Estimator will need network isolation to run.
Returns Whether this Estimator needs network isolation or not.
Return type bool
fit(records, mini_batch_size=None, wait=True, logs=True, job_name=None)
Fit this Estimator on serialized Record objects, stored in S3.
records should be an instance of RecordSet. This defines a collection of S3 data files to train this
Estimator on.
Training data is expected to be encoded as dense or sparse vectors in the “values” feature on each Record.
If the data is labeled, the label is expected to be encoded as a list of scalas in the “values” feature of the
Record label.
More information on the Amazon Record format is available at: https://docs.aws.amazon.com/sagemaker/
latest/dg/cdf-training.html
See record_set() to construct a RecordSet object from ndarray arrays.
Parameters
9.3. IP Insights 91
sagemaker, Release 1.18.0
9.4 K-means
9.4. K-means 93
sagemaker, Release 1.18.0
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
9.4. K-means 95
sagemaker, Release 1.18.0
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• channel (str) – The SageMaker TrainingJob channel this RecordSet should be
assigned to.
Returns A RecordSet referencing the encoded, uploading training and label data.
Return type RecordSet
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
Session and base job name used by the Estimator.
Parameters
• instance_count (int) – Number of EC2 instances to use.
• instance_type (str) – Type of EC2 instance to use, for example, ‘ml.c4.xlarge’.
• strategy (str) – The strategy used to decide how to batch records in a single re-
quest (default: None). Valid values: ‘MULTI_RECORD’ and ‘SINGLE_RECORD’.
9.4. K-means 97
sagemaker, Release 1.18.0
Parameters
• train_instance_type (str) – Type of EC2 instance to use for training, for exam-
ple, ‘ml.c4.xlarge’.
• k (int) – Required. Number of nearest neighbors.
• sample_size (int) – Required. Number of data points to be sampled from the train-
ing data set.
• predictor_type (str) – Required. Type of inference to use on the data’s labels,
allowed values are ‘classifier’ and ‘regressor’.
• dimension_reduction_type (str) – Optional. Type of dimension reduction
technique to use. Valid values: “sign”, “fjlt”
• dimension_reduction_target (int) – Optional. Target dimension to reduce to.
Required when dimension_reduction_type is specified.
• index_type (str) – Optional. Type of index to use. Valid values are “faiss.Flat”,
“faiss.IVFFlat”, “faiss.IVFPQ”.
• index_metric (str) – Optional. Distance metric to measure between points when
finding nearest neighbors. Valid values are “COSINE”, “INNER_PRODUCT”, “L2”
• faiss_index_ivf_nlists (str) – Optional. Number of centroids to construct in
the index if index_type is “faiss.IVFFlat” or “faiss.IVFPQ”.
• faiss_index_pq_m (int) – Optional. Number of vector sub-components to con-
struct in the index, if index_type is “faiss.IVFPQ”.
• **kwargs – base class keyword argument values.
repo_name = 'knn'
repo_version = 1
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
• role (str) – The ExecutionRoleArn IAM Role ARN for the Model, which is
also used during transform jobs. If not specified, the role from the Estimator will be
used.
• volume_kms_key (str) – Optional. KMS key ID for encrypting the volume at-
tached to the ML compute instance (default: None).
create_model(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Return a KNNModel referencing the latest s3 model data produced by this Estimator.
Parameters vpc_config_override (dict[str, list[str]]) – Optional over-
ride for VpcConfig set on the model. Default: use subnets and security groups from
this Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
class sagemaker.KNNModel(model_data, role, sagemaker_session=None, **kwargs)
Bases: sagemaker.model.Model
Reference S3 model data created by KNN estimator. Calling deploy() creates an Endpoint and returns
KNNPredictor.
class sagemaker.KNNPredictor(endpoint, sagemaker_session=None)
Bases: sagemaker.predictor.RealTimePredictor
Performs classification or regression prediction from input vectors.
The implementation of predict() in this RealTimePredictor requires a numpy ndarray as input. The array
should contain the same number of columns as the feature-dimension of the data used to fit the model this
Predictor performs inference on.
predict() returns a list of Record objects, one for each row in the input ndarray. The prediction is stored
in the "predicted_label" key of the Record.label field.
9.6 LDA
LDA Estimators can be configured by setting hyperparameters. The available hyperparameters for LDA are
documented below.
For further information on the AWS LDA algorithm, please consult AWS technical documentation: https://docs.
aws.amazon.com/sagemaker/latest/dg/lda.html
Parameters
• role (str) – An AWS IAM role (either name or full ARN). The Amazon SageMaker
training jobs and APIs that create Amazon SageMaker endpoints use this role to access
training data and model artifacts. After the endpoint is created, the inference code might
use the IAM role, if accessing AWS resource.
• train_instance_type (str) – Type of EC2 instance to use for training, for exam-
ple, ‘ml.c4.xlarge’.
• num_topics (int) – The number of topics for LDA to find within the data.
• alpha0 (float) – Optional. Initial guess for the concentration parameter
• max_restarts (int) – Optional. The number of restarts to perform during the Alter-
nating Least Squares (ALS) spectral decomposition phase of the algorithm.
• max_iterations (int) – Optional. The maximum number of iterations to perform
during the ALS phase of the algorithm.
• tol (float) – Optional. Target error tolerance for the ALS phase of the algorithm.
• **kwargs – base class keyword argument values.
repo_name = 'lda'
repo_version = 1
create_model(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Return a LDAModel referencing the latest s3 model data produced by this Estimator.
Parameters vpc_config_override (dict[str, list[str]]) – Optional over-
ride for VpcConfig set on the model. Default: use subnets and security groups from
this Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• channel (str) – The SageMaker TrainingJob channel this RecordSet should be
assigned to.
Returns A RecordSet referencing the encoded, uploading training and label data.
Return type RecordSet
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
9.7 LinearLearner
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
data_location
delete_endpoint()
Delete an Amazon SageMaker Endpoint.
Raises ValueError – If the endpoint does not exist.
deploy(initial_instance_count, instance_type, accelerator_type=None, endpoint_name=None,
use_compiled_model=False, **kwargs)
Deploy the trained model to an Amazon SageMaker endpoint and return a sagemaker.
RealTimePredictor object.
More information: http://docs.aws.amazon.com/sagemaker/latest/dg/how-it-works-training.html
Parameters
• initial_instance_count (int) – Minimum number of EC2 instances to de-
ploy to an endpoint for prediction.
• instance_type (str) – Type of EC2 instance to deploy to an endpoint for pre-
diction, for example, ‘ml.c4.xlarge’.
• accelerator_type (str) – Type of Elastic Inference accelerator to attach to
an endpoint for model loading and inference, for example, ‘ml.eia1.medium’. If not
specified, no Elastic Inference accelerator will be attached to the endpoint. For more
information: https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html
• endpoint_name (str) – Name to use for creating an Amazon SageMaker end-
point. If not specified, the name of the training job is used.
• use_compiled_model (bool) – Flag to select whether to use compiled (opti-
mized) model. Default: False.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns
A predictor that provides a predict() method, which can be used to send requests
to the Amazon SageMaker endpoint and obtain inferences.
Return type sagemaker.predictor.RealTimePredictor
enable_network_isolation()
Return True if this Estimator will need network isolation to run.
Returns Whether this Estimator needs network isolation or not.
Return type bool
fit(records, mini_batch_size=None, wait=True, logs=True, job_name=None)
Fit this Estimator on serialized Record objects, stored in S3.
records should be an instance of RecordSet. This defines a collection of S3 data files to train this
Estimator on.
Training data is expected to be encoded as dense or sparse vectors in the “values” feature on each Record.
If the data is labeled, the label is expected to be encoded as a list of scalas in the “values” feature of the
Record label.
More information on the Amazon Record format is available at: https://docs.aws.amazon.com/sagemaker/
latest/dg/cdf-training.html
See record_set() to construct a RecordSet object from ndarray arrays.
Parameters
• records (RecordSet) – The records to train this Estimator on
• mini_batch_size (int or None) – The size of each mini-batch to use when
training. If None, a default value will be used.
• wait (bool) – Whether the call should wait until the job completes (default: True).
• logs (bool) – Whether to show the logs produced by the job. Only meaningful
when wait is True (default: True).
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• channel (str) – The SageMaker TrainingJob channel this RecordSet should be
assigned to.
Returns A RecordSet referencing the encoded, uploading training and label data.
Return type RecordSet
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
num_point_for_scaler
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
margin
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
quantile
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
loss_insensitivity
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
huber_delta
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
early_stopping_patience
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
early_stopping_tolerance
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
num_classes
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
accuracy_top_k
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
f_beta
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
balance_multiclass_weights
An algorithm hyperparameter with optional validation. Implemented as a python descriptor object.
create_model(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Return a LinearLearnerModel referencing the latest s3 model data produced by this Estimator.
Parameters vpc_config_override (dict[str, list[str]]) – Optional over-
ride for VpcConfig set on the model. Default: use subnets and security groups from
this Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
class sagemaker.LinearLearnerModel(model_data, role, sagemaker_session=None, **kwargs)
Bases: sagemaker.model.Model
Reference LinearLearner s3 model data. Calling deploy() creates an Endpoint and returns a
LinearLearnerPredictor
class sagemaker.LinearLearnerPredictor(endpoint, sagemaker_session=None)
Bases: sagemaker.predictor.RealTimePredictor
Performs binary-classification or regression prediction from input vectors.
The implementation of predict() in this RealTimePredictor requires a numpy ndarray as input. The array
should contain the same number of columns as the feature-dimension of the data used to fit the model this
Predictor performs inference on.
predict() returns a list of Record objects, one for each row in the input ndarray. The prediction is stored
in the "predicted_label" key of the Record.label field.
9.8 NTM
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
• input_shape (dict) – Specifies the name and shape of the expected inputs for
your trained model in json dictionary form, for example: {‘data’:[1,3,1024,1024]}, or
{‘var1’: [1,1,28,28], ‘var2’:[1,1,28,28]}
• output_path (str) – Specifies where to store the compiled model
• framework (str) – The framework that is used to train the original model. Allowed
values: ‘mxnet’, ‘tensorflow’, ‘pytorch’, ‘onnx’, ‘xgboost’
• framework_version (str) – The version of the framework
• compile_max_run (int) – Timeout in seconds for compilation (default: 3 * 60).
After this amount of time Amazon SageMaker Neo terminates the compilation job
regardless of its current status.
• tags (list[dict]) – List of tags for labeling a compilation job. For more, see
https://docs.aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns A SageMaker Model object. See Model() for full details.
Return type sagemaker.model.Model
data_location
delete_endpoint()
Delete an Amazon SageMaker Endpoint.
Raises ValueError – If the endpoint does not exist.
deploy(initial_instance_count, instance_type, accelerator_type=None, endpoint_name=None,
use_compiled_model=False, **kwargs)
Deploy the trained model to an Amazon SageMaker endpoint and return a sagemaker.
RealTimePredictor object.
More information: http://docs.aws.amazon.com/sagemaker/latest/dg/how-it-works-training.html
Parameters
• initial_instance_count (int) – Minimum number of EC2 instances to de-
ploy to an endpoint for prediction.
• instance_type (str) – Type of EC2 instance to deploy to an endpoint for pre-
diction, for example, ‘ml.c4.xlarge’.
• accelerator_type (str) – Type of Elastic Inference accelerator to attach to
an endpoint for model loading and inference, for example, ‘ml.eia1.medium’. If not
specified, no Elastic Inference accelerator will be attached to the endpoint. For more
information: https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html
• endpoint_name (str) – Name to use for creating an Amazon SageMaker end-
point. If not specified, the name of the training job is used.
• use_compiled_model (bool) – Flag to select whether to use compiled (opti-
mized) model. Default: False.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns
A predictor that provides a predict() method, which can be used to send requests
to the Amazon SageMaker endpoint and obtain inferences.
Return type sagemaker.predictor.RealTimePredictor
enable_network_isolation()
Return True if this Estimator will need network isolation to run.
Returns Whether this Estimator needs network isolation or not.
Return type bool
fit(records, mini_batch_size=None, wait=True, logs=True, job_name=None)
Fit this Estimator on serialized Record objects, stored in S3.
records should be an instance of RecordSet. This defines a collection of S3 data files to train this
Estimator on.
Training data is expected to be encoded as dense or sparse vectors in the “values” feature on each Record.
If the data is labeled, the label is expected to be encoded as a list of scalas in the “values” feature of the
Record label.
More information on the Amazon Record format is available at: https://docs.aws.amazon.com/sagemaker/
latest/dg/cdf-training.html
See record_set() to construct a RecordSet object from ndarray arrays.
Parameters
• records (RecordSet) – The records to train this Estimator on
• mini_batch_size (int or None) – The size of each mini-batch to use when
training. If None, a default value will be used.
• wait (bool) – Whether the call should wait until the job completes (default: True).
• logs (bool) – Whether to show the logs produced by the job. Only meaningful
when wait is True (default: True).
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• channel (str) – The SageMaker TrainingJob channel this RecordSet should be
assigned to.
Returns A RecordSet referencing the encoded, uploading training and label data.
Return type RecordSet
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
Session and base job name used by the Estimator.
Parameters
• instance_count (int) – Number of EC2 instances to use.
• instance_type (str) – Type of EC2 instance to use, for example, ‘ml.c4.xlarge’.
• strategy (str) – The strategy used to decide how to batch records in a single re-
quest (default: None). Valid values: ‘MULTI_RECORD’ and ‘SINGLE_RECORD’.
• assemble_with (str) – How the output is assembled (default: None). Valid
values: ‘Line’ or ‘None’.
• output_path (str) – S3 location for saving the transform result. If not specified,
results are stored to a default bucket.
• output_kms_key (str) – Optional. KMS key ID for encrypting the transform
output (default: None).
• accept (str) – The content type accepted by the endpoint deployed during the
transform job.
• env (dict) – Environment variables to be set for use during the transform job (de-
fault: None).
• max_concurrent_transforms (int) – The maximum number of HTTP re-
quests to be made to each individual transform container at one time.
9.9 Object2Vec
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• channel (str) – The SageMaker TrainingJob channel this RecordSet should be
assigned to.
Returns A RecordSet referencing the encoded, uploading training and label data.
Return type RecordSet
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
Session and base job name used by the Estimator.
Parameters
9.10 PCA
This Estimator may be fit via calls to fit_ndarray() or fit(). The former allows a PCA model to be fit
on a 2-dimensional numpy array. The latter requires Amazon Record protobuf serialized data to be stored in
S3.
To learn more about the Amazon protobuf Record class and how to prepare bulk data in this format, please
consult AWS technical documentation: https://docs.aws.amazon.com/sagemaker/latest/dg/cdf-training.html
After this Estimator is fit, model data is stored in S3. The model may be deployed to an Amazon SageMaker
Endpoint by invoking deploy(). As well as deploying an Endpoint, deploy returns a PCAPredictor object
that can be used to project input vectors to the learned lower-dimensional representation, using the trained PCA
model hosted in the SageMaker Endpoint.
PCA Estimators can be configured by setting hyperparameters. The available hyperparameters for PCA are
documented below. For further information on the AWS PCA algorithm, please consult AWS technical docu-
mentation: https://docs.aws.amazon.com/sagemaker/latest/dg/pca.html
This Estimator uses Amazon SageMaker PCA to perform training and host deployed models. To
learn more about Amazon SageMaker PCA, please read: https://docs.aws.amazon.com/sagemaker/latest/dg/
how-pca-works.html
Parameters
• role (str) – An AWS IAM role (either name or full ARN). The Amazon SageMaker
training jobs and APIs that create Amazon SageMaker endpoints use this role to access
training data and model artifacts. After the endpoint is created, the inference code might
use the IAM role, if accessing AWS resource.
• train_instance_count (int) – Number of Amazon EC2 instances to use for
training.
• train_instance_type (str) – Type of EC2 instance to use for training, for exam-
ple, ‘ml.c4.xlarge’.
• num_components (int) – The number of principal components. Must be greater than
zero.
• algorithm_mode (str) – Mode for computing the principal components. One of
‘regular’ or ‘randomized’.
• subtract_mean (bool) – Whether the data should be unbiased both during train and
at inference.
• extra_components (int) – As the value grows larger, the solution becomes more
accurate but the runtime and memory consumption increase linearly. If this value is unset
or set to -1, then a default value equal to the maximum of 10 and num_components will
be used. Valid for randomized mode only.
• **kwargs – base class keyword argument values.
repo_name = 'pca'
repo_version = 1
DEFAULT_MINI_BATCH_SIZE = 500
create_model(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Return a PCAModel referencing the latest s3 model data produced by this Estimator.
Parameters vpc_config_override (dict[str, list[str]]) – Optional over-
ride for VpcConfig set on the model. Default: use subnets and security groups from
this Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
• tags (list[dict]) – List of tags for labeling a compilation job. For more, see
https://docs.aws.amazon.com/sagemaker/latest/dg/API_Tag.html.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns A SageMaker Model object. See Model() for full details.
Return type sagemaker.model.Model
data_location
delete_endpoint()
Delete an Amazon SageMaker Endpoint.
Raises ValueError – If the endpoint does not exist.
deploy(initial_instance_count, instance_type, accelerator_type=None, endpoint_name=None,
use_compiled_model=False, **kwargs)
Deploy the trained model to an Amazon SageMaker endpoint and return a sagemaker.
RealTimePredictor object.
More information: http://docs.aws.amazon.com/sagemaker/latest/dg/how-it-works-training.html
Parameters
• initial_instance_count (int) – Minimum number of EC2 instances to de-
ploy to an endpoint for prediction.
• instance_type (str) – Type of EC2 instance to deploy to an endpoint for pre-
diction, for example, ‘ml.c4.xlarge’.
• accelerator_type (str) – Type of Elastic Inference accelerator to attach to
an endpoint for model loading and inference, for example, ‘ml.eia1.medium’. If not
specified, no Elastic Inference accelerator will be attached to the endpoint. For more
information: https://docs.aws.amazon.com/sagemaker/latest/dg/ei.html
• endpoint_name (str) – Name to use for creating an Amazon SageMaker end-
point. If not specified, the name of the training job is used.
• use_compiled_model (bool) – Flag to select whether to use compiled (opti-
mized) model. Default: False.
• **kwargs – Passed to invocation of create_model(). Implementations may
customize create_model() to accept **kwargs to customize model creation
during deploy. For more, see the implementation docs.
Returns
A predictor that provides a predict() method, which can be used to send requests
to the Amazon SageMaker endpoint and obtain inferences.
Return type sagemaker.predictor.RealTimePredictor
enable_network_isolation()
Return True if this Estimator will need network isolation to run.
Returns Whether this Estimator needs network isolation or not.
Return type bool
fit(records, mini_batch_size=None, wait=True, logs=True, job_name=None)
Fit this Estimator on serialized Record objects, stored in S3.
records should be an instance of RecordSet. This defines a collection of S3 data files to train this
Estimator on.
Training data is expected to be encoded as dense or sparse vectors in the “values” feature on each Record.
If the data is labeled, the label is expected to be encoded as a list of scalas in the “values” feature of the
Record label.
More information on the Amazon Record format is available at: https://docs.aws.amazon.com/sagemaker/
latest/dg/cdf-training.html
See record_set() to construct a RecordSet object from ndarray arrays.
Parameters
• records (RecordSet) – The records to train this Estimator on
• mini_batch_size (int or None) – The size of each mini-batch to use when
training. If None, a default value will be used.
• wait (bool) – Whether the call should wait until the job completes (default: True).
• logs (bool) – Whether to show the logs produced by the job. Only meaningful
when wait is True (default: True).
• job_name (str) – Training job name. If not specified, the estimator generates a
default job name, based on the training image name and current timestamp.
get_vpc_config(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Returns VpcConfig dict either from this Estimator’s subnets and security groups, or else validate and
return an optional override value.
hyperparameters()
Return the hyperparameters as a dictionary to use for training.
The fit() method, which trains the model, calls this method to find the hyperparameters.
Returns The hyperparameters.
Return type dict[str, str]
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• volume_kms_key (str) – Optional. KMS key ID for encrypting the volume at-
tached to the ML compute instance (default: None).
class sagemaker.PCAModel(model_data, role, sagemaker_session=None, **kwargs)
Bases: sagemaker.model.Model
Reference PCA s3 model data. Calling deploy() creates an Endpoint and return a Predictor that transforms
vectors to a lower-dimensional representation.
class sagemaker.PCAPredictor(endpoint, sagemaker_session=None)
Bases: sagemaker.predictor.RealTimePredictor
Transforms input vectors to lower-dimesional representations.
The implementation of predict() in this RealTimePredictor requires a numpy ndarray as input. The array
should contain the same number of columns as the feature-dimension of the data used to fit the model this
Predictor performs inference on.
predict() returns a list of Record objects, one for each row in the input ndarray. The lower dimension
vector result is stored in the projection key of the Record.label field.
• train_instance_type (str) – Type of EC2 instance to use for training, for exam-
ple, ‘ml.c4.xlarge’.
• num_samples_per_tree (int) – Optional. The number of samples used to build
each tree in the forest. The total number of samples drawn from the train dataset is
num_trees * num_samples_per_tree.
• num_trees (int) – Optional. The number of trees used in the forest.
• eval_metrics (list) – Optional. JSON list of metrics types to be used for reporting
the score for the model. Allowed values are “accuracy”, “precision_recall_fscore”: pos-
itive and negative precision, recall, and f1 scores. If test data is provided, the score shall
be reported in terms of all requested metrics.
• **kwargs – base class keyword argument values.
repo_name = 'randomcutforest'
repo_version = 1
create_model(vpc_config_override=’VPC_CONFIG_DEFAULT’)
Return a RandomCutForestModel referencing the latest s3 model data produced by this Estimator.
Parameters vpc_config_override (dict[str, list[str]]) – Optional over-
ride for VpcConfig set on the model. Default: use subnets and security groups from
this Estimator. * ‘Subnets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]):
List of security group ids.
classmethod attach(training_job_name, sagemaker_session=None,
model_channel_name=’model’)
Attach to an existing training job.
Create an Estimator bound to an existing training job, each subclass is responsible to implement
_prepare_init_params_from_job_description() as this method delegates the actual con-
version of a training job description to the arguments that the class constructor expects. After attaching,
if the training job has a Complete status, it can be deploy() ed to create a SageMaker Endpoint and
return a Predictor.
If the training job is in progress, attach will block and display log messages from the training job, until
the training job completes.
Parameters
• training_job_name (str) – The name of the training job to attach to.
• sagemaker_session (sagemaker.session.Session) – Session object
which manages interactions with Amazon SageMaker APIs and any other AWS ser-
vices needed. If not specified, the estimator creates one using the default AWS con-
figuration chain.
• model_channel_name (str) – Name of the channel where pre-trained model
data will be downloaded (default: ‘model’). If no channel with the same name exists
in the training job, this option will be ignored.
Examples
>>> my_estimator.fit(wait=False)
>>> training_job_name = my_estimator.latest_training_job.name
Later on:
>>> attached_estimator = Estimator.attach(training_job_name)
>>> attached_estimator.deploy()
Returns Instance of the calling Estimator Class with the attached training job.
model_data
The model location in S3. Only set if Estimator has been fit().
Type str
record_set(train, labels=None, channel=’train’)
Build a RecordSet from a numpy ndarray matrix and label vector.
For the 2D ndarray train, each row is converted to a Record object. The vector is stored in the
“values” entry of the features property of each Record. If labels is not None, each corresponding
label is assigned to the “values” entry of the labels property of each Record.
The collection of Record objects are protobuf serialized and uploaded to new S3 locations. A manifest
file is generated containing the list of objects created and also stored in S3.
The number of S3 objects created is controlled by the train_instance_count property on this
Estimator. One S3 object is created per training instance.
Parameters
• train (numpy.ndarray) – A 2D numpy array of training data.
• labels (numpy.ndarray) – A 1D numpy array of labels. Its length must be
equal to the number of rows in train.
• channel (str) – The SageMaker TrainingJob channel this RecordSet should be
assigned to.
Returns A RecordSet referencing the encoded, uploading training and label data.
Return type RecordSet
train_image()
Return the Docker image to use for training.
The fit() method, which does the model training, calls this method to find the image to use for model
training.
Returns The URI of the Docker image.
Return type str
training_job_analytics
Return a TrainingJobAnalytics object for the current training job.
transformer(instance_count, instance_type, strategy=None, assemble_with=None,
output_path=None, output_kms_key=None, accept=None, env=None,
max_concurrent_transforms=None, max_payload=None, tags=None, role=None,
volume_kms_key=None)
Return a Transformer that uses a SageMaker Model based on the training job. It reuses the SageMaker
Session and base job name used by the Estimator.
Parameters
• instance_count (int) – Number of EC2 instances to use.
• instance_type (str) – Type of EC2 instance to use, for example, ‘ml.c4.xlarge’.
• strategy (str) – The strategy used to decide how to batch records in a single re-
quest (default: None). Valid values: ‘MULTI_RECORD’ and ‘SINGLE_RECORD’.
• assemble_with (str) – How the output is assembled (default: None). Valid
values: ‘Line’ or ‘None’.
• output_path (str) – S3 location for saving the transform result. If not specified,
results are stored to a default bucket.
Workflows
SageMaker APIs to export configurations for creating and managing Airflow workflows.
10.1 Airflow
10.1.1 training_config
143
sagemaker, Release 1.18.0
– (list[sagemaker.amazon.amazon_estimator.RecordSet]) - A list of
:class:~‘sagemaker.amazon.amazon_estimator.RecordSet‘ objects, where each
instance is a different channel of training data.
10.1.2 tuning_config
10.1.3 model_config
• instance_type (str) – The EC2 instance type to deploy this Model to. For example,
‘ml.p2.xlarge’
• model (sagemaker.model.FrameworkModel) – The SageMaker model to ex-
port Airflow config from
• role (str) – The ExecutionRoleArn IAM Role ARN for the model
• image (str) – An container image to use for deploying the model
Returns Model config that can be directly used by SageMakerModelOperator in Airflow. It can
also be part of the config used by SageMakerEndpointOperator and SageMakerTransformOp-
erator in Airflow.
Return type dict
10.1.4 model_config_from_estimator
sagemaker.workflow.airflow.model_config_from_estimator(instance_type, es-
timator, task_id,
task_type, role=None,
image=None, name=None,
model_server_workers=None,
vpc_config_override=’VPC_CONFIG_DEFAULT’)
Export Airflow model config from a SageMaker estimator
Parameters
• instance_type (str) – The EC2 instance type to deploy this Model to. For example,
‘ml.p2.xlarge’
• estimator (sagemaker.model.EstimatorBase) – The SageMaker estimator
to export Airflow config from. It has to be an estimator associated with a training job.
• task_id (str) – The task id of any airflow.contrib.operators.SageMakerTrainingOperator
or airflow.contrib.operators.SageMakerTuningOperator that generates training jobs in
the DAG. The model config is built based on the training job generated in this operator.
• task_type (str) – Whether the task is from SageMakerTrainingOperator or Sage-
MakerTuningOperator. Values can be ‘training’, ‘tuning’ or None (which means training
job is not from any task).
• role (str) – The ExecutionRoleArn IAM Role ARN for the model
• image (str) – An container image to use for deploying the model
• name (str) – Name of the model
• model_server_workers (int) – The number of worker processes used by the infer-
ence server. If None, server will use one worker per vCPU. Only effective when estimator
is a SageMaker framework.
• vpc_config_override (dict[str, list[str]]) – Override for VpcConfig
set on the model. Default: use subnets and security groups from this Estimator. * ‘Sub-
nets’ (list[str]): List of subnet ids. * ‘SecurityGroupIds’ (list[str]): List of security group
ids.
Returns
Model config that can be directly used by SageMakerModelOperator in Airflow. It can also
be part of the config used by SageMakerEndpointOperator.
SageMakerTransformOperator in Airflow.
Return type dict
10.1.5 transform_config
10.1.6 transform_config_from_estimator
sagemaker.workflow.airflow.transform_config_from_estimator(estimator, task_id,
task_type, in-
stance_count, in-
stance_type, data,
data_type=’S3Prefix’,
content_type=None,
compres-
sion_type=None,
split_type=None,
job_name=None,
model_name=None,
strategy=None,
assem-
ble_with=None, out-
put_path=None, out-
put_kms_key=None,
accept=None,
env=None,
max_concurrent_transforms=None,
max_payload=None,
tags=None,
role=None, vol-
ume_kms_key=None,
model_server_workers=None,
image=None,
vpc_config_override=None)
Export Airflow transform config from a SageMaker estimator
Parameters
• estimator (sagemaker.model.EstimatorBase) – The SageMaker estimator
to export Airflow config from. It has to be an estimator associated with a training job.
• task_id (str) – The task id of any airflow.contrib.operators.SageMakerTrainingOperator
or airflow.contrib.operators.SageMakerTuningOperator that generates training jobs in the
DAG. The transform config is built based on the training job generated in this operator.
• task_type (str) – Whether the task is from SageMakerTrainingOperator or Sage-
MakerTuningOperator. Values can be ‘training’, ‘tuning’ or None (which means training
job is not from any task).
• instance_count (int) – Number of EC2 instances to use.
• instance_type (str) – Type of EC2 instance to use, for example, ‘ml.c4.xlarge’.
• data (str) – Input data location in S3.
• data_type (str) – What the S3 location defines (default: ‘S3Prefix’). Valid values:
– ’S3Prefix’ - the S3 URI defines a key name prefix. All objects with this prefix will be used as
inputs for the transform job.
– ’ManifestFile’ - the S3 URI points to a single manifest file listing each S3 object to use as
an input for the transform job.
• content_type (str) – MIME type of the input data (default: None).
10.1.7 deploy_config
10.1.8 deploy_config_from_estimator
sagemaker.workflow.airflow.deploy_config_from_estimator(estimator, task_id,
task_type, ini-
tial_instance_count,
instance_type,
model_name=None,
endpoint_name=None,
tags=None, **kwargs)
Export Airflow deploy config from a SageMaker estimator
Parameters
• estimator (sagemaker.model.EstimatorBase) – The SageMaker estimator
to export Airflow config from. It has to be an estimator associated with a training job.
• task_id (str) – The task id of any airflow.contrib.operators.SageMakerTrainingOperator
or airflow.contrib.operators.SageMakerTuningOperator that generates training jobs in the
DAG. The endpoint config is built based on the training job generated in this operator.
• task_type (str) – Whether the task is from SageMakerTrainingOperator or Sage-
MakerTuningOperator. Values can be ‘training’, ‘tuning’ or None (which means training
job is not from any task).
• initial_instance_count (int) – Minimum number of EC2 instances to deploy
to an endpoint for prediction.
• instance_type (str) – Type of EC2 instance to deploy to an endpoint for prediction,
for example, ‘ml.c4.xlarge’.
• model_name (str) – Name to use for creating an Amazon SageMaker model. If not
specified, one will be generated.
s
sagemaker.session, 33
151
sagemaker, Release 1.18.0
A B
accuracy_top_k (sagemaker.LinearLearner at- balance_multiclass_weights (sage-
tribute), 118 maker.LinearLearner attribute), 118
AmazonAlgorithmEstimatorBase (class in sage- best_training_job() (sage-
maker.amazon.amazon_estimator), 79 maker.tuner.HyperparameterTuner method),
analytics() (sagemaker.tuner.HyperparameterTuner 23
method), 23 boto_region_name (sagemaker.session.Session at-
AnalyticsMetricsBase (class in sage- tribute), 33
maker.analytics), 46
as_json_range() (sage- C
maker.tuner.CategoricalParameter method), cast_to_type() (sage-
25 maker.tuner.CategoricalParameter class
as_tuning_range() (sage- method), 25
maker.tuner.CategoricalParameter method), cast_to_type() (sage-
25 maker.tuner.ContinuousParameter class
attach() (sagemaker.estimator.Estimator class method), 24
method), 11 cast_to_type() (sagemaker.tuner.IntegerParameter
attach() (sagemaker.estimator.EstimatorBase class class method), 25
method), 6 CategoricalParameter (class in sagemaker.tuner),
attach() (sagemaker.estimator.Framework class 25
method), 16 Chainer (class in sagemaker.chainer.estimator), 69
attach() (sagemaker.FactorizationMachines class ChainerModel (class in sagemaker.chainer.model), 71
method), 83 ChainerPredictor (class in sage-
attach() (sagemaker.IPInsights class method), 88 maker.chainer.model), 72
attach() (sagemaker.KMeans class method), 94 classify() (sagemaker.tensorflow.serving.Predictor
attach() (sagemaker.KNN class method), 100 method), 58
attach() (sagemaker.LDA class method), 105 clear_cache() (sage-
attach() (sagemaker.LinearLearner class method), maker.analytics.AnalyticsMetricsBase method),
113 46
attach() (sagemaker.NTM class method), 120 clear_cache() (sage-
attach() (sagemaker.Object2Vec class method), 126 maker.analytics.HyperparameterTuningJobAnalytics
attach() (sagemaker.PCA class method), 131 method), 47
attach() (sagemaker.RandomCutForest class method), clear_cache() (sage-
137 maker.analytics.TrainingJobAnalytics method),
attach() (sagemaker.transformer.Transformer class 47
method), 32 CLOUDWATCH_NAMESPACE (sage-
attach() (sagemaker.tuner.HyperparameterTuner maker.analytics.TrainingJobAnalytics at-
class method), 21 tribute), 47
COACH_LATEST_VERSION (sage-
153
sagemaker, Release 1.18.0
154 Index
sagemaker, Release 1.18.0
Index 155
sagemaker, Release 1.18.0
156 Index
sagemaker, Release 1.18.0
Index 157
sagemaker, Release 1.18.0
158 Index
sagemaker, Release 1.18.0
Index 159
sagemaker, Release 1.18.0
160 Index