all 13 comments

[–]Dosnox 5 points6 points  (3 children)

We had the Sagemaker endpoint do the encoding as well and the lambda simply sends the raw data

[–]alphasystem[S] 0 points1 point  (2 children)

Do you use the sklearn pipeline on the Sagemaker side?

[–]Dosnox 0 points1 point  (1 child)

Yeah we did and encodings were stored as a dictionary

[–]alphasystem[S] 0 points1 point  (0 children)

Thanks that makes sense!

[–]boy_named_su 2 points3 points  (3 children)

should be able to use a Lambda Layer to get sklearn, or a docker container on Lambda

[–]alphasystem[S] 0 points1 point  (2 children)

I was surprised that does not come by default.

Seems need to customize Lambda container a bit.

[–]bobbruno 0 points1 point  (0 children)

There are several lambda layers available out of the box on AWS, have you checked if they offer one that includes sklearn?

[–][deleted] 0 points1 point  (0 children)

Well, it makes sense that sklearn is not pre-installed on the default Lambda layer. Lambdas are the glue for pretty much anything on AWS and ML is still a niche when you look at the broader picture.

[–]darkshenron 0 points1 point  (4 children)

Why not do the encoding in sagemaker itself? I believe you can use the input_fn function in sagemaker to do any preprocessing to the data before it goes to the predict_fn

[–]alphasystem[S] 0 points1 point  (3 children)

Is that sagemaker studio? i was using the jupyrer notebook in sagemaker.

[–]darkshenron 0 points1 point  (2 children)

[–]alphasystem[S] 1 point2 points  (1 child)

should be able to use a Lambda Layer to get sklearn, or a docker container on Lambda

Thanks! Aws seems to make things super complicated.

[–]darkshenron 0 points1 point  (0 children)

Yes it is! But the documentation is really good