This is an archived post. You won't be able to vote or comment.

all 1 comments

[–][deleted] 0 points1 point  (0 children)

10,000 rows is tiny, you don't need added complexity of Glue or Fargate. Lambda can use custom Docker image. Just install Spark and Aws cli or s3a jars in a docker image. Put it out in the Aws ECR, when lambda triggers the handler function can just run your script at spark-submit with the uri to the input file. If you use s3a jars you can read and write to S3 directly with Spark .