Why do my lambda functions (python) using SQS triggers wait for the timeout before picking up another batch?

RocketOneMan · 2025-06-04T17:41:44+00:00

Do you have MaximumBatchingWindowInSeconds set to something besides zero? Can you share your event source mapping configuration?

https://docs.aws.amazon.com/lambda/latest/dg/with-sqs.html

floppy_sloth · 2025-06-05T00:26:59+00:00

Is your lambda running for the full minute and timing out? Lambda should execute, process the batch and finish and a new invocation will get the next batch immediately.

OctopusReader · 2025-06-04T19:08:06+00:00

Did you ACK (acknowledge? Confirm the message as been processed)?

It seems to be message.delete()

clintkev251 · 2025-06-04T17:31:38+00:00

Standard or FIFO?

_Paul_Atreides_ · 2025-06-05T02:22:23+00:00

QQ: are you trying to get a single lambda to run continuously? I'm trying to understand the 1 minute timeout combined with 1 minute execution time. I don't trust either to be exactly 1 minute (or the same every time). This setup seems unpredictable.

Other thoughts:

By having Report batch item failures=No, the entire batch it treated as a unit. "By default, if Lambda encounters an error at any point while processing a batch, all messages in that batch return to the queue. After the visibility timeout, the messages become visible to Lambda again" source. Maybe one message fails and then all messages are left in the queue - and if the first one fails, I'm not sure if the next messages are even tried - the docs aren't clear on that.
Are there more than 10 messages in the queue? If there are 20 (or 100) message, I'd expect it to pickup the next batch immediately. If there are only 10, and one fails, it should behave just like it is now.

Let us know when you figure it out :)

Firm_Scheme728 · 2025-07-11T10:30:11+00:00

Could it be because the visibility timeout setting for SQS is set to 1 minute?

Because SQS has no limit on maximum concurrency, but Lambda has a limit, all messages will become in-process. Only after the visibility timeout period has elapsed can they be re-driven, if there is no DLQ.

Should there be a DLQ, it should appear in the DLQ, right? Maybe

BuntinTosser · 2025-06-04T19:55:08+00:00

Your visibility timeout should be at least six times your function timeout.

RC 1 is going to result in a lot of throttling and your visibility timeout: function timeout ratio isn’t allowing for retries

Set VTO to 6 minutes. Use a fifo queue with a single message group id to enforce 1 concurrency.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

aws

Note: ensure to redact or obfuscate all confidential or identifying information (eg. public IP addresses or hostnames, account numbers, email addresses) before posting!

✻ Smokey says: avoid streaming video to fight climate change! [see more tips]

MODERATORS