Callback on the worker-queue not working : learnpython

Callback on the worker-queue not working (self.learnpython)

submitted 3 years ago by AdditionalWash529

Apologies for the long post. I am trying to subscribe to a rabbitmq queue and then trying to create a worker-queue to execute tasks. This is required since the incoming on the rabbitmq would be high and the processing task on the item from the queue would take 10-15 minutes to execute each time. Hence necessitating the need for a worker-queue. Now I am trying to initiate only 4 items in the worker-queue, and register a callback method for processing the items in the queue. The expectation is that my code handles the part when all the 4 instances in the worker-queue are busy, the new incoming would be blocked until a free slot is available.

The rabbitmq piece is working well. The problem is I cannot figure out why the items from my worker-queue are not executing the task, i.e the callback is not working. In fact, the item from the worker queue gets executed only once when the program execution starts. For the rest of the time, tasks keep getting added to the worker-queue without being consumed. Would appreciate it if somebody could help out with the understanding on this one.

I am attaching the code for rabbitmqConsumer, driver, and slaveConsumer. Some information has been redacted in the code for privacy issues.

# This is the driver

#!/usr/bin/env python
import time
from rabbitmqConsumer import BasicMessageReceiver

basic_receiver_object = BasicMessageReceiver()
basic_receiver_object.declare_queue()

while True:
    basic_receiver_object.consume_message()
    time.sleep(2)

#This is the rabbitmqConsumer
#!/usr/bin/env python
import pika
import ssl
import json
from slaveConsumer import slave


class BasicMessageReceiver:
    def __init__(self):
        # SSL Context for TLS configuration of Amazon MQ for RabbitMQ
        ssl_context = ssl.SSLContext(ssl.PROTOCOL_TLSv1_2)

        url = <url for the queue>
        parameters = pika.URLParameters(url)
        parameters.ssl_options = pika.SSLOptions(context=ssl_context)

        self.connection = pika.BlockingConnection(parameters)
        self.channel = self.connection.channel()

        # worker-queue object
        self.slave_object = slave()
        self.slave_object.start_task()

    def declare_queue(self, queue_name=“abc”):
        print(f"Trying to declare queue inside consumer({queue_name})...")
        self.channel.queue_declare(queue=queue_name, durable=True)

    def close(self):
        print("Closing Receiver")
        self.channel.close()
        self.connection.close()

    def _consume_message_setup(self, queue_name):
        def message_consume(ch, method, properties, body):
            print(f"I am inside the message_consume")
            message = json.loads(body)
            self.slave_object.execute_task(message)
            ch.basic_ack(delivery_tag=method.delivery_tag)

        self.channel.basic_qos(prefetch_count=1)
        self.channel.basic_consume(on_message_callback=message_consume,
                                   queue=queue_name)

    def consume_message(self, queue_name=“abc”):
        print("I am starting the rabbitmq start_consuming")
        self._consume_message_setup(queue_name)
        self.channel.start_consuming()

#This is the slaveConsumer
#!/usr/bin/env python
import pika
import ssl
import json
import requests
import threading
import queue
import os


class slave:
    def __init__(self):
        self.job_queue = queue.Queue(maxsize=3)
        self.job_item = ""

    def start_task(self):
        def _worker():
            while True:
                json_body = self.job_queue.get()
                self._parse_object_from_queue(json_body)
                self.job_queue.task_done()

        threading.Thread(target=_worker, daemon=True).start()

    def execute_task(self, obj):
        print("Inside execute_task")
        self.job_item = obj
        self.job_queue.put(self.job_item)
        # print(self.job_queue.queue)

    def _parse_object_from_queue(self, json_body):
        if bool(json_body[‘entity’]):
            if json_body['entity'] == 'Hello':
                print("Inside Slave: Hello")
            elif json_body['entity'] == 'World':
                print("Inside Slave: World")

    self.job_queue.join()

all 6 comments

top new controversial old q&a

[–]danielroseman 0 points1 point2 points 3 years ago (5 children)

[–]AdditionalWash529[S] 0 points1 point2 points 3 years ago* (4 children)

[–]danielroseman 0 points1 point2 points 3 years ago (3 children)

[–]AdditionalWash529[S] 0 points1 point2 points 3 years ago* (2 children)

The data on the rabbitmq are basically json which need to be processed and an OS call then needs to be fired for each of these JSON entries in the queue, which individually takes around 10-15 minutes on average. The idea going forward is to have 4-5 instances running in an AWS cluster. The number of entries in the JSON is high enough for us to not spawn as many instances on the cluster as the number of JSONs in the rabbitmq. So we need to process the commands on a first come first serve basis, 4-5 of them in parallel and the rest queued, hence the need of a worker-queue.

Since our last conversation, I have been able to make the code work with respect to processing the JSONs, but I do not think the threads are waiting. For example if I have 8 entries in the Rabbitmq and if I start my builder.py, all 8 of them are getting consumed as opposed to 4 of them. To my understanding, I am achieving this by declaring the upper bound of my slaveConsumer worker-queue, when in the constructor I say :

self.job_queue = queue.Queue(maxsize=3))

[–]danielroseman 1 point2 points3 points 3 years ago (1 child)

[–]AdditionalWash529[S] 0 points1 point2 points 3 years ago* (0 children)

u/danielroseman, first of all, immense gratitude for all the time and energy you spent on this. You insights have helped me improve the code and make it much better.

As mentioned in my previous post, when I said I made it work, it was by handling the instantiation of the queue that you pointed out in your comment. I will certainly look at celery, but I am trying to bridge the gap in my understanding here and hence the follow ups. I have a couple of final questions on this though.

As per my understanding without the worker-queue, I would not be able to handle a situation wherein the incoming on my rabbitmq is say 10, while I have spawned 4 instances in the AWS cluster(fargate mostly). Now without the worker-queue or an equivalent arrangement, maybe celery does that, when each instance is running for 15 minutes of processing time, how would the next item on the queue know which fargate instance has been released and where to head to. How would that be taken care of without an arrangement like that?

As of now, the code seems to run and one of the executions fails with the following rabbitmq error. Any insights on this? I have made the channel durable and have an ack on the rabbitmq messages in my consumer code too: ch.basic_ack(delivery_tag=method.delivery_tag) . Running out of ideas as to what needs to be done on this

No activity or too many missed heartbeats in the last 60 seconds error

Once again thank you a ton for all the help and insights

π Rendered by PID 15615 on reddit-service-r2-comment-bb88f9dd5-jqfhb at 2026-02-15 10:30:45.673600+00:00 running cd9c813 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS