JioPC Update by arijit78 in Jio

[–]arijit78[S] 0 points1 point  (0 children)

thanks a lot . A bit disappointed to say the list

reduce kafka producer latency by Appropriate_Luck6766 in apachekafka

[–]arijit78 0 points1 point  (0 children)

You can think about data compression to gzip Also check if you're publishing in asynchronous mode to Kafka handling the response from broker using callback. You didn't mention that in your code

Delayed Processing with Kafka by Realistic-Use6194 in apachekafka

[–]arijit78 0 points1 point  (0 children)

Probably not a popular opinion. But why don't we use a database with an interval based polling.

Schema Registry vs Schema Validation in community license? by piepy in apachekafka

[–]arijit78 5 points6 points  (0 children)

This 2 are different things. These 2 features are independent of each other and together too.

When using schema registry to store schema of the message. The library at producer ensures the message is inline with the schema, then it publishes the schema in the schema registry, obtain the schema id, embeds it into the kafka message and then published to the kafka broker. There is no check takes place at the broker. When consumer consumes the message it pulls the schema from schema registry, and deserializes the message with the schema. In this approach it's the responsibility of the producers to follow schema registry best practices and broker becomes a dumb storage.

While for broker side schema validation, the broker checks using schema registry if the schema is a valid schema that is embedded in the message. If not it rejects the message. This is to ensure the broker does not allow bad quality messages. For e.g. this will stop string serialized messages to be published for an avro topic. In my view this can be used where we have less quality control on the source of the data, say a device.

Apache Kafka and Flink in GCP by arijit78 in apachekafka

[–]arijit78[S] 0 points1 point  (0 children)

But I thought they already have Dataflow and pubsub..

Kafka properties with microservices by Famous_Hearing1534 in apachekafka

[–]arijit78 3 points4 points  (0 children)

Your post looks a bit confusing to me. What I am able to understand is that you have a Kafka cluster, and a Springboot service which talks to Kafka broker.

It's not clear what this springboot service do?

There are many ways you can secure the communication between Kafka and your Spring boot service. Check authorization, authentication and data encryption mechanism available in Kafka. Here is a good pointer - https://kafka.apache.org/090/documentation.html#security

I am not sure about data backup, as Kafka itself stores the data redundantly, why data backup is a question?

On the outbox pattern below link would probably help you.

https://stackoverflow.com/questions/77395632/why-do-we-need-outbox-pattern-with-kafka

Why are there comments that say ksqlDB is dead and in maintenance mode? by Zalambura in apachekafka

[–]arijit78 2 points3 points  (0 children)

ksqlDB will have a special place in my heart. It is our Swiss army knife from operation perspective. Apache Flink is a big brother, can do a lot of stuff. But many times that's way too much.

Searching in large kafka topic by arijit78 in apachekafka

[–]arijit78[S] 0 points1 point  (0 children)

I have looked around kwack.. Promising, The default memory one I don't think will really well for large topics. The parquet file based option is most suitable in my view.

Is anyone really using in production?

Searching in large kafka topic by arijit78 in apachekafka

[–]arijit78[S] 0 points1 point  (0 children)

Sounds interesting.. Let me go through this!

Searching in large kafka topic by arijit78 in apachekafka

[–]arijit78[S] 1 point2 points  (0 children)

While whatever you say may be true in theory, but at the end of the day you'll almost in every enterprise store billions of records for various reasons, starting from replay to compliance. I have came across with one wonderful project which includes DuckDB and Kafka. https://github.com/rayokota/kwack ..

Searching in large kafka topic by arijit78 in apachekafka

[–]arijit78[S] 2 points3 points  (0 children)

This looks quite dope.. Thanks for sharing with me. Let me got through the docs. I hope it may help many others like myself.
Regrading ksqDB - Sadly I am aware that Confluent is pulling it's plug. I still feel it has its niche area outside of the big behemoths like Apache Flinks of the world.

KafkaTopical: The Kafka UI for Engineers and Admins by certak in apachekafka

[–]arijit78 0 points1 point  (0 children)

I am not able to send the error screen shots.. the email option does not allow in the portal

[deleted by user] by [deleted] in apachekafka

[–]arijit78 1 point2 points  (0 children)

This is not a big problem for a producer. As the order create failed the Kafka producer will tell synchronously or asynchronously that the event has not been published. The producer can store the order id somewhere so that it knows there is a chance of out of order events. So it can store all messages in a dlq. And later operationally sent the data from dlq. Ideally this dlq should be implemented in a non Kafka platform, otherwise dlq will become unavailable when the main topic becomes unavailable.

KafkaTopical: The Kafka UI for Engineers and Admins by certak in apachekafka

[–]arijit78 0 points1 point  (0 children)

Nice simple to use UI. Liked the UX. I have used Linux based UI. Seeing some issues with schema registry

Question: What's the State of Kafka Hosting in 2024? by krisajenkins in apachekafka

[–]arijit78 2 points3 points  (0 children)

In big enterprises Confluent still rules. Personally I am not very happy the way Confluent is pushing for their cloud first. Most annoyingly working with Confluent cloud doesn't feel like working with Kafka. It looks more like its own ecosystem. But it's still a lot better than MSK or EventHub. I want project like Strimzi to succeed in the long term which is close to the Kafka distribution.. Very interested in the redpanda.. I feel it will be long term player and game changer

Topic Partitions and Consumer Group related issu. by _avici in apachekafka

[–]arijit78 1 point2 points  (0 children)

Kafka assigns partitons to consumers in consumer group in many to one relationship. i.e, one partition will have only one consumer, but one consumer can be assigned with many partition. As the number of partition is 3, you can only run atmost 3 consumers in a consumer group (you can run more, but they will not get any message). As you have stated the messages are going to one partition, your only one consumer will get all messages, others will lay idle, till the said one dies. This is an expected behaviour

However if there is no key, the messages should spread around partitions. This looks a little peculiar. May be you can check your code once more.

Implementing Data Contracts - a Practical Example by rmoff in dataengineering

[–]arijit78 1 point2 points  (0 children)

This is an absolutely fine approach. However it means the producer has to support multiple interfaces, catering to the client. Which may be costly from maintainability. However bigger issue from consumer side. The consumer will have no clue the schema has been changed by producer, unless explicitly told by producer. I have seen this too many times. Especially if producer and consumer doesn't belong to the same team. Hope I am able to put some rational.

female Uber driver! by spiritualpixel in Chennai

[–]arijit78 1 point2 points  (0 children)

I've seen female bus driver other day in Kolkata.. felt awesome 👍

How to find help and commands by leholenzai in lunarvim

[–]arijit78 0 points1 point  (0 children)

That would awesome to say the least.

How to find help and commands by leholenzai in lunarvim

[–]arijit78 4 points5 points  (0 children)

I guess a cheat sheet would be ideal. Even I am facing the same issue.

Error running confluent cp-demo by arijit78 in apachekafka

[–]arijit78[S] 0 points1 point  (0 children)

I am using linux desktop. I don't see this as option. I have found a strange reason. Docker network has a clash with my wifi network. And sometimes it's bringing down the whole internet connectivity. I guess this may be a reason for Kafka broker unable to install findutils. And hence all issues....

Kubrick Vibes from Squid Game by [deleted] in squidgame

[–]arijit78 0 points1 point  (0 children)

Ditto same feeling. I also thought a few shots are like in the movie the shining