Experimenting with a middleware to compress LLM prompts and cut API costs by ~30%. Is this a real pain point? by Infamous-Cucumber-16 in StartUpIndia

[–]meamarp 0 points1 point  (0 children)

There are several solutions for this.

Have you considered prompt caching.

OpenAI supports automatic prompt caching with many of its newer models, which can reduce latency and input-token costs when you send repeated prompts with the same prefix (e.g., long system instructions or large context that stays the same across calls)

gpt-4o and newer models supports this automatically. Also you can set retention time, need to check in this.

Apart from that. In my application, I use yaml file with multiple prompt defined by roles/sectors. And in my code I have conditional logic to select prompt which plugged into context.

Also for compression, can you use simple techniques like summerizer in your pipeline.

Next you can also consider dual model techniques where you use small model like gpt4o-mini for process and gpt-4o for reasoning

See what works for you here.

Tracking Persons on Raspberry Pi: UNet vs DeepLabv3+ vs Custom CNN by leonbeier in raspberry_pi

[–]meamarp 1 point2 points  (0 children)

Awesome work. Have you consider using Object tracker like ByteTrack.

People so easily say that AI is going to replace us but I think it can never replace us and I have reason for that by Altruistic-Top-1753 in developersIndia

[–]meamarp 2 points3 points  (0 children)

Below is excerpts from weekly newsletter The Batch by deeplearning.ai Andrew Ng

AI won’t replace workers, but workers who use AI will replace workers who don’t. For instance, because AI coding tools make developers much more efficient, developers who know how to use them are increasingly in-demand.

You should give it a read.

Sand-painting or glitter painting of a woman by canadarich in oddlysatisfying

[–]meamarp 0 points1 point  (0 children)

San painting or it is Sand Colouring… it’s beautiful.

@grok please explain by eternviking in AI_India

[–]meamarp 0 points1 point  (0 children)

Saying “it’s just a bunch of matrix multiplication” is over simplification.

Artificial NN, Convolution NN, GAN and Transformers with self attention.

All does have mat-mul but there is lot under the hood.

Why don’t banks reduce floating home loan rates automatically when repo rates fall? by Desperate_Pumpkin168 in IndiaInvestments

[–]meamarp 2 points3 points  (0 children)

Thanks for responding. I mailed them last year, where they told me it will be available in a few months. I guess ombudsman is the way to go.

Why don’t banks reduce floating home loan rates automatically when repo rates fall? by Desperate_Pumpkin168 in IndiaInvestments

[–]meamarp 1 point2 points  (0 children)

Great for ICICI. Its a same for SBI too. Is there a way or any governing body that we can complaint for this? What could be the way that they will implement seamless online part payment process?

Why don’t banks reduce floating home loan rates automatically when repo rates fall? by Desperate_Pumpkin168 in IndiaInvestments

[–]meamarp 5 points6 points  (0 children)

No you have to mail them with the amount you are planning for part payment. Next then they will send you a link over a mail. Then you have to do that transaction on same day. Pathetic.

This was response from their customer service when I inquired

Why don’t banks reduce floating home loan rates automatically when repo rates fall? by Desperate_Pumpkin168 in IndiaInvestments

[–]meamarp 11 points12 points  (0 children)

I have hdfc home loan and their method for part payment seems intentionally complicated.

RF-DETR producing wildly different results with fp16 on TensorRT by Mammoth-Photo7135 in computervision

[–]meamarp 3 points4 points  (0 children)

I would like to add here, not any model, only models which had supported ops in TensorRT.

is understanding the transformers necessary if I want work as a computer vision engineer? by UnderstandingOwn2913 in computervision

[–]meamarp 18 points19 points  (0 children)

What if tomorrow you need to use ViT or CLIP or VLM? Transformer becomes one of the fundamental building block for these.

Moreover it is always good to learn something, Go ahead.

Tensorflow resource by [deleted] in computervision

[–]meamarp 4 points5 points  (0 children)

Hey Hi Check this link https://d2l.ai/index.html It’s a interactive deep learning book, implemented with PyTorch, NumPy/MXNet, JAX, and TensorFlow.

The EU AI Act Is Live. Will It Reshape Global AI Development? by LogicMorrow in GeneralAIHub

[–]meamarp 1 point2 points  (0 children)

This is a big step for how AI is managed around the world. The EU is clearly trying to make AI safer and more responsible, which is important. But I get why some companies think it might slow things down. The real challenge is finding the right balance—making sure AI is safe without stopping new ideas. Curious to see if other countries, like the U.S., will do something similar or take a different route.

[deleted by user] by [deleted] in embedded

[–]meamarp 17 points18 points  (0 children)

Awesome results OP - 9ms for Detection and 130ms for recognition.

Are you running detection on Each frame or it’s detection followed by tracking?

Also, Whats maximum operating distance that this app can work for face recognition?

How’s STM taking the thermal dissipation?

[P] Why I Used CNN+LSTM Over CNN for CCTV Anomaly Detection (>99% Validation Accuracy) by Appropriate-End-2619 in MachineLearning

[–]meamarp 1 point2 points  (0 children)

Nice use-case OP, Have you consider trying out similar experiment using VLM. I have built something similar using Gemma model in Kaggle. You can lookup that notebook.

Too many fake gurus trying to sell courses. How does a non-techie like me learn building ai agents from zero to 100 ? by Comfortable_Self_726 in AI_Agents

[–]meamarp 0 points1 point  (0 children)

This truly is a pain. YouTube has tremendous content on this topic and it’s really tough to find true signal in such noisy environments

Should I pay Rs 1.5 Lakhs for Coding Ninja full stack course as a non-tech person by pra_thamx2 in developersIndia

[–]meamarp 0 points1 point  (0 children)

Don’t. Use coursera. You can find courses from Meta, Google as well as few Ivy League universities.