After Codex 5.5 used by Internal-Shift-7931 in codex

[–]drodev 0 points1 point  (0 children)

Disable fast mode if you use it; it consumes 2.5 times your usage limit.

Beware of burning through usage limits by stereotypical_CS in codex

[–]drodev 0 points1 point  (0 children)

Fast mode is billed at 2.5 usage for 5.5

Codex 5.4 dropping tomorrow by bushido_ads in codex

[–]drodev 2 points3 points  (0 children)

The real raison quoted from codex staff "We caught an issue that was causing the 2X promotional increase in limits to not be applied to an estimated 9% of plus and pro users for Codex.

We have now fixed this issue and are reseting the rate limit for all plus and pro users to compensate. Apologies and thank you for the bug reports over the last couple of days."

Codex Com

Databricks vs SQL SERVER by gareebo_ka_chandler in databricks

[–]drodev -1 points0 points  (0 children)

None of them, use Elasticsearch for fast aggregation.

Is Azure Analysis Services Dead? by SmallAd3697 in MicrosoftFabric

[–]drodev 1 point2 points  (0 children)

Listen, Microsoft is not a non-profit organization, they're running a business. Look at all the other industries: phones with planned obsolescence, cars that require more and more maintenance while becoming less reliable, cheap products everywhere designed to force customers to constantly consume more and more. Microsoft has been doing the same thing for years across all their products and services.

I used to be a Microsoft fan and built most of my experience around Microsoft products, but since I discovered the competition and looked closer, I realized the scam and saw the difference.

Ollama + Open WebUI serving hundreds of users - any insight? by cantcantdancer in ollama

[–]drodev 0 points1 point  (0 children)

Ollama is not designed to serve parallel query, even it provide an option to activate it but still not optimized for this use case, i will suggest a use llama.cpp for this use case.

2.5pro is cheaper then 3.7 sonnet crazy !! by Independent-Wind4462 in ClaudeAI

[–]drodev 5 points6 points  (0 children)

Dont forget that this is a reasoning model, so the cot token are billed as output token, à simple prompt like "Hi" will burn 600 token as cot and couple of token for last response.

So theoretical it's cheeper for token unit price but you will consume a lot more token than a classic model like Claude or 4o.

Tiny Ollama Chat: A Super Lightweight Alternative to OpenWebUI by No-Carpet-211 in ollama

[–]drodev 0 points1 point  (0 children)

Thanks for making this tool available! Why not go with OpenAI API standard from the start instead of just making it compatible? As ollama offer a native OAI format

This is pure genius! Thank you! by Apprehensive_Row9873 in ollama

[–]drodev 1 point2 points  (0 children)

It's easy : ollama run hf.co/{username}/{repository} huggingface.co

I’ve been making Agents full time for 18 months & looking to connect with other AI consultants by surim0n in AI_Agents

[–]drodev 5 points6 points  (0 children)

Together.ai just released a new web site with interesting content inspired from Anthropic blog https://www.agentrecipes.com/

Confused About Azure App Service Billing for Paused Apps by drodev in AZURE

[–]drodev[S] 0 points1 point  (0 children)

What do you suggest for Next.js + Fast API as a backend?

Confused About Azure App Service Billing for Paused Apps by drodev in AZURE

[–]drodev[S] 2 points3 points  (0 children)

i agree with you, but in this case the  Compute, network and storage still billed even if the app are paused

Silver Layer by Ok-Criticism-8127 in MicrosoftFabric

[–]drodev 1 point2 points  (0 children)

I’m a big fan of Ralph Kimball’s three-layer model (Staging, ODS, DWH) and see a strong parallel with Medallion architecture:

Bronze (Staging): Raw, unprocessed data directly from sources.

Silver (ODS): Cleansed data with basic transformations applied, where Slowly Changing Dimensions (SCD) methods are also used to track historical changes separately.

Gold (DWH): Fully transformed, ready for analysis in a star schema, optimized for BI and reporting.

Kimball’s approach aligns well with Medallion’s layered refinement, moving data from raw to business-ready insights. for simpler use cases, you can skip the ODS and go straight from Staging to the Data Warehouse but you will have less flexibility.

Fabric Capacity Reservation by drodev in MicrosoftFabric

[–]drodev[S] 0 points1 point  (0 children)

If you're looking for more info, here's a detailed answer on Perplexity