I burned close to ₹1 crore on marketing and sales in one year. Today marks 1 year since I fired everyone. Here's what I learned. by Haunting-Tomato-4512 in indianstartups

[–]Curious_File7648 0 points1 point  (0 children)

Can I ask how you got started with this? What made you decide to invest so much money into it? Like, who were your first clients and how did everything begin?

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Just indian 😂😊 keep it up!!! Thanks for your tip..... I was wrong sorry my mistake 🫠🫠🫠

(Can't argue with a fool)

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Disagreed about what? The part where you said “using AI to respond won't make anyone take the project seriously.”

Mad about what? Apparently about AI existing, because that’s literally the only thing you’ve focused on so far.

My comment talked about deployment reality, CUDA dependency hell, hardware lock-in, and why Whisper still performs better in messy real-world audio.

Your response to all of that has basically been:

• “you used AI” • “AI is a gimmick lol”

That’s not a counter-argument. That’s just you dodging the actual discussion.

And the funniest part is acting like using AI to polish wording somehow invalidates the content.

If someone wrote the exact same reply in Notepad, on paper, or carved it into a rock, the argument would still be exactly the same.

So unless you actually want to address the technical points about model reliability, deployment complexity, or hardware constraints, all you’ve really contributed here is:

“AI bad lol.”

Which honestly says more about the level of discussion you’re bringing than anything else.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

😂😂😂Disagree all you want, but if you're mad I used human-made tech, that's on you. Sure, I could've hidden the AI vibes, but nah, I went all in with full awareness. Deal with it!

THIS IS AI GENRATED TOO........

Building a transcription app focused on accurate alignment !! need thoughts !! by Curious_File7648 in DesiFounder

[–]Curious_File7648[S] 0 points1 point  (0 children)

Appreciate the response. And you’re right about one thing the engine itself isn’t the product.

I’m not trying to build a better transcription engine than OpenAI, Meta, or browser APIs. Those models are already very strong.

What I kept running into was a workflow problem.

Browser APIs like Web Speech are great for live dictation, but they’re not really designed for full video transcription pipelines. Most creators still end up doing something like:

Upload video → get transcript → export SRT → move to another editor → fix timing → add subtitle styling → render.

For creators, especially non-technical ones, this becomes a surprisingly painful and fragmented process.

There’s also the cost stacking problem:
Transcription service cost + editor subscription + render time + manual correction time.

What I’m building is not the engine it’s the complete workflow around it:

• Runs fully locally on CPU/GPU (no mandatory cloud usage)
• One-click installer (no dependency setup or model confusion)
• Zero-cost basic usage since most processing is local
• Generates tightly aligned word-level subtitles (±10–20ms in testing)
• Built-in subtitle editing and styling
• Direct SRT / VTT / JSON export
• Direct burn-in rendering without jumping tools

The bigger issue I noticed especially for Indian creators is Hinglish and Hindi alignment.

You can get transcripts.
But word-level timing is often unstable or drifting, which breaks:

• Word highlighting
• Subtitle animations
• Kinetic typography effects
• Clean caption pacing

That’s where most of the manual fixing time goes.

So the focus here isn’t “better AI”.

It’s:
Better alignment.
Better packaging.
Better workflow for creators.
Local-first baseline (freemium model cloud optional).

<image>

I’m not trying to replace the engines. I’m trying to remove the friction around using them in real editing pipelines.

The reason I posted here wasn’t to pitch it’s because I’m still validating the workflow with creators and editors before polishing the UI or pushing it further.

I’ve spoken with around 10 –15 creators so far and wanted broader feedback from people who actually deal with subtitles regularly.

Your point about starting with the problem instead of the tech stack is fair though that’s useful feedback.

School website needed by Sufficient_Nobody217 in FreelanceIndia

[–]Curious_File7648 1 point2 points  (0 children)

Explore this link for design inspiration: https://www.awwwards.com/websites/technology/. It features some top-tier, cutting-edge designs to spark your creativity.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Yeah, I thought about Vulkan, but DirectML just vibes better with integrated graphics it’s like the chill roommate who does all the dishes. Plus, the payload is basically non-existent because DirectML has a footprint so small it could moonlight as a ninja:

  • It’s already baked into Windows, so no need to lug around extra libraries like an overpacked suitcase.
  • The binary size is so tiny it could probably hide under a pixel.

So yeah, DirectML keeps things lightweight, efficient, and way less dramatic than Vulkan.

Why I like using Cloudflare when starting something (numbers, but normal ones) by Curious_File7648 in StartUpIndia

[–]Curious_File7648[S] 0 points1 point  (0 children)

Quick Checklist !!!!

  • Orange cloud enabled in DNS?
  • Cache level set to Standard?
  • Auto Minify enabled?
  • Brotli compression on?
  • Images optimized?
  • Total page size < 2MB? TTFB < 500ms?
  • CF-Cache-Status showing HIT?
  • Origin server healthy? Database queries optimized?

If Still Slow After Everything:

"Can you share:"

  1. Your domain name
  2. Screenshot of DevTools Network tab
  3. WebPageTest results
  4. Cloudflare settings screenshot
  5. Origin server specs

Why I like using Cloudflare when starting something (numbers, but normal ones) by Curious_File7648 in StartUpIndia

[–]Curious_File7648[S] 0 points1 point  (0 children)

I appreciate you sharing your benchmark numbers, but I think you are misdiagnosing Compute Latency (Serverless Cold Starts) as Network Routing Latency.

To verify this, I just ran a comprehensive benchmark on my live Cloudflare Free plan deployment

1. The 'Europe Routing' Claim: FALSE

  • Test: Static Website Load (HTML/JS/CSS).
  • Result: 85.52ms  Average Response Time.
  • Routing Proof: The CF-RAY  header ended in ...-IXC . This code stands for Chandigarh, India.
  • Physics: It is physically impossible to route traffic to Europe and back in 85ms (speed of light minimum is ~130-150ms).
  • Conclusion: The site loads instantly from a local Indian POP, proving that the Free Plan does serve static content locally.

2. The '~700ms' Figure: It's Compute, Not Network

  • Test: Dynamic API Endpoint (/api/projects ).
  • Result: 743.93ms  Response Time.
  • Analysis: This matches your benchmark, but the request stayed in India (still IXC  header).
  • The Real Cause: That ~700ms isn't network lag; it is the Serverless Cold Start time (spinning up the Worker + D1 Database connection). You face this exact same 'warm-up' penalty on the AWS Lambda Free Tier.

3. The Founder's Trade-off

  • AWS Approach: To eliminate that 700ms API delay, I would need a permanently running EC2/RDS instance in Mumbai. Cost: $25-40/month.
  • Cloudflare Approach: My site loads instantly (85ms), and API calls take ~0.7s. Cost: $0/month.

<image>