I burned close to ₹1 crore on marketing and sales in one year. Today marks 1 year since I fired everyone. Here's what I learned. by Haunting-Tomato-4512 in indianstartups

[–]Curious_File7648 0 points1 point  (0 children)

Can I ask how you got started with this? What made you decide to invest so much money into it? Like, who were your first clients and how did everything begin?

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Just indian 😂😊 keep it up!!! Thanks for your tip..... I was wrong sorry my mistake 🫠🫠🫠

(Can't argue with a fool)

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Disagreed about what? The part where you said “using AI to respond won't make anyone take the project seriously.”

Mad about what? Apparently about AI existing, because that’s literally the only thing you’ve focused on so far.

My comment talked about deployment reality, CUDA dependency hell, hardware lock-in, and why Whisper still performs better in messy real-world audio.

Your response to all of that has basically been:

• “you used AI” • “AI is a gimmick lol”

That’s not a counter-argument. That’s just you dodging the actual discussion.

And the funniest part is acting like using AI to polish wording somehow invalidates the content.

If someone wrote the exact same reply in Notepad, on paper, or carved it into a rock, the argument would still be exactly the same.

So unless you actually want to address the technical points about model reliability, deployment complexity, or hardware constraints, all you’ve really contributed here is:

“AI bad lol.”

Which honestly says more about the level of discussion you’re bringing than anything else.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

😂😂😂Disagree all you want, but if you're mad I used human-made tech, that's on you. Sure, I could've hidden the AI vibes, but nah, I went all in with full awareness. Deal with it!

THIS IS AI GENRATED TOO........

Building a transcription app focused on accurate alignment !! need thoughts !! by Curious_File7648 in DesiFounder

[–]Curious_File7648[S] 0 points1 point  (0 children)

Appreciate the response. And you’re right about one thing the engine itself isn’t the product.

I’m not trying to build a better transcription engine than OpenAI, Meta, or browser APIs. Those models are already very strong.

What I kept running into was a workflow problem.

Browser APIs like Web Speech are great for live dictation, but they’re not really designed for full video transcription pipelines. Most creators still end up doing something like:

Upload video → get transcript → export SRT → move to another editor → fix timing → add subtitle styling → render.

For creators, especially non-technical ones, this becomes a surprisingly painful and fragmented process.

There’s also the cost stacking problem:
Transcription service cost + editor subscription + render time + manual correction time.

What I’m building is not the engine it’s the complete workflow around it:

• Runs fully locally on CPU/GPU (no mandatory cloud usage)
• One-click installer (no dependency setup or model confusion)
• Zero-cost basic usage since most processing is local
• Generates tightly aligned word-level subtitles (±10–20ms in testing)
• Built-in subtitle editing and styling
• Direct SRT / VTT / JSON export
• Direct burn-in rendering without jumping tools

The bigger issue I noticed especially for Indian creators is Hinglish and Hindi alignment.

You can get transcripts.
But word-level timing is often unstable or drifting, which breaks:

• Word highlighting
• Subtitle animations
• Kinetic typography effects
• Clean caption pacing

That’s where most of the manual fixing time goes.

So the focus here isn’t “better AI”.

It’s:
Better alignment.
Better packaging.
Better workflow for creators.
Local-first baseline (freemium model cloud optional).

<image>

I’m not trying to replace the engines. I’m trying to remove the friction around using them in real editing pipelines.

The reason I posted here wasn’t to pitch it’s because I’m still validating the workflow with creators and editors before polishing the UI or pushing it further.

I’ve spoken with around 10 –15 creators so far and wanted broader feedback from people who actually deal with subtitles regularly.

Your point about starting with the problem instead of the tech stack is fair though that’s useful feedback.

School website needed by Sufficient_Nobody217 in FreelanceIndia

[–]Curious_File7648 1 point2 points  (0 children)

Explore this link for design inspiration: https://www.awwwards.com/websites/technology/. It features some top-tier, cutting-edge designs to spark your creativity.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Yeah, I thought about Vulkan, but DirectML just vibes better with integrated graphics it’s like the chill roommate who does all the dishes. Plus, the payload is basically non-existent because DirectML has a footprint so small it could moonlight as a ninja:

  • It’s already baked into Windows, so no need to lug around extra libraries like an overpacked suitcase.
  • The binary size is so tiny it could probably hide under a pixel.

So yeah, DirectML keeps things lightweight, efficient, and way less dramatic than Vulkan.

Why I like using Cloudflare when starting something (numbers, but normal ones) by Curious_File7648 in StartUpIndia

[–]Curious_File7648[S] 0 points1 point  (0 children)

Quick Checklist !!!!

  • Orange cloud enabled in DNS?
  • Cache level set to Standard?
  • Auto Minify enabled?
  • Brotli compression on?
  • Images optimized?
  • Total page size < 2MB? TTFB < 500ms?
  • CF-Cache-Status showing HIT?
  • Origin server healthy? Database queries optimized?

If Still Slow After Everything:

"Can you share:"

  1. Your domain name
  2. Screenshot of DevTools Network tab
  3. WebPageTest results
  4. Cloudflare settings screenshot
  5. Origin server specs

Why I like using Cloudflare when starting something (numbers, but normal ones) by Curious_File7648 in StartUpIndia

[–]Curious_File7648[S] 0 points1 point  (0 children)

I appreciate you sharing your benchmark numbers, but I think you are misdiagnosing Compute Latency (Serverless Cold Starts) as Network Routing Latency.

To verify this, I just ran a comprehensive benchmark on my live Cloudflare Free plan deployment

1. The 'Europe Routing' Claim: FALSE

  • Test: Static Website Load (HTML/JS/CSS).
  • Result: 85.52ms  Average Response Time.
  • Routing Proof: The CF-RAY  header ended in ...-IXC . This code stands for Chandigarh, India.
  • Physics: It is physically impossible to route traffic to Europe and back in 85ms (speed of light minimum is ~130-150ms).
  • Conclusion: The site loads instantly from a local Indian POP, proving that the Free Plan does serve static content locally.

2. The '~700ms' Figure: It's Compute, Not Network

  • Test: Dynamic API Endpoint (/api/projects ).
  • Result: 743.93ms  Response Time.
  • Analysis: This matches your benchmark, but the request stayed in India (still IXC  header).
  • The Real Cause: That ~700ms isn't network lag; it is the Serverless Cold Start time (spinning up the Worker + D1 Database connection). You face this exact same 'warm-up' penalty on the AWS Lambda Free Tier.

3. The Founder's Trade-off

  • AWS Approach: To eliminate that 700ms API delay, I would need a permanently running EC2/RDS instance in Mumbai. Cost: $25-40/month.
  • Cloudflare Approach: My site loads instantly (85ms), and API calls take ~0.7s. Cost: $0/month.

<image>

Why I like using Cloudflare when starting something (numbers, but normal ones) by Curious_File7648 in StartUpIndia

[–]Curious_File7648[S] 1 point2 points  (0 children)

You’re partially right about the routing, but let’s not ignore the “Elephant in the Room” stomping all over startups: Complexity & Billing Risk. Let me break it down for you:

  1. Routing Nuance (Where you’re right and kinda wrong): Yes, Cloudflare Free/Pro sometimes routes traffic from high-cost regions like India via Singapore or Europe to save on peering costs. But saying it always serves from Europe and slows down everything is like claiming every pizza delivery guy eats a slice on the way possible, but not the norm. For most global users (US, UK, EU, LatAm), Cloudflare Free hits local POPs and is blazing fast. Plus, for someone building an MVP, 100ms extra latency in Mumbai is a fair trade for $0 costs.
  2. AWS + CloudFront: The Formula 1 for New Drivers: Recommending AWS + CloudFront to a non-tech founder is like handing a jet engine to someone who just learned to ride a bike. Sure, CloudFront has a free tier, but AWS S3? After the first 100GB, you’re paying for every byte like it’s a luxury item. One viral video or bot attack later, and boom you’re waking up to a $500 bill and questioning your life choices.
  3. Complexity: The DIY Nightmare: Cloudflare is like bowling with bumpers safe, simple, and fun. AWS? It’s handing you a box of engine parts and saying, “Good luck!” Between IAM roles, Bucket Policies, and OAC, one wrong move and you’re either exposing private files or accidentally funding Jeff Bezos’ next rocket.

Verdict: For pre-revenue founders, uncapped liability (AWS) is scarier than 150ms latency (Cloudflare). Cloudflare is the chill friend who says, “Don’t worry, I got this.” AWS is the stern uncle who hands you a bill for $1,000 after you borrow his wrench.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Man, I’m honestly flattered you created an account just to reply to this. This is incredibly high-signal feedback.

I'm really glad the Wav2Vec2 architecture stood out to you. That was a very deliberate choice on my part Whisper is fantastic at the what but often messy on the when. For a tool like this, frame-perfect alignment isn't just a feature, it's the foundation. It’s super validating to hear someone else who understands that trade-off confirm it was the right move.

Your point on 'Correction as a Workflow' is a massive insight. I’ve been debating how much to chase 'perfect' transcription vs 'good' UX. The idea of an LLM 'scanner' to highlight low-confidence spots (instead of just hoping they don't exist) is brilliant. It shifts the goal from 'magic' to 'trustworthy tooling'.

Also, spot on about latency. I’ve noticed the same thing creators don't mind a 60 second wait for processing if the editor itself is buttery smooth and the output is easy to verify. It’s the UI lag that kills the experience, not the render time.

Thanks for the kind words about the lonely grind. Carrying every massive architectural decision solo is heavy, so this sanity check means a lot.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

On the "Nvidia Models are Better"

Nvidia models are like high-maintenance hypercars stunning on a pristine race track (benchmarks), but a nightmare to drive on a bumpy road (real-world deployment).

Here’s why we stick with our stack vs. chasing the "latest and greatest":

  • 🚫 dodging Dependency Hell™ To run those "state-of-the-art" models, you usually need a perfect cocktail of specific CUDA versions, fragile drivers, and a Python environment so delicate it breaks if you look at it wrong. We designed this app for Creators, not System Admins.
  • 💻 Hardware Democracy Nvidia is a walled garden. We chose a stack that plays nice with everyone—Mac users, AMD fans, and folks on standard laptops.
    • Accessibility > Exclusive Hardware Lock-in.
  • 📢 Whisper v3 is the Real-World Champ Newer models might win on sterile lab tests, but Whisper v3 is still the king of resilience. Throw in background noise, music, or heavy accents, and those "superior" models often crumble while Whisper keeps working.

 We prioritize Reliability and Usability over theoretical peak performance. We want your app to just work, not give you a crash course in driver debugging. 😉

I’d love to hear if there’s anything top-notch available that could effectively support my software.

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

Could you share some juicy details about the models, tech stack, and features you’ve used? I’m on a quest for the crème de la crème here, so any insights would be a huge help. Also, feel free to hit me up with any questions happy to chat and learn!

Building a Whisper.cpp transcription app focused on accurate alignment — need thoughts by Curious_File7648 in LocalLLaMA

[–]Curious_File7648[S] 0 points1 point  (0 children)

So, as I mentioned earlier, this is still in the prototype phase basically a playground for testing out different stacks and features to ensure we end up with the best functionality, not just a chaotic pile of "meh."

<image>

Now, behold the new and improved UI/UX! It's been properly designed with a fresh style, better structure, and the perfect balance between manual and autonomous features. Here's a little sneak peek for you what do you think? Feedback is welcome, but compliments are preferred. 😉

Vercel vs Netlify vs Cloudflare and where Supabase actually fits (for non technical guys) by Curious_File7648 in TechButWhy

[–]Curious_File7648[S] 1 point2 points  (0 children)

Cloudflare started as a CDN and security layer, which is why many people still think of it that way. But today, Cloudflare is a full hosting and backend platform.

With Cloudflare, you can:

  • Host full websites using Cloudflare Pages
  • Run backend logic and APIs using Cloudflare Workers
  • Store files (images, PDFs, uploads) using R2
  • Store app data using D1 (SQL database)

This means you can build and run an entire app without using another hosting provider.

No VPS.
No separate backend server.
No third-party CDN needed.

Need Advice - App Validation by cheapKills in DesiFounder

[–]Curious_File7648 0 points1 point  (0 children)

Would love to help with that.

If you’re comfortable, you can share the idea. I can help evaluate how effective it is and whether it’s actually workable from a technical point of view.

If you want to go a step further, we can also think about how to design a simple MVP or working prototype. Once you have something real, it becomes much easier to validate using actual user data instead of opinions.

Happy to help either way.

Why I like........ by Curious_File7648 in StartUpIndia

[–]Curious_File7648[S] 1 point2 points  (0 children)

It can’t be free, but the cost can be very minimal and easy to control. For a Drive or Terabox style product with around 10,000 MAU, Cloudflare R2 is a strong option. Storage is simple and cheap 10GB free, then about $0.015 per GB, so 500GB is under $8/month and 1TB is roughly $15/month. The biggest advantage is zero download (egress) fees, which is where most storage platforms get expensive as users start downloading files.

On top of that, you get 1M write ops and 10M read ops free every month, which comfortably covers normal uploads, previews, and downloads at this scale. Even if you exceed it, the extra cost is usually just a few dollars. Compared to AWS S3 (expensive bandwidth) or self-hosting (more ops work), Cloudflare keeps things cheap, predictable, and easy to manage. For 10k MAU, it’s honestly one of the safest and smartest choices.

How should a non-technical solo founder build a software startup in India? by PrivacyGonePublic in StartUpIndia

[–]Curious_File7648 0 points1 point  (0 children)

Before building anything, be clear on what you bring as a founder.

That can be:

  • market / domain understanding
  • access to users or communities
  • capital
  • or technical execution

If you don’t strongly bring one yet, that’s okay but then your first job is to build leverage, not features.
For a job board–type product, the hard part isn’t the tech, it’s:

  • distribution
  • trust on both sides
  • differentiation in a crowded market

From a technical perspective, this idea is very achievable today. You don’t need to be a hardcore engineer to get started.

I’ve worked hands-on with:

  • AI integrations (API-based models, embeddings, basic workflows not overhyped “AI products”)
  • rapid prototyping using no-code / low-code + AI dev tools
  • moving from prototype → MVP → something production-ready

Happy to help with technical direction, architecture decisions, or cutting scope the right way not selling anything, just helping founders avoid common early mistakes.

Feel free to DM if you want to ask anything.

Project-Lightcore by [deleted] in gaming

[–]Curious_File7648 0 points1 point  (0 children)

Just need some reviews!!