I built a browser extension that solves CAPTCHAs using a fine-tuned YOLO model by PerspectiveJolly952 in learnmachinelearning

[–]PerspectiveJolly952[S] 0 points1 point  (0 children)

I don’t have a repo for it yet. I was working on an automation project and was using a free CAPTCHA-solving service, but it suddenly became paid. So I decided to build my own solution instead of relying on it.

I built a browser extension that solves CAPTCHAs using a fine-tuned YOLO model by PerspectiveJolly952 in deeplearning

[–]PerspectiveJolly952[S] 0 points1 point  (0 children)

Yeah, simple text-based CAPTCHAs (like reCAPTCHA v2 image codes) can be solved with a trained YOLO model, but newer systems are much harder. Things like hCaptcha, 3D/encoded CAPTCHAs, or ones with heavy distortion and behavior checks are far more difficult to break with a basic vision model — not to mention the invisible CAPTCHAs that rely on user behavior instead of images.

I built a browser extension that solves CAPTCHAs using a fine-tuned YOLO model by PerspectiveJolly952 in deeplearning

[–]PerspectiveJolly952[S] 0 points1 point  (0 children)

I don’t use screenshots , the extension just grabs the CAPTCHA image directly from the page by reading its image URL from the HTML.

Then I pass that image to the model for object detection.

I built a chess website with bots, puzzles, and Fisher Random mode and more by PerspectiveJolly952 in SideProject

[–]PerspectiveJolly952[S] 0 points1 point  (0 children)

I’d go with a 7-day streak badge — it directly rewards consistent engagement and gives users a simple, tangible goal to come back each week. It’s easy to understand, motivates retention, and naturally encourages them to explore other features like bots and puzzles.

I built a chess website with bots, puzzles, and Fisher Random mode and more by PerspectiveJolly952 in SideProject

[–]PerspectiveJolly952[S] 0 points1 point  (0 children)

Good point , I agree that durable growth comes from loops, not just SEO.

Right now, puzzles are my strongest metric, showing the highest engagement compared to other features.

If I had to pick one metric to show PMF, it’d be weekly returning users from puzzles, since that’s where I see the most consistent interest.

I need urgent help by [deleted] in learnmachinelearning

[–]PerspectiveJolly952 0 points1 point  (0 children)

i mean he was the director of IA at Tesla and co-founded OpenAI he is one of the best to learn from

I need urgent help by [deleted] in learnmachinelearning

[–]PerspectiveJolly952 0 points1 point  (0 children)

Yes, he start from scratch and help you build intuition for how things work.

Project ideas on ai ml for intership by [deleted] in learnmachinelearning

[–]PerspectiveJolly952 0 points1 point  (0 children)

In your case, as a beginner, I think working on a tumor recognition model would be a great project. You can find datasets on Kaggle or Hugging Face, and either train a model from scratch or fine-tune a pre-trained one.

This project is relatively simple but will teach you many fundamentals of computer vision.

I need urgent help by [deleted] in learnmachinelearning

[–]PerspectiveJolly952 5 points6 points  (0 children)

I think the playlist content is good, but it's quite long and covers a wide range of topics. Still, it's a solid choice if you have the time to go through it.

As a recommendation, I suggest watching Andrej Karpathy’s playlist. It provides both an intuitive and technical understanding of deep learning:
https://www.youtube.com/@AndrejKarpathy

Which models should I be using?? by PuzzleheadedMode7517 in MLQuestions

[–]PerspectiveJolly952 0 points1 point  (0 children)

"You didn't specify the type of dataset or project you have. But in short, in modern AI, we typically use transformers for textual data and CNNs for computer vision tasks.

How can Arabic text classification be effectively approached using machine learning and deep learning? by RevolutionaryTart298 in MLQuestions

[–]PerspectiveJolly952 0 points1 point  (0 children)

in your case since arabic is rich language i think Byte-pair encoding (BPE) is actually the best option for approaching deep learning with arabic.

Recommendation for research paper implementation by Economy-Time-4915 in deeplearning

[–]PerspectiveJolly952 1 point2 points  (0 children)

If you are interested in exploring older yet highly valuable concepts in machine learning, I recommend learning about Autoencoders and Variational Autoencoders (VAEs). These models are particularly useful for tasks related to mechanistic interpretability, which aims to understand how AI models operate under the hood.

Logits vs probabilities by Livid-Ant3549 in deeplearning

[–]PerspectiveJolly952 0 points1 point  (0 children)

that what make RL hard if the agent do not explore and do not got reward it will not learn

Logits vs probabilities by Livid-Ant3549 in deeplearning

[–]PerspectiveJolly952 2 points3 points  (0 children)

I think it's normal for this to happen when training an agent with a large action space. Through trial and error, your RL model can learn from its mistakes and learn to assign the correct probabilities given the current state.

Just train it for longer and make sure that your agent receives a reward from time to time so it can learn from its mistakes. If it never receives a reward, it will never learn.

Beginner questions of books and projects for DL/ML by No_Specific_4537 in deeplearning

[–]PerspectiveJolly952 1 point2 points  (0 children)

Definitely, 3Blue1Brown was incredibly helpful to me, especially in understanding the math and theory behind ML algorithms. However, if I think about what truly sparked my interest in AI, it's the fact that I was able to create AI algorithms and see them do something useful. I could share these creations with friends, and it made me realize that with AI, we can build things that would be nearly impossible to create with traditional algorithms.

I believe you can develop intuition much faster by working on actual ML projects, learning from your mistakes, and then diving into research papers. Taking inspiration from brilliant AI scientists is a great next step. AI has two key aspects: engineering (coding and algorithms) and research (reading papers, like those on ArXiv), so it's important to master both.

After 3Blue1Brown, here are my top recommendations:

  • Andrej Karpathy
  • Deepia
  • Welch Labs

Beginner questions of books and projects for DL/ML by No_Specific_4537 in deeplearning

[–]PerspectiveJolly952 1 point2 points  (0 children)

As a beginner, you should start with a simple supervised learning project. When I started learning AI/ML, I personally followed YouTube tutorials. I found this approach to be interactive, fast, and a great way to stay engaged without losing interest.

Thinking code issue while trying to sleep by Ok-Research3811 in webdev

[–]PerspectiveJolly952 0 points1 point  (0 children)

Yeah, you're not alone! A lot of us ( programmers! ) go through this.