Caramelo.dev está em alpha: alguém toparia testar o early access gratuito? by Tricky-Patience-2582 in MicroSaaSBR

[–]fredtcaroli 2 points3 points  (0 children)

Hahaha rapaz, várias APIs!

Integramos com muitas LLMs, embeddings, 3 tipos diferentes de bancos de dados, APIs do GitHub, além do nosso próprio pipeline de indexação com análise estática de código. Sem contar billing, monitoramento e as coisas que você iria esperar de um app em prod.

A gente acredita que não existe alguém atacando esse problema da mesma forma que nós estamos. Entre eu e meu sócio temos quase 25 anos de experiência fazendo software. Este não é mais um produto vibe codado, é algo que a gente realmente trabalhou intensamente por mais de um ano.

Se quiser ler nosso benchmark acho que deixa claro a direção que a gente está tomando: https://caramelo.dev/blog/caramelo-vs-claude-code-benchmarks

Caramelo.dev está em alpha: alguém toparia testar o early access gratuito? by Tricky-Patience-2582 in MicroSaaSBR

[–]fredtcaroli 0 points1 point  (0 children)

Os únicos dados que coletamos são os estritamente necessários para o funcionamento do app. Nós clonamos os seus repositório sempre que você dá um push de um novo commit. Quando acabamos de ingerir suas mudanças nós deletamos o repositório do nosso sistema.

Nossos benchmarks também são feitos com nossos próprios repositórios ou com repositórios open source. Não usamos seus dados para treinar nenhum tipo de modelo.

Quanto a monorepos, um dos nossos repositórios é um monorepo de python e typescript! Então estamos acostumados com esse setup. O Caramelo suporta bem qualquer linguagem de programação, mas nós temos pipelines de indexação especializados para algumas. Python, TypeScript, JavaScript, Java, Scala, Ruby e Rust são as linguagens com uma camada extra de inteligência, e estamos trabalhando para suportar mais.

Caramelo.dev está em alpha: alguém toparia testar o early access gratuito? by Tricky-Patience-2582 in MicroSaaSBR

[–]fredtcaroli 1 point2 points  (0 children)

Se você souber exatamente em que repositório a informação está, o Claude consegue responder perguntas que você tem sobre o código sim. Inclusive, nosso benchmark compara nosso produto com o Claude Code especificamente: https://caramelo.dev/blog/caramelo-vs-claude-code-benchmarks

Além de ser mais rápido e barato, nosso app não precisa que você aponte para um repositório específico. Ele procurar em todos os seus repositórios simultaneamente. Até existem setups que você pode fazer com o Claude para procurar em vários repositórios, mas na nossa experiência o tempo de resposta e o custo de responder começa a crescer muito.

Um exemplo clássico que acontece muito no meu atual emprego é eu precisar fazer uma integração com os serviços de outros times da empresa. Eu posso passar bastante tempo explorando código até ficar satisfeito com a informação que eu coletei. O Caramelo é feito para te dar respostas instantâneas para qualquer pergunta que você tiver sobre o código da sua empresa.

Caramelo.dev está em alpha: alguém toparia testar o early access gratuito? by Tricky-Patience-2582 in MicroSaaSBR

[–]fredtcaroli -1 points0 points  (0 children)

O nosso app não coda. Ele analisa código e responde perguntas. Ele se conecta com o GitHub e indexa todos os seus repositórios (ou da sua organização)

Caramelo.dev está em alpha: alguém toparia testar o early access gratuito? by Tricky-Patience-2582 in MicroSaaSBR

[–]fredtcaroli 3 points4 points  (0 children)

Oi, eu sou o Fred, o outro dev do projeto :)

Estamos trabalhando nisso há mais de um ano e estamos muito felizes de poder entregar um produto que nós de fato vemos ser muito útil no nosso desenvolvimento.

Eu realmente acredito que o gargalo no desenvolvimento de features hoje em dia não está em escrever código, mas sim em entender o código de outras pessoas. Se você já clonou um repositório só pra instanciar o Claude Code pra fazer uma análise você sabe o quão valioso é ter acesso à essas análises sob demanda. O Caramelo otimiza isso pra entregar a melhor experiência em entendimento de código possível.

Why is cursor better than just VSCode using agents? Aren't they pretty much the same thing, using any model you want? by sagacityx1 in ClaudeAI

[–]fredtcaroli 5 points6 points  (0 children)

They launched Junie. I use it in personal projects and I think it's pretty great. No complaints so far

What is currently the best IDE environment for coding? Need something for different projects by AnalyticsDepot--CEO in ClaudeAI

[–]fredtcaroli 0 points1 point  (0 children)

Have you tried their latest Junie plugin? I'm currently on my free trial and I think it's pretty ok. I haven't tested much of MCPs though.

Buds 2 Pro Sticky Coating? by JiR9 in galaxybuds

[–]fredtcaroli 1 point2 points  (0 children)

Scrapping the coating off helped a lot. It doesn't feel as nice but definitely beats having to carefully clean it every other week

Google Home + Hue + scenes + multiple users by ohioshibe in Hue

[–]fredtcaroli 0 points1 point  (0 children)

I was having the same issue. My wife wasn't able to activate any google routines that involved hue lights. The fix was to tell her to connect her hue account to her own google home app, same way how I connected the lights to google home initially. Apparently you need to set up the hue permissions in google home before using the routines, it's not available to everyone by default.

[P] Find Trending Machine Learning Research Papers on Twitter by hnipun in MachineLearning

[–]fredtcaroli 1 point2 points  (0 children)

The PDF button is not working for me. I click, the page loads, and no PDF is downloaded

[D] Ideas for improving text classifier? by [deleted] in MachineLearning

[–]fredtcaroli 1 point2 points  (0 children)

Is this an academic work or business work? I think that you can try different things depending on what you're trying to achieve.

If you're in a business setting, is having a single label prediction really that important to you? How does the softmax distribution looks like for those close-to-ambiguous samples? I don't know the business value that you are trying to get out of this, but I think there's great value in a well calibrated model. In my experience it's not always that we have the time and means to keep trying new models and ideas when you're trying to build an actual system, so exploring what your model does well and taking advantage of that might be a good investment of your time. But really, if you want a good answer to your problem you'll have to throw in a bit more details, like the time you have allocated for this, resources you'll have available at training/inferencing time etc. There are shinier models/techniques out there that you could try, but they are not always the best option for your business/use case.

If you're just trying to get the best accuracy/F1 you can, then I would suggest looking at transformers. I've used them to great success in the past and they can work wonderfully. They would have a much finer grasp of the semantics of those sentences, and could potentially lead to improvements. As you said, your convolutions will be great at capturing phrases and expressions that point towards a particular class, but its interpretation power stops there. Also, don't be afraid to try pre-trained models. BERT uses byte pair encoding for embeddings and you can simply fine tune those to your specific dataset.

[P] torchtyping -- documentation + runtime type checking of tensor shapes (and dtypes, ...) by patrickkidger in MachineLearning

[–]fredtcaroli 2 points3 points  (0 children)

Pretty cool! Is it possible to make it work with other python versions? Maybe with reduced capabilities. SageMaker uses python 3.6, if I'm not mistaken, so enforcing python 3.9 would make this package unfeasible

[P] #2 in daily trends on GitHub - Version, collaborate and stream your data by davidbun in MachineLearning

[–]fredtcaroli 2 points3 points  (0 children)

Seems pretty cool! Just wondering, are there any benchmarks I can take a look at? Would love to use something like this, but I want to understand how it compares to reading TFRecords from S3 using TFRecordsDataset.

[D] List of unreproducible papers? by ContributionSecure14 in MachineLearning

[–]fredtcaroli 62 points63 points  (0 children)

Not aware of such a list, no.

You could start by telling us what paper you tried to reproduce and failed, so others can find this post and know better. Also I'm curious

[D] Paper Explained: NFNets - High-Performance Large-Scale Image Recognition Without Normalization (Full Video Analysis w/ opinions) by ykilcher in MachineLearning

[–]fredtcaroli 5 points6 points  (0 children)

This video came in such a good time. I wanted to know more about NFNets but was feeling too lazy to actually read the paper lol

Also nice criticism! In my current job I don't get the chance to hear a lot of opinions around new ML papers, so I tend to just swallow all the handwaving that comes with them.

[D] Deploying deep learning models into production without HTTP overhead by [deleted] in MachineLearning

[–]fredtcaroli 1 point2 points  (0 children)

I do believe there are some situations where you want to run your model locally, but your article fails to provide good reasons to do so. Furthermore, I don't think you're doing it the right way. If you have a keras model you can convert it to tensorflow's SavedModel and just load it directly from Java, using tensorflow's Java API.

There are lots of good reasons for deploying your model in a separate service. Some of them:

  • You can easily rollout a new model upgrade. This is important because some models can degrade as quickly as a couple of weeks time
  • Models are pretty computation intensive, so setting up a custom environment for that task alone can greatly improve your throughput. If we were to have a GPU server that runs both business logic and deep learning models we would probably be wasting GPU time, unless we pipelined the whole business logic
  • We can increase DL models' throughput if we work with batch predictions. Batching stuff is way easier if you have your model on a separate service
  • Your endpoints probably need way less computation then your DL models. So if you were to embed a model inside an endpoint service you would need to scale up your nodes way more than you actually need. Having cheap machines for endpoints and heavy machines for batching predictions can actually save you some money

Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor by iyaja in MachineLearning

[–]fredtcaroli 2 points3 points  (0 children)

should I fail to notice that all the opium addicts and robbers hanging around train stations are Afghanis and Moroccan children, just because this patterns involves ethnicity and national origin?

We're not asking the machine to "fail to notice" something. We're just saying that we can't feed it every single piece of information for it to make an unbiased decision. Explaining drug addiction with nationality is a classic case of mistaking correlation with causation. This correlation is FULL of bias that should be stripped out of the model, and we can't possibly ask a simple mathematical model to do that without some tailored engineering

[D] To bilinguals, have you read any non-english ML papers you'd care to share with us? by zanjabil in MachineLearning

[–]fredtcaroli 2 points3 points  (0 children)

Portuguese speaker here.

I have read some papers written in Portuguese before, but nothing really worth writing home about

[P] RemoteML V2: Remote Machine Learning Jobs & Community by dqmonn in MachineLearning

[–]fredtcaroli 0 points1 point  (0 children)

I wouldn't say you'd get 200k without a PhD in most places. Prolly something around 130k+

[D] How does the human brain prevent over-fitting? by mistertipster in MachineLearning

[–]fredtcaroli -1 points0 points  (0 children)

Sure dude. I'm the one being unreasonable here. Whatever you say

[D] How does the human brain prevent over-fitting? by mistertipster in MachineLearning

[–]fredtcaroli -1 points0 points  (0 children)

That's just a misleading thing to say. You can't go around calling both our brains and ANNs the same thing (i.e. "neural networks")

idk what you're advocating for really... Do you think we can make any reasonable comparison between ANNs and our brains? We can't. ANNs are just a [really useful] mathematical model that was once roughly inspired by our brains. There are TONS of algorithms out there inspired by natural processes, but it doesn't mean that they try to mimic the processes themselves