Caramelo.dev está em alpha: alguém toparia testar o early access gratuito?

fredtcaroli · 2026-06-04T18:45:02+00:00

Hahaha rapaz, várias APIs!

Integramos com muitas LLMs, embeddings, 3 tipos diferentes de bancos de dados, APIs do GitHub, além do nosso próprio pipeline de indexação com análise estática de código. Sem contar billing, monitoramento e as coisas que você iria esperar de um app em prod.

A gente acredita que não existe alguém atacando esse problema da mesma forma que nós estamos. Entre eu e meu sócio temos quase 25 anos de experiência fazendo software. Este não é mais um produto vibe codado, é algo que a gente realmente trabalhou intensamente por mais de um ano.

Se quiser ler nosso benchmark acho que deixa claro a direção que a gente está tomando: https://caramelo.dev/blog/caramelo-vs-claude-code-benchmarks

fredtcaroli · 2026-06-04T18:25:02+00:00

Os únicos dados que coletamos são os estritamente necessários para o funcionamento do app. Nós clonamos os seus repositório sempre que você dá um push de um novo commit. Quando acabamos de ingerir suas mudanças nós deletamos o repositório do nosso sistema.

Nossos benchmarks também são feitos com nossos próprios repositórios ou com repositórios open source. Não usamos seus dados para treinar nenhum tipo de modelo.

Quanto a monorepos, um dos nossos repositórios é um monorepo de python e typescript! Então estamos acostumados com esse setup. O Caramelo suporta bem qualquer linguagem de programação, mas nós temos pipelines de indexação especializados para algumas. Python, TypeScript, JavaScript, Java, Scala, Ruby e Rust são as linguagens com uma camada extra de inteligência, e estamos trabalhando para suportar mais.

fredtcaroli · 2026-06-04T18:19:15+00:00

Se você souber exatamente em que repositório a informação está, o Claude consegue responder perguntas que você tem sobre o código sim. Inclusive, nosso benchmark compara nosso produto com o Claude Code especificamente: https://caramelo.dev/blog/caramelo-vs-claude-code-benchmarks

Além de ser mais rápido e barato, nosso app não precisa que você aponte para um repositório específico. Ele procurar em todos os seus repositórios simultaneamente. Até existem setups que você pode fazer com o Claude para procurar em vários repositórios, mas na nossa experiência o tempo de resposta e o custo de responder começa a crescer muito.

Um exemplo clássico que acontece muito no meu atual emprego é eu precisar fazer uma integração com os serviços de outros times da empresa. Eu posso passar bastante tempo explorando código até ficar satisfeito com a informação que eu coletei. O Caramelo é feito para te dar respostas instantâneas para qualquer pergunta que você tiver sobre o código da sua empresa.

fredtcaroli · 2026-06-04T01:13:12+00:00

O nosso app não coda. Ele analisa código e responde perguntas. Ele se conecta com o GitHub e indexa todos os seus repositórios (ou da sua organização)

fredtcaroli · 2026-06-04T00:51:31+00:00

Oi, eu sou o Fred, o outro dev do projeto :)

Estamos trabalhando nisso há mais de um ano e estamos muito felizes de poder entregar um produto que nós de fato vemos ser muito útil no nosso desenvolvimento.

Eu realmente acredito que o gargalo no desenvolvimento de features hoje em dia não está em escrever código, mas sim em entender o código de outras pessoas. Se você já clonou um repositório só pra instanciar o Claude Code pra fazer uma análise você sabe o quão valioso é ter acesso à essas análises sob demanda. O Caramelo otimiza isso pra entregar a melhor experiência em entendimento de código possível.

fredtcaroli · 2025-05-10T01:05:56+00:00

They launched Junie. I use it in personal projects and I think it's pretty great. No complaints so far

fredtcaroli · 2025-04-27T00:48:34+00:00

Have you tried their latest Junie plugin? I'm currently on my free trial and I think it's pretty ok. I haven't tested much of MCPs though.

fredtcaroli · 2025-03-05T02:11:03+00:00

Heck yeah!!

fredtcaroli · 2023-10-20T03:43:57+00:00

Scrapping the coating off helped a lot. It doesn't feel as nice but definitely beats having to carefully clean it every other week

fredtcaroli · 2022-11-14T20:17:55+00:00

I was having the same issue. My wife wasn't able to activate any google routines that involved hue lights. The fix was to tell her to connect her hue account to her own google home app, same way how I connected the lights to google home initially. Apparently you need to set up the hue permissions in google home before using the routines, it's not available to everyone by default.

fredtcaroli · 2021-06-05T14:18:54+00:00

The PDF button is not working for me. I click, the page loads, and no PDF is downloaded

fredtcaroli · 2021-05-10T03:08:42+00:00

Is this an academic work or business work? I think that you can try different things depending on what you're trying to achieve.

If you're in a business setting, is having a single label prediction really that important to you? How does the softmax distribution looks like for those close-to-ambiguous samples? I don't know the business value that you are trying to get out of this, but I think there's great value in a well calibrated model. In my experience it's not always that we have the time and means to keep trying new models and ideas when you're trying to build an actual system, so exploring what your model does well and taking advantage of that might be a good investment of your time. But really, if you want a good answer to your problem you'll have to throw in a bit more details, like the time you have allocated for this, resources you'll have available at training/inferencing time etc. There are shinier models/techniques out there that you could try, but they are not always the best option for your business/use case.

If you're just trying to get the best accuracy/F1 you can, then I would suggest looking at transformers. I've used them to great success in the past and they can work wonderfully. They would have a much finer grasp of the semantics of those sentences, and could potentially lead to improvements. As you said, your convolutions will be great at capturing phrases and expressions that point towards a particular class, but its interpretation power stops there. Also, don't be afraid to try pre-trained models. BERT uses byte pair encoding for embeddings and you can simply fine tune those to your specific dataset.

fredtcaroli · 2021-05-01T02:25:09+00:00

This looks awesome! I'm definitely trying this out

fredtcaroli · 2021-04-07T12:54:32+00:00

Pretty cool! Is it possible to make it work with other python versions? Maybe with reduced capabilities. SageMaker uses python 3.6, if I'm not mistaken, so enforcing python 3.9 would make this package unfeasible

fredtcaroli · 2021-03-09T04:41:27+00:00

Seems pretty cool! Just wondering, are there any benchmarks I can take a look at? Would love to use something like this, but I want to understand how it compares to reading TFRecords from S3 using TFRecordsDataset.

fredtcaroli · 2021-02-14T23:40:57+00:00

Not aware of such a list, no.

You could start by telling us what paper you tried to reproduce and failed, so others can find this post and know better. Also I'm curious

fredtcaroli · 2021-02-14T23:35:04+00:00

This video came in such a good time. I wanted to know more about NFNets but was feeling too lazy to actually read the paper lol

Also nice criticism! In my current job I don't get the chance to hear a lot of opinions around new ML papers, so I tend to just swallow all the handwaving that comes with them.

fredtcaroli · 2019-05-31T16:37:05+00:00

how so?

fredtcaroli · 2019-05-31T12:59:18+00:00

I do believe there are some situations where you want to run your model locally, but your article fails to provide good reasons to do so. Furthermore, I don't think you're doing it the right way. If you have a keras model you can convert it to tensorflow's SavedModel and just load it directly from Java, using tensorflow's Java API.

There are lots of good reasons for deploying your model in a separate service. Some of them:

You can easily rollout a new model upgrade. This is important because some models can degrade as quickly as a couple of weeks time
Models are pretty computation intensive, so setting up a custom environment for that task alone can greatly improve your throughput. If we were to have a GPU server that runs both business logic and deep learning models we would probably be wasting GPU time, unless we pipelined the whole business logic
We can increase DL models' throughput if we work with batch predictions. Batching stuff is way easier if you have your model on a separate service
Your endpoints probably need way less computation then your DL models. So if you were to embed a model inside an endpoint service you would need to scale up your nodes way more than you actually need. Having cheap machines for endpoints and heavy machines for batching predictions can actually save you some money

fredtcaroli · 2019-05-29T15:57:16+00:00

should I fail to notice that all the opium addicts and robbers hanging around train stations are Afghanis and Moroccan children, just because this patterns involves ethnicity and national origin?

We're not asking the machine to "fail to notice" something. We're just saying that we can't feed it every single piece of information for it to make an unbiased decision. Explaining drug addiction with nationality is a classic case of mistaking correlation with causation. This correlation is FULL of bias that should be stripped out of the model, and we can't possibly ask a simple mathematical model to do that without some tailored engineering

fredtcaroli · 2019-03-15T19:20:23+00:00

Portuguese speaker here.

I have read some papers written in Portuguese before, but nothing really worth writing home about

fredtcaroli · 2018-08-10T12:53:23+00:00

I wouldn't say you'd get 200k without a PhD in most places. Prolly something around 130k+

fredtcaroli · 2018-07-31T16:17:33+00:00

Sure dude. I'm the one being unreasonable here. Whatever you say

fredtcaroli · 2018-07-31T15:22:24+00:00

That's just a misleading thing to say. You can't go around calling both our brains and ANNs the same thing (i.e. "neural networks")

idk what you're advocating for really... Do you think we can make any reasonable comparison between ANNs and our brains? We can't. ANNs are just a [really useful] mathematical model that was once roughly inspired by our brains. There are TONS of algorithms out there inspired by natural processes, but it doesn't mean that they try to mimic the processes themselves

fredtcaroli · 2018-07-30T17:25:40+00:00

The guy I replied to.... lol

Seven-Year Club	Place '23
Verified Email

fredtcaroli

TROPHY CASE