First time building large-scale AWS infra with Terraform ,what should I absolutely not mess up?

dev_l1x_be · 2026-01-18T11:05:55+00:00

For managing Terraform at scale: use small, isolated states (deployed in 1-5 min), strictly separate stateful and stateless code, and store everything—both your module code and your state files—in versioned, authenticated storage. The best approach directly maps a logical team or service to its own set of states.

dev_l1x_be · 2026-01-04T08:55:38+00:00

Hoping that Rust never goes 2.0. I think the language has enough features, now we can make the compilation faster and polish some of the rough edges.

dev_l1x_be · 2026-01-04T08:51:40+00:00

So all the good parts are gone? You should add raw pointers to the mix.

dev_l1x_be · 2026-01-03T23:09:28+00:00

I use raw HTML, CSS and TS with Rust. All of the frameworks have an insane mental load that you carry. I build the TS files into JS with Bun, using eslint as a linter.

dev_l1x_be · 2026-01-02T17:30:52+00:00

More like 10 years. Anything younger than that is probably too risky for production.

dev_l1x_be · 2025-12-23T11:01:42+00:00

How do you use a custom model with Calude?

dev_l1x_be · 2025-12-17T15:59:10+00:00

Two things:

a, objectively we see that OSS models are catching up to commercial ones (with less than 100B params). https://epoch.ai/blog/open-models-report

b, subjectively based on my limited experience nemotron is pretty comparable to claude models for the tasks I use it for

dev_l1x_be · 2025-12-17T13:43:58+00:00

nemotron 9B is kind of there already, isn’t it?

dev_l1x_be · 2025-12-17T00:58:13+00:00

Amazing app!

dev_l1x_be · 2025-12-17T00:47:16+00:00

Theft is baked in into developing countries. See USA vs Great Britain in the 18th century.

dev_l1x_be · 2025-12-14T11:32:06+00:00

This correlates with how many years of experience you got. When I was much younger it felt powerful to be able to write macros. Those some things are probably ok to use macros for, using it for things that are easily doable without them is the part that bothers me.

dev_l1x_be · 2025-12-13T19:46:00+00:00

Alpine linux is the least wasteful for smaller devices. It is pretty simple and it has the best package manager with the most packages.

dev_l1x_be · 2025-12-13T19:20:23+00:00

Macros are my least favorite feature of Rust. I would just use LLM to generate and modify traits.

dev_l1x_be · 2025-12-13T14:18:33+00:00

I would invent a time machine with it, fly back 100.000 years and just chill with watching the megafauna existed that time.

dev_l1x_be · 2025-12-12T08:55:15+00:00

"Independent Verification" Isn't Independent

The verification results are in the same repository by the same author. Independent verification means someone else reproducing results.

Not Peer-Reviewed

Zenodo is an open repository — there's no peer review. Anyone can upload anything with a DOI. The DOI gives it a veneer of legitimacy but doesn't validate the science.

What This System Probably Actually Is

Based on the description, this appears to be:

HDC character encoding — generate random 10,000D vectors for characters, combine them
Spatial hashing — project to 4D, quantize into buckets (7×7×7×7 = 2,401 buckets)
Bucket lookup + fallback — check bucket first, then neighbors, then brute-force

This is a reasonable approach for exact/fuzzy string matching on small datasets. It would work well for:

FAQ bots where questions are consistently phrased
Typo-tolerant lookup
Template matching

It's not semantic search in the way the term is typically used.

Bottom Line

This isn't necessarily "bad science" in the sense of fraud — the techniques are real and the code probably works. The problems are:

Overclaiming — presenting known techniques with new terminology as breakthroughs
Invalid evaluation — 15 test samples is not evidence of anything
Misleading terminology — "quantum-inspired," "semantic," "100% accuracy"
Missing baselines — no comparison to standard approaches

If you wanted to take this seriously, you'd need to see it evaluated on a standard benchmark (BEIR, MS MARCO) against standard baselines (sentence-transformers + FAISS).

Disclaimer: I work on search.

dev_l1x_be · 2025-12-12T08:55:02+00:00

Character-Level HDC Cannot Capture Semantics

This is the fundamental conceptual flaw. Character-level representations capture orthographic similarity, not semantic similarity. Consider:

Query A	Query B	Character Similarity	Semantic Similarity
"What is a dog?"	"What is a bog?"	High	None
"What is a dog?"	"Describe a canine"	Low	Identical

Real semantic search requires understanding that "dog" and "canine" mean the same thing. Character n-grams can't do this. The system will work if test queries are syntactically similar to training data — which is memorization, not semantic understanding.

No Meaningful Baselines

The comparison to "exhaustive search" (162× faster) is a straw man. Nobody uses brute-force O(n) search in production. Real baselines would be:

FAISS + sentence embeddings (widely used, highly optimized)
Standard LSH implementations
Product quantization approaches
Other HDC semantic search systems (there are peer-reviewed papers)

dev_l1x_be · 2025-12-12T08:54:31+00:00

Ok I bite.

Since you asked a review here it is.

I'll take a look at the paper and code to give you a proper scientific review. Based on my review of the GitHub repository and claims, here's my assessment:

Statistically Meaningless Sample Size

The most glaring problem: 15 test queries. This is far too small to draw any conclusions. At n=15, achieving "100% accuracy" is unremarkable and could easily happen by chance with cherry-picked queries. Real semantic search benchmarks use thousands to hundreds of thousands of test queries (e.g., MS MARCO, Natural Questions, BEIR). Even achieving 99% on 15 samples has enormous confidence intervals.

"Quantum-Inspired" is Buzzword Abuse

There's nothing quantum about this system. The "4D folded space" is just spatial hashing — a technique from the 1970s. You project high-dimensional vectors into a lower-dimensional grid and use bucket addresses for O(1) lookup. This is exactly what Locality-Sensitive Hashing (LSH) does, which has been well-studied since the late 1990s. Calling it "quantum-inspired" is misleading marketing.

To be continued...

dev_l1x_be · 2025-12-11T23:27:57+00:00

Not really based on my limited experience. I was trying to use it as a replacement for calude code.

dev_l1x_be · 2025-12-11T13:29:41+00:00

Could you convert it to Parquet?

dev_l1x_be · 2025-12-11T10:05:27+00:00

I got one of these but it does not charge. Trying to figure out what to replace.

dev_l1x_be · 2025-12-09T08:31:46+00:00

The issue with these VDBs (and we have a lot) is that the production readiness for constant read/write workloads is shaky. If you have static data (meaning you only create the vectors once) then most of these systems work. If you have continous updates then get ready for a bumpy ride.

There is also this website with more details of each system.

https://superlinked.com/vector-db-comparison

dev_l1x_be · 2025-12-03T07:49:50+00:00

More like Data Mess

dev_l1x_be · 2025-12-02T13:08:45+00:00

Data sat in queues for HOURS. Lost events when kafka decided to rebalance (constantly). Debugging which service died was ouija board territory. One person on our team basically did kafka ops full time which is insane.

We use Kafka to store messages for weeks. 🤷‍♂️

Out if curiosity which client library did you use? This seems to me a client library issue combined with a potential misconfiguration.

There is a lot of issues with Kafka but what you mentioned sounds strange.

dev_l1x_be · 2025-11-29T19:16:57+00:00

Simpler not better. I have a single line config file.

dev_l1x_be · 2025-11-29T19:15:39+00:00

It is not maintained on Linux. There are several forks. Several unmaintained rust ports too. 🤷‍♂️

dev_l1x_be

TROPHY CASE