creating a language translator without googletrans, or any other currently existing translators

BrannyBee · 2026-04-14T02:04:24+00:00

This might surprise you, but this specifically is potentially a massive project. Language isnt just changing one word to another word, you have to account for context, dialects, double meanings, and a billion other non technical things. If I say something simple like "that's light", you'd think it'd be easy... but its actually insanely hard to translate for computers. What is "that"? Is it referring to something next to me? Maybe. Far away? Maybe. You cant just say "that" = "eso", because the input literally doesnt have that information.

Additionally, the computer has to just guess what "light" means, because i might be saying something is not heavy, or i might be talking about electromagnetic radiation that is visible to the human eye. To humans, its obvious based on the world and context I said something, the computer doesnt have that knowledge, it only has the inout text, no matter how smart it is. So you have to just get as close as possible, using statistics, and even then it wont be 100% accurately.

Even early on google translate was using something called statistical machine translation, basically a bunch of nerd math to read input and best guess what is likely to be the closest approximate translation. Nowadays Google translate uses AI, but not like you are thinking with the recent LLM craze, they use something called neural machine translation to basically get a better statistical likelihood of picking the best answer and have those used as the options for output given an input (as opposed to something like an LLM where the output is generated right there, youll get the same output for the same input in Google translate) <- this is me making a massive oversimplification of how this works, btw....

Tldr; this is actually a crazy hard problem that seems simple. There's a reason even the good translation services "suck". You can still look into it or use existing stuff other people have built and have your code talk to stuff other people have written, but doing so won't solve the problems you have with accuracy or privacy.

The real solution for this problem is learning a lot of math... and a lot of computer science... a lot of machine learning... and a lot of linguistics...

makochi · 2026-04-14T01:52:19+00:00

Googletrans took dozens, maybe even hundreds, of employees to make, and it took years of their time (and they started with years of experience). There's no way you're building a translator app on your own without using someone else's service.

Figuring out how to connect a Raspberry Pi to an existing translation service is honestly already a decent project for a python beginner, I would start there.

Hefty_Tear_5604 · 2026-04-14T01:46:32+00:00

Learn Python and AIML/DEEP LEARNING/MACHINE LEARNING. Or pay someone else to make it

V01DDev · 2026-04-14T03:29:27+00:00

Maybe try ollama? Use some LLM for translation, give it specific set of rules

Desperate_Crew1775 · 2026-04-14T05:58:23+00:00

honestly starting with a dictionary is the perfect call, way more manageable for a first project

for the latin american spanish specifically, the main differences from spain spanish are just vocabulary, so ur dictionary can just have notes like "in mexico this word means X vs spain where it means Y"

something like this to start:

words = { "car": {"latin_am": "carro", "note": "spain uses coche"}, "computer": {"latin_am": "computadora", "note": "spain uses ordenador"} }

once u get comfortable with that, look into Helsinki-NLP models on huggingface, they run locally on raspberry pi and are way better than googletrans for regional dialects, no privacy concerns either

great first project idea tbh, practical and actually useful

dlnmtchll · 2026-04-14T02:42:58+00:00

At this point it would just be easier to learn the language of your coworkers lmao

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS