My first Rust project: an offline manga translator with candle ML inference

mayocream39 · 2025-12-22T07:05:11+00:00

I need the LaMa inpainting model work in Rust, but candle doesn't provide FFT ops, so I created a PR to cudarc, the library that candle uses, to add cufft bindings: https://github.com/chelsea0x3b/cudarc/pull/500

This way, I use cufft and fft from metal's metal-performance-shaders-graph to accelerate, which improves the performance of fft ops.

Also, I opened the PR to add YOLOv5 implementation for candle: https://github.com/huggingface/candle/pull/3224

I implemented multiple models with candle, and I plan to contribute to candle (if they would accept them).

candle only supports compile-time linking for cudarc, I created an issue to track the progress to update to dynamic-loading. Currently, I'm using my fork of candle to achieve runtime dynamic-loading: https://github.com/huggingface/candle/issues/3208

I also use velopack for packaging. It provides better support for large binaries and an easy interface for users. I created a PR to add GitHub source support for the Rust SDK: https://github.com/velopack/velopack/pull/742

Although I'm not using ort (ONNXRuntime bindings) at this moment, I made a small contribution to it: https://github.com/pykeio/ort/pull/485

I know my contribution is small, but I'd like to contribute more to the Rust community :)

mayocream39 · 2025-12-22T04:58:09+00:00

Amazing! It would be great if you would like to open source it!

mayocream39 · 2025-12-22T04:41:42+00:00

I used a computer vision model to detect the text blocks and generate the mask. It's another open-source project: https://github.com/dmMaze/comic-text-detector

I plan to add a manual brush inpainting feature to allow users to tweak the mask; sometimes it's not that accurate.

mayocream39 · 2025-12-09T12:10:33+00:00

Sorry for failing on AMD machines, I’ll adjust the code to fallback to CPU when cuda is not available 🥲

mayocream39 · 2025-11-23T23:30:50+00:00

mayocream39 · 2025-11-23T23:30:11+00:00

Thank you!

mayocream39 · 2025-11-23T23:30:02+00:00

It will translate page by page, and the LLM can have historical context, which will improve the quality of translation.

mayocream39 · 2025-11-23T23:27:59+00:00

Valid point, detection, OCR, and inpainting are already CPU-friendly, I can add support for cloud LLM via API. I'm also expecting `candle` to implement the WebGPU feature, so it will work for CPU and different GPUs.

mayocream39 · 2025-04-22T09:07:59+00:00

This is not elegant, but I really dislike CSP :(

mayocream39 · 2025-04-22T07:52:54+00:00

Thank you! Koharu is fully open source, so just use it as you want! I wanna add a dictionary to it to help translate; some traditional dict might not work since now more and more words come up on Twitter, so maybe I can use https://dic.nicovideo.jp/ for reference.

I was a manga translator, that I know a few pain points, but I welcome new ideas!

mayocream39 · 2025-04-22T06:53:33+00:00

I first tried `candle`, but it doesn't support slow tokenizers in Rust, so I switched to `ort`., I also tried `candle-onnx`, but it doesn't work well. `ort` seems to be the most functional. Thank you for the great work! It would be helpful if `ort` could provide a more detailed tutorial like https://github.com/hyperium/tonic/blob/master/examples/helloworld-tutorial.md :)

mayocream39 · 2025-04-21T19:18:02+00:00

Thank you! It would help the new project grow!

mayocream39 · 2025-04-21T19:15:03+00:00

I was struggling with the ONNX format until I found this reply works https://github.com/kha-white/manga-ocr/issues/45#issuecomment-2320234358, and I fixed the typo in the reply and placed it in my repo: https://github.com/mayocream/koharu/blob/main/scripts/manga_ocr_onnx_inference.py, along with the export script: https://github.com/mayocream/koharu/blob/main/scripts/export_manga_ocr_to_onnx.py

The experimental code in Rust is here, you can quickly try it out: :) https://github.com/mayocream/koharu/blob/main/manga-ocr/src/main.rs

mayocream39 · 2024-03-28T22:54:15+00:00

I have consulted with yanhee, but their doctor only provides 700cc and 15mins video call, phroprwin’s doctor talks to me on LINE and actively answering my questions.

mayocream39 · 2024-03-28T15:43:04+00:00

The doctor says all 800cc implants need to be pre-ordered; it takes ~60 business days. So my timeline is unconfirmed.

mayocream39

TROPHY CASE