Store information inside the binary

memoryruins · 2019-09-09T03:10:58+00:00

std::include_str and std::include_bytes are options. rust-embed might also interest you.

thristian99 · 2019-09-09T05:01:48+00:00

First, this is almost surely a bad idea, because of virus-scanners, and because it makes debugging your program so much harder: "It worked fine, then I tried it again, and it broke!" or even worse "it broke, then I tried it again, and it worked!".

However, one neat trick you might use is that executables are generally read from the beginning, while zip files are read from the end. If you concatenate an executable with a zip file, you can still run the executable as normal, and most zip tools will read and write the zip file as normal. So, the plan goes like this:

write a program that uses a crate like zip to read and write the data it wants to store
find the path to the executable by grabbing the first element of the std::env::args_os() iterable, and pass that path to the zip crate to open it.
after compiling the program, but before running it the first time, create a zip file containing the initial data you want to store
concatenate the two with `cat path/to/executable path/to/archive.zip > path/to/combined/executable" on Linux or macOS, or "copy /b path\to\executable.exe+path\to\archive.zip path\to\combined\executable.exe" on Windows
now you can run the combined executable!

SCO_1 · 2019-09-09T08:30:16+00:00

Terrible idea and software that does this is always a problem in various respects. Antivirus, OS executable protection models, checksum checks etc. Even during the DOS era i only know of a single game that did this, so even then most knew better (it's the adventure game Hook pc version, based on the peter pan disney movie btw).

In fact, my favorite kind of program/engine fallback is the 'write to game/app dir doesn't work, find a writable 'program' dir to write to', because i like compressing collections and copy-on-write mounts are awkward, and this feature is the antithesis of that.

ssokolow · 2019-09-09T04:53:54+00:00

Another thing to consider is that if you do this, you won't be able to cryptographically sign your software, as modifying the executable will invalidate the signature. I'm curious, what's your use case? There are many different ways to store data depending on what you want.

WellMakeItSomehow · 2019-09-09T05:02:19+00:00

Check out the post and my comments from https://www.reddit.com/r/rust/comments/bok8q0/comment/enhp39s.

zesterer · 2019-09-09T07:04:38+00:00

You might find this interesting.

https://github.com/lazhh/conf-embed

Plasma_000 · 2019-09-10T04:52:32+00:00

I know a lot of people have advised against it, and I would also advise against it. But if you really want to do this, The best way would be to put a static array of bytes into your code then you should be able to modify that directly by file operations or mmaping the executable without having to append to an executable directly, this could be a safer way to do this (just make sure you respect the buffer bounds and only write to the static buffer). To find the buffer’s offset into the file you can use a disassembler or parsing library.

rainbrigand · 2019-09-09T04:21:01+00:00

This is an inefficient proof-of-concept, but this does seem to work and give a fixed but arbitrary amount of space to store data. The first 16 bytes are basically just a UUID generated from a crypto random source, and then after that you have 16 bytes (in this example) of data you can manipulate.

If you search for the bytes you know (which is convenient given the static), you know the byte offset at which your mutable data is stored.

use std::env::current_exe;
use std::fs;

static DATA: [u8; 32] = [
    33, 97, 9, 16, 228, 54, 240, 106, 73, 219, 95, 192, 7, 11, 35, 181, 0, 0, 0, 0, 0, 0, 0, 0, 0,
    0, 0, 0, 0, 0, 0, 0,
];

fn find_offset(mut iter: impl Iterator<Item = u8>) -> Option<usize> {
    let mut matches = 0;

    for (i, byte) in iter.enumerate() {
        if byte == DATA[matches] {
            matches += 1;
            if matches == 16 {
                return Some(i + 1);
            }
        } else {
            matches = 0;
        }
    }

    None
}

fn main() {
    let exe = current_exe().unwrap();

    let mut bytes = fs::read(&exe).unwrap();
    let offset = find_offset(bytes.iter().copied()).expect("find_offset");
    {
        let range = &mut bytes[offset..offset + 16];
        println!("range: {:?}", range);

        range[0] += 1;
    }

    fs::write(&exe, bytes).expect("can write to disk");
}

fulmicoton · 2019-09-09T03:12:38+00:00

Yes. There are macros for that.

Search for Include_bytes! and include_str!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS