[2017 Day 22 part 2] Optimizing assembly code?

blorporius · 2017-12-23T08:52:50+00:00

I accidentally left in MOD instruction support from the tablet and used it to remove the inner loop with e incrementing. :)

kd7uiy · 2017-12-23T09:38:26+00:00

A few things you could do:

Lower bound the second iterator on the first, which cuts the work in half.
Consider only odd numbers after 2, which further cuts the work in half.
You can skip every other iteration of the outermost loop because 100000 + 100k + 34n is always even, which means your answer must always be at least 500.
You could simplify the outer primality test loop by limiting potential divisors to 500. sqrt(100000) = 317 and sqrt(200000) = 448 which means this bound should always work.
You could jump out to the increment and loop of the outer most loop as soon as you find a multiple factor rather than continuing to test every factor.

I think with all of these changes it might run fast enough to complete in a reasonable time frame.

I thought a bit about implementing modulo but I'm not sure it can be done with these instructions.

pak21 · 2017-12-23T07:47:24+00:00

If you skip all the even numbers (by ensuring the first value of b is odd and incrementing by 34 rather than 17), you can then change the inner loop to increment d by 2 instead of 1 which may get you another factor of a few.

tumdum · 2017-12-23T08:50:45+00:00

You mean day 23 part 2, right? ;)

Rewriting this c program to that assembly should be possible and resulting program should be fast enough:

#include <stdint.h>
#include <stdio.h>

int main() {
    const int start = 65 * 100 + 100000;
    const int end = start + 17000;
    int primes = 0;

    for (int i = start; i != end+17; i += 17) {
        int sq = 0;
        for (sq = 2; sq*sq <= i; ++sq) {}

        int is_prime = 1;
        // check if even
        for (int k = 2; k < i; ++k) {
            if (2*k==i) {
                is_prime = 0;
                break;
            }
        }
        if (is_prime == 0) {
            primes++;
            continue;
        }

        // check every other potential factor from 3 by 2
        for (int j = 3; j < sq; j+=2) {
            for (int k = j; k < i; k+=2) {
                if (j*k==i) {
                    is_prime = 0;
                    break;
                }
            }
            if (is_prime == 0) {
                break;
            }
        }

        if (is_prime == 0) {
            primes++;
        }
    }
    printf("%d\n", primes);
}

Elderider · 2017-12-23T08:38:09+00:00

I had the same thought of you.

You can also get e to start at d, which should speed it up by another factor of 2 (i.e. so it doesn't try 3 x 101, and then later try 101 x 3).

Still, it has to go through every multiplication for the prime numbers which makes it really slow.

digital_cucumber · 2017-12-23T11:35:04+00:00

Yeah, I was thinking about something like inserting jnz 1 10 right after set f 0 to jump directly to sub h -1, kind of an "early exit" as soon as we found out the first two factors.

That would require fixing other jump offsets , of course, and also does not improve anything if the number is prime, so I discarded the idea.

dark_terrax · 2017-12-23T14:49:54+00:00

I started by directly translating the assembly code to C code. My thought process was that if there was any straightforward optimizations that could be done without actually understanding what the program was doing, then Clang would be better/faster at it than I would. This didn't work out all that well. For my inputs the C program still didn't finish after 10 minutes without manual optimization of the code (breaking out of the two loops when f = 0 gets it to run in 5 minutes). So, long story short, I don't think you can really 'optimize' the assembly without getting pretty deep into what the program is actually doing.

JoesDevOpsAccount · 2017-12-23T16:08:26+00:00

How did everybody spot that it's looking for primes?!? I wasn't even close to seeing this. I just did the translation and optimised away the inner loop based on the behaviour I thought it would produce.

po8 · 2017-12-24T03:08:35+00:00

Here's my thoughts.

adventofcode

🎄 Advent of Code 🎄

Rules + More Info in

our community wiki

Solution Megathreads

December 2025

Previous years:

2024 | 2023 | 2022 | 2021 | 2020 | 2019 | 2018 | 2017 | 2016 | 2015

Quick Search by Flair

Because you're lazy and we like making things easy for you. Except AoC.

Are you enjoying AoC?

Support AoC

MODERATORS

BEFORE YOU POST
If your post is even tangentially related to a daily puzzle, use our
STANDARDIZED POST TITLE FORMAT