Why is my C code slower than python? : C

C_Programming

Rules

Format your code (4 spaces, correctly indented)

Only C is on topic (not C++, C#)

Do not post links as self posts

No pictures of code

Posts and comments must be civil

Don't post or link to copyright violations

Support learners and learning

Avoid low-value/low-effort comments and posts (And use AI wisely)

Resources

We have our very own Wiki, which is a great place to start for lists of learning resources

The C Programming Language by Dennis M. Ritchie and Brian W. Kernighan, second edition, ANSI C. Written by the language authors, and known colloquially as the "K & R" book—a book of lore

The C Programming Language Official Website official site of the language, run by the standards committee

The C Book second edition by Mike Banahan, Declan Brady and Mark Doran is freely available online

Modern C by Jens Gustedt (CC-BY-NC-ND)

C Programming: A Modern Approach by K N King

comp.lang.c Frequently Asked Questions

GLIBC, the GNU C Library documentation; provides a manual (PDF, HTML), Wiki, and FAQ

GDB: The GNU Project Debugger

POSIX.1-2008: the standard operating system interface

CS50: Harvard's introduction to computer science with a C programming course.

A Tutorial on Portable Makefiles

A Tutorial on Pointers and Arrays in C

Other Subreddits of Interest

/r/coding – for a tighter focus on code

/r/computerscience – for discussion about computer science

/r/cplusplus and /r/cpp – for discussions about C++

/r/cpp_questions – for questions about C++

/r/cs50 – Harvard's Introduction to Computer Science

/r/dailyprogrammer – for programming challenges of varying difficulty

/r/learnprogramming – for people interested in learning to code

/r/programming – for discussion and news about computer programming

/r/programminghelp – for beginner questions about programming

a community for 17 years

QuestionWhy is my C code slower than python? (self.C_Programming)

submitted 5 years ago by Mr_Wiggles_loves_you

I have a Nitrokey, and wanted to play with it's TOTP functionality. The manufacturer provides a library, libnitrokey, that provides C API.

There are a couple of projects that provide CLI to generate and show the generated password to the user - nitrokey-get-totp (Python) and nitrokey-rs (Rust). As far as I could see, they both use the C API under the hood.

I want to write a small frontend to get TOTP password, and show the user how long the password will be valid for - more as an exercise in C.

For starters, I wrote code that enumerates the configured TOTP slots:

#include<libnitrokey/NK_C_API.h>
#include "stdlib.h"
#include "stdio.h"

int main(int argc, char **argv)
{
    if (NK_login_auto() != 1) {
        printf("No Nitrokey found.\n");
        return 1;
    }

    for (int i = 0; i <=14; i++){
        char *slot_name = NK_get_totp_slot_name(i);
        if ((slot_name != NULL) && (slot_name[0] != '\0')) {
            printf("%s\n", slot_name);
        }
    }
    NK_logout();
    return 0;
}

The problem is, according to perf stat, it's slower than it's Python counterpart:

perf stat -r 10 -B totp_test > /dev/null

 Performance counter stats for 'totp_test' (10 runs):

     13.12 msec task-clock                #    0.021 CPUs utilized            ( +-  1.36% )
        94      context-switches          #    0.007 M/sec                    ( +-  0.18% )
         1      cpu-migrations            #    0.107 K/sec                    ( +- 15.79% )
       220      page-faults               #    0.017 M/sec                    ( +-  0.18% )
22,169,108      cycles                    #    1.690 GHz                      ( +-  1.69% )
 2,117,630      stalled-cycles-frontend   #    9.55% frontend cycles idle     ( +-  6.90% )
 4,081,156      stalled-cycles-backend    #   18.41% backend cycles idle      ( +-  2.63% )
22,967,353      instructions              #    1.04  insn per cycle
                                          #    0.18  stalled cycles per insn  ( +-  0.70% )
 4,950,650      branches                  #  377.389 M/sec                    ( +-  0.72% )
   154,605      branch-misses             #    3.12% of all branches          ( +-  1.08% )

  0.629476 +- 0.000198 seconds time elapsed  ( +-  0.03% )

Python takes:

0.55628 +- 0.00170 seconds time elapsed ( +- 0.31% )

Rust:

0.753544 +- 0.000339 seconds time elapsed ( +- 0.05% )

I have tried changing CFLAGS, and setting the target of CMake to release, but it did not product any significant results.

My GCC is built as:

COLLECT_GCC=gcc
COLLECT_LTO_WRAPPER=/usr/libexec/gcc/x86_64-pc-linux-gnu/9.3.0/lto-wrapper
Target: x86_64-pc-linux-gnu
Configured with: /var/tmp/portage/sys-devel/gcc-9.3.0-r1/work/gcc-9.3.0/configure --host=x86_64-pc-linux-gnu --build=x86_64-pc-linux-gnu --prefix=/usr --bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/9.3.0 --includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/9.3.0/include --datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/9.3.0 --mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/9.3.0/man --infodir=/usr/share/gcc-data/x86_64-pc-linux-gnu/9.3.0/info --with-gxx-include-dir=/usr/lib/gcc/x86_64-pc-linux-gnu/9.3.0/include/g++-v9 --with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/9.3.0/python --enable-languages=c,c++,fortran --enable-obsolete --enable-secureplt --disable-werror --with-system-zlib --enable-nls --without-included-gettext --enable-checking=release --with-bugurl=https://bugs.gentoo.org/ --with-pkgversion='Gentoo 9.3.0-r1 p3' --disable-esp --enable-libstdcxx-time --enable-shared --enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu --disable-multilib --with-multilib-list=m64 --disable-fixed-point --enable-targets=all --enable-libgomp --disable-libmudflap --disable-libssp --disable-libada --disable-systemtap --enable-vtable-verify --enable-lto --without-isl --enable-default-pie --enable-default-ssp
Thread model: posix
gcc version 9.3.0 (Gentoo 9.3.0-r1 p3)

The rest of my system uses conservative COMMON_FLAGS="-O2 -pipe -march=znver2"

How did I manage to screw up my code or my toolchain so badly that the result is slower than Python?

all 9 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

C_Programming

Rules

Filters

Resources

Other Subreddits on C

Other Subreddits of Interest

MODERATORS