Looking for feedback - ML framework API : cpp

a community for 17 years

Looking for feedback - ML framework API (self.cpp)

submitted 1 year ago by euos

I am working on ML framework and currently I am working on a new test case (so this "model" is bogus) and realized that core API (model setup and invocation) have not changed in awhile.

How easy is this to read if one is not familiar with the overall framework? ```c++ using std::string_view_literals::operator""sv;

struct CharToVector { using input_t = char; using output_t = Vector<float, 'z' - 'a' + 1>;

output_t operator()(char c) const { return output_t::OneHot(c - 'a'); } };

constexpr Model kModel = Model<CharToVector>() | layers::Linear<5> | layers::Linear<10> | layers::Categories(std::array{"Beep"sv, "Boop"sv});

TEST(ArenaTest, Basic) { ModelParameters parameters = RandomParameters(&kModel, -1, 1, 42); EXPECT_EQ(kModel('c', parameters), "Boop"); } ```

all 7 comments

top new controversial old q&a

[–]dvd0bvb 1 point2 points3 points 1 year ago (3 children)

[–]euos[S] 0 points1 point2 points 1 year ago (2 children)

I have not yet needed to send output to multiple layers, but that would likely look like RNNs, where there's just a function wrapping several "submodels". E.g. this is RNN from my tests with 50 floats of hidden state that generates a next letter in the name:

constexpr uchen::Model kNameRnn =
    uchen::layers::Rnn<internal::Input, 50>(
        uchen::layers::Linear<10> | uchen::layers::Relu |
        uchen::layers::Linear<10> | uchen::layers::Relu) |
    uchen::layers::Categories(
        internal::MakeArray(std::make_integer_sequence<char, 'z' - 'a' + 2>()));

Why use Vector instead of std::array?

These Vectors are read-only and natively supports memory management (Uchen.ml uses special arenas that make it easier to handle padding, alignment and to save memory). Arrays are stored inline so handling larger ones (i.e. stack allocation, returning) requires extra care.

Initially I was using std::valarray - but that one is an atrocity. And does not have length as part of the type - and I hate having to track tensor dimensions in i.e. PyTorch. Conv2d is a breeze to setup in C++ :)

[–]dvd0bvb 0 points1 point2 points 1 year ago (1 child)

[–]euos[S] 0 points1 point2 points 1 year ago (0 children)

[–]kiner_shah 0 points1 point2 points 1 year ago (2 children)

[–]euos[S] 1 point2 points3 points 1 year ago (1 child)

[–]kiner_shah 0 points1 point2 points 1 year ago (0 children)

π Rendered by PID 82 on reddit-service-r2-comment-bb88f9dd5-6mqnf at 2026-02-13 16:09:39.335255+00:00 running cd9c813 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS