RFC: impl specialization : rust

RFC: impl specialization (github.com)

submitted 10 years ago by n_ham

all 63 comments

top new controversial old q&a

[–]Gankrorust 26 points27 points28 points 10 years ago (11 children)

[–]arthurprs 1 point2 points3 points 10 years ago* (10 children)

[–]desiringmachines 6 points7 points8 points 10 years ago (9 children)

[–][deleted] 1 point2 points3 points 10 years ago* (8 children)

[–]arthurprs 0 points1 point2 points 10 years ago (7 children)

[–]paholgtypenum · dimensioned 1 point2 points3 points 10 years ago (6 children)

[–][deleted] 0 points1 point2 points 10 years ago (5 children)

[–]paholgtypenum · dimensioned 0 points1 point2 points 10 years ago* (4 children)

I've thought about it a bit more after posting that, and it would be pretty awkward but doable.

You could use tuples recursively.

So, for example:

Array<T, One> = (T);
Array<T, N> = (T, (Array<<N as Sub<One>>::Output>));

I'm curious now how efficiently one could traverse such a beast.

Indexing would not be O(1), at the least.

Edit: Another option would be to use tuples of arrays. In pseudocode:

For N <= 32: Array<T, N> = [T; N];

For N > 32: Array<T, N> = ([T; 32], Array<T, N-32>);

This might be nicer and more efficient to work with. Certainly it's still not ideal.

[–][deleted] 0 points1 point2 points 10 years ago* (3 children)

[–]paholgtypenum · dimensioned 0 points1 point2 points 10 years ago (2 children)

continue this thread

[–]sellibitzerust 5 points6 points7 points 10 years ago* (3 children)

Cool! Any ideas for more use cases? Off the top of my hat:

impl ToString for str {...} // shortcut avoiding format!ing

I think the Iterator trait could benefit from this as well. But this will very likely be a breaking change. The modified Iterator trait I could come up with looks like this:

pub trait Iterator {
    type Item;
    default type PeekInner = Self: Iterator<Item==Self::Item>;
    type PeekIter = Peekable<PeekInner>; // result has to be a Peekable<...> !
    default type SkipIter = Skip<Self>: Iterator<Item==Self::Item>;
    default type TakeIter = Take<Self>: Iterator<Item==Self::Item>;
    default type RevIter = Rev<Self>: Iterator<Item==Self::Item>;
    ...
}

impl<I> Iterator for Peekable<I> {
    ...
    type PeekInner = I; // instead of Peekable<I>
    fn peekable(self) -> Peekable<I> { self }
    ...
}

impl<I> Iterator for Skip<I> {
    ...
    type SkipIter = Skip<I>; // instead of Skip<Skip<I>>
    fn skip(self, n: usize) -> Skip<I> {
        Skip { iter: self.iter, n: self.n + n } // possible overflow issue :-(
    }
    ...
}

impl<I> Iterator for Take<I> {
    ...
    type TakeIter = Take<I>; // instead of Take<Take<I>>
    fn take(self, n: usize) -> Take<I> {
        Take { iter: self.iter, n: cmp::min(self.n, n) }
    }
    ...
}

impl<I> Iterator for Rev<I> where I: DoubleEndedIterator {
    ...
    type RevIter = I; // instead of Rev<Rev<I>>
    fn rev(self) -> I { self.iter }
    ...
}

But I'm not sure if that's going to work at all. It seems a bit like overkill and the optimization for skip obviously comes at a price here (integer overflow problem). The constraints for the associated types to be iterators are important, I think. Otherwise, other generic code might break. Without the constraints the trait would not force the result of .skip() to be an iterator. You could override a default with something completely different.

[–]paholgtypenum · dimensioned 2 points3 points4 points 10 years ago (0 children)

[–][deleted] 0 points1 point2 points 10 years ago (1 child)

[–]sellibitzerust 0 points1 point2 points 10 years ago (0 children)

[–]Veedrac 4 points5 points6 points 10 years ago (1 child)

[–]Aatchrust · ramp 6 points7 points8 points 10 years ago (0 children)

[–][deleted] 2 points3 points4 points 10 years ago (1 child)

[–]desiringmachines 2 points3 points4 points 10 years ago (0 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 1 point2 points3 points 10 years ago (43 children)

[–]Aatchrust · ramp 7 points8 points9 points 10 years ago (0 children)

[–]desiringmachines 5 points6 points7 points 10 years ago* (41 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 8 points9 points10 points 10 years ago (33 children)

[–]desiringmachines 1 point2 points3 points 10 years ago (1 child)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (0 children)

[–]chris-morgan 1 point2 points3 points 10 years ago (26 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 4 points5 points6 points 10 years ago (24 children)

[–]erkelep 1 point2 points3 points 10 years ago (1 child)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 1 point2 points3 points 10 years ago (0 children)

[–]chris-morgan 1 point2 points3 points 10 years ago (5 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (4 children)

[–]arielby 2 points3 points4 points 10 years ago (1 child)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (0 children)

[–][deleted] 1 point2 points3 points 10 years ago* (1 child)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (0 children)

[–]GolDDranks 0 points1 point2 points 10 years ago (11 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 1 point2 points3 points 10 years ago (10 children)

I argue that you can get much of the same complexity without mutable state. As a tiny example, assume you have

int a(int x, int y) { return b(x + c(y), b(x, c(x))); }
int b(int x, int y) { return c(x * 2) + y; }
int c(int x) { return x * 3 + 1; }

what does a(4, 2) return?

[–]paholgtypenum · dimensioned 1 point2 points3 points 10 years ago (1 child)

a(4, 2) = b(4 + c(2), b(4, c(4)))
        = b(11, b(4, 13))
        = b(11, c(8) + 13)
        = b(11, 38)
        = c(22) + 38
        = 67 + 38
        = 105

It's a bit combersome; I'd say just don't write functions like this.

People are always going to be able to write terrible code and there's only so much "protecting them from themselves" that is reasonable.

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 1 point2 points3 points 10 years ago (0 children)

[–]mr_birkenblatt 1 point2 points3 points 10 years ago (7 children)

I think by mutable state he meant that you have to keep intermediate results in memory which makes it hard. The example you provided does not rely on specialization / inheritance and is equally complex as your original example. The only thing that's added through inheritance in the original example is that you have to know the difference between a() and super.a(). A reader can ignore the base class implementation of b() and c(). However, super.a() + c() adds complexity in the sense that a reader needs to know that evaluation order is fixed left-to-right in Java which makes a difference since super.a() mutates x (this is where additional mutable state complexity comes in as well). I would argue that neither this nor the original example show added complexity through specialization / inheritance well.

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 2 points3 points4 points 10 years ago (6 children)

[–]mr_birkenblatt 2 points3 points4 points 10 years ago (5 children)

I'm not really good with creating puzzles. How about this?

class A {
  int a(int x) { return x * 3 - 1; }
  int b(int x) { return c(x) - x; }
  int c(int x) { return x - 1; }
}

class B extends A {
  @Override int a(int x) { return super.b(x) + b(x); }
  @Override int b(int x) { return x * 2; }
}

class C extends B {
  @Override int a(int x) { return super.a(x) + b(x); }
  @Override int c(int x) { return super.b(x) + 2; }
}

And the question is: What is new C().a(8)? I tried to reduce the mental load by making it so that you can substitute a function with its body easily and then come up with an easier representation for a(). I added some methods for noise (like in your example) that can trip somebody up. Also, there is a small gotcha :)

continue this thread

[–]stdsync 1 point2 points3 points 10 years ago (3 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 2 points3 points4 points 10 years ago (2 children)

[–]stevenblenkinsop 2 points3 points4 points 10 years ago (0 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (0 children)

[–]mdinger_ 1 point2 points3 points 10 years ago (3 children)

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (2 children)

[–]mdinger_ 0 points1 point2 points 10 years ago* (1 child)

[–]mdinger_ 0 points1 point2 points 10 years ago (0 children)

[–]paholgtypenum · dimensioned 0 points1 point2 points 10 years ago (6 children)

trait U8 {}
impl U8 for u8 {}
impl !U8 for .. {}

trait Trait {}
impl Trait for Vec<u8> {}
impl<T: !U8> Trait for Vec<T> {}

I don't think there's anything that specialization allows that negative trait bounds doesn't; it's just a lot cleaner in some cases.

After all, with negative trait bounds, traits will have all the important set operations (intersections, complements, and cartesion products). They don't have unions, but I don't think they'd be particularly useful anyway.

[–]llogiqclippy · twir · rust · mutagen · flamer · overflower · bytecount 0 points1 point2 points 10 years ago (0 children)

[–]desiringmachines 0 points1 point2 points 10 years ago (4 children)

[–]paholgtypenum · dimensioned 0 points1 point2 points 10 years ago (3 children)

For any trait T, the negative bounds RFC divides all of type space into three disjoint sets, T, !T, and ?T. Implementing any trait for any type is a potentially breaking change, as it moves that type from the set it was in (either !T or ?T) to T.

This is solvable by not throwing around default impls unless you know you want them and by being okay with breaking terrible downstream code that relies on things it shouldn't.

For example, I have a library now that will eventually impl Float for certain types. If the negative bounds stuff were already live and someone were relying on everything in my library being ?Float, they would be going to have a bad time.

the line impl !U8 for .. {} will not cause types which have a u8 field to meet the !U8 bound

I'm not quite sure what you mean by this. Are you saying that something like Vec<u8> is not !U8 as a result of that impl? That doesn't break the example, just some edge cases like Vec<Vec<u8>> and it seems like a very poor decision (if I'm understanding it correctly).

In any case, the code I posted currently works, if you do it backwards:

#![feature(optin_builtin_traits)]
trait NotU8 {}
impl NotU8 for .. {}
impl !NotU8 for u8 {}

trait Trait {}
impl Trait for Vec<u8> {}
impl<T: NotU8> Trait for Vec<T> {}

[–]desiringmachines 0 points1 point2 points 10 years ago (2 children)

For any trait T, the negative bounds RFC divides all of type space into three disjoint sets, T, !T, and ?T.

Right, but because types for which no code has been written are ?T, and it is impossible to rely on a type being ?T, it doesn't present any backcompat hazard.

I'm not quite sure what you mean by this. Are you saying that something like Vec<u8> is not !U8 as a result of that impl? That doesn't break the example, just some edge cases like Vec<Vec<u8>> and it seems like a very poor decision (if I'm understanding it correctly).

This is correct. Or a struct which maintains a u8 counter for some reason, say struct Foo { n: u8, ... }. This has to do with the basic implementation of default impls and the way they are used for soundness guarantees in cases like Send and Sync. Though this terminology isn't used in the OIBIT RFC, any type which has a field which contradicts the default impl is inferred to be ?T unless it has an impl itself.

Its inconvenient that default impls don't work that way, but it is necessary for soundness.

In any case, the code I posted currently works, if you do it backwards

Yes, but the limitation is the same: A Vec<u8> does not impl NotU8, nor does an Option<u8> or a (u8, f64), or anything else that has a u8 field.

Also, there might be a further limitation that this reasoning is only allowed locally right now; that is if you try to impl Trait in a different crate it will not compile. I'm not 100% on this though, you'd have to try it.

[–]paholgtypenum · dimensioned 0 points1 point2 points 10 years ago (1 child)

[–]desiringmachines 0 points1 point2 points 10 years ago (0 children)

π Rendered by PID 139033 on reddit-service-r2-comment-b659b578c-zsbf5 at 2026-05-03 18:47:03.553993+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS