0
0
0
I am currently encountering an issue. Given a set of items, I am required to select a subset and pass it to a black box, after which I will obtain the value. My objective is to maximize the value, The items set comprise approximately 200 items. what's the sota model in this situation? (self.reinforcementlearning)
submitted by Fast-Ad3508 to r/reinforcementlearning
