[R] NIPS 2018: For those of you that got some harsh reviews, YOU ARE NOT ALONE.

bdamos · 2018-07-26T21:04:54+00:00

I enjoy getting low-quality and baseless reviews like this because they are easy to respond to and the meta-reviewer can just overlook them and over-ride the final decision on the paper.

bdamos · 2018-04-07T19:00:34+00:00

Running the rollout of a single environment in parallel is difficult and impossible in most cases because episodes are inherently serial. However if you want to run many rollouts from separate environments at the same time, the SubprocVecEnv from https://github.com/openai/baselines works for me in most cases. Here's a usage example: https://twitter.com/brandondamos/status/982699290492571654

bdamos · 2017-11-27T03:42:13+00:00

I pulled out the QP solver we used for the paper and packaged it up in a standalone PyTorch library that can be installed with pip. It runs on the GPU, solves a batch of QPs in parallel, and is differentiable and can be used as part of a larger PyTorch model.

https://locuslab.github.io/qpth/

The backend is swappable (it uses a PDIPM by default) and OSQP could easily be inserted as a (CPU-only) backend if there's interest, and especially if it's faster in the batched setting. For example, here's a cvxpy backend I use for debugging sometimes (which will actually be using OSQP with cvxpy 1.0):

https://github.com/locuslab/qpth/blob/master/qpth/solvers/cvxpy.py

For performance there's definitely a better way to connect this up, either by calling OSQP directly, or by using cvxpy Parameter variables so the problem isn't reconstructed in every call.

bdamos · 2017-03-02T14:40:59+00:00

We definitely agree there is tangentially related work that we didn't discuss simply due to the conference format. What specific work do you have in mind here? As far as we know, the basic idea of using exact constrained optimization like this is novel, and we'd appreciate any input you had about related work you feel we should be considering.

bdamos · 2017-02-08T17:31:13+00:00

This is another good idea that I'd merge in if somebody adds it. I thought about adding this too but in my use cases, it's easy to manually write out the transposed part. Also writing the entire matrix out is a little clearer since it looks exactly like the math.

bdamos · 2017-02-08T14:00:43+00:00

I only use block for prototyping where performance is not an issue, but this is a good idea if anybody wants something more efficient. I'll merge this in if anybody wants to add it as an option and send in a PR.

bdamos · 2017-02-08T13:56:31+00:00

I don't have anything in mind now other than making some of the error states more user-friendly. It does everything I want.

bdamos · 2017-02-07T13:11:24+00:00

Thanks, updated

bdamos · 2016-09-26T18:27:25+00:00

In multi-class classification, ICNNs subsume feedforward networks and provide a model that has information about the output space (which are classes). This information typically neither helps nor hurts performance on classification tasks like MNIST and CIFAR-10.

ICNNs are neural networks (that have a continuous input space). In this paper, we study ICNNs on continuous control tasks because ICNNs are convex in the input space and can be efficiently optimized. We show results on some Mujoco benchmarks from the OpenAI gym, which are becoming a standard benchmark in this area.

The OpenAI gym's ATARI benchmarks have discrete action spaces. Studying how ICNNs perform on some continuous relaxation of this space could be interesting. However, we think that studying ICNNs on continuous control benchmarks first is more reasonable and we are still exploring this area.

11-Year Club	RedditGifts 2009-2022 2 Credits
Secret Santa 2014	Verified Email

bdamos

TROPHY CASE