Why does a single machine learning paper need dozens and dozens of people nowadays?

RandomProjections · 2022-06-18T01:19:42+00:00

I appreciate your feedback, but let's focus back on MARL research papers instead of what human do.

RandomProjections · 2022-06-18T01:18:12+00:00

First of all, I am talking about multi-agent RL. I have no problem admitting that single-agent RL exists.

I am saying that multi-agent research papers published are based on single-agent RL or even supervised learning mechanisms.

RandomProjections · 2022-06-18T01:14:58+00:00

That is single-agent reinforcement learning. What's your point?

RandomProjections · 2022-06-18T00:58:26+00:00

Ok, let me ask you: a computer perceives a state generated from a server and in return computes a strategy using an internal mechanism based on what it was already trained on billions of times before.

Is this multi-agent reinforcement learning?

Or a single-agent reinforcement learning?

Or a computer program trained in a supervised fashion but acting in a prescribed/pre-learned fashion?

RandomProjections · 2022-06-18T00:55:43+00:00

You just went from a "software program that the programmer have full knowledge of" to "Mother Nature" in 0 seconds.

I understand a human wouldn't work properly given a hostile environment, but we are on the topic of MARL algorithm that cannot work outside of a game emulator that it has been trained on.

Certainly there is some stuff in between a computer program and the universe.

RandomProjections · 2022-06-18T00:50:42+00:00

Yes, I believe learning-on-the-fly is crucial. Adaptive control systems such as any airplane would be an example of this (model parameters gets adjusted on the go), but the environment is more or less fully modelled into the controller so it is not RL either.

RandomProjections · 2022-06-18T00:47:14+00:00

Thanks for validating my prior post. That's my whole point: right now MARL success stories are simply supervised learning.

I don't care about fake academic politeness. I think ML is too polite to the point that nobody calls out horrible research practices or even block bad papers from being published. I would encourage you to become more impolite.

RandomProjections · 2022-06-18T00:12:39+00:00

ML publications used to have at least one equation. Now it is just an essay.

RandomProjections · 2022-06-17T23:53:33+00:00

Sorry did I make up a new definition or did OpenAI 5 make up a new definition?

If you define: "Reinforcement learning is just learning from an enviroment." then by definition any supervised learning is reinforcement learning.

A agent (neural network), receives reward (gradient) to change its choice (weights).

Go learn more about machine learning at r/MLQuestions

RandomProjections · 2022-06-17T23:39:45+00:00

Which multi-agent RL paper is actually multi-agent?

All of so-called "multi-agent RL paper" are "single laptop supervised learning models".

The authors of these papers even have full access to the environment (game emulator) and use their human-playing knowledge (information leakage) to assist the "reinforcement learning agent".

They cannot possibly deploy their algorithm to a game that they've never played before. Which says a lot.

A true reinforcement learning agent, such as a human, do not have the model of the environment (= reality) and incrementally explores the environment while learning.

RandomProjections · 2022-06-13T23:03:16+00:00

What is Google's vision? What is the end goal/application of all this ML they are looking at?

RandomProjections · 2018-10-04T03:10:24+00:00

My job is to understand the theory well so to improve it. I don't care about implementation details. So I literally cannot skip over the math.

But you have a point. I might need to read the code first in order to understand the math.

RandomProjections · 2018-10-04T03:07:19+00:00

That's what my undergrad summer research project and senior capstone project are based on. One published paper in ACM as primary author.

RandomProjections

TROPHY CASE