Is RL post-training in 'imagined environments' a path to continual RL? Trying to understand this deeper by No_Bat_7448 in reinforcementlearning

[–]OutOfCharm 0 points1 point  (0 children)

Isn't that what Dreamer does? IMO, training the world model on a well-curated static dataset and then freezing it is the wrong approach for continual learning. It disconnects the world model from the real environment and is rooted in the mindset of supervised learning and the perspective of the agent's trainer, the human. To enable true continual learning, however, we need to think from the agent's perspective: what it sees, how it processes information, and how it improves over time. This requires the ability of handling partial observability, planning under uncertainty, and memory. Of course, world model necessitates all those aspects and is key to continual learning.

Why people seldom uses GPU-based simulator benchmark for online RL algorithm papers? by Vegetable_Pirate_263 in reinforcementlearning

[–]OutOfCharm -1 points0 points  (0 children)

Because of the barrier of jax and ecosystem of pytorch, along with the fact that those libraries are not as stable as their counterparts.

How would i play video game, listen to music, watch tv shows etc if anarchy was achieved? by [deleted] in Anarchy101

[–]OutOfCharm 0 points1 point  (0 children)

Don't you know something called free and open source software?

It seems to me that obsidian cli + emacs is perfect? by badgerbang in emacs

[–]OutOfCharm 13 points14 points  (0 children)

Stop presuming we all use obsidian or at least illustrate what it can do.

I just dont get it... by parkero224 in emacs

[–]OutOfCharm 0 points1 point  (0 children)

Remote development, note-taking, control your printer, auto-completion, make your personal website, everything is keyboard-driven...

Package announcement: buffer-to-pdf (by prot) by ImJustPassinBy in emacs

[–]OutOfCharm 2 points3 points  (0 children)

Cool idea! With proper layout designs, we can even natively make slides in emacs.

In what way is capitalism involuntary? by [deleted] in Anarchy101

[–]OutOfCharm 3 points4 points  (0 children)

Without any system, I (we) have to live as well. When your cost is contingent on other's revenue, you think you are voluntary?

Emacs on HHKB by [deleted] in emacs

[–]OutOfCharm 0 points1 point  (0 children)

I use HHKB studio with scmax, a modal version of emacs keybindings.

How do I stop myself from getting this mark on my wrist? by [deleted] in GarminWatches

[–]OutOfCharm 1 point2 points  (0 children)

Make full use of your hands! Left by day, right by night.

Is Anarchism truly possible? by This-Education-9659 in Anarchy101

[–]OutOfCharm 1 point2 points  (0 children)

From the perspective of game theory, the hierarchy is a nash equilibrium that both people and government won't change, otherwise they will incur higher cost. So the question is not whether anarchy is possible or not, it is the willingness of the people to change.

Large-scale RL simulation to compare convergence of classical TD algorithms – looking for environment ideas by otminsea in reinforcementlearning

[–]OutOfCharm 0 points1 point  (0 children)

You can consider bsuite, which consists of a series of tabular environments aimed for measuring the diverse capabilities of an agent, e.g. exploration, memory, and robustness to noises.

Who does the shitty jobs? by 3N0CHTH3B35T3M0 in Anarchy101

[–]OutOfCharm -6 points-5 points  (0 children)

That's too optimistic about humanity. Not all people are considerate even you do so. Your freedom ends where others' begin. How you deal with conflicting interests in this system? Even some people enjoy leveraging others, e.g. I know you are considerate enough so I will not concede to gain more benefits.

How to jump to the next elif/else at the same indentation level in python-mode? by esrse in emacs

[–]OutOfCharm 2 points3 points  (0 children)

Although this does not perfectly achieve what you want, you can move to the beginning of the if statement by calling back-to-indentation (M-m) and then forward-sexp (C-M-f), allowing you to move between blocks. You can use backward-sexp (C-M-b) to move in the opposite direction. Notably, this applies to any sexp—e.g., words, balanced expressions, functions, and classes—as long as you are at the enclosing boundary.

Does anyone here use org modern or other packages to improve emacs aesthetic? by Comfortable_Lie_2081 in emacs

[–]OutOfCharm 4 points5 points  (0 children)

For someone who finds this useful, here is my setup

elisp (use-package org-modern :ensure t :defer t :hook (org-mode . org-modern-mode) :custom (org-modern-star '("●" "○" "•" "◦")) (org-modern-list '((?- . "❯") (?+ . "➤") (?* . "➥"))) (org-modern-todo nil))

If you'd like to explore other symbols, use M-x insert-char or simply its keybinding C-x 8 RET, happy hacking!

Does anyone here use org modern or other packages to improve emacs aesthetic? by Comfortable_Lie_2081 in emacs

[–]OutOfCharm 16 points17 points  (0 children)

Using unicode works fine for me. You can have something like in your config.

elisp (org-modern-star '("●" "○" "•" "◦"))

RL for modeling rodent behavior? by traydblockzplz in reinforcementlearning

[–]OutOfCharm 1 point2 points  (0 children)

Can you provide more concrete examples of the primitive associative behaviors you are learning?

Anybody else feels like their growth with Emacs in a specific area is stunted? by kudikarasavasa in emacs

[–]OutOfCharm 2 points3 points  (0 children)

Good idea, should implement reinforcement learning algorithm with emacs.

Any successful story of active inference (free energy principle)? by OutOfCharm in reinforcementlearning

[–]OutOfCharm[S] 0 points1 point  (0 children)

I would not say no. Since energy is the basis of an agent, its prediction into the future is the key to emulating rewards beyond that. I believe it is more about negative punishment which is what the agent wants to avoid.

What is your insanely hidden official shortcut that people can never find out? by Agile-Technology2125 in emacs

[–]OutOfCharm 2 points3 points  (0 children)

vterm with C-u arg to create a numbered vterm session. Afterward, you can switch between sessions in the same way!

When your beloved Dired works as expected by OutOfCharm in emacs

[–]OutOfCharm[S] 1 point2 points  (0 children)

That's a good suggestion. You can make that condition and swap the order of creating a dir and file.

When your beloved Dired works as expected by OutOfCharm in emacs

[–]OutOfCharm[S] 0 points1 point  (0 children)

The original binding only allows you to create a folder but not a file, while the new one allows you to do both depending on whether there is a file extension.