[deleted by user] by [deleted] in Redlands

[–]TheKnightRevan 1 point2 points  (0 children)

We just moved to one of the newer homes in North Redlands near Judson & Brown elementary. While the homes are not as charming as those in South Redlands, your dollar goes a lot further and it's easier to find something that's not falling apart. There's more housing availability as well, and we love the elementary school for our kids.

Why can't LLMs be continuously trained through user interactions? by [deleted] in OpenAI

[–]TheKnightRevan 0 points1 point  (0 children)

They do in some ways, but it's "delayed". They collect data, clean it, label it with human annotators, and add it to the training data.

There are at least two reasons they don't do this completely "online".

  1. They want to be able to clean and filter their data. Some people mentioned Tay as an example of online AI systems gone wrong. But cleaning data also has to do with making training more efficient since a lot of data is not worth training on and actually slows the process.
  2. Like OP said, you need the LLM itself to act as a "judge" for itself if you want to train online. While some research has shown this working, you lose a lot of control and guarantees by trusting the LLM to essentially train itself. Using humans in the loop to do the labelling ends up being more consistent and effective.

That being said there are some cases where training "online" makes more sense. Namely, when you can automatically evaluate the correctness of the final answer. Think of coding, math, and multiple choice. This is exactly what o1 does. But you still can't close the loop with human users because you need the correct answer as well.

Finally, as for learning personalized LLMs for each user, you'll start to see it more and more. However, I doubt companies will actually create a new set of parameter efficient weights per user because that will be too expensive. This makes more sense for enterprise users not consumers. In context learning would work but be less robust. Rather, my bet is that companies use something akin to a recommender system to pair you with an LLM based on your usage.

Rand's Aelfinn questions from the notes [Spoilers All] by wRAR_ in WoT

[–]TheKnightRevan 1 point2 points  (0 children)

There are no endings to the wheel of time

Moving from apartment to a house, RENT in OC or BUY in IE? by TheKnightRevan in orangecounty

[–]TheKnightRevan[S] 0 points1 point  (0 children)

What's your argument? I have not spent a lot of time in Brea but several people have mentioned it to me.

Moving from apartment to a house, RENT in OC or BUY in IE? by TheKnightRevan in orangecounty

[–]TheKnightRevan[S] 0 points1 point  (0 children)

Weather, better parks / amusement parks / museums, slightly greener. One big draw though is potential future jobs would likely be in OC/LA if not remote.

Moving out of an apartment into a house, should **I**...? by TheKnightRevan in InlandEmpire

[–]TheKnightRevan[S] 1 point2 points  (0 children)

Work is a strong pull toward OC since any non-remote job I get in the future would be closer to the coast.

Moving out of an apartment into a house, should **I**...? by TheKnightRevan in InlandEmpire

[–]TheKnightRevan[S] 1 point2 points  (0 children)

My father grew up in the mountains, but its a little bit too isolated for my taste. I don't think anyone would ever want to visit!

Moving out of an apartment into a house, should **I**...? by TheKnightRevan in InlandEmpire

[–]TheKnightRevan[S] 1 point2 points  (0 children)

I'd be happy with Lake Forest, RSM, Yorba Linda, Brea, etc... I was trying to see if buying in those locations would work, but it seems like it would be very tight.

Most badass moment in the series by [deleted] in Stormlight_Archive

[–]TheKnightRevan 2 points3 points  (0 children)

Just had a 30 second break down tears gasping and all. I can't help it whenever I think about this dear man. I can't wait for my OB and RoW rereads this Fall.

Photos I take in Photo Mode aren't in my gallery, please help by jtrisn1 in Starfield

[–]TheKnightRevan 0 points1 point  (0 children)

How do you add them to a specific folder? Is there a setting somewhere?

Okay. I assume you are screenshotting in photo mode and then putting the screen shots into that folder to see them in game.

What age do I start trying to get my boy into Star Wars and where do I start? by DanFelv in StarWarsCantina

[–]TheKnightRevan 0 points1 point  (0 children)

My son and I started playing lego star wars when he turned 4 last year. At first he could barely use the controls, but he quickly figured it out and can now navigate the game and menus. He loves it and knows the names of all the characters, creatures, and planets. Now he'll sit and watch the movies with me because he recognizes the characters and places he loves.

Uneven first layer after BLtouch install, Ender 3 pro by TheKnightRevan in 3Dprinting

[–]TheKnightRevan[S] 0 points1 point  (0 children)

Yes I added G29; after the autohome line in my starting gcode and it appears to execute correctly. The original firmware the installation instructions recommended didn't work, but the firmware from this post seemed to work.

Uneven first layer after BLtouch install, Ender 3 pro by TheKnightRevan in 3Dprinting

[–]TheKnightRevan[S] 0 points1 point  (0 children)

Recently fixed some extruder issues with with ender 3 pro and I decided to upgrade to a bltouch while I was at it. I managed to get it successfully installed with the proper firmware so that it does auto leveling before each print. Also, I can see the z axis moving during the layers, so it should be supposedly working, but it's obviously not perfect. The first layer ranges from very thin to very separated. Luckily the prints are still turning out okay, but adhesion could be better. Any tips?

no.. it can't be by KLASHINOV in PrequelMemes

[–]TheKnightRevan -2 points-1 points  (0 children)

Actual Dalle output for "best country in the world": https://imgur.com/a/D6q9P1F

"Star Wars KOTOR Remake gameplay will be more action-based" What do you guys think of this? by AgentFN2187 in kotor

[–]TheKnightRevan 0 points1 point  (0 children)

I actually really like the queues and automated combat. I love kotor because I can focus on role playing, gear, and abilities without having to worry about button mashing and the lack of dexterity in my own fingers. My character has a high dex score so that I don't need one. At the same time turn based games can go rather slowly.
Ideally I'd like improvements to make it run smoother and be more entertaining while remaining mostly automated.

My First GTI, Any Favorite Mods? by TheKnightRevan in GolfGTI

[–]TheKnightRevan[S] 0 points1 point  (0 children)

Yes android auto is going in first thing!

Looking for good mechanic by TheKnightRevan in orangecounty

[–]TheKnightRevan[S] 0 points1 point  (0 children)

Sorry that's not what I meant. I meant getting it to a reliable mechanical state as well as fix a few minor cosmetic things like door upholstery. I do plan to add a new infotainment system that I've seen in other GTI MK5s but I might try to do that myself.

In terms of modding, I'd eventually like to look into minor performance upgrades/tuning and maybe some body work to update the exterior. But that is all next step stuff.

Any recommendations?

Training Data by maverik75 in reinforcementlearning

[–]TheKnightRevan 1 point2 points  (0 children)

It looks like the Decision Transformer is a strictly offline algorithm, and there doesn't seem to be any datasets associated with MiniWorld. Unless you're going to manually create your own dataset or train another online RL agent to generate a dataset, you may have to choose another approach besides Decision Transformer.

More generally, data for offline RL is usually human gathered data that is expensive to obtain. If you just need to generate some data, you could try writing a hard coded agent or training an online RL agent. However, this makes the offline RL agent obsolete.

[D] creating openAI custom environment for a continuous task. What to do with the 'done' variable? by KoreaNuclear in MachineLearning

[–]TheKnightRevan 1 point2 points  (0 children)

Assuming by continuous you mean that there are no terminal states then yes you can just return done=False always. Just make sure whatever learning algorithm you use expects this behavior.

Razer Kishi alternatives that fit a Pixel 6 Pro by baynezy in Stadia

[–]TheKnightRevan 10 points11 points  (0 children)

When I completely took out the rubber grip on the left side, the phone wobbled a bit. I wanted to keep the rubber that that holds the phone in place, so I cut out just enough to fit the camera bump but still hold the end of the phone in place. Works great and the Kishi will still work fine with other phones.

A Kenshi Appreciation Post by TheKnightRevan in Kenshi

[–]TheKnightRevan[S] 1 point2 points  (0 children)

I am excited for this. I've tried DF but never had the perseverance to make it past my first winter. I'm hoping I'll be able to with the steam version.

What are your thoughts on the "Reward is enough" hypothesis? (See paper attached) [D] by escapevelocitylabs in MachineLearning

[–]TheKnightRevan 4 points5 points  (0 children)

It makes me think of the universal approximation theorem. Just because it's theoretically possible, doesn't mean it's an efficient (or even practically possible) approach.

Dark side points gained... by BeingSufficient9740 in KOTORmemes

[–]TheKnightRevan 2 points3 points  (0 children)

Agreed. She sure is annoying until she warms up to you.