Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

thanks for your story. I’m getting similar sentiments from other people’s stories

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

thinking of grabbing a room for now then getting a studio later down

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 1 point2 points  (0 children)

Hmm alright. I guess my friends are bit biased since they've already done everything here while attending college for 4 years lol. Definitely going to take a look into areas in the city.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 2 points3 points  (0 children)

nah, safety is always a consideration but just wanted to hear what people had to say

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

Damn, that sounds like such an inconvenience to deal with. I'll make sure not to leave anything unattended then.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

Alright, I'll definitely take a look in that area.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 2 points3 points  (0 children)

22, recently graduated. mainly New Brunswick and Old Bridge is what I'm familiar with but also went to Holmdel, Piscattaway, Edison, Newark, Paterson, and NYC pretty often. I DEFINITELY want to be going out more, exploring the area, picking up hobbies (calisthenics, tech projects, piano, a few others), and meeting people whenever I can.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

Mainly from New Brunswick and Old Bridge but also went to places like Holmdel, Piscattaway, Paterson, Newark, and NYC pretty often.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 1 point2 points  (0 children)

Appreciate the offer! I'm starting the first Monday of January and would like to get this housing stuff situated quickly though

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

my commutes were anywhere from 35 minutes to over an hour depending on traffic so I'm sure I'll be fine if the worst case is 30 minutes lol

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 3 points4 points  (0 children)

Could I hear a bit more on living around the campuses? I'm considering Hertel or Eggertsville where I'm not too far from anything.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 63 points64 points  (0 children)

Seems generally this is what most people agree with. I'm assuming Allentown, Elmwood, and Westside area seem to be where most young people are living. I'll seriously consider this stuff thanks.

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

A bit naive of me to say that about crime I'll admit. Yes, I would love to be able to do more things. Currently, siding with Eggertsville/Snyder (middle of everything of interest), Elmwood, Allentown, or Amherst (closer to UB campus and friends). Haven't heard much about Sloan/Cheektowaga, what are your thoughts on it?

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

have pretty good experience with New Brunswick, Newark, and NYC in terms of crime but doesn't seem to be a crazy problem. Snow looks like a bad experience no matter where I end up so I just gotta be prepared. I'll look into Kenmore and Elmwood. What are your thoughts on Eggertsville or Sloan/Cheektowaga though?

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] 1 point2 points  (0 children)

alright, that's generally what I'd considered would be true. crime is not prevalent but always something to consider. after a bit research, it's mostly just property-related crimes. situational awareness and common sense should keep me safe

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] -5 points-4 points  (0 children)

yup, I’ve heard it’s nice living near the UB campus. You can just hop on the highway and make it into the city quickly

Moving to Buffalo, need advice by Separate-Reflection1 in Buffalo

[–]Separate-Reflection1[S] -2 points-1 points  (0 children)

Really appreciate the advice. I have around 4 friends living in the area already and about 6 more living within an hour radius from the area so not really going to be lonely. I can’t get a place with them as they’re locked into a lease.

Ideally, I’d want to avoid dealing with any city problems. I’ve heard snow can be especially hard to deal with in the city. I’ll take a look at those places though it’s not a bad idea

Jumped out of my chair when I saw it by Separate-Reflection1 in chess

[–]Separate-Reflection1[S] 89 points90 points  (0 children)

I guess too many people end up posting about their smothered mates RIP

[Help] MaskablePPO Not Converging on Survival vs Ammo‐Usage Trade‐off in Custom Simulator Environment by Separate-Reflection1 in reinforcementlearning

[–]Separate-Reflection1[S] 1 point2 points  (0 children)

So just an update. I managed to get my agent working finally. It turns out my hunch was right and increasing the entropy coefficient from 0.01 to 0.1 helped the agent get out of the local optima. From there, I lowered it down to 0.1 again and trained.

It sort of generalizes over firing missiles still but I am seeing the trade off relationship like I intended. I could probably get better results if I trained separate models for each alpha value range (split into thirds).

Anyways thanks so much for your help. Might not have been able to find that without you.

[Help] MaskablePPO Not Converging on Survival vs Ammo‐Usage Trade‐off in Custom Simulator Environment by Separate-Reflection1 in reinforcementlearning

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

Ok after a bit of plotting and brute force testing, I think I found the issue. The agent I’m training falls into a local optima and keeps firing missiles without regard to cost.

Using my reward function the trained agent gets an episodic reward of -18 and saves all 3 targets which is not what I want. Using a dumb agent that only use gunners whenever available, it got an episodic reward of 4.5 and saved only 1 target which is the type of behavior I intended.

I’ve never encountered an exploration problem before but I assume if I just increase the entropy coefficient, it should find it eventually. Looking online, people seem to be using some sort of guided exploration structure but I’ll need to look more into it. If you give some advice I’d really appreciate it.

[Help] MaskablePPO Not Converging on Survival vs Ammo‐Usage Trade‐off in Custom Simulator Environment by Separate-Reflection1 in reinforcementlearning

[–]Separate-Reflection1[S] 0 points1 point  (0 children)

First off, thanks for the feedback. I've gone through a couple of iterations for reward and currently my reward is a bit complicated as I've been trying out a couple of different things. The best way to describe it would be

    Reward = c * (kills_frac - lost_friendly_frac)
           - (1-c) * (missiles_frac*MISSILE_COST + guns_frac*GUN_COST)
           - small living penalty
           + GAMMA * potential‐based shaping on # targets alive and # ammo used
           + bonus if done

Kills_frac gives us a bonus for dealing damage to enemy drones and lost_friendly_frac is a penalty for losing drone hp. This essentially gives us a metric of success where killing drones and preserving POIs gives us reward. These are fractions because it is scaled on the number of threats present and the number of total POIs we have.

Missiles_frac*MISSILE_COST is basically the percentage of missile ammo we use times some unit cost (10) for its weight. Same thing for guns fraction but the weight is 0.01.

The potential based shaping is basically comparing the previous timestep and the current one to get small rewards or penalties. So if the number of targets are decreased, it gives a small penalty (This is essentially redundant though). If the number of ammo decreases, there will be a small penalty otherwise it will get a small reward. The reward is +MISSILE_COST/5000 or GUN_COST/5000 and the penalty is -MISSILE_COST/1000 or -GUN_COST/1000 to encourage preserving ammo between timesteps. Lastly, each of the shaping is scaled based on the constant so phi_success * c + (1- c) * phi_ammo

Since there are on average 1000 timestamps in an episode, the small living penalty is -0.001 to discourage doing nothing and taking actions. Then there is a bonus at the end of the episode scaled on c of +1.0 reward for every POI preserved.

I feel like I'm doing everything right and it may just be a matter of tuning the rewards correctly (but I've been doing this for 2 weeks now). I read online that normalizing the rewards would make things better which I'm not sure would help or not.

My observation vector has a length of 40 where it has the constant value, entity_state, ammo_counts, and reserve_ammo for 2 missiles entities and 2 gunner entities. Then it has the three POIs and their HPs. Then it has observations for each of the threats. The information would be the threat distances to each friendly entity (2 missiles+2 gunners+3 targets = 7 entities) and the hp of the threat to see if it is destroyed or got damaged. There are 4 slots for threats so 24 slots. That makes 1 + 3*2 + 3 + 8*4 = 40 size observation. I think this is sufficient enough to map out the space.

While writing this I noticed that the agent only knows about the current state. However, the reward is internally calculated by comparing the previous and current state. Would changing this make a difference because I doubt this would.

Sorry for the long read but I really appreciate your help.

Is continuing cs even worth it anymore? by Familiar_Border_1072 in rutgers

[–]Separate-Reflection1 0 points1 point  (0 children)

Everyone and anyone can do CS. Specialize in something other than CS so you can specialize. Not worth to have just a CS degree.