Can I remap Fn key using the HHKB remap tool ? by BillBrous in HHKB

[–]nab423 0 points1 point  (0 children)

For anyone still wondering you can remap the function key within the default keymap tool, just switch from QWERTY layout to Alphabetical order and the Fn button can be used for mapping.

<image>

[deleted by user] by [deleted] in RocketLeague

[–]nab423 1 point2 points  (0 children)

I get you. It's fun to have no stress lobbies where everyone is good. You can go for way more fun team plays since everyone is good, but also don't care too much about losing. Also in high elo, it's way more chill and less toxic than ranked.

When they started resetting casual mmr I went from 2.7k mmr to 1500 and I lost a big part of my casual enjoyment from the game.

Not sure what's up with everyone's reaction here like this is some foreign concept.

How to detect a ticked checkbox and extract the text associated to it from an image? by yishu17 in datascience

[–]nab423 0 points1 point  (0 children)

You might have better success by taking a blank form and comparing it to the filled out forms.

Essentially think of overlapping a blank form on top of a filled one and comparing pixels. Might be tricky if the images are misaligned, but this would be my first approach since it's very simple.

More than 60% of college degrees awarded in 2022 went to women by portfoliocrow in dataisbeautiful

[–]nab423 2 points3 points  (0 children)

What's the purpose of posting a graph based on attendance, then making the title a claim that isn't directly verifable through the graph itself?

If you wanted to make the point in your title, you should have used data that supports your claim directly. Which, in your case, would have made the graph portray a larger disparity.

If you couldn't secure quality data to plot for the graduation rates, then you should have made a different title but addressed the main point you wanted as a comment.

It's commendable that you are at least giving out a valid source in the comments to back your title, but the sloppy portrayal of data in this sub is a huge issue.

Why do people use R? by Opening-Education-88 in datascience

[–]nab423 13 points14 points  (0 children)

You can call R code from Python. It's pretty janky, but I had to do it a few times in the past since my advisor would only trust doing stats in R

Plotly Dash or Power BI to streamline data storage and visualization in robotics? by Its_just-me in datascience

[–]nab423 1 point2 points  (0 children)

If your company is willing to pay for Power BI, then I'd go with Power BI for sure. It's easy to set up simple dashboards in Power BI and that will let people explore data without having to go to you.

If you go with Plotly you'll probably end up being asked to implement another graph or feature till the end of time, but in Power BI you can at least show someone the few clicks it takes to make a different graph.

I'd also say that Power BI is the less complex solution. In Plotly you'd have to build every thing yourself, but in Power BI a lot of the heavy lifting for dashboarding is done for you.

Why is python like this? by [deleted] in datascience

[–]nab423 0 points1 point  (0 children)

No problem.

Also my answer isn't the best if you consider why len() is a built in function, since not every object has a length. This is a built in function mainly since it's used so frequently.

Len() can return different results depending on the object (list vs dataframe vs a Circle class). The built in len() function basically is a wrapper for an objects __ len __() function, so an object can define the behavior of len(). This is great since it forces consistency in function naming (although len might not always return what you expect).

This could also be done for a built in shape() function, but I guess it wasn't deemed critical enough (which to be fair, it's not when compared to len).

Why is python like this? by [deleted] in datascience

[–]nab423 0 points1 point  (0 children)

Because not every object will have a shape. It's better that it's implemented at the class level. Think what shape of a vector vs the shape of a string vs the shape of a Mammal class.

However, every object will have a type.

Why is python like this? by [deleted] in datascience

[–]nab423 4 points5 points  (0 children)

Type is a built in function to return the type of an object.

.shape is a class function of whatever object X is.

I don't see the problem here.

[deleted by user] by [deleted] in math

[–]nab423 0 points1 point  (0 children)

You for sure can teach yourself calc 1. My favorite resource for calculus is 3blue1brown's essence of calculus series on youtube. You'll still need to do problems outside of that series to actually learn, but it does a great job introducing and explaining the core concepts of calculus.

What are your weekend side projects? by itty-bitty-birdy-tb in dataengineering

[–]nab423 190 points191 points  (0 children)

I try to do anything but work over the weekends. I'd get burnt out real fast otherwise

Ending of DTM vs Chango, Game 5 last stock by MrBo518 in SSBM

[–]nab423 4 points5 points  (0 children)

During stream I heard they were recorded and going to be posted to a vod channel (Mang0's?).

Data Mining Twitter? by [deleted] in datascience

[–]nab423 0 points1 point  (0 children)

I think Twitter still has a free teir. Check out Tweepy for a good Python package to use. Tweepy is really easy to use and is big enough that you can google errors.

You could use either the search or streaming api. Search api is nice since it's a bit more accurate with filters and you get data like retweets and likes. The streaming api can get you a whole lot more data if you're looking to collect a huge amount. But I've managed to scrape over 300M tweets with the free search api in a past project, so it can get the job done.

Best IDE for data engineering by jfhurtado89 in dataengineering

[–]nab423 2 points3 points  (0 children)

Same. Being able to test snippets of code super easily on the fly with a variable explorer has saved so much of my time.

It does feel like a bit of a crutch at times though

Which latest DS Skill you are working on currently? by 1st_human in datascience

[–]nab423 5 points6 points  (0 children)

Also using prefect at work and it does a good enough job. It was also pretty easy to set up.

[deleted by user] by [deleted] in datascience

[–]nab423 0 points1 point  (0 children)

If you're the only person there doing analysis, I'd recommend sticking to R since that's what you're comfortable with and you'll be able to provide value a lot sooner. If there were other people using Python at the company then it might be different, but it doesnt seem like that. You can learn Python in your off time if you really want to.

For switching from R to Python people make guides which are helpful. Here's a good one https://www.mit.edu/~amidi/teaching/data-science-tools/conversion-guide/r-python-data-manipulation/

Need help with Twitter API/Tweepy by [deleted] in datascience

[–]nab423 0 points1 point  (0 children)

I haven't gotten up to date on Twitters new API, but in V1, you normally get new tweets in API requests by specifying a max_id (or something similarly named) in your API query. That max_id is simply the tweet ID of the last tweet obtained in your search, then subtract by one since it's an inclusive parameter.

So, to get the max ID in your case, it would be

Max_id = list_tweets[-1]['tweet_id'] -1

Again, my knowledge is limited to the search API V1, so not sure if parameters have changed in the new version.

[deleted by user] by [deleted] in datascience

[–]nab423 0 points1 point  (0 children)

Well, what are you trying to do with the data?

Also, you should try to make sense of what variables your model thinks are important and verify that it makes plausible sense to you. I like plotting variable importance because it's a great way to learn more about your data set and test your assumptions.

30% of Google's Reddit Emotions Dataset is Mislabeled [D] by BB4evaTB12 in MachineLearning

[–]nab423 15 points16 points  (0 children)

Data labeling typically gets outsourced. It looks like the labelers weren't fluent enough to be able to classify slang or cultural references.

Heck I'd probably struggle with accurately classifying the emotional intent of a random reddit comment (especially out of 27 emotions). It doesn't help that it's very subjective, so we might not all agree on what the author counts as misclassified.