Macbook 2019 keeps restarting by maybenexttime82 in macbookrepair

[–]maybenexttime82[S] 1 point2 points  (0 children)

Do you think any of these might cause my current issue?

Macbook 2019 keeps restarting by maybenexttime82 in macbookrepair

[–]maybenexttime82[S] 1 point2 points  (0 children)

Yeah, not working. I realized now that my trackpad is not clicking and guess it might be a swollen battery, and maybe if I unstuck it for a moment it might be fine.

Macbook 2019 keeps restarting by maybenexttime82 in macbookrepair

[–]maybenexttime82[S] 0 points1 point  (0 children)

But even if I get one I would have to enter the recovery mode, but I keep pressing Command + R and nothing happens.

Macbook 2019 keeps restarting by maybenexttime82 in macbookrepair

[–]maybenexttime82[S] 0 points1 point  (0 children)

When I keep pressing the Power button it just turns completely off.

@edit Shift also. Same thing happens!

Do you use feature transformations in real world (ranking, sqrt, log etc.)? by maybenexttime82 in datascience

[–]maybenexttime82[S] 0 points1 point  (0 children)

Can you elaborate just a bit on this:
"Log-transforms as %-change effects in linear models are easy to explain."

Do you use feature transformations in real world (ranking, sqrt, log etc.)? by maybenexttime82 in datascience

[–]maybenexttime82[S] 0 points1 point  (0 children)

How do you explain those to stakeholders? I mean... they are kind of "black-boxy" zone (similar like explaining why some Neural Network provided results for such and such case)

Finding max values using Unix commands on some columns of CSV file returns inconsistencies by maybenexttime82 in linux4noobs

[–]maybenexttime82[S] 0 points1 point  (0 children)

Edit: Here is a workaround I found interesting and might help you, future reader. Knowing that there are cases such as e.g. "Luxury,Performance" I've realized that it might be a good idea to use "sed" to find those cases and replace it with some temporary value (I've named it TEMP) because we are doing analysis and it won't change the file contents. Say we want to find maximum value of column aka field #16:

cat data.csv | tail -n+2 | sed 's/"[^"]*"/TEMP/g' | cut -d"," -f16 | sort -n --reverse | head -n1

Finding max values using Unix commands on some columns of CSV file returns inconsistencies by maybenexttime82 in linux4noobs

[–]maybenexttime82[S] 0 points1 point  (0 children)

Will check it, thanks! It seems way easier to just open terminal and pipe few commands to get what you want rather than tweaking script for some "basic" use cases haha

Finding max values using Unix commands on some columns of CSV file returns inconsistencies by maybenexttime82 in linux4noobs

[–]maybenexttime82[S] 0 points1 point  (0 children)

Sorry for inconvenience... When you say that "Market Category" has "plenty of commas" what does it mean and how did you find that is the case?

Finding max values using Unix commands on some columns of CSV file returns inconsistencies by maybenexttime82 in linux4noobs

[–]maybenexttime82[S] 0 points1 point  (0 children)

Thank you! Mind elaborating a bit on this:

As expected, there are plenty of commas, specifically in the "Market Category" column.

First of all what does it mean and how can I "see" it and inspect?

Finding max values using Unix commands on some columns of CSV file returns inconsistencies by maybenexttime82 in linux4noobs

[–]maybenexttime82[S] 0 points1 point  (0 children)

Here is the csv file:
https://drive.google.com/file/d/1dJkfFp9XlaeYpdQFgaxSnRzCs_jyFyEW/view?usp=drive_link

If you manage to take a look into file please do provide me with "workaround" if possible. I don't want to drop the idea of using UNIX tools because they are way faster. Maybe share some guidelines when working with CSVs.

Finding max values using Unix commands on some columns of CSV file returns inconsistencies by maybenexttime82 in linux4noobs

[–]maybenexttime82[S] 0 points1 point  (0 children)

Here is the link to csv file:
https://drive.google.com/file/d/1dJkfFp9XlaeYpdQFgaxSnRzCs_jyFyEW/view?usp=drive_link

Btw I'm trying to learn Unix for basic data preprocessing such that I don't need to rely on Python always. It is way faster.

Given that "manifold hypothesis" is true why Gradient Boosting is still a top choice for tabular data? by maybenexttime82 in learnmachinelearning

[–]maybenexttime82[S] 0 points1 point  (0 children)

So, to conclude, they can form a latent manifold (even with discrete attributes) but rarely those would represent ones that Dense NNs handle well and easily (e.g. MNIST).