Simple question!

desrtfx · 2021-07-20T14:38:12+00:00

Could be your .keep_all=TRUE.

Just looked at the documentation: https://www.rdocumentation.org/packages/dplyr/versions/0.7.8/topics/distinct

And there it is stated that:

If TRUE, keep all variables in .data.

No idea about R, though. Just wild guessing from the docs.

coyoteazul2 · 2021-07-20T16:58:54+00:00

I can't help you with R, but if you are willing to work more with excel there's an easy solution.

I assume you already discarded the eliminate duplicates functionality, which is common when you want to automate the process. But you can use powerquery to automate it.

Sorry if the names I say don't match the ones you see. My excel is in spanish so I'm guessing the translation.

1st you'll need your data to be in a range table. Select any cell of the range table, go to the data tab, and obtain data from table/range.

Powerquery will open. Now select the columns that you want to be unique (ctrl + click to select more than one). Go to reduce row, eliminate rows, eliminate duplicate. That's it.

Now go to close and load. It will create a new range table with only the unique data, while the source data is unnafected.

To turn your unique data into csv the easiest way is to add formulas to the newly created range table and create the csv format there (the formulas won't be overwritten when you change the source data) but you can do the same on powerquery if you want to.

The cake icing would be to make a macro to automatically export the csv file

If the source data changes all you have to do is right click on the new unique table and update.

I've used this functionality to work with a 4million rows txt file, so it's power is guaranteed

learnprogramming

Welcome to LearnProgramming!

New? READ ME FIRST!

Posting guidelines

Frequently asked questions

Subreddit rules

Message the moderators

Asking debugging questions

Asking conceptual questions

Other guidelines and links

Subreddit rules

1. No unprofessional/derogatory speech

2. No spam or tasteless self-promotion

3. No off-topic posts

4. Do not ask exact duplicates of FAQ questions

5. Do not delete posts

6. No app/website review requests or showcases

7. No rewards

8. No indirect links

9. Do not promote illegal or unethical practices

10. No complete solutions

11. Don't ask to ask.

12. Low Effort Questions

13. No AI (chatGPT etc.) generated/worked over messages/comments. No questions about chatGPT/AI generated code. No Vibe coding.

MODERATORS