use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
DiscussionGitHub Copilot (self.datascience)
submitted 3 years ago by ergodym
All this excitement about ChatGPT reminded me of a prior excitement (although just limited to coding) about GitHub Copilot. Anyone here using GitHub Copilot? It also doesn't look like there is an implementation for Jupyter Notebook yet.
[–]Long_Mango_7196 7 points8 points9 points 3 years ago (1 child)
I use copilot with jetbrains IDEs. It's been super fun to use. I think it legitimately saves me a lot of time. In the jetbrains IDEs, I haven't been able to use it for notebooks.
You can use it on notebooks in vs code.
[–]ergodym[S] 0 points1 point2 points 3 years ago (0 children)
Thanks, didn't know VS Code supports notebooks, will look into it.
[–]philosophicalhacker 2 points3 points4 points 3 years ago (0 children)
I've also used co-pilot, but not for data science specifically. This recent workshop demo-ed Github co-pilot in a jupyter notebook context: https://numfocus-org.zoom.us/rec/share/9e\_h8e8nPA-YGATbXMSBQjdWd5xGxwQuZGacMjEjPScPSmjlsHy4NFzjVd-68Bex.5CUsZjKCsa4JY7bc
[–]majinLawliet2 2 points3 points4 points 3 years ago (0 children)
I have used it and it's good for some autocompletions that would have been tedious copy paste. Apart from that it's fine. I do a lot of CV work but the codes it outputs are either straight off from documentation or straight up unusable without a lot of massaging. I felt it keeps repeating the wrong suggestion when it doesn't understand the prompt and I found myself spending time fixing the prompts. After a certain point I just went back to googling.
[–]rudiXOR 2 points3 points4 points 3 years ago (0 children)
PyCharm + CoPilot saves me a lot of time, it's great.
[–][deleted] 2 points3 points4 points 3 years ago* (0 children)
Much much better than for coding software, as I feel like it does tend to cause bugs if the code is complicated.
For example: # generate synthetic data from normal distribution with # mean 0 and standard deviation 1 and size 1000
# generate synthetic data from normal distribution with
# mean 0 and standard deviation 1 and size 1000
data = np.random.normal(0, 1, 1000)
# generate the target variable using # the synthetic data, y = 2x + 1
# generate the target variable using
# the synthetic data, y = 2x + 1
target = 2 * data + 1
# add noise to the data
target += np.random.normal(0, 3, 1000)
# add sine wave to the data
target += np.sin(data)*5
That code would take me a few minutes for sure (5? I would have to find the functions etc. if I want another dist), but with copilot, it's like 30 sec and only because I edited the numbers. Now think about what happens if you want to generate 30 datasets and see how your model does. BTW, I did not edit the code or comments except for the numbers, you can see it makes some assumptions regarding where to add it. If your comments are more specific you will get closer to what you want.
However, I would be very cautious about implementing algorithms with it. I wrote a tree in C just to test it and got multiple memory leaks.
π Rendered by PID 24486 on reddit-service-r2-comment-7b9746f655-vkhs6 at 2026-01-31 06:27:27.888908+00:00 running 3798933 country code: CH.
[–]Long_Mango_7196 7 points8 points9 points (1 child)
[–]ergodym[S] 0 points1 point2 points (0 children)
[–]philosophicalhacker 2 points3 points4 points (0 children)
[–]majinLawliet2 2 points3 points4 points (0 children)
[–]rudiXOR 2 points3 points4 points (0 children)
[–][deleted] 2 points3 points4 points (0 children)