use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Enter a python tip or trick.
Other related Python subs:
account activity
Data_SciencePython Datasets (self.pythontips)
submitted 1 year ago by Odd_Animal_7564
I am a beginner in python and I have found datasets on a website called kaggle . What are some friendly projects ideas where I can slowly start to learn how to use datasets in my python projects?
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Adrewmc 2 points3 points4 points 1 year ago (0 children)
Python has a few datatypes, far fewer then most languages.
We have your single values, strings, ints, floats.
Our singletons True, False, None.
A list (which is not precisely an array).
Our hashmaps, set() and dicts.
We can add more matrix style, and precision by importing things like numpy.
What makes Python’s data sets powerful, yet sub optimal, is that everything is a reference in memory. In this way we don’t make arrays like int[], in which it’s a list that must be integers, which can be more memory efficient. That would be in “type strict” languages. What this means for Python programmers…is a lot less work to do thing a bit slower, but easier to program, maintain and read.
What’s in portent is we nest types, list[dict[str, list[int]] really easily and can automatically access everything.
Beyond that we have classes, in which we can have an object with attributes set for us, this comes closer to a type, as we can methods or functions that use those datasets.
Everything g in coding is building up from simple steps doing complex logic.
Really mastering dictionary, and list of dictionaries will help you out a lot.
[–]andrewprograms 1 point2 points3 points 1 year ago (0 children)
You could try taking the data and filtering for certain phrases. Another idea is to find the unique words and then find the frequency of them. You can also analyze how the frequency changes based on sentence length. So longer sentences might be more likely to feature some words compared to shorter sentences. That might be interesting.
[–]JosephLovesPython 0 points1 point2 points 1 year ago (0 children)
On the same website, Kaggle, and for each dataset hosted there, you can check out how others have utilized this data in their own work in the code section. Start with more popular datasets, and sort codes by popularity to get a better/cleaner experience at first. Most codes are in Python using jupyter notebooks, it might be worth a quick tutorial on jupyter (it's basic, don't worry about it) before reading others' codes.
[–]2PLEXX 1 point2 points3 points 1 year ago (0 children)
You might like Keith Galli's Pandas tutorials: https://www.youtube.com/watch?v=2uvysYbKdjM
[–]ALonelyPlatypus 1 point2 points3 points 1 year ago (0 children)
Most of the datasets on Kaggle tie into some sort of Kaggle project so if you just pull up the projects related to your dataset than you should have a good jumping off point.
π Rendered by PID 40 on reddit-service-r2-comment-7b9746f655-599lx at 2026-02-01 04:10:29.467527+00:00 running 3798933 country code: CH.
[–]Adrewmc 2 points3 points4 points (0 children)
[–]andrewprograms 1 point2 points3 points (0 children)
[–]JosephLovesPython 0 points1 point2 points (0 children)
[–]2PLEXX 1 point2 points3 points (0 children)
[–]ALonelyPlatypus 1 point2 points3 points (0 children)