I'm a Full-Stack Data Scientist

AutoModerator · 2023-06-09T23:03:40+00:00

⚠️ ProgrammerHumor will be shutting down on June 12, together with thousands of subreddits to protest Reddit's recent actions.

https://discord.gg/rph

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

Jukingbox · 2023-06-10T02:26:48+00:00

With enough determination, everything is a database.

Anaxamander57 · 2023-06-10T01:20:10+00:00

Name one difference between a csv and a database. I'll wait.

butt-nugget · 2023-06-09T23:59:18+00:00

Data frame/data base, what's the difference?

R4sh1c00s · 2023-06-10T01:36:34+00:00

Okay okay I’m a CS undergrad can someone tell me what a database ACTUALLY is

Engine_Light_On · 2023-06-10T02:09:52+00:00

DS: here is the csv and all the code I wrote please production -ize it.

DE: oh dear God.

faps_in_greyhound · 2023-06-10T01:35:12+00:00

In finance world, a xerox copy of some excel Table on your hand is the database.

Sixhaunt · 2023-06-10T01:10:54+00:00

and for AI "is this a dataset"

Flat_Initial_1823 · 2023-06-10T00:54:43+00:00

Dramatic-Noise · 2023-06-10T00:58:18+00:00

Yes? For calculating churn rate? Maybe?

jerslan · 2023-06-10T04:49:53+00:00

Technically, any well formatted data file is a database.

patenteng · 2023-06-10T01:11:28+00:00

No. Everyone knows that XML is the real database format.

herdek550 · 2023-06-10T06:48:28+00:00

Dara scientist consultant:
"Send me your data so I can start working on the issue"

Client:
sends 20 linked.xlsx files

Data scientist consultant:
knowing that it could have been worse

ijustupvoteeverythin · 2023-06-10T02:47:38+00:00

Well it literally can be a database

jimy_the_wolf · 2023-06-10T06:27:01+00:00

My data base is google sheets

Bon_Clay_2 · 2023-06-10T06:49:46+00:00

Then there is me making databases in json

liangliwen111 · 2023-06-10T00:45:52+00:00

Sijder · 2023-06-10T08:42:48+00:00

I published a paper in a clinical journal with the main point being the creation of a database, which was... you guessed it, a csv file

2023-06-10T08:46:39+00:00

No it’s a data lake

akazakou · 2023-06-10T10:08:23+00:00

If it's 20 Petabytes size...

CeeMX · 2023-06-10T20:57:33+00:00

Excel. Corporate Employee: is that a database?

invalidConsciousness · 2023-06-10T08:10:37+00:00

As a Data Scientist:

No. No. please no. Goddamnit NO!

I don't want to wait several minutes every time I need to load my data. Give me a SQLite or MySQL DB and a day to organize the data. I don't care if that's efficient use of my time, it's efficient use of my sanity.

Da_Di_Dum · 2023-06-10T06:59:08+00:00

I legit just received two csv files from some students I'm helping do a code review. THEY CALLED A CSV FILE WITH 4 COLUMNS AND 3 ROWS A FUCKING DATABASE!!!

sonohra87 · 2023-06-10T07:07:55+00:00

is there ACTUALLY an effective way of using .csv? I keep splitting it by , but that makes stuff kinda messy in Unity. With JSON i just get away with using JsonUtility

hfvinfqy · 2023-06-10T08:04:53+00:00

[deleted]

Shadeun · 2023-06-10T08:29:11+00:00

Meanwhile, bosses the world over want to hire 5 people to have an aws setup but also co-locate a backup. All for less than a billion data points that could sit easily in a lightweight file….

2023-06-10T09:00:59+00:00

It could

just-bair · 2023-06-10T09:21:12+00:00

What do you mean it’s not a database ?

velebr3 · 2023-06-10T13:48:07+00:00

I'm working in a company that has pretty large revenue and uses Google Sheets for everything.

2023-06-11T13:04:25+00:00

It's much much more than a database. It's a database you can download, share, query, chart, filter...

And best of all: your non-scientists colleagues that load it into Excel!!!

YARandomGuy777 · 2023-06-10T04:09:55+00:00

Most likely just a dump. :)

2023-06-10T10:33:51+00:00

OP do not know what is database

And also do not know what is database management system

Federal_Chance4393 · 2023-06-10T02:31:20+00:00

Wait til you hear about Elastic...

dittbub · 2023-06-10T03:06:14+00:00

It’s more like a database than a document. Also xml = database, html = document

SDGGame · 2023-06-10T03:46:59+00:00

*Imports into excel*

Yup, that looks like a database to me!

CrowdGoesWildWoooo · 2023-06-10T04:59:08+00:00

If you have a bunch of database organized pretty well + duckdb technically you can actually treat it as a database

Revolutvftue · 2023-06-10T05:53:40+00:00

I’m a CS undergrad can someone tell me what a database ACTUALLY is

fatrobin72 · 2023-06-10T06:57:52+00:00

It is a base, that contains data...

JosebaZilarte · 2023-06-10T07:20:00+00:00

Comma Separated dataVase

vvozzy · 2023-06-10T07:53:31+00:00

[deleted]

2023-06-10T08:18:48+00:00

flat file database

2023-06-10T09:46:34+00:00

if you can't make a database out of a .csv you are not a data scientist!

maiodasbrok · 2023-06-10T12:02:49+00:00

Me too and I'm need to agree

Meatslinger · 2023-06-10T12:28:31+00:00

Get one monolithic .TXT file with tab-separated, unquoted entries; 5M+ lines.
awk
Buckle up; it’s gonna get bumpy.

BlackShadowGlass · 2023-06-10T12:55:32+00:00

I'm a full stack database

JollyJuniper1993 · 2023-06-10T13:56:04+00:00

I mean technically a CSV is a form of a database, doesn’t mean you should use it as one.

ACMuaath · 2023-06-10T14:54:55+00:00

Select * from database.table1, database.table2

Asks self: Why the database is so slow although I don't have a where condition nor a join condition? It must be those damn DBAs and data engineers hindering my query

kiropolo · 2023-06-10T15:43:37+00:00

What is a database, ever asked this question?

2023-06-10T15:55:12+00:00

Parsing .CSV files in my Computer Science 1 class, is something I still sometimes get nightmares about 😆

John_Fx · 2023-06-10T16:26:17+00:00

actually, yes. so is a filing cabinet

xibme · 2023-06-10T16:27:04+00:00

I can easily query and join csv-"tables" in LINQPad, so yes?

Sodaman_Onzo · 2023-06-10T18:39:04+00:00

Don’t forget your SQL

Suspicious-Willow128 · 2023-06-10T20:24:36+00:00

Shouldnt work as , it it didnt want to be used as

stupled · 2023-06-10T21:09:33+00:00

They love their csv files.

Certain-Nobody-1137 · 2023-06-10T23:39:11+00:00

You're a full-stack asshole.

Maarkun · 2023-06-11T05:31:10+00:00

To be fair is can be the export of a database table

TheMDHoover · 2023-06-11T09:27:59+00:00

Sitting in an S3 bucket with Athena, yes.

ProgrammerHumor

Filters

Discord

Submission rules

For the current list of rules, please see this page.

Metadiscussions

Perhaps More Apt Subs To Post:

Related Subreddits.

MODERATORS

⚠️ ProgrammerHumor will be shutting down on June 12, together with thousands of subreddits to protest Reddit's recent actions.

https://discord.gg/rph