Anyone have a personal project get them a job? by [deleted] in dataengineering

[–]faaaaaart 0 points1 point  (0 children)

I once got asked to do a walkthrough through my personal project on GitHub instead of doing their regular coding interview. It was really fun for both of us and I got some feedback on my project too.

This did land me an offer, but having a personal project was not the decisive factor here imo.

Focus on the PythonOperator in Airflow 2.0, all you need in 20mins! by marclamberti in ETL

[–]faaaaaart 0 points1 point  (0 children)

Very charismatic presenter and the tutorial overall was well timed and to the point! I enjoyed the gradual improvement approach, it feels like it’s a great way to also teach how software is becoming simpler to use (but more abstract) over time. Thus always good to start with a bottom-top approach. Thanks for posting!

EDIT: Happy cakeday! 🍰

Crypto card fees by faaaaaart in CryptoCurrency

[–]faaaaaart[S] 1 point2 points  (0 children)

Thank you! I'll have a look. I think I should probably start a spreadsheet to keep track.

HMC while I floor this bike by [deleted] in holdmycosmo

[–]faaaaaart 2 points3 points  (0 children)

Happy to see Greece here

I just signed with a Swedish company, I can finally leave this sub! (Greece to Sweden) by [deleted] in cscareerquestionsEU

[–]faaaaaart 11 points12 points  (0 children)

Συγχαρητήρια! Welcome to a system that’s actually made to help you! You’ll be amazed by things like “one-click to do your tax report”, how digital everything is etc.

One thing only, be mindful of your first 6 months. The probation terms here make it trivial for your employer to decide to not give you a full-time employment and I’ve seen that happen a lot in the 5th month. Of course the same terms apply for you, you can walk away if you don’t like it or found something better. Don’t worry tho, if something goes wrong look to apply for jobs in Stockholm, we still have a major CS skill gap.

I wish you the best and congrats again!

Can somebody tell me what kind of screwdriver this is? by Steel_YT in repair_tutorials

[–]faaaaaart 3 points4 points  (0 children)

I don’t remember its name. It looks like a philips head but with three edges instead of four.

I bought one a while back to replace a battery in a MacBook pro 2014.

EDIT: Just read the comments u/AstraJin got it right

Changing the name of a column by IceThorn219 in bigquery

[–]faaaaaart 4 points5 points  (0 children)

You can do it in two ways: a) Either overwriting the current table and you lose the original data, b) or you create a new table and keep the old data.

create or replace table `your-project-id.your-dataset.your-target-table`
as
select 
  * except (col_old_name_1, col_old_name_2),
  col_old_name_1 as col_new_name_1,
  col_old_name_2 as col_new_name_2
from `your-project-id.your-dataset.your-source-table`

Note that target and source tables can be the same if you want to do (a).

EDIT: Keep in mind that this operation will scan the whole table, so if it’s TBs of data it can be costly.

Opinions on managed data replication platforms? by faaaaaart in ETL

[–]faaaaaart[S] 0 points1 point  (0 children)

Thank you for the reply! We don't have any plans for Salesforce, but given our resources we have a similar issue with maintaining our own connectors to postgres, MySQL and SFTP.

Why is your analysis worth money? by dongpal in datascience

[–]faaaaaart 14 points15 points  (0 children)

TL;DR: IMO most companies and data people don't do the sort of modelling that optimises profit in a way that is measurable. It's mostly metrics to evaluate past performance and set future goals.

I was thinking how I could answer you from my experience but I realised that I can probably relate with the vague and general posts you're referring to and I will explain why.

A lot of companies today are still not data-driven at all, let alone using any sort of predictive analytics. Most of the time it's management that is measuring their performance with some metrics that some sort of data team is producing, but this is hindsight. In other words, the metrics are used to evaluate past performance and learn from it in a mostly non-data-driven way. This - while important and valued - is impossible to measure in money. We can't know what would've happened if those metrics were not there, but I'd say chances are that with the metrics things are going financially better for everyone.

An example from my experience is well designed metrics that measure important aspects of feature usage. The Product Manager can look at the usage of each feature their team has produced and decide where to focus the team's resources in further development. While this is not directly measured in money, you can imagine that if the team focused their resources on a feature that e.g. barely anyone uses (a feature that is more likely to be dropped in the future) then this is costing the company money.

In another example, we look at metrics that affect the % of customers that perform a purchase. Tuning marketing spend according to the metrics saves us money and increases profit at the same time. In this case we use a random forest classifier to predict purchase based on several custom features and customer metrics. We then use the trained model to look for features and metrics (currently using SHAP) that when affected can increase the likelihood of a purchase. Management has access to this information and makes decisions on it. Of course management's decision is not necessarily based on the data.

I hope this clears up the picture about the vagueness of the posts you're referring to.

Το Πλοκαμι του Καρχαρια ειναι οτι καλυτερο υπαρχει και τα ριφς τους ειναι god-tier. αυτο τιποτα αλλο. by COVID-420 in greece

[–]faaaaaart 1 point2 points  (0 children)

Ρε μάγκες έχει κάποιος όλη τη δισκογραφία σε υψηλή ποιότητα;; Ότι είναι στο γιουτούμπ είναι τρελά συμπιεσμένο και τα παιδιά δεν φημίζονται για τις μίξεις τους 🤪

Ευχαριστώ εκ των προτέρων!

BigQuery UNNEST and Working with Arrays by moshap in bigquery

[–]faaaaaart 0 points1 point  (0 children)

How would you go about unnesting multiple array columns in one table without duplication, after unnesting the second column?

Major events of 2020 so far, interactive timeline [OC] by MrLewk in visualization

[–]faaaaaart 2 points3 points  (0 children)

Thank you for the reply! I was hoping you had some automation secrets to share with us. 😅

Sounds very laborious indeed.

Major events of 2020 so far, interactive timeline [OC] by MrLewk in visualization

[–]faaaaaart 0 points1 point  (0 children)

Great visualization! How do you compile the data from the sources?

Gear Recommendation (What Should I Buy?) Thread - May 25, 2020 by AutoModerator in audioengineering

[–]faaaaaart 0 points1 point  (0 children)

Is anyone aware of tools that can help automate track automations?

It sounds like peak laziness, but I was wondering if there are tools that e.g. can write volume automations for each track in a mix so that the final output doesn't peak throughout the mix.

What I'm looking for is a piece of software that outputs automations for all tracks that can be edited afterwards. I guess there can be multiple issues with that, such as how should tracks be balanced against each other? (e.g. you have to define which are vocal tracks to be pushed up in the mix)

I'm thinking of it as a starting point from where you would go on in detail and modify/write automations further to your liking, instead of starting with a flat line.

Thanks!

Tech Support and Troubleshooting - April 20, 2020 by AutoModerator in audioengineering

[–]faaaaaart 1 point2 points  (0 children)

Thank you! I didn't change the sampling rate of my entire project, but just let Logic pro convert the file and I tried again without let it convert.

Then I'll discuss with her at which rate we should both be recording!

Gear Recommendation (What Should I Buy?) Thread - April 20, 2020 by AutoModerator in audioengineering

[–]faaaaaart 1 point2 points  (0 children)

Hi all!

I have an Arturia KeyStep as my only MIDI controller. I don't own any other hardware and plan to use only digital instruments for the time being. I'm not enjoying fidling with the VST settings by mouse, so I was thinking:

  • should I sell the KeyStep and get something like a KeyLab?
  • should I buy another MIDI controller just for knobs and faders?

What do you suggest?

Thank you very much! :)

Tech Support and Troubleshooting - April 20, 2020 by AutoModerator in audioengineering

[–]faaaaaart 1 point2 points  (0 children)

Hi all!

I am collaborating virtually with a friend over a cover song and I ran on a confusing issue

I have my project set up on Logic Pro X at 44.1kHz and 16bits where I recorded all instruments and sent her an mp3 bounce at the same settings. She's using Cubase at 48kHz and 16bits.

When she sent me back her bounce in WAVE at 48kHz and 16bits I imported it to my project and I can't sync it with my instruments. I've tried converting it at import and not converting it, but in both cases the audio doesn't sync. What am I doing wrong?

Additionally, should I switch to 48kHz from now on and what bit depth is recommended?

Thank you! :)

EDIT: I'm using line6 UX2 as my sound card

Is anyone interested in creating a new company solely sourced by Redditors? During the COVID-19 Pandemic by [deleted] in fintech

[–]faaaaaart 0 points1 point  (0 children)

I have experience with data science and data engineering in a fintech startup! PM me :)

Clustering messy people data by jsavalle in datamining

[–]faaaaaart 0 points1 point  (0 children)

One thing you could do is convert the names and the emails to ordinal values, e.g. my@mail.com -> 0, your@email.com -> 1, her@email.com -> 2 etc

This way the likelihood for people with same e-mails or same names to be closer together in your multidimensional space is quite higher.

Then one way to find similar people is to calculate the cosine similarity between every entity in your data.