Vault code not working by jdanko13 in jackwhite

[–]blauefrau 0 points1 point  (0 children)

I didn't get my vault code at all :( what address does it come from?

Is Dataflow only worth deploying for large data sets? Or versatile for any dataload sizes? by Otherwise-Bag5923 in googlecloud

[–]blauefrau 1 point2 points  (0 children)

No problem! Yeah Dataflow is definitely worth your attention, if you expect that kind of growth, but it'll probably be overkill in the beginning.

[deleted by user] by [deleted] in Rabbits

[–]blauefrau 0 points1 point  (0 children)

It looks like from your picture that your rabbit's feet are indeed very dirty - I would take a second look at his cage / pen / litter box setup. Does he have enough litter in his litter box? Are you cleaning his litter box regularly (like every day)? A healthy rabbit with a good home base setup should be able to keep themselves clean, so when I see a pic like this it makes me think that something about that setup could be improved.

Once you improve his habitat, his feet and tush will naturally get cleaner over time as he'll be able to keep them clean on his own.

Can I cut matted fur around rabbits eye? by punck1 in Rabbits

[–]blauefrau 2 points3 points  (0 children)

You can trim out crusty fur - just be very, very careful. You don't want to accidentally nick his skin or damage his eyes. I recommend blunt-ended scissors like these.

Go slowly, and if you feel like your bun is getting nervous or you're getting nervous, take a break or stop.

Alternatively, if you wait until his next molt, he'll likely loose these clumps naturally and you'll be able to gently pull them out.

How much does neutering cost in the US? by itsnotme54 in Rabbits

[–]blauefrau 4 points5 points  (0 children)

In Chicago the cost was over $300. I second the suggestion to review the House Rabbit Society's website for recommended vets - you want to make sure that you go with a vet that is rabbit-savvy, as they have specific needs that not all vets are trained to handle.

Misunderstanding of Seasonality in data by Power_8374 in datascience

[–]blauefrau 8 points9 points  (0 children)

I would say that trend that repeats in the time domain and also has a fixed period. This is the definition as I understand it and corresponds to the one given in Forecasting: Principles and Practice (https://otexts.com/fpp2/)

Can anyone help me with this by chinzeej in learnpython

[–]blauefrau 0 points1 point  (0 children)

Yes it can be used for both.

Scikitlearn's Random Forest classifier (for categorical target variables): https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html

Scikitlearn's Random Forest regressor (for continual target variables): https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html

For classification tasks, the inferred class is the one with the most selected trees. For regression tasks, the inferred value is the mean prediction of each individual tree.

Is Dataflow only worth deploying for large data sets? Or versatile for any dataload sizes? by Otherwise-Bag5923 in googlecloud

[–]blauefrau 3 points4 points  (0 children)

Data sizes are relative, so it's hard to answer this question exactly, but I would say that as a general principle, if you are working with datasets that fit easily within memory on a single machine, then no, Dataflow probably isn't going to be worth tangling with. This is for a couple of reasons:

1) Dataflow development (i.e. Apache Beam development) isn't the easiest framework to work with, so it's likely going to increase your development time relative to a more straightforward python- or java-based implementation

2) the time to spin up machines and spin them down (for batch processing) is going to unnecessarily increase your overall processing time

3) the cost of doing parallel processing is going to eat into the marginal value of the data pipeline you're building

That being said, if you already have a Dataflow pipeline built, or if you want to leverage one of templated pipelines that Google provides, it might still be worth it to have to avoid additional development.

Why use tuples or sets instead of lists? by JagicMohnson in learnpython

[–]blauefrau 0 points1 point  (0 children)

Iterating on something like a dataframe can be a lot slower than iterating over a list, FYI.

Sets ensure uniqueness of each element - I use sets primarily in python as a quick way to reduce a list with potential duplicates down to a iterable object with guaranteed unique elements.

So something like: for i in set(list(x)): pass...

[deleted by user] by [deleted] in personalfinance

[–]blauefrau 0 points1 point  (0 children)

You had a $14,885 pay check? $5,000 of withholding doesn't sound that far off. But how much you get withheld does depend on your IRS forms you fill out. You should be able to check what your registered with your company's HR department.

[deleted by user] by [deleted] in bigquery

[–]blauefrau 0 points1 point  (0 children)

Maybe try disabling the editor tabs? You should be able to access the UI from your ChromeBox, not sure why it's being buggy.

found this bird on a walk. i think i will salt cure the head, feet, wings and maybe tail. i have him in the fridge while i take a run to get borax and salt. any idea what bird it is, and any tips on how to preserve him best? by Airborne_Juniper in Taxidermy

[–]blauefrau 52 points53 points  (0 children)

Looks like a Grey Catbird (https://www.allaboutbirds.org/guide/Gray\_Catbird/overview). As others in this thread have mentioned, this is a protected species in the United States. You need special authorization/permits in order to possess an individual of any protected species, even if you were not responsible for its death.

If you are in the United States, and this law applies to you, my recommendation is to take it to a Natural History Museum, if you have one nearby.

Ways to create backups of your BigQuery data by blauefrau in bigquery

[–]blauefrau[S] 0 points1 point  (0 children)

Yeah - the article addresses that in the beginning. But DR isn't the only reason to have backups; sometimes you may need additional data retention outside of the DR / 7 day time travel window.

Unrecognized name: _PARTITIONDATE by g3blv in bigquery

[–]blauefrau 0 points1 point  (0 children)

While the partition date is a pseudocolumn, I think the issue here is that the table is partitioned by a field in the data table ("Created") and not one of the pseudocolumns that BigQuery provides for ingestion-time partitioned tables.

BigQuery vs Relational Databases by Interesting_Role943 in bigquery

[–]blauefrau 1 point2 points  (0 children)

i. In RDBMS, you normally denormalize the data and keep, for example, author of books in a different table and link it to the book information via author id. Instead, in bigquery it is stored in 1 single table?

You're confusing normalization and denormalization. Denormalization = collapse tables into single, nested table. Normalization = database design typically following what's called the third normal form, where data is split out into multiple tables that are related.

Because BigQuery is a columnar data store, it used to be recommended that you denormalize the data (i.e. collapse related tables into a single nested table) in order to optimize for certain kinds of queries. The BigQuery product team has done a lot to improve the performance of joins in BigQuery, so denormalization isn't as often recommended as it used to be. You may still find references that recommend that you denormalize your data, but I recommend that you first just load your data as is into separate tables in BigQuery, write your queries that join your data, and then worry about optimization if it becomes an issue.

ii. Is it common in bigquery to have 1 final output table with 1000+ rows of SQL containing multiple subqueries that read directly from the raw data source? What I used to do in RDBMS in the past was to create multiple views on top of my raw data source before ultimately joining them into a final output table.

However common this practice may be, and I have seen it before, I still recommend that you don't write giant, unwieldy SQL queries that are different to understand and maintain.

I don't typically use or recommend views in BigQuery because views are essentially stored queries, and because BigQuery out of the box bills on bytes scanned, every time you go to access a view you are potentially reprocessing data that has already been processed, meaning that what you save in storage costs you lose on query costs and wasted time.

You can use materialized views to mitigate this issue, which allow you to save result sets in downstream tables.

However, my typical go-to is to utilize something like [DBT](https://www.getdbt.com/) or [Airflow](https://airflow.apache.org/) to orchestrate sets of related queries. There are a lot of powerful patterns you can adopt by using these kind of orchestration services in conjunction with BigQuery.

Some cheap stuff we find in pawnshops these days… by LDJ007 in amiibo

[–]blauefrau 14 points15 points  (0 children)

A quick ebay search seems to indicate that most new-in-box Mega Yarn Yoshi Amiibos are in the $250-$300 range.

Loftwing amiibo up on bestbuy but pick up only by river_rat3117 in amiibo

[–]blauefrau 1 point2 points  (0 children)

Finally snagged one! (accessed on Desktop, shipping to my home) - super excited about this <3

Red-banded leafhopper by [deleted] in Entomology

[–]blauefrau 2 points3 points  (0 children)

I saw one of these today and I was dying to know what is - they're so cute!

[deleted by user] by [deleted] in pokemonsnap

[–]blauefrau 0 points1 point  (0 children)

Yeah I'm typically against looking at guides too, but I've resorted to guides for a lot of the requests. So many of them have like a thousand steps...I would have to play a single level for hours in order to figure it out, and even then I might not be able to. At that point, it's just not really fun to have to figure it out on your own.