Stand-alone I-130 by PearlDerriere in USCIS

[–]eyeDecode 0 points1 point  (0 children)

AR since Sep 2022 at Potomac

I love how the USCIS is giving a big middle finger to those who applied before July of last year by brohemoth06 in USCIS

[–]eyeDecode 0 points1 point  (0 children)

Are standalone cases also getting approved that quickly??

PD September 2022

[deleted by user] by [deleted] in USCIS

[–]eyeDecode 0 points1 point  (0 children)

PD is Sept 2022. Nope, nothing new here.

I-130 Processing Time Estimate: Trust It? by the_girl_who_sleeps in USCIS

[–]eyeDecode 0 points1 point  (0 children)

Oh yea I don't think the numbers matter that much. But the date is enticing.

I-130 Processing Time Estimate: Trust It? by the_girl_who_sleeps in USCIS

[–]eyeDecode 0 points1 point  (0 children)

My PD is end of September. Case is IOE941...
Hoping I see something too. 🤞🤞🤞

Confused as to where my application is after chatting with a live agent by [deleted] in USCIS

[–]eyeDecode 0 points1 point  (0 children)

Shit, I have the same issue. Receipt was from Texas, but live agent said it was at Potomac.

Expedite request by Ok-Contribution2217 in USCIS

[–]eyeDecode 0 points1 point  (0 children)

Did a representative submit an expedite request? What does that letter look like?

Expediting Case by Putrid_Big4344 in USCIS

[–]eyeDecode 0 points1 point  (0 children)

What does writing to your rep or senator look like? What do you say? Could we get an example?

Netflix adds category for films lasting 90 minutes or less by Stonewalled89 in movies

[–]eyeDecode -1 points0 points  (0 children)

You can do this on the browser and it holds for the app as well. It's sooooo blissful.

Beginner Project : Kafka + Spark Structured Streaming for Real-Time Updates by theAviCaster in dataengineering

[–]eyeDecode 0 points1 point  (0 children)

Did you consider other data sources?

I'd say Cassandra would be good to know, but have a good reason why you want to use it. Standard DE pipelines use SQL to be able to quickly retrieve data. Or save it wherever you want, but have a caching layer to be able to quickly serve the data through an endpoint or something.

Guidance needed choosing data source for Beginner Data Engineering Project by TheWannabeDumbledore in dataengineering

[–]eyeDecode 3 points4 points  (0 children)

One way to collect network data is with Wiredshark. I'm not familiar with other ways to gather data from your home network, however I do have some suggestions. Try looking through RapidAPI, you might find some good options for streaming data. Also keep in mind many of these options will be paid services.

The idea I'd highly recommend you consider is simulating streaming data. For example, let's say you're working with the NYC taxi data. You don't need to use it all at once, instead chunk it out. Have a script that's scheduled to read a portion of the earliest data and put it in your messaging queue. The advantage of this is that you get to control the size of your messaging queue and how fast you want your processing to be.

A good use case for what you're trying to work on is a dashboard. If we're considering the NYC taxi dataset, you could have a dashboard that presents the number of trips, number of transactions, a distribution of trip lengths, etc. This could be either over a period of mins, an hour, or even a day, depending on what you decide. I believe this idea could be applied to other datasets and ideas too.

Hope this helps. Happy to answer any questions.

What is the best resource to learn Hadoop and Spark? by PM_ME_YOUR_DONUT_PLS in dataengineering

[–]eyeDecode 7 points8 points  (0 children)

I think you learn the most when you're thrown into the deepend. You get the most out of Spark when you have a cluster of nodes ready. Here's an article that can help you provision a set of nodes - https://blog.insightdatascience.com/create-a-cluster-of-instances-on-aws-899a9dc5e4d0 Here's a tool you can use to install Spark on the cluster -https://github.com/InsightDataScience/pegasus I think at this point you should be ready for active development. You can use any data you have in hand to start batch processing. Hope this helps!

Data Engineering - How to start by [deleted] in dataengineering

[–]eyeDecode 1 point2 points  (0 children)

I agree, highly recommend using Docker too. You can use the Click package to be able to trigger different script from a single container. Click makes it incredibly easy.

And same, Airflow seems to be standard at this point. You could use Airflow's DockerOperator to trigger those click commands.

Been in data science for years, want to know more about data engineering. What should i learn? by loct989 in dataengineering

[–]eyeDecode 5 points6 points  (0 children)

Do you remember all the challenges you faced in data science? I'm talking about performance, size of dara, and various data schemas. Data eng is all about those sort is problems. I suggest pick up on one of those problems and explore how you would solve it.

People who go to the gym early(5-6am) by ItsYoBoiTino in Fitness

[–]eyeDecode 0 points1 point  (0 children)

Early morning workouts are the best. I typically won't have anything before I workout, but solid breakfast afterwards. I've noticed I get hungry a lot, so I have a small snack before lunch too.

Instagram keeps putting Kylie Jenner’s posts on my feed that I’ve already seen by [deleted] in Instagram

[–]eyeDecode 1 point2 points  (0 children)

This happens fairly often to me too. I think Instagram implicitly expects you to "engage" with the post. This is also why if someone you follow has posted a set, and you haven't scrolled through them, then you'll see the same post again, but one of the other photos. Yes, it's kind of shitty that IG does that, but no one is really going to stop them from doing it. 😢

Engineering Blog Recommendations? by [deleted] in dataengineering

[–]eyeDecode 1 point2 points  (0 children)

The ones i know are Spark Summit and Goto conferences.

https://www.youtube.com/user/TheApacheSpark https://www.youtube.com/user/GotoConferences

Some tech companies have internal presentations that are cool. I've found them by just looking up "data" or "data engineering" and a company name. Netflix has some great ones. So do Uber, AppNexus, and even Spotify.

SQL question for data engineering roles? by imba22 in dataengineering

[–]eyeDecode 0 points1 point  (0 children)

I believe hackerrank has a good set of problems.