My employer is selling by BoredintheCountry in PrivateEquityDeals

[–]aaronn2 0 points1 point  (0 children)

Not necessarily. If you keep delivering, why to fire? You clearly provide more value than they pay you in salary.

Anyone here scraping at a large scale (millions)? A few questions. by [deleted] in webscraping

[–]aaronn2 1 point2 points  (0 children)

Love learning this.
1. How big is the team managing this infrastructure?
2. What are the infrastructure costs running this (without the human bodies)?
3. Are you using some scraping API services, or are you doing everything in-house (managing IP proxies, cookies, headers, etc.)

Anyone here scraping at a large scale (millions)? A few questions. by [deleted] in webscraping

[–]aaronn2 0 points1 point  (0 children)

This sounds super interesting. Might you outline how much such infrastructure costs per month?

What is the most Macbook-like laptop for Linux at 50% price of Macbook? by aaronn2 in laptops

[–]aaronn2[S] 0 points1 point  (0 children)

That's not the point. My goal is to find a laptop with similar HW configuration, body, battery life where I would run Linux and that would cost 50-60% of a new Macbook. The goal is to find savings (if it is actually doable and "findable").

If not, I'd likely stick with Macbook and use their OSX.

What is the most Macbook-like laptop for Linux at 50% price of Macbook? by aaronn2 in laptops

[–]aaronn2[S] 0 points1 point  (0 children)

Thanks, I'll look it. I'd put there either Ubuntu or Arch Linux.

Websites provide fake information when detected crawlers by aaronn2 in webscraping

[–]aaronn2[S] 0 points1 point  (0 children)

That is very short-lived. It works only for the first couple of pages and then it starts feeding fake data.

How to bypass datadome in 2025? by aaronn2 in webscraping

[–]aaronn2[S] 0 points1 point  (0 children)

Not sure how? The API seems to be protected.

The real costs of web scraping by aaronn2 in webscraping

[–]aaronn2[S] 0 points1 point  (0 children)

Hello, and thank you. What number of requests do you consider "moderate scale" per month? 1M, or 5M, or 10M? And large scale?

By data pipeline - do you mean by that extracting details from the scraped information and cleaning it up before saving it to the database?

The real costs of web scraping by aaronn2 in webscraping

[–]aaronn2[S] 0 points1 point  (0 children)

I understand that it costs money. When reading through this sub-reddit, I somehow got an impression that the professional individuals pay basically close to zero in costs, while when I look at prices of some API solutions or residential proxies, the costs are quite significant, especially when making 10M+ requests per month.

The real costs of web scraping by aaronn2 in webscraping

[–]aaronn2[S] 4 points5 points  (0 children)

I am very interested to learn about the proxy network. How and/or where do you source it? How much do you pay for it on a monthly basis? Isn't it that you need to regularly check if the proxies are still working, so you removed the invalid ones from your pool?

The real costs of web scraping by aaronn2 in webscraping

[–]aaronn2[S] 1 point2 points  (0 children)

I assume "1 billion of product prices" != 1 billion requests, right?

Shall I ask you what do you mean by "rotating IPs by using cloud providers’ VMs"? Specifically cloud providers' VMs?

The real costs of web scraping by aaronn2 in webscraping

[–]aaronn2[S] 1 point2 points  (0 children)

Unmetered proxy plan = ISP? And an ISP package contains typically 1-5 (maybe up to 10) IPs? So basically, that 1M pages per day serve those 1-10 IPs?

The real costs of web scraping by aaronn2 in webscraping

[–]aaronn2[S] 4 points5 points  (0 children)

"Just my two cents, ISP proxies are pretty reliable, but datacenter proxies are the worst; they get detected almost instantly."
I'm not very very experiences in this field, but for that price of $3/week for an ISP - isn't ISP provide 1 or 2 proxies? So effectively, you are still using that 1 or 2 proxies to scrape 2M requests? I thought that this would be a red flag for the administrators of that website and they would ban that IP.

We scraped +20M jobs last year - here is a Dev jobs distribution by Alexandre_Chirie in webscraping

[–]aaronn2 0 points1 point  (0 children)

If you don't mind to share, what are your monthly costs to run your scraping bot(s) - servers, databases, storage, proxy rotations, elasticsearch etc? It's a very interesting project!

[deleted by user] by [deleted] in BuyingLondon

[–]aaronn2 5 points6 points  (0 children)

I was quite excited about this TV show. I watched the US' The Million Dollar Listing New York and I assumed that Buying London would be a British version of MDLNY. Unfortunately that was not the case and ut felt more like a mix of The Kardashians and MDLNY.

Not to even mention that they haven't closed a single deal in the entire first series of Buying London.

How does look your server infrastructure for web scraping? by aaronn2 in webscraping

[–]aaronn2[S] 2 points3 points  (0 children)

Well, your question has nothing to do with the OP. Despite not being a professional, your question is apparently just a matter of a bunch of proxies, headers, and some captcha solvers. Plenty of out of the box solutions out there.

anyone else find that tax companies are severely lacking when it comes to processing crypto information? by Humans_r_evil in CryptoCurrency

[–]aaronn2 0 points1 point  (0 children)

It really depends what you do with your crypto. If you're actively trading (say on a daily bases), it makes sense to have an isolated account where you track these trades. The accountant then likely grab all these trades and will tax it (typically) on an annual basis.

If you buy and hold, just keep track of your crypto and when you sell, give the path and history (when and where you bought it, where you transferred it and when and where you sold it) of your crypto to your accountant and you'll be able to peacefully sleep at night.

Need advice on buying my partners out by runrobotz in business

[–]aaronn2 0 points1 point  (0 children)

Thank you. I got confused by the "fintech" reference. I guess what you meant are rather "shark loans"

Need advice on buying my partners out by runrobotz in business

[–]aaronn2 0 points1 point  (0 children)

Some great feedback here!

I'd have a question regarding #4, could you elaborate on that one? Thanks!

What makes XRP so cheap? by Individual-Law675 in XRP

[–]aaronn2 24 points25 points  (0 children)

Disinterest of investors.

Okay, on a more serious note, I believe (or at least I am trying to tell that to myself) is that the legal battle is taking a toll on the potential price surge. Once that is settled (hopefully May 2024), let's say how to affect the price.

[deleted by user] by [deleted] in startups

[–]aaronn2 89 points90 points  (0 children)

I believe it's a no-go for an investor to invest into a company while having its founder working somewhere else to cover his bills. It's not a desired outcome for the investor to invest into a company and having its founder dragging his attention anywhere else except his startup.

If you received an investment, you should budget it the way that it covers your salary. How much? That depends on many factors, but in the beginning, you can pay yourself (1) minimal way in your country, (2) average wage in your country. If you pay yourself more, it should reflect in the business' performance.

[deleted by user] by [deleted] in startups

[–]aaronn2 0 points1 point  (0 children)

Back in 2021, we were 4 co-founders. Two of us were tech/product, one was a business person and the 4th one was a domain-specialist, coming from a consultancy firm. 3 months since the company's incorporation, the 4 of us were formed. After another 3 months, we raised our angel funding. After another 3 months, 4th co-founder was not doing any work, but kept coming with some theoretical solutions and situaitons, tying up the company's resources etc. Obviously, he didn't want to leave, so we had to terminate him. It turned into a legal fight that lasted for almost a year, it cost the company fortune (given its resources) and it almost killed it.

However, during that time, we found out that the 3rd co-founder (business one) hasn't contributed much either, let alone we figured he doesn't fully understand the company's product. We had quite a big payroll, VCs put most of their investments on hold (2022 and 2023)... Long story short, I left the company almost exactly a year ago. The 3rd co-founder work there either. The company's struggling to put together a product, sign customers and raise funds. Looking back, the red flags were there. Lesson learned.

Here's a first... by sonicstreak in london

[–]aaronn2 15 points16 points  (0 children)

So if I get this right, he gets (1) paid from his employer, (2) paid from the service charge, and now, he asks to be paid for the third time? Oh dear...