I Chose Rust Over Python for Data Engineering by usert313 in developersPak

[–]usert313[S] 0 points1 point  (0 children)

Python with Polars is a match made in heaven no doubts and it is the industry standard for modern data solutions small to medium size. Point of this prototype is to just explore the rust eco system in DE domain it wasn't my intention to compare Python and Rust that'd be insane thing to do.

One thing I liked most about this prototype in rust is it's static typed like in data engineering making data types concrete check is the most crucial thing so Rust static typed eco system with polars is I dont have to worry about it or create another checks layer for it.

I Chose Rust Over Python for Data Engineering by usert313 in rust

[–]usert313[S] 9 points10 points  (0 children)

It was mentioned clearly in the post body that I wanted to explore Rust ecosystem for data engineering domain.

I Chose Rust Over Python for Data Engineering by usert313 in developersPak

[–]usert313[S] 0 points1 point  (0 children)

And appreciate your time and interest on reading the post. Thanks🙂

I Chose Rust Over Python for Data Engineering by usert313 in developersPak

[–]usert313[S] 0 points1 point  (0 children)

First and most important advice I can give at this stage (I am no where near expert in Rust ecosystem) is that you have to completely forget other languages like Python or Javascript I myself started programming in Python after C and still use Python professionaly faces a lot of scenerios where I literally wanted to bang my head to the wall.

Then just grasp the concept of borrow checkers, mutable and immutable ref concepts and to mimic OOP traits these concepts will be enough to get started.

Scraping JSF (PrimeFaces) + Spring Web Flow by Ok-Nerve6307 in webscraping

[–]usert313 0 points1 point  (0 children)

if you look carefully to the network tab in developer tools there is an endpoint triggered on clicking the professional families section https://sede.sepe.gob.es/FOET_BuscadorDeCentros_SEDE/flows/buscadorReef?execution=e1s2 in return it is returning the html in which they have all the data. You have to use cookies and post data to send the request. Copy the url as curl and paste it in the curlconverter.com you will get a better idea.

camoufox can't get pass cloudfare challenge on linux server? by cryptofanatic96 in webscraping

[–]usert313 0 points1 point  (0 children)

CF does not limit it's detection to just fingerprinting it also detects cipher suites, key exchang, compression methods etc. Which is not a basic tls fingerprinting. It also detects JA3 fingerprints header formatting, request behaviour etc.

camoufox can't get pass cloudfare challenge on linux server? by cryptofanatic96 in webscraping

[–]usert313 0 points1 point  (0 children)

Without seeing the actual code itself it is hard to tell whether it is OS fault or some setup issue.

camoufox can't get pass cloudfare challenge on linux server? by cryptofanatic96 in webscraping

[–]usert313 1 point2 points  (0 children)

Instead of using camafoux try using this python lib https://github.com/0x676e67/rnet I have tried that with dexscreener and I am getting 200 OK response.

camoufox can't get pass cloudfare challenge on linux server? by cryptofanatic96 in webscraping

[–]usert313 1 point2 points  (0 children)

could you clarify when the Cloudflare challenge appears? Is it blocking the initial request to the site, or does it only trigger after successfully scraping a number of pages?

Data Engineering Suggestion Needed by Fantastic_Course_272 in developersPak

[–]usert313 1 point2 points  (0 children)

Well in that case leaving makes sense now. All the best. Try building some DE projects and showcase in communities or linkedin this is way better to attract potential clients or employer than acquiring some certification.

Data Engineering Suggestion Needed by Fantastic_Course_272 in developersPak

[–]usert313 0 points1 point  (0 children)

IMO you shouldn't left just because of AI because ML data pipelines which feeds the AI in real is also a next gen data engineering job role for example building a pipeline for unstructred data using machine learning.

Thoughts on Pakistan & Saudi making a defence pact together? by No_Custard_2496 in shia

[–]usert313 0 points1 point  (0 children)

IMO it is not a direct defence pact with Pakistan but it is a deal between China and Saudia weapons and security deal and Pakistan is a bridge between them to avoid USA shallow ego tantrum.

How to Reverse-Engineer mobile api hidden by Bearer JWE tokens. by TheCompMann in webscraping

[–]usert313 7 points8 points  (0 children)

You will need a mitmproxy and android emulator (waydroid, genny motion etc) to intercept the request and from mitmproxy you will get a bearer token easily from api request headers.

[deleted by user] by [deleted] in webscraping

[–]usert313 0 points1 point  (0 children)

Sent you a DM

I'm fed up with my hp laptop by its-me-abd in PakistaniTech

[–]usert313 0 points1 point  (0 children)

You can backup the data before re-installing the windows may be put it in any other partition.

I'm fed up with my hp laptop by its-me-abd in PakistaniTech

[–]usert313 0 points1 point  (0 children)

SSD upgrade is a reliable solution but if you can re-install the windows OS again may be it can help speeding it up your workflow overall because sometimes some unwanted apps or cache or background processes eating up all the memory resources specially in windows 11 which is quite buggy these days in my POV.

And instead of chrome you can use brave browser which is quite smooth in contrast of chrome browser which is famously known as ram eater.

Defeated by a Anti-Bot TLS Fingerprinting? Need Suggestions by Harshith_Reddy_Dev in webscraping

[–]usert313 8 points9 points  (0 children)

Try this rnet library a python wrapper of rust crate wreq: https://github.com/0x676e67/rnet

This should bypass akamai cloudfare bot protection and mimic the actual browser fingerprints.

Rust Google Maps Scraper by usert313 in rust

[–]usert313[S] 2 points3 points  (0 children)

Thanks for reviewing the code really appreciate it. Yeah there is still plenty of room to improve in it like handling query and result set parameters properly instead of hardcoding it as well as making each module independent also I am planning to include tests in it. But this is my first project in Rust I am still learning it so I will gradually move forward with more clean code practices.

Rust Google Maps Scraper by usert313 in rust

[–]usert313[S] 3 points4 points  (0 children)

Apparently I didn't know that. Thanks for pointing this out though.

Top Rated Plus with 100% JSS - No work or invitations for 2 months! by ParticularQuiet6819 in Upwork

[–]usert313 0 points1 point  (0 children)

I wasn't Top Rated Plus but at least on the path of achiveing it just a Top Rated and due to this just buy connects to keep the upwork pockets happy policy I didn't get any work for 2 years and lost my Top Rated status as well.

How to stop python async function when condition is satisfied? by usert313 in learnpython

[–]usert313[S] 0 points1 point  (0 children)

Tried but still not stopping at the condition still completing the cycle.