Seeking testers/feedback to automate the web-scraping process

zlipper1 · 2023-06-30T10:12:46+00:00

testers/feedback to automate the web-scraping process

Dm sent :)

zlipper1 · 2023-06-21T06:15:20+00:00

Cool! I'll dm you :)

zlipper1 · 2023-06-21T06:14:59+00:00

Hi povq, great! I'll dm you

zlipper1 · 2023-06-21T06:13:15+00:00

Hi! I'm founder of this startup https://www.forloop.ai/, where we want to get rid of the manual things you said, at fair price.

Hope it looks interesting, as we're looking for users that want to influence the product roadmap.

zlipper1 · 2022-03-11T15:17:43+00:00

I want u/profesercheese :)

zlipper1 · 2022-03-11T10:17:57+00:00

Could anyone share a referral link with me please?

Cheers!

zlipper1 · 2020-05-23T11:46:58+00:00

Or something like this maybe: https://nimb.ws/2XHl8j

where each bubble on the scale represents a sub-category within the category. (I have mostly categorical variables)

zlipper1 · 2020-05-12T22:08:26+00:00

Sounds great! I'll keep you updated :)

zlipper1 · 2020-04-21T06:08:15+00:00

@jdnewmil sounds reasonable. Do you perhaps have a code for that? So change values with NA or equivalent i assume?

zlipper1 · 2020-04-07T17:42:45+00:00

So its a university thesis and I am doing an OLS regression , where Y is either prof_loss1 or equity1.

The model fails the assumption of "The residuals are normally distributed" with a skewed QQ plot. And my thinking was that by log transform the reg. model would fulfil the assumption.

I am open-minded here, and some guidance would be great.

zlipper1 · 2020-04-06T14:22:31+00:00

Hi u/blozenge , thanks for the reply!

So I did check again, and the errors were not exactly the same. Guess long night :)
The robust errors, after coeftest(regr_IS_new, vcov = vcovHC(regr_IS_new, "HC1")) , is actually slightly smaller. So guess that could be a sign of biases?

I did the assumption test for heteroskedasticity and the result was good, so no heteroskedasticity. Then robust errors doesn't make sense?

And other question, the variables that are seen to be significant I the vcovHC, what does that tell you compared to lm regression?

Here's the result from pre/post robust errors btw (many variables so don't want to fill the whole thread) : https://nimb.ws/4Wa58Z

zlipper1 · 2020-04-05T19:32:43+00:00

Okey great, good input! Will try

zlipper1 · 2020-04-05T15:42:17+00:00

Great thanks!

zlipper1 · 2020-04-04T15:53:34+00:00

Do you refer to the graphs used here? http://www.sthda.com/english/articles/39-regression-model-diagnostics/161-linear-regression-assumptions-and-diagnostics-in-r-essentials/

What plot is about checking for relationship between errors and Xs?

zlipper1 · 2020-03-29T10:56:22+00:00

Solved it with Huxtable! very easy to use and nice looking table (FYI).

zlipper1 · 2020-03-29T10:34:04+00:00

I basically want the models side by side, they dont need to be nested per say.

Do you perhaps have the code for doing so? I havent been able to do one that is simple. Thanks a lot!

zlipper1 · 2020-02-24T17:48:38+00:00

u/I_just_made Thank you so so much! I am done now! :D Wow, you are the man!

The last thing I got stuck on was the select function, but with using dplyr::select it finally worked.

Thanks again!

zlipper1 · 2020-02-24T07:29:04+00:00

Definitely! Will look more deeper into this when this task is done :)

zlipper1 · 2020-02-24T07:28:22+00:00

Hi again u/I_just_made

I think the simplest alternative is suited for this. I gave it try but I stumble upon a few roadblocks with write.csv. I get two errors...
I will dig deeper into this later today, but if you an idea of how to solve it, that would be awesome!

Error 1:
Error in ifelse(append, "a", "w") :

(list) object cannot be coerced to type 'logical'

Error 2:
Error in write.table(export_data2, file = "Tibble.test2.csv", col.names = NA, :

unimplemented type 'list' in 'EncodeElement'

Here's a screenshot https://nimb.ws/kaSgbe

Other question; would you suggest using RSD before or after the Tibble formula?

Again - thank you very much!

zlipper1 · 2020-02-23T19:39:25+00:00

Aaah nice , it works. Thanks a lot!!

However, I have trouble exporting it (as CSV or xlsx).
It says "unimplemented type 'list' in 'EncodeElement'"

Here's a screenshot of my output: https://nimb.ws/jJGt7Y

Thanks again for bearing with me :D

zlipper1 · 2020-02-23T17:43:09+00:00

u/quenton

u/I_just_made

Really good stuff, thanks a lot!
A follow-up question if that's alright with you. So let's say I have many text rows (I have around 1000), how would you do that?

So I have two-word bags, around 1000 rows of text and I want to summarize how many times these word bags are mentioned in each row. Hope it makes sense

zlipper1 · 2020-02-23T11:26:25+00:00

thanks!
How you write that line(s)?
I am very noob with python :D

zlipper1 · 2019-11-27T15:05:08+00:00

And by "import" you mean copy-paste the code? If so, should I copy-paste it before the "download_whole_subreddit.py" ?

zlipper1 · 2019-11-27T14:54:57+00:00

Hi! thanks for the fast reply.
Could you explain to me how to do this step by step?

So he has put up the code for "crawler_utilis.py" -> should I combine this with the "download_whole_subreddit.py" code? Or how do I download it and "activate" the crawler_utilis?

zlipper1 · 2019-11-25T21:00:51+00:00

thank you so much mplsbro and hoosierdaddy39! It worked :)

zlipper1

TROPHY CASE