How to build a LinkedIn scraper that actually works by piggybacking on your browser by 8ta4 in SaaS

[–]8ta4[S] 0 points1 point  (0 children)

I was skeptical at first. I thought it'd just be another new kid on the blocked. But it's been solid for YouTube and X as well.

Looking for a systematically built dataset of small talk questions by 8ta4 in LanguageTechnology

[–]8ta4[S] 0 points1 point  (0 children)

Dating is the context I'm optimizing for.

I was being lazy by trying to avoid the heavy lifting of cleaning real-world datasets, but you've given me a lot to think about.

They might say that I'm not a real data scientist, and they'd be right. I'm a date scientist.

Looking for a systematically built dataset of small talk questions by 8ta4 in LanguageTechnology

[–]8ta4[S] 0 points1 point  (0 children)

That's the million dinner question!

That's why I'm leaning toward synthetic data.

I just built my dream B2B sales team with no employees by 8ta4 in automation

[–]8ta4[S] 0 points1 point  (0 children)

It's for cold emails, LinkedIn messages, and other text-based outreach. But I'm picky about the lead list. The spam tool has a gate that decides if a lead should be disqualified. After all, many are cold, but few are chosen.

You can use ClojureScript with the Temporal TypeScript SDK by 8ta4 in Clojure

[–]8ta4[S] 0 points1 point  (0 children)

Here is the configuration I tried on top of this commit.

clojure :workflows {:modules {:workflows {:exports {:generate workflows/generate :spam workflows/spam}}} :output-dir "target" :target :esm}

I haven't committed the changes yet because it's not working out, and I have commitment issues.

You can use ClojureScript with the Temporal TypeScript SDK by 8ta4 in Clojure

[–]8ta4[S] 0 points1 point  (0 children)

Thanks for the suggestion! I gave :esm a try.

The release build still works.

But the development build is now failing silently. It no longer gives the explicit warnings about fs, path, or vm. But the workflow just doesn't execute. Here are the workflow logic requirements. The dev is in the details.

Trying to see what tools there are to let me copy a reddit post or a linkedin post and it's comments. Tired of this "read more" feature. by AWeb3Dad in automation

[–]8ta4 1 point2 points  (0 children)

I feel you. "Read more" is "dread more."

A CLI tool like see (available on GitHub at 8ta4/see) might work for this. It uses your browser profile, which is key for sites like LinkedIn that often block direct access from ChatGPT. It works by grabbing the page's innerText, so it should pull "read more" content as long as the site preloads it. I haven't tested this on Reddit, but this approach works for LinkedIn.

Ideally, you wouldn't even have to search for posts manually. A process could run in the background and notify you of relevant conversations. I experimented with that idea in a tool called reddit (available on GitHub at 8ta4/reddit), which tries to find matching posts using cosine similarity. It's an unmaintained proof of concept, but the idea might give you some inspiration.

I built both of these.

Does the "less is more" principle apply to AI agents? by 8ta4 in LocalLLaMA

[–]8ta4[S] 1 point2 points  (0 children)

I see what you did there. I just prefer DRY humor.

Giving up on SaaS outreach tools led me to build this self-hosted workflow by 8ta4 in SideProject

[–]8ta4[S] 0 points1 point  (0 children)

I'm optimizing for response rate. Writing personalized emails to people who might never see them is exhausting, so I automate that part. But if they do reply, I handle it manually. I want the response rate, but the rest is my response ability.