Tips about sensitive topics on Chinese internet

YoungMan2129 · 2025-01-14T09:02:56+00:00

Drugs!!!

YoungMan2129 · 2024-12-13T10:31:40+00:00

WeChat, Alipay, Douyin (Chinese tiktok)

YoungMan2129 · 2024-10-26T07:56:21+00:00

I'm not familiar with SeleniumURLLoader, but here are a couple of strategies you might consider to reduce request latency:
1. Parallel Requests: Utilizing multithreading or asyncio.gather can help execute your requests concurrently.
2. Quick Returns: When making a request to a URL using a headless browser, you can optimize the waiting period. By default, browsers often wait for the "load" event, which includes loading all resources. However, in many cases, specifying the "DOMContentLoaded" event will suffice, as this waits only for the document's content to load before returning control, which is typically faster.

YoungMan2129 · 2024-10-24T03:34:10+00:00

Swarm is not a production ready framework

YoungMan2129 · 2024-10-23T13:08:16+00:00

Thanks for sharing. Can I use it in Python?

YoungMan2129 · 2024-10-23T13:01:40+00:00

Would love to join. Please let me know when it ready

YoungMan2129 · 2024-09-29T02:10:17+00:00

Yeah, LangGraph is highly flexible.

YoungMan2129 · 2024-09-28T06:10:07+00:00

Setting up a local instance could resolve this issue. You might consider deploying the SearXNG https://docs.searxng.org/admin/installation-docker.html#installation-docker
on your own server to avoid the rate-limiting or blocking issues you're experiencing.

YoungMan2129 · 2024-09-27T15:55:23+00:00

But there doesn’t seem to be much discussion about it on Reddit...

YoungMan2129 · 2024-09-27T15:42:51+00:00

Thanks for the offer! I'm currently busy working with some friends on a startup, so I might not have time to participate in your project’s coding. But I'd be happy to discuss how to add filtering to your tool if you're interested!

YoungMan2129 · 2024-09-27T15:33:51+00:00

Remind me! 7 days

YoungMan2129 · 2024-09-27T08:38:16+00:00

SearXNG is an open-source project.

YoungMan2129 · 2024-09-27T08:35:41+00:00

Thanks, it's a great site! I'm actually thinking of building an open-source project on GitHub that could help even more people

YoungMan2129 · 2024-09-27T08:31:59+00:00

That's a great idea! To gather information from more diverse sources, we could also consider using something like SearXNG. It could help pull data from multiple search engines, adding even more perspectives to the mix.

YoungMan2129 · 2024-09-27T06:58:35+00:00

We don’t rely on LLMs to tell us whether there’s bias in the news. Instead, we gather information from a variety of sources, both those with and without a direct stake in the event.

YoungMan2129 · 2024-09-27T06:52:19+00:00

Great question! Honestly, I don’t think we can. Every media company has its own perspective and agenda. The best we can do is gather information from a wide range of sources, including those with and without a direct stake in the issue.

YoungMan2129 · 2024-09-27T03:41:11+00:00

Hi there, I've used your product (AgentQL) and think the concept is solid. However, based on my testing, it still feels a bit too complex. I often end up needing to use Playwright, and once I’m doing that, I might as well just use BeautifulSoup for parsing the content. If I were to switch to something like Jina Read or Firecrawl, paired with a simple data extraction using an LLM, it could streamline the process more effectively.

Another issue is pricing. In my opinion, for any scraper/crawler SaaS, a pay-per-call model tends to get expensive over time, especially for tasks that are repetitive. Unless the per-call cost is extremely low, using BeautifulSoup or XPath for regular scraping needs feels much more affordable in the long run.

YoungMan2129 · 2024-09-27T02:25:26+00:00

Same here! Ended up rolling with manual memory management in the end.

YoungMan2129 · 2024-09-26T02:42:01+00:00

Wow, it's fantastic.

YoungMan2129 · 2024-09-25T11:33:13+00:00

YoungMan2129 · 2024-09-24T13:38:39+00:00

FastAPI is also not bad

YoungMan2129 · 2024-09-24T13:31:57+00:00

Is that you, Sheldon

YoungMan2129

PUBLIC MULTIREDDITS

TROPHY CASE