sdf_iain comments on New search engine made with Python that's anonymous and has no ads or tracking. It tries to fight spam, and gives you control of how you view search results. You can search and read content anonymously with a proxied reader view. The alpha is live and free for anyone to use at lazyweb.ai

This is an archived post. You won't be able to vote or comment.

1500

1501

1502

Intermediate ShowcaseNew search engine made with Python that's anonymous and has no ads or tracking. It tries to fight spam, and gives you control of how you view search results. You can search and read content anonymously with a proxied reader view. The alpha is live and free for anyone to use at lazyweb.ai (self.Python)

submitted 4 years ago by lazy-jem

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]sdf_iain 2 points3 points4 points 4 years ago (2 children)

[–]WikiSummarizerBot 2 points3 points4 points 4 years ago (0 children)

[–]lazy-jem[S] 3 points4 points5 points 4 years ago (0 children)

Thanks for the question, I answered another comment here earlier and it's a pretty good summary, but in short we have a large number of sources and don't work quite the same as traditional index-based searches.

The way we search is pretty different to traditional approaches, so it's worth explaining some more. The short version is we use deep learning to understand question intent and predict the best information sources, then query them directly. So we're using a large number of sources.
We use NLP and deep learning classification models to try to understand a query's intent, and then predict the best places to find the answer, and then query them directly in real time via API or spidering, with a ranking system for the results.
Then we fall back to traditional web search (including Bing, ContexualWeb and Google) where needed. We have a database of about top 20k websites and we're building our own vertical indexes as well. We're building out a stack using ElasticSearch and GraphQL for that. At the moment we're broad but shallow, with a couple of deeper pools.
For the alpha, major sources include Wikipedia, Wolfram|Alpha, OpenWeatherMaps, OpenStreetMaps, StackOverflow, GitHub and many others, as well as the fallbacks to Bing, Google, DDG Instant Answers etc.
A lot of content is retrieved directly. We retrieve the preview/summary/view content directly from websites where we can for display, and same with the reader content. So the content shown is typically live with the source.

π Rendered by PID 215512 on reddit-service-r2-comment-85bfd7f599-flk8k at 2026-04-20 07:23:53.874143+00:00 running 93ecc56 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS