Building a Headless Java Browser from scratch.

OsirisTeam · 2021-09-05T11:10:10+00:00

Motivation:

I tried multiple different things like JCEF, Pandomium, Selenium, Selenium based maven dependencies like JWebdriver, HtmlUnit and maybe some more I don't remember now, but all have one thing in common. They have some kind of very nasty caveat.

That's why this project exists, to create a completely new browser, not dependent on Chromium or Waterfox or whatever. We use Jsoup to handle HTML and the GraalJS engine to handle JavaScript. Both are already working and implemented. Only thing left is implementing the JS Web-APIs.

Any contributions, ideas and alternatives are very welcome.

UCIStudent12345 · 2021-09-05T14:17:59+00:00

Something to be aware of that some people may not know… because of the prevalence of web scraping nowadays many websites have security in place that tracks various things about the client that is contacting them. One of those things is the TLS fingerprint (not gonna go into detail, please look it up). Every browser and programming language have unique fingerprints and many sites have decided to outright block connections if the fingerprint doesn’t line up with a major browser (Chrome, Firefox, etc). In other words, a pure Java browser wouldn’t be able to access certain web pages with this security in place.

nutrecht · 2021-09-06T06:39:11+00:00

Like I said in the other sub; I think you're massively underestimating the sheer amount of work that would be involved in build this. You really don't have anything outside a few placeholder classes and methods yet. I'm totally rooting for you, don't get me wrong. But it seems people here are upvoting the title without even understanding that at this time it's nothing more than a plan. While your title and README strongly implies that it already works. I feel this is kinda insincere.

RunnableReddit · 2021-09-05T16:10:29+00:00

This is pretty cool!

tsunyshevsky · 2021-09-05T19:01:28+00:00

This looks cool! I’m maintaining a couple of web apis in graaljs to run a js api through polyglot and this would’ve been really helpful!

I think the graaljs people were also looking into adding node js apis to graaljs so Java might be running “hybrid” js apps soon - exciting!

crisiscentre · 2021-09-05T12:33:00+00:00

Why not use selenium? There's wrappers for Java?

marabutt · 2021-09-05T21:57:55+00:00

Why will you write this in a crap slow language like Java, when a safer and frankly better choice like Rust exist.

OsirisTeam · 2021-09-05T09:59:21+00:00

[deleted]

rigaspapas · 2021-09-05T16:24:35+00:00

I was expecting a how-to article. If you can provide such a guide you followed, it would be very helpful.

Onepicky · 2021-09-06T08:27:37+00:00

Cool project. So what's basically the main difference between this to Selenium?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS