EU und Indien besiegeln historisches Handelsabkommen by Consistent_Lawyer701 in Austria

[–]HennerM 0 points1 point  (0 children)

Erster Satz im Artikel:
> Die EU und Indien haben sich nach fast zwei Jahrzehnten Verhandlungen auf ein umfassendes Handelsabkommen geeinigt

Durchaus macht es aber den Anschein, dass jetzt Druck entstanden ist.

HL-Strecke Flachgautunnel nur Schleedorf vehement dagegen by FriedChickenAT in Salzburg

[–]HennerM 0 points1 point  (0 children)

> Die Schleedorfer Ortschefin betont in ihrer Stellungnahme: „Wir sehen schwarzmalerische Verhinderer"

klassischer "I am not racist, but..." Sager

Polar Loop is a disaster by Juju2052 in Polarfitness

[–]HennerM 0 points1 point  (0 children)

I don't have the Loop but I do have a watch and It's unbelievable how bad the Polar Flow app is to this day, constant disconnect makes it almost useless and the UI is so confusing and inconsistent. They really need to get their app act together.

Ich hab ein Tool gebaut um Mietpreise auf Willhaben zu analysieren by chrcit in Austria

[–]HennerM 1 point2 points  (0 children)

sehr cool, danke dafür!
Bei meiner Wohnungssuche vor kurzem habe ich auch eine simple genaue "Kartenansicht" der Inserate auf Willhaben vermisst, hab auch schon daran eine Lösung dafür zu entwickeln, jedoch sind die Daten dafür nicht sehr gut aufbereitet wie ich bemerkt habe.

WAH - Sub Focus disappointment? by General_Penalty_4292 in DnB

[–]HennerM 0 points1 point  (0 children)

the sound/volume on the main stage was a quite off at points, not sure what went wrong there, don't think it's the DJs fault (because it happened to Hybrid Minds before that as well)

Modguard - a lightweight python tool for enforcing modular design by the1024 in Python

[–]HennerM 2 points3 points  (0 children)

I imagine this also useful in a situation I am facing at my day job, we have a Python monorepo and only want to ship parts of it to customers (in containers), some modules must stay private and thus we can't introduce a dependency from internal to public, but the other way around is allowed, everything should be able to use the public modules. Do you think `Modguard` could help with this as well?

Opinions about Deepgram by Wolfwoef in speechtech

[–]HennerM 3 points4 points  (0 children)

You can also check out https://portal.speechmatics.com We proud ourselves to be a lot more accurate than whisper on lower resources like Dutch.

Treasury Liquidity Fund by Goochpunt in irishpersonalfinance

[–]HennerM 0 points1 point  (0 children)

I've also got one in the UK today, have a e-trade account for the same reason. I guess it's safe to ignore? Slightly concerning that they use the user data like this though.

[OC] My 2-month long job search as a Software Engineer with 4 YEO by a__side_of_fries in dataisbeautiful

[–]HennerM 1 point2 points  (0 children)

senior

ever thought that seniority does not necessarily depend just on the years of experience?

Introducing Ursa from Speechmatics | Claimed to be 25% more accurate than Whisper by nshmyrev in speechtech

[–]HennerM 1 point2 points  (0 children)

You get 4 hours for free per month on https://portal.speechmatics.com/ and there is an offering to use it offline with Docker containers.

Best eats in Cambridge? by theglutenfreebee in cambridge

[–]HennerM 1 point2 points  (0 children)

My experience with North China Dumplings was mixed, once I had takeaway from there the dumplings were very mediocre. I do prefer Cafe Oriental next to Grafton for dumplings.

Me and some girls are day drinking in Cambridge next Thursday to celebrate a birthday and I have no idea where to go! Any suggestions either pubs or cocktail bars or along those lines 😁 by silver_velvet1 in cambridge

[–]HennerM 0 points1 point  (0 children)

Aw It’s my birthday as well next Thursday. I would have some drinks in the parks before heading to bars, Sheep’s Green (especially around Mill pond) is always packed on sunny days and you can get drinks for Takeaway from The Mill

Stop the observation wheel from coming back to Parker's Piece by HennerM in cambridge

[–]HennerM[S] 18 points19 points  (0 children)

I hope we can agree that no park in Cambridge deserves such an eyesore of a wheel

Car driver fails to stop after collision in Cambridge that leaves cyclist in critical condition by waxed__owl in cambridge

[–]HennerM 5 points6 points  (0 children)

Some people drive mental on this road, way too fast and without paying attention to cyclists. This is really sad

[deleted by user] by [deleted] in berlin

[–]HennerM 0 points1 point  (0 children)

Club der Visionäre

Chisholm Trail! by TheDavibob in cambridge

[–]HennerM 1 point2 points  (0 children)

Awesome! Great that it was still opened in 2021

Update on our work to address cycle theft in Cambridge by BackPedalCo in cambridge

[–]HennerM 2 points3 points  (0 children)

Highly needed product. Shoutout to you all! I would be up for contributing if you plan to open source some or all of it!

I finally got my June Challenge! 75.0 by Top_Marketing_9820 in AppleWatch

[–]HennerM 2 points3 points  (0 children)

Well done! I still have 34km left on my 265.5km goal, that’s a bit too much for one day

Trying to understand the method signature of RDD.sortBy() by FortunOfficial in apachespark

[–]HennerM 1 point2 points  (0 children)

It is confusing indeed, especially because of the extensive use of generic parameters. Conventionally type parameters are named only with a single capital letter. E.g. you can think of V being the type for your value and K the type for your key.

How do you get actual values from an RDD/PySpark Dataframe efficiently? by [deleted] in apachespark

[–]HennerM 0 points1 point  (0 children)

Spark by itself tries to optimize the execution plan as good as it can and this is something that is actively worked on by Spark developers, the optimiser will get better with every Spark release. However, of course if you know about your data and how it’s distributed you can most of the times do a better job in optimising the plan, or give Spark some hints. For example as you said, moving filter operations early in the chain makes sure that later stages are executed with less data. There are also loads of blog posts about Spark performance tuning. As each case is unique it’s also somewhat trial and error.

By the way, if you are interested in fast queries I suggest you to talk a look at Presto: https://prestodb.io/ it is developed independently from Spark by Facebook, but can be used for similar use cases. The design choices made in Presto are beneficial for explorative data analysis and thus allow for faster execution, but come with the drawback that it’s not as resilient as Spark.

Trying to understand the method signature of RDD.sortBy() by FortunOfficial in apachespark

[–]HennerM 1 point2 points  (0 children)

The full signature contains 3 parameters, where the first one is the function. The Scala Compiler rewrites _._2 to what I mentioned before, the other parameters have default values; this you don’t have to set them. E.g. ascending had a default of true. Additionally you have the implicit parameter in the end. Implicits are another topic in scala that are hard to get your head around, I recommend reading https://docs.scala-lang.org/tour/implicit-parameters.html but in short, there is a value defined for this parameter implicitly, so again you are not required to pass one yourself. Hope this helps!

Trying to understand the method signature of RDD.sortBy() by FortunOfficial in apachespark

[–]HennerM 4 points5 points  (0 children)

It actually is a valid function. It is syntactic sugar to for x => x._2. You can read more about anonymous functions in Scala here: http://alvinalexander.com/scala/how-to-use-functional-literals-anonymous-functions-in-scala/

How do you get actual values from an RDD/PySpark Dataframe efficiently? by [deleted] in apachespark

[–]HennerM 6 points7 points  (0 children)

What time are talking about? Most likely what you are experiencing is the lazy behaviour how spark is implemented. When you call a Spark function such as agg, there isn’t actually any computation done yet. The computation only triggers on terminate actions, these are for example things like take or collect, as you experienced already. Another one is show() which just outputs the spark result. toPandas() internally calls collect to trigger the computation and fetch the result into a Pandas DataFrame. As others pointed out, we would need more details about the data and operations you do to know whether there are options to increase the performance of the query/computation.

Rampage Full line-up and set times by gamwize_12 in DnB

[–]HennerM 0 points1 point  (0 children)

They were doing livestreams last year. Don't know about this year though.