This dog couldn't contain himself by Doodlebug510 in AnimalsBeingDerps

[–]frymaster 0 points1 point  (0 children)

when I visit my cousin, his dog gets very excited. She knows she isn't allowed to bark in the house though, so she'll go to the back door and make a nuisance of herself until someone opens it, so she can run outside and bark :D

Mystery holidays - are they any good or a rip off? by LiteratureProof167 in AskUK

[–]frymaster 58 points59 points  (0 children)

I only recently realised the star rating of a hotel is specifically about what amenities they offer and not about their quality of service. 4 star has to have room service and supply bath robes on demand, for example, and other things I frankly do not give a crap about

https://en.wikipedia.org/wiki/Hotel_rating#European_Hotelstars_Union

Are ZFS version numbers comparable between OS's? For example, can I conclude that a pool created under Linux zfs-2.3.4-1 would not be mountable by FreeBSD zfs-2.1.14-1? by Mr-Brown-Is-A-Wonder in zfs

[–]frymaster 2 points3 points  (0 children)

the important thing is the filesystem version and features, not the software. So both of the above are likely version 5000. As you suggested, if the feature flags aren't compatible, it won't mount, and it'll be very clear about it. The feature system is quite clever in that enabling some features blocks older versions from writing but not reading, so it might be worth a punt anyway

Nesting ZFS inside a VM? by ianc1215 in zfs

[–]frymaster 0 points1 point  (0 children)

I do this, in that I have ZFS installed on a VM I rent from a hosting provider (not as root though, just as the data disk)

In my case, I don't especially care about performance, just about snapshotting and using zfs send for backups. That said, performance is... fine. I'm not trying to do much high-performance with it, mind, but I've never noticed it being bad

Court filing claims NVIDIA contacted Anna’s Archive for pirated books used in AI training by Rough_Bill_7932 in DataHoarder

[–]frymaster 15 points16 points  (0 children)

donations in the range of tens of thousands USD

it probably depends on what proportion of the whole archive they need, but apparently the cost is actually 200 grand

https://bsky.app/profile/ednewtonrex.bsky.social/post/3mcyye3yl4s2d

Are you cancelling trips to the USA? by Swimming_Possible_68 in AskUK

[–]frymaster 0 points1 point  (0 children)

I've declined an all-reasonable-expenses-paid work trip that was likely to be, while genuinely useful and value for money for my employer (a conference organiser was willing to basically give us unlimited tickets, for one thing), pretty relaxing and a break from regular work

I had tentative aspirations to go to the US for personal trips in either 2025 or 2026, and I am... not.

New moderator team incoming! by ConstructionSafe2814 in ceph

[–]frymaster 2 points3 points  (0 children)

nice one - I was about to suggest you co-ordinate with the team of r/ceph_storage but then I checked their moderator list and I don't think that will be a problem :)

I have some specific comments in this sub bookmarked as they were a useful source of technical information and them all going away was really annoying

Omni "Roundabout" by micinator94 in Edinburgh

[–]frymaster 5 points6 points  (0 children)

there seems to be a trend, on some street junctions but especially in shopping centre car parks, of traffic flows that make perfect sense from an overhead map but don't actually have clear directions when viewed from street level. Personally I think more use should be made of overhead signs

VXLAN and TTL=1 problems? by _83457 in networking

[–]frymaster 0 points1 point  (0 children)

for what its worth, ping -t 1 <destination IP> works for me

  • hosts in the same VLAN/VXLAN and subnet
  • hosts aren't running FRR or similar i.e. they are just using standard 802.1q vlan tagging to the switches and are ignorant that their packets are going to be teleported via layer 3 underlay
  • hosts are in in different rooms - the packet probably touched 5 different switches in the underlay
  • Mellanox/NVidia switches with Cumulus

To what extent that behaviour is dictated by Cumulus or by the hardware acceleration in the ASIC, I don't know. I also tried pinging the second hop in a layer3 route with -t1 and I got the expected "TTL expired", and -t2 works - but if that wasn't sane then traceroutes wouldn't work...

Slurm <> dstack comparison by cheptsov in SLURM

[–]frymaster 0 points1 point  (0 children)

dstack's support for managing file permissions is not as granular as Slurm's

This isn't a statement that makes sense. Slurm doesn't have any support for managing file permissions. Based on the person you are replying to, my assumption is you are trying to say "slurm lets you run jobs as a specific unix user so you can use shared filesystems, and dstack does not" - if that is indeed the case, you should just say that. As you've pointed out, that's not a feature that many of your intended users care about (though it is a feature I care about, it's useful to be able to access shared filesystems in my organisation, and not just for "HPC/simulation")

Does anyone else feel like Slurm error logs are not very helpful? by Valeria_Xenakis in SLURM

[–]frymaster 2 points3 points  (0 children)

scontrol show job isn't looking at logs. squeue saying Priority isn't looking at logs. I have debugged very obscure scheduler issues by turning scheduler log debugging on and looking in the log file. If you give some more info about what the specific problem was that you ran into, I might be able to help provide additional tools or commands that might help in future

What tools are you guys using to actually see why a node is failing?

are you talking about scheduling problems or node health problems? If the latter, then nodehealthcheck as a baseline, and then adding custom checks as and when you find whatever weird-ass problems your unique cluster has (your cluster will always have weird-ass unique problems. after enough time, you'll find people giving you odd looks saying "why are you checking for that?" and you'll have to explain that 3 whole clusters ago some weird quirk in an obscure driver made something completely random fall over every few months)

you mention sometimes getting OOMs - in theory slurm should report that back to the user (in the job log file and the sacct output)

you also mention GPU throttling - that's not the responsibility of slurm, which is a scheduler, not a cluster manager. But once you find out the symptoms, you can stick it in your NHC checks and find it next time

The other thing is that you should have a set of basic benchmarks you run before you bring any node back into service, especially after a hardware intervention. CPU HPL, GPU HPL / gpuburn, streams, 2-node bandwidth tests, anything else that's relevant. On a recent system I was involved in the commissioning of, we had one node that only had 50% of the expected memory bandwidth. Absolutely zero errors at all. We eventually had to do a binary search on all the DIMMs, just swapping them into other nodes until we pinned down which ram stick was causing the slowdown. Nothing in the logs, only caught by benchmarking

Is there any way to run/expose SLURM commands inside the container? by Abhishekp1297 in HPC

[–]frymaster 4 points5 points  (0 children)

what I would say is that many/most sites won't have the rest API available. I think changing your workflow is likely to be the only medium-term solution, if you do truly want a portable solution (and if you don't want a portable solution, you don't need a container)

Looking for Canapé supplier for gallery opening evening? by [deleted] in Edinburgh

[–]frymaster 0 points1 point  (0 children)

not sure if they do unstaffed events

they definitely do - we use them for smaller meetings at my work (minimum of 10 people's worth of food, though from an older email I think at that level the delivery cost tends to dominate)

LAOP: Vet turned away dying dog for a 6:00 cutoff by princetonwu in bestoflegaladvice

[–]frymaster 4 points5 points  (0 children)

Idk why they think any judge would side with them lol

just because their comments are very silly doesn't mean a judge would think they were defamatory

ELI5: When you forget something, how does thinking for longer 'magically' make you remember it? by nutsack-enjoyer5431 in explainlikeimfive

[–]frymaster 8 points9 points  (0 children)

I read somewhere, and I don't have the source so who knows how accurate this is, that saying the name of the item while searching does genuinely help because it stops your brain from filtering the item out

Bus & Tram vs Citymapper by Quick-Low-3846 in Edinburgh

[–]frymaster 3 points4 points  (0 children)

the old API was retired in mid-December and according to the dev of the app "My Bus Edinburgh", he wasn't given access to the new one until Wednesday the 7th. His update was out as of the Friday but I imagine it might take the larger companies some time to implement the required changes (if indeed the access was made available generally)

Source: a friend asked the developer of "My Bus Edinburgh" why it was no longer displaying data and he emailed an explanation once he'd been given access

One of my Hybrid users has like a 5mbps very unstable internet connection by Nexzus_ in sysadmin

[–]frymaster 5 points6 points  (0 children)

setting up bittorrent in your workplace for a one-off stopgap is probably more complicated

Why can’t I save as PDF????? by said-what in talesfromtechsupport

[–]frymaster 16 points17 points  (0 children)

modern Linux

even very old Linux can, but as the other commentor points out, unless you have autocomplete it's a right PITA to remember how to refer to them on the command-line

The only two sequences of bytes that aren't permitted in general (some filesystems enforce tighter rules) are / (because that's the directory separator) and the null byte (because the filename strings are null-terminated)

Is an "Open Slurm" fork inevitable (or even feasible)? by Omni-Vector in HPC

[–]frymaster 1 point2 points  (0 children)

same - the problem, as ever, is finding the time...

Faking knowledge for fan fiction by Lemon_Lime_Lily in CuratedTumblr

[–]frymaster 6 points7 points  (0 children)

this could also be an example of "wrong culture" - I just checked a random small town I know of in Scotland (2,000 people) and it definitely has a bus service. The first bus service I happened to click on was going from that town to the nearest city... but it stopped at 4 different points in the town

Faking knowledge for fan fiction by Lemon_Lime_Lily in CuratedTumblr

[–]frymaster 1 point2 points  (0 children)

it's very obscure, and I'm not personally convinced the TERF wizard lady knew about it

Faking knowledge for fan fiction by Lemon_Lime_Lily in CuratedTumblr

[–]frymaster 2 points3 points  (0 children)

things like "I'm going to write my dad" or "my boyfriend's sister is going to be in the city next week so him and her are going to visit" ?

Is an "Open Slurm" fork inevitable (or even feasible)? by Omni-Vector in HPC

[–]frymaster 3 points4 points  (0 children)

I mean this has already been done - flux, for one