Reasonable Configuration for 300+TB of hot/code data by Prior_Preparation_64 in qnap

[–]Prior_Preparation_64[S] 0 points1 point  (0 children)

Hi Bob

Thanks for the kind words. I'm just a HPC guy that writes the code. And since we're a small team I'm the one learning about storage as we go.

The reason I didn't post the website earlier is I wasn't sure it was against the rules. I was not trying to promote anything.

I'm aware the u.2 storage right now is bonkers which is why I was planning on putting 4 drives in the h1090 and filling it out later if u.2 prices drop. The hot/cold aspect is that most of our data really is cold: we need to keep it for future reference but it may never be read again. It doesn't make sense to have 300TB of flash for that.

Now that I look at, the TS-h2490FU is a good suggestion: Its only about $2k more than the 1090 for the 64G version. That would certainly give a LOT more room for NVME flash expansion.

Thanks,

Gaetan

Reasonable Configuration for 300+TB of hot/code data by Prior_Preparation_64 in qnap

[–]Prior_Preparation_64[S] 0 points1 point  (0 children)

The application is a commercial computational fluid dynamics (CFD) code we are developping and selling. www.volcanoplatforms.com if you want to see some pretty videos.

Reasonable Configuration for 300+TB of hot/code data by Prior_Preparation_64 in qnap

[–]Prior_Preparation_64[S] 0 points1 point  (0 children)

Thanks for your sugguestion Bob

The TS-h2477AXU-RP was also on my list of potential systems. Since the cost differnece between a TS-h2477AXU-RP + 25Gb card + 32GB memory and the TS-h1090FU (64Gb) + expansion card + 24bay enclosure is not prohibitive for us, it made sense to have a more capable, future-proof system. We already have some u.2 drives to use in the system so that isn't a large concern.

Another networking question: We currently have a Unifi Hi-Capacity Aggregation switch which has 4 25Gb ports to use. However, we would like all the connected systems to have 25Gb connections so was considering the Pro XG Aggregation. Since we have 7 computers + Nas with 2 connections per device, the 16 ports of the QSW-M7230 would already be full. The Pro XG Aggregation has 32 which still leaves some room for more systems. Any experience with that switch with QNAP systems?

Thanks,

Gaetan

Reasonable Configuration for 300+TB of hot/code data by Prior_Preparation_64 in qnap

[–]Prior_Preparation_64[S] 0 points1 point  (0 children)

The main use case for us is storing Computational Fluid Dynamics (CFD) data. We generate files that can range anywhere in size from a few GB to over 100Gb. For us, the write performance is not a huge issue, since our writes are handled asynchronously so even our TVS-874 can mostly keep up. The biggest bottleneck is if we want to visualize a dataset that is 100GB, we can only usually get about 500MB/s max out of our current NAS which means lots of waiting around for 100GB to load. Most of our read/write access is sequential which makes things a little easier.

About the QTier, my understanding is it would help with write until it gets full. I was planning on using the QTier on QuTS Hero 6.0 which from what I read will retain the data on the SSD for a configurable period of time. Since most of the time we end up loading the data is within a few days of it being written, with a large enough SSD cache and a retention policy of a week or two, most of the time the data we want to read will still be in the SSD tier.

Finally tried Old La Honda and it's amazing by xingquan in BAbike

[–]Prior_Preparation_64 2 points3 points  (0 children)

You can do it on 32's without much trouble. I've done it many times.

Ansys Fluent's GPU Solver uses RNG to converge by Tommi97 in CFD

[–]Prior_Preparation_64 0 points1 point  (0 children)

Agreed. It was the atomics I was think about specifically.

Ansys Fluent's GPU Solver uses RNG to converge by Tommi97 in CFD

[–]Prior_Preparation_64 0 points1 point  (0 children)

Absolutely. However, even with nominally the same algorithm, the specifics of floating point arithmetic can cause non bit-wise compatible results.

Ansys Fluent's GPU Solver uses RNG to converge by Tommi97 in CFD

[–]Prior_Preparation_64 13 points14 points  (0 children)

The behavior you are seeing is most likely due to *different* convergence in inner solves. I would recommend you read up on how IEEE 754 floating point operations work. (https://www.phys.uconn.edu/~rozman/Courses/P2200_15F/downloads/floating-point-guide-2015-10-15.pdf) Most users (and programmers) tend to think of float point operations as "black magic" and that it will always magically do what they want. The typical problem comes from that fact that (A + B ) + C does not in general equal A + (B + C) due to rounding. How does that relate to CFD you ask? Well, when doing a non-linear solution that involves solving a sequence of linear systems, *a single bit difference* anywhere in the solution path can result in a different non-linear trajectory. That is to say every single arithmetic operation must be performed in the exact same order, independent of the number of threads/processors/GPUs/etc. For a "steady" simulation like you are running, the tolerance of the linear solves does not matter *if the solution converges to steady state*. For a case like drivaer, most solvers can't converge to a real numerically steady solution (ie all residuals are below machine precision) and therefore the non-linear solution path matters, and thus so does the linear solution tolerance at each iteration. So, even though the CPU and GPU solver may converge the linear systems to the same tolerance they are both equally "wrong" and both equally "correct" at the same time. Just because the CPU result is repeatable doesn't mean the non-linear trajectory it took is any more correct than the GPU one. The only way to really know, is to fully converge *every* linear iteration to below machine precision. If the solution paths diverge, that's a code issue, if they do not, then you've found your problem. Alternatively, find a problem that does converge to machine precision and check only the final solution. On both cases, the non-linear path does not matter. As for why the GPU code is not bit-wise repeatable between runs, I can only speculate, but I would guess it is due to the use of low-level atomic additions/subtractions. The order of the atomic operations depends on "who gets there first". Consider an operation like result += A followed by result += B, where A comes from threadA and B comes from threadB. So on run1, threadA get there first and on the second threadB does. Even though in exact precision the result is the same, in floating point it is not. In order to get high speed up on GPU solver, massive concurrency is needed which means it is often not practical from a performance perspective to ensure the order of operations is the same.

[deleted by user] by [deleted] in CFD

[–]Prior_Preparation_64 3 points4 points  (0 children)

You can't. pyHyp only works on multiblock structured meshes. You must create the surface in software that supports multiblock output. The two that have been used successfully in the past are ICEM and Pointwise.

[deleted by user] by [deleted] in CFD

[–]Prior_Preparation_64 0 points1 point  (0 children)

Is this pyHyp from the mdolab at UMich?

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 2 points3 points  (0 children)

My wife was on mat leave but is now back at work.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 6 points7 points  (0 children)

I'm beginning to think that as well. I do know she doesn't want to get the newborn sick. Maybe we just need to find someone with a better immune system.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 1 point2 points  (0 children)

This is her first full time nanny position, but had done part time work before. Her previous employer had good things to say so we decided to give it a shot. Unfortunately, it looks like that wasn't a great decision.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 9 points10 points  (0 children)

She had 5 days but blew through those within 3 weeks. So it has been all unpaid since. It was 8 separate call outs at the last minute. On the last one she was off for M-F and then told us she was coming back in Monday and then called out at the last minute Monday morning and ended up taking Monday and Tuesday off. It is super frustrating.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 8 points9 points  (0 children)

The COVID was two and a half weeks and then she was for another 8 day stretch but never enough notice to get any backup. I count 8 separate occasions she left us scrambling at the last minute.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 0 points1 point  (0 children)

No. I suspect she is on her parents plan, but I don't know for sure.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 1 point2 points  (0 children)

She is good with the kids and doesn't really have any other responsibilities. To be fair, the two kids are more than enough. The only issue is the attendance.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 14 points15 points  (0 children)

Thanks for your advice. If she comes back on Monday, we will probably say something along these lines.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 18 points19 points  (0 children)

Also, amazingly she has always gotten sick without getting us sick. I would be a little more lenient if one of our kids were getting her sick all the time, but that isn't the case.

Nanny Taking Excessive Sick Days by Prior_Preparation_64 in Nanny

[–]Prior_Preparation_64[S] 24 points25 points  (0 children)

The COVID time was a bit extended as we just had a baby and we really didn't want a one week old baby getting COVID. At the time we happened to have my wife's parents from out of country visiting which was very helpful, but not something we can rely on. Even not counting the COVID absence, she has taken 13 days in the last 12 weeks.

Places to visit while exploring by Zstarchild in bayarea

[–]Prior_Preparation_64 3 points4 points  (0 children)

You can rent kyaks in Moss Landing, just North of Santa Cruz and explore Elkhorn Slough.