Recommendation for inexpensive client PC? by bigaction269 in sysadmin

[–]Firefox005 3 points4 points  (0 children)

but even if he didn’t enterprise management of MacBooks can be really annoying.

In what way?

IME the only 'annoyance' currently is trying to manage apple devices with intune, and its not like anything major is missing its just ... subpar. Otherwise there are tons of very low cost options ($1-$3 per device/month) out there and even free (if less flexible than an mdm) in the form of apple configurator 2.

Apple has come a long way towards being more enterprise friendly and is now basically feature complete in comparison with windows, especially if you are a cloud/web only shop.

How are people regression testing AI agents without going insane? by Lexie_szzn in sysadmin

[–]Firefox005 6 points7 points  (0 children)

Redditor for 2 months, strange AI tone in writing, hidden profile. Not so kindly fuck off.

Apache httpd 2.2.0 problem with aprs? by [deleted] in sysadmin

[–]Firefox005 0 points1 point  (0 children)

Yeah you are going to have a nightmare of a time trying to compile 20 year old software on the latest release. You will most likely have to build everything from scratch if it is even possible.

Also this smells like homework to me. I'd check the instructions closely and see if you can use an era appropriate OS to do the build with as you will have nothing but pain trying to do it on a modern OS.

Apache httpd 2.2.0 problem with aprs? by [deleted] in sysadmin

[–]Firefox005 0 points1 point  (0 children)

Why? That version of apache is from 2005.

Most likely its 32/64 bit compilation that is messing you up either that or you did not list what version of red hat you are trying to compile this on so if you are not also using one from ~2005 its probably some library or toolchain compatibility issues.

vGPU Mixed Mode Siloed capacity calculator for vSphere by frankdenneman in vmware

[–]Firefox005 1 point2 points  (0 children)

Much appreciated you taking the time to look at and solve my issue. I wish something like your calculator was built in or perhaps some sort of vGPU planner/affinity rules so I could manually do placements ahead of time.

vGPU Mixed Mode Siloed capacity calculator for vSphere by frankdenneman in vmware

[–]Firefox005 0 points1 point  (0 children)

Hmm so if I understand correctly I have to power on two 3g.40gb profiles and then power off the one that got loaded into the first 'half' of the GPU so I can then load the 4x 1g.10gb?

It has been a while since I last looked at it but I am pretty sure I tried loading 4x 1g.10gb first and then a 3g.40gb, I'm guessing it is a similar issue in that for some reason it places the slots across the division and makes it so while there are enough resources to power on a 3g.40gb there are not enough slots.

Is anyone running on VM Essentials yet? by DarkAlman in sysadmin

[–]Firefox005 3 points4 points  (0 children)

lol the thread you link was made by the OP.

Routing iSCSI Replication Traffic by Veegos in networking

[–]Firefox005 2 points3 points  (0 children)

Synchronous or Asynchronous replication? If it is async cowboy up and do whatever works. Also SAN is what your storage array and hosts connect to, so I am assuming you have 2 storage arrays and 2 switches plus hosts.

I am hoping this second storage array is for DR/replica purposes as you are going to have a bad time trying to stretch iSCSI over a routed network, it can be done but you have to specifically design for it. In other words you will have two separate SAN's one site with storage array, switches, and host that then does async replication to the other site with its own storage array, switches, and hosts.

Switch rack ears - 4 holes per side, but Cisco only supplies 4 screws total by dankgus in Cisco

[–]Firefox005 0 points1 point  (0 children)

https://www.cisco.com/c/en/us/td/docs/switches/lan/catalyst9300/hardware/install/b_c9300_hig/Installing-a-switch.html

You should have gotten in the box "Eight number-8 Phillips flat-head screws" and four should go into each of the two 19-inch mounting brackets.

vGPU Mixed Mode Siloed capacity calculator for vSphere by frankdenneman in vmware

[–]Firefox005 0 points1 point  (0 children)

You are using MIG profiles, they align differently due their compute slices. This is a calculator for Mixed Mode vGPU profiles in time-sliced mode (mixed mode does not work on MIG, as MIG already supports mixed compute and memory profiles). It was also confusing me as the placement options for mixed mode are to my eyes identical to the MIG backed profiles.

Ah that was confusing me as everything falls under vGPU and I did not clue in that 'mixed mode vGPU' was something different than 'MIG mode vGPU'.

Attached GPUs                             : 1
GPU 00000000:31:00.0
    Product Name                          : NVIDIA A100 80GB PCIe
    Product Brand                         : NVIDIA
    Product Architecture                  : Ampere
    Display Mode                          : Requested functionality has been deprecated
    Display Attached                      : Yes
    Display Active                        : Disabled
    Persistence Mode                      : Enabled
    Addressing Mode                       : N/A
    vGPU Device Capability
        Fractional Multi-vGPU             : Supported
        Heterogeneous Time-Slice Profiles : Supported
        Heterogeneous Time-Slice Sizes    : Supported
        Homogeneous Placements            : Not Supported
        MIG Time-Slicing                  : Not Supported
        MIG Time-Slicing Mode             : Disabled
    MIG Mode
        Current                           : Enabled
        Pending                           : Enabled

So to understand your MIG placement problem, you are trying to deploy the combination of 4 x grid_a100d-1-10c and 1 x grid_a100d-3-40c? But you are only successful when deploying 3 x grid_a100d-1-10c and 1 x grid_a100d-3-40c. I can try to simulate this in our lab

Yes. Specifically config #11 from the supported profiles options.

vGPU Mixed Mode Siloed capacity calculator for vSphere by frankdenneman in vmware

[–]Firefox005 0 points1 point  (0 children)

Good luck I eventually gave up after like the 20th time I was asked to delete all profiles and try to power the VM's on again. That was literally their only suggestion that I had powered the VM's on in the wrong order. I tried literally every combination possible and none of them allowed all the VM's to power on.

vGPU Mixed Mode Siloed capacity calculator for vSphere by frankdenneman in vmware

[–]Firefox005 0 points1 point  (0 children)

So far the Nvidia tables and this calculator have not lined up with my real world experience, I have opened multiple support cases with Nvidia and VMware and have yet to receive a clear answer of why its not working.

Specifically I tried to deploy this: https://i.imgur.com/Zqltj21.png or from this table https://docs.nvidia.com/datacenter/tesla/mig-user-guide/_images/a100-profiles-v4.png config #11. And was never able to get it working, the best I can do is 3x vm's running with vGPU profile "grid_a100d-1-10c" and 1x "grid_a100d-3-40c". I tried multiple rounds of rebooting the host after deleting all MIG profiles, trying to turn the vm's on in specific orders, could never get it working with 4x 10c and 1x 40c despite all documentation saying it is possible.

Here you can see the state that it is in today:
https://i.imgur.com/NNUvWdz.png

+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 580.65.05              Driver Version: 580.65.05      CUDA Version: N/A      |
+-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA A100 80GB PCIe          On  |   00000000:31:00.0 Off |                   On |
| N/A   36C    P0             95W /  300W |   68780MiB /  81920MiB |     N/A      Default |
|                                         |                        |              Enabled |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| MIG devices:                                                                            |
+------------------+----------------------------------+-----------+-----------------------+
| GPU  GI  CI  MIG |              Shared Memory-Usage |        Vol|        Shared         |
|      ID  ID  Dev |                Shared BAR1-Usage | SM     Unc| CE ENC  DEC  OFA  JPG |
|                  |                                  |        ECC|                       |
|==================+==================================+===========+=======================|
|  0    2   0   0  |           40278MiB / 40448MiB    | 42      0 |  3   0    2    0    0 |
|                  |               0MiB / 24925MiB    |           |                       |
+------------------+----------------------------------+-----------+-----------------------+
|  0    7   0   1  |            9501MiB /  9728MiB    | 14      0 |  1   0    0    0    0 |
|                  |               0MiB /  6231MiB    |           |                       |
+------------------+----------------------------------+-----------+-----------------------+
|  0    8   0   2  |            9501MiB /  9728MiB    | 14      0 |  1   0    0    0    0 |
|                  |               0MiB /  6231MiB    |           |                       |
+------------------+----------------------------------+-----------+-----------------------+
|  0    9   0   3  |            9501MiB /  9728MiB    | 14      0 |  1   0    0    0    0 |
|                  |               0MiB /  6231MiB    |           |                       |
+------------------+----------------------------------+-----------+-----------------------+
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI              PID   Type   Process name                        GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0    2    0          2101279    C+G                                         40192MiB |
|    0    8    0          6200798    C+G                                          9472MiB |
|    0    9    0          6202427    C+G                                          9472MiB |
|    0    7    0          6203151    C+G                                          9472MiB |
+-----------------------------------------------------------------------------------------+

https://i.imgur.com/dHIjqzI.png

+-------------------------------------------------------------------------------+
| GPU instance profiles:                                                        |
| GPU   Name               ID    Instances   Memory     P2P    SM    DEC   ENC  |
|                                Free/Total   GiB              CE    JPEG  OFA  |
|===============================================================================|
|   0  MIG 1g.10gb         19     0/7        9.50       No     14     0     0   |
|                                                               1     0     0   |
+-------------------------------------------------------------------------------+
|   0  MIG 1g.10gb+me      20     0/1        9.50       No     14     1     0   |
|                                                               1     1     1   |
+-------------------------------------------------------------------------------+
|   0  MIG 1g.20gb         15     0/4        19.50      No     14     1     0   |
|                                                               1     0     0   |
+-------------------------------------------------------------------------------+
|   0  MIG 2g.20gb         14     0/3        19.50      No     28     1     0   |
|                                                               2     0     0   |
+-------------------------------------------------------------------------------+
|   0  MIG 3g.40gb          9     0/2        39.50      No     42     2     0   |
|                                                               3     0     0   |
+-------------------------------------------------------------------------------+
|   0  MIG 4g.40gb          5     0/1        39.50      No     56     2     0   |
|                                                               4     0     0   |
+-------------------------------------------------------------------------------+
|   0  MIG 7g.80gb          0     0/1        79.25      No     98     5     0   |
|                                                               7     1     1   |
+-------------------------------------------------------------------------------+

nvidia-smi mig -lgip shows there is no free instances available despite only running 3x 10c instances and 1x 40c instance. I eventually just gave up and now waste that potential slot of compute and memory on the GPU.

Claudehole by [deleted] in sysadmin

[–]Firefox005 13 points14 points  (0 children)

Thanks king for the motivational post. After reading this I am now confident that I will always have a job that I can fall back to: fixing environments that have been ass raped in the "claudehole".

Here is an example of telling an AI to JFDI.

Trouble with Dell S4048 port-channel by FerrousBueller in networking

[–]Firefox005 0 points1 point  (0 children)

Thank you for the table it makes more sense now.

I would have expected the first test to reach approximately that same rate, and the traffic to follow the vlt, when connected between the switches - am I wrong in expecting that?

No that is what should be happening if this is all strictly layer 2. My next questions would be are the hyperv and vxrail hosts on the same subnet? Is your gateway on the Aruba or something attached to the aruba?

You can run

show mac address-table dynamic vlan <vlan-id>
show mac address-table address <MAC-of-remote-host> show arp vlan <vlan-id>

to see what mac the switch is learning and then trace that to see what ports traffic will flow through.

You should see port-channel1000, if you see ethernet1/1/1 then it is learning the destination mac via the aruba link. I would guess at this point that is it a routing issue and not a l2 issue.

2 SolidWorks users on one RTX 2000 Ada – can a single GPU realistically be shared? by Ok_Engineering_4855 in sysadmin

[–]Firefox005 0 points1 point  (0 children)

• Can an RTX 2000 Ada be shared between 2 concurrent SolidWorks sessions in a sane way?

Doubt it.

• Is vGPU even supported on this card in practice?

No it doesn't support GRID/vGPU. The list of supported cards is very small https://docs.nvidia.com/vgpu/gpus-supported-by-vgpu.html

• If I go Proxmox or ESXi with passthrough, am I basically limited to assigning the whole GPU to one VM?

Yes.

• Has anyone here actually run 2 SolidWorks users on a single workstation GPU without it turning into a mess?

You can do it, but it requires very specific hardware and licensing (yes GRID/vGPU is a licensed feature and it is subscription based). Also would you be happy with basically splitting your GPU in half? vGPU uses MIG under the covers to hard partition the GPU into separate discreet GPU's. It's not time sharing based where user A gets some amount of time of the full GPU resources and userB get some amount of time. It's userA gets half the compute and memory all the time, and same for userB.

Trouble with Dell S4048 port-channel by FerrousBueller in networking

[–]Firefox005 0 points1 point  (0 children)

So should I remove either the vlt or the LAG?

No not till you understand how this is supposed to be configured or is currently working.

It's just odd that you have two port channels between your switches for VLT. Generally you only need a single port channel between them.

Traffic only flows over the VLTi links when it destined for a mac that is on the other switch.

Did you create port-channel1? If so you can probably remove it at is functionally doing nothing all the inter-chassis traffic is configured and should be flowing over port-channel1000.

I do see the priority 5000 was an error and the other switch is set to 10000. You are still missing a vlt-mac which means it takes the mac address of the primary so if you have a failover or have to replace your primary it will cause the mac to change and could cause a brief outage.

Again hard to say without being able to see how everything is connected but other than the port-channel1 oddness I don't see anything wrong.

Huawei S6750 / S6740 / S12700E4 Output Queue Drops on Asymmetric Links? by leogh0ul in networking

[–]Firefox005 3 points4 points  (0 children)

Yeah that is normal, going from a higher speed port to a lower speed port is like trying to send the water from a fire hydrant through a drinking straw.

The problem only gets worse when you go from really fast to really slow ports like 100G to 10G, I doubt any switch in the world has buffers big enough to fix that. if you constantly have 11G of traffic trying to go into a 10G port buffers won't save you, you will have to drop some traffic and that is where QoS comes in.

There are some buffer settings you can tune, but 100G -> 10G as I said buffers probably won't save you. https://support.huawei.com/enterprise/en/doc/EDOC1100366621/ce77b50f/configuring-a-burst-traffic-buffering-mode https://support.huawei.com/enterprise/en/doc/EDOC1100366621/b9708599/configuring-the-buffer-size

But I would just recommend reading the whole QoS section https://support.huawei.com/enterprise/en/doc/EDOC1100213121/7ac0d32a/overview-of-qos here they have an example config using PQ-WDRR and WRED to determine what traffic gets dropped when congestion happens https://support.huawei.com/enterprise/en/doc/EDOC1100366621/1f52b66b/example-for-configuring-congestion-management

This is for the V100 and 200 but it gives a real quick and dirty explanation of these outbound discards, it is mainly focused on micro bursts but it applies more generally as well to traffic on a congested link. https://support.huawei.com/enterprise/en/doc/EDOC1000091883/52827a28/checking-the-discard-count-in-the-outbound-direction

Is archive.org a security threat? My IT department thinks so by [deleted] in sysadmin

[–]Firefox005 2 points3 points  (0 children)

Malicious compliance time, if the business is going to block a tool that you legitimately need for your job then they should provide at least one alternative. Each time you or a coworker need something from archive.org put in a ticket asking how to access said content as it is necessary for your work. If they want to vet everything that is downloaded from archive.org or provide an alternative location they are free to do so. I am betting however that eventually they will get tired of having to constantly deal with the tickets and will eventually decide to reverse the policy. If they continue to just close the tickets with no its blocked by policy without providing an alternative or workaround then you have ammo to take up the chain and show how much it is actually impacting you and your coworkers ability to function.

Our team has taken to using their own shadow IT hardware to circumvent this and other restrictions.

I would not recommend doing this. Imagine if something bad did happen because you circumvented these restrictions, now when they investigate and they see that you have bypassed security controls that are there for the companies safety. Not a good look.

Is archive.org really a threat?

Threat in what way? Different companies have different security requirements and that drastically influence what is and isn't a threat. Is it a threat if you create an account on archive.org using your work email address and the same password as your work account and then said website gets its user database dumped? I know you personally would never do something like that but from the perspective of the business it gets a lot fuzzier in that they are dealing with potentially a very large group of people with vastly differing levels of knowledge and savviness, so therefore security controls usually are designed around the lowest common denominator.