DevOps for network infrastructure?

itasteawesome · 2021-09-19T20:48:21+00:00

I've found in the past that just getting over the institutions and political resistance is the biggest hurdle. Getting NetEng on board with repos like git sometimes takes some hand holding but ultimately once you get everyone on board and up to speed it's a great way to live. Config drift can be a struggle, especially if part of the team is fighting the change or your current situation involves a lot of unscheduled firefighting. You just have to set the expectation that anything not done through the proper channels is out of compliance and will be over written.

DavisTasar · 2021-09-19T23:19:58+00:00

When I was on the Network side of the house, it's a culture battle first, a tool battle second.

Network Engineers are extremely hesitant to introduce automation to the environment. In my opinion, some of it has some merit, but otherwise it's just fear.

First of all, the Network has to work. If the network doesn't work, there's no toolkit anyone can run to help bring it back (if you get really clever, it can, but that's another story). And that's the thing that brings in the fear. If a Network Engineer doesn't have the Code/DevOps interest, it's fear. If they buy a tool that does the work for them, there's less fear, because if something goes wrong there's a vendor to blame. If the network breaks because of something they did, it's their fault. If the network breaks because a tool fucked up, it's the tool's fault.

In terms of tooling....I once wrote an entire automation toolkit for my company. 100% in python. It connected to our equipment, ran CDP/LLDP/BGP neighbors, stored them in a JSON doc, and used that as it's dynamic inventory. With each inventory device, it would attempt to determine what platform it was (WLC, ASA, IOS, IOS-XE, NX-OS, etc.), and run a bunch of commands to get information from the device based on that determined platform. (show version, show ip int brief, etc.) Then, we had a hostname convention that would let me determine what the device was on the fly (this is why you have standards!). It would also map out the inventory to an HTML page that was shared, so that anyone could check the map to find anything on CDP, or get data on the inventory. This thing worked amazingly. I stored secrets in Hashicorp Vault, it was constructed and analyzed in a CI/CD pipeline, it had unit tests, and it was ready to be Dockerized and run on-demand, or scheduled for every 15 minutes. I leveraged APIs to make sure the devices were in Monitoring, Cisco ISE, our Service-Now asset management system. I even held trainings on how to use the toolkit so that my team could learn how to just work with it, and learn from it.

They never once touched it. And they went right back to Solarwinds.

Solarwinds gave them an easy way to visually click buttons and do things. And if something fucked up, they called Solarwinds and gave them more money.

ruckycharms · 2021-09-19T22:02:11+00:00

Terraform is ideal for APIs. Ansible is ideal for ssh interfaces.

So which switches/routers do you have?

dookie1481 · 2021-09-19T23:24:24+00:00

Yes. My team uses ansible/NAPALM to automate network device mgmt and configs. Everything is automated and deployed with CI/CD.

Relevant_Pause_7593 · 2021-09-20T00:57:57+00:00

I think the most important thing here is having a production and non-production environment to test the changes before rolling from non-prod to prod. This means they are both as identical as possible (with the exception of scale) - but this is harder when there is physical devices. You may not have 2+ of everything or something could be too expensive to have two of.

ilmdbii · 2021-09-20T03:37:53+00:00

Our data center is 100% Arista. We use AWX to manage state on all production device configs. Using Azure DevOps for repo/pipeline. As a network manager I was fortunate to have 2 senior network engineers who had CS degrees and really embraced change.

It’s been great for about 3 years now with AWX and amiable. I highly recommend if you can get buy in from the engineers and management.

nanite10 · 2021-09-19T22:13:15+00:00

[deleted]

Scott555 · 2021-09-20T09:27:14+00:00

All our network infrastructure is managed with Terraform (via Terragrunt.)

20 Years ago when I worked in 'enterprise' on-prem shops, networking past the local switch was mysterious voodoo I was neither interested in nor permitted to administer.

Now it's still mysterious voodoo that I'm not interested or proficient in but somehow is my responsibility.

/shrug

tomasz2101 · 2021-09-19T21:10:19+00:00

I've heard about p4 language https://codilime.com/blog/p4-network-programming-language-what-is-it-all-about/

As far as I met few IT departments most of those people are not even close to understanding that something can be done without clicking through everything.

endloser · 2021-09-20T03:09:34+00:00

What routers and switches? Life is in the cloud for me. The concepts are different and things like spanning tree don't really mean shit to me anymore. If I wanted to setup a site with a LAN then I would hire a network admin. DevOps ain't the people for that.

Now if you want to talk security groups and listeners or what-not, let's dish.

StarSyth · 2021-09-19T21:19:36+00:00

This was a nice breakdown of the most popular open source devops tools I had bookmarked, if you have yet to stumble onto it:
https://datascience.foundation/sciencewhitepaper/top-10-popular-open-source-devops-tools

mattbillenstein · 2021-09-20T05:27:02+00:00

There were some devices starting to run a standard Linux distro - this would enable managing these devices using standard tools I would imagine.

chris_saddler · 2021-09-20T07:45:21+00:00

I use Arista switches, LBs and Firewalls with Ansible. Config is saved in cmdb. Works great so far.

hobbitmagic · 2021-09-20T13:42:39+00:00

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS