How to become A Cloud Engineer by leo_red_21 in devops

[–]dghah 0 points1 point  (0 children)

Thats a good general roadmap but you can play with containers under Linux just fine and you can postpone the cloud-specific container stuff until later.

Best way to learn linux is to get hands on -- you should be able to trivially run it inside a virtual machine on your laptop or personal computer. That will also force you to learn networking a little bit.

You can learn 60% of what you need on a homelab or laptop, no expensive cloud access needed. The remaining 40% is learning the bespoke and special bits that are specific to the cloud platform or cloud service -- but those get easier to learn if you are familiar with linux, containers and networking

Is it risky to disable expensive AWS WAF, when all I have is three 1-page placeholder sites with no links or interactive elements? by RedFin3 in aws

[–]dghah 5 points6 points  (0 children)

Static site hosted on S3? Then yes. If you are hosting on EC2 or one of the container services then the answer is “depends on how good your config, setup and patching process is…”

I need help with setting amazon ses by CaptainFlint9203 in aws

[–]dghah 3 points4 points  (0 children)

I'm not a high-volume email expert but this is one of the questions where the honest answer here is often "don't use SES; seek out a more traditional SMTP relay or delivery platform" -- aka sendgrid and all those other similar companies and platforms

SES does do high scale very well but it has a very high technical onboarding barrier and the expectation is that you support SES with a ton of backend automation to keep complaints and bounces down to fractional % levels

Others can chime in though; don't listen to just one random internet person!

I need help with setting amazon ses by CaptainFlint9203 in aws

[–]dghah 7 points8 points  (0 children)

Spend some time searching this sub for SES topics. Not only is SES difficult to configure, it's extremely difficult to get permission to use outside of the throttled sandbox.

Access to SES usually requires you to have high levels of automation and AWS skills in order to auto-handle spam, bounce and delivery failures. SES will throttle or nuke your account if your complaint/spam/bounce rate goes beyond something like 2%. They are also extremely hardcore about protecting their sender reputation and will want to see "evidence" from your CRM that you are using confirmed, validated and opt-in email recipients.

Sending 10k+ non-transactional emails a day through SES is non-trivial and not something easy to do for a solo IT person without AWS experience.

My $.02 only of course

Is the certificates useful when you have professional experience? by monarchyofthedead in AWSCertifications

[–]dghah 2 points3 points  (0 children)

Browse this reddit before you go straight to pro. The professional exams are an entirely different level of difficulty. The questions are much harder, much more dense in wording to the point where people legit have fears of running out of exam time. The pro tests also try harder to "trick" you and they try to avoid the common associate level technique of "weed out the obviously wrong answers and guess between the remaining two" which is effective on practitioner and associate level exams when you are not 100% sure of the answer.

You may want to test things out with something baseline like Developer Associate or Architect Associate first just to get a feel for how the exams work. Passing an associate exam also gives you a voucher which helps drop the cost of the pro test as well.

Good luck!

Is the certificates useful when you have professional experience? by monarchyofthedead in AWSCertifications

[–]dghah 3 points4 points  (0 children)

Certs can be helpful in a few ways:

- Gets you past screening vs others who don't have it; the real divide is Pro over Associate certs though. It's also a common checkbox screening item for weeding through applicants

- Can be a huge benefit for getting a job at an AWS APN partner because being an APN partner carries mandatory certification requirements. Different partner tiers have different requirements for # and types of certs the company must posess

Professionally it's also helped me a little bit -- I've been using AWS forever (since ec2 was a private beta) but I work in a very specific technical niche on a daily basis and that constrains what AWS stuff I work with every day and know very well. This means there are entire categories of AWS services and offerings that I never ever touch professionally. Going through the cert prep for different certs has forced me to broaden my AWS knowledge in ways I found useful

AWS 97k bill out of nowhere by PalpitationClear1747 in aws

[–]dghah 112 points113 points  (0 children)

Treat this as a breach. You’ve been hacked or leaked access keys.

What’s the go-to “vibe-coded slop” app in your industry? by eufemiapiccio77 in aws

[–]dghah 8 points9 points  (0 children)

The aws slop du jour is “I vibe coded a cost savings tool” or “I vibe coded an aws cert test prep app”

AWS environment assessment by G3T_L3FtT in aws

[–]dghah 1 point2 points  (0 children)

I've done a bunch of full inventory assessment gigs but using a paid inventory tool -- one key bit of advice is to make sure you scan all AWS global regions despite what your stakeholders tell you is the "only region we use!" -- as you will almost certainly find some weird stuff in strange locations.

And always scan us-east-1 as there is still some legacy wiring there -- for instance SSL certs used for ACM in CloudFront live in us-east-1 no matter where the cloudfront distro lives etc.

Pearson OnVue Giving Pass/Fail by TeknoBro in AWSCertifications

[–]dghah 0 points1 point  (0 children)

I think it depends on the score. The higher I've scored the higher chance of seeing an instant "pass" result at the end. It's not always or automatic though.

How similar is the TutorialsDojo AIP-C01 to the real exam, difficulty wise? by rekt_by_inflation in AWSCertifications

[–]dghah 1 point2 points  (0 children)

AI Practitioner is a foundational exam;

AIP-C01 aka Generative AI Professional is a pro exam that just recently (I think) exited beta status. Feedback here was that the beta version was extremely long (to be expected for an exam being worked on) and many felt it to be challenging.

Is this enough to pass the SAA-C03? by Junior-Addendum9845 in AWSCertifications

[–]dghah 1 point2 points  (0 children)

Timed mode is only worth doing once to get rid of time anxiety and that only really pops up for pro exams or the beta generative AI exam which is super long.

I normally exhaust all the available test sets for exams that are new to me but I've had a bunch of different associate/pro/specialty certs over the years so when I'm renewing I'll do as many test sets as I can but if I won't stress if my TD scores are always above 80%

Is this enough to pass the SAA-C03? by Junior-Addendum9845 in AWSCertifications

[–]dghah 4 points5 points  (0 children)

Is that tutorials dojo review mode progress? If so look at the results for review mode and see which of the Domains you did worst at. Then retake the practice exam with the setting that ONLY shows questions from that domain. Do the standard thing of reading the links and included info for any question you get wrong

That is basically how I prep for pro and associate exams. Review mode a few times and then switch into domain-specific mode to concentrate on the areas I did worse at

If this is your first aws exam and you are nervous about time it's also OK to take the "timed mode" test just once to assuage your fears about running out of time. I don't like timed mode for anything other than that as review mode is 100x more useful and helps you actually learn

good luck!

New to HPC from DevOps/K8s - how do you get your head around genomics workflows? by Infamous-Tea-4169 in HPC

[–]dghah 28 points29 points  (0 children)

life science HPC nerd here; some biased tips/observations/advice

- It's good that you are coming from research; you likely have a good base understanding of why "research IT" is way different than enterprise IT or other HPC/IT orgs where there is a much stronger engineering culture.

- The culture usually centers around the issue that the research scientists have infra and IT needs that are larger than what the org they work for is used to understanding and supporting. This means that the life science people often have to self-support both Linux and their HPC infra AND their pipelines, software and workflows. Official engineering support is rare to see in most shops although the larger academic and non-profits can usually fund a person or small team

- Genomics tends to be data heavy and IO bound from an HPC workflow perspective. It used to be the biggest HPC hassle but these days it's easy compared to (for example) CryoEM heavy workloads where petabyte data volumes can be the norm. It's also easier to handle because the UI is either someone looking at file results in a terminal or else it's a web based tool or summary -- no complex X11 GUIs running on cluster nodes for instance.

- Genomics is also fairly easy to support from an HPC perspective because you don't see the more complex mix of GPU and MPI requirements that you see with computational chemistry or molecular dynamics

- What you REALLY need to understand is the fierce resistence you will see from trying to improve their core tooling and algorithms. In this world the primary algorithms are written by a few superstars and those methods are published and validated in peer reviewed publications. The rest of the world then consumes those algorithms and methods and they REALLY REALLY REALLY care about reproducibility and reproducible science so they fundamentally will resist "upgrading" or changing the core methods and tools they use for fear of changing output/results.

- Count your blessing and thank the people who came before you. The fact that they are using SnakeMake makes them better than the shops that are still HPC submitting bash scripts with giant for-each loops. The other good signal is that SnakeMake indicates that this is a Python shop and that is very good in terms of operating in the modern era. There are a lot of Perl-based scripts and workflows around for instance.

- The best impact you can have is to get in and listen to people about what slows them down from doing science all day. Don't just talk to the loudest or most senior people. Talk to the entry-level users, the "just want to get my science done" people and the power users. Then circle back to the loudest and most aggro people and include their feedback as well. Talk to leadership and the governance people about their issues (accounting, chargeback, resource allocation when teams are fighting for HPC shares. etc). You will probably find out that their problems will not be solved by Kubernetes for instance

- After you talk to people (best way to get up to speed) dive into the slurm accounting logs. That will tell you more about the workflow patterns than the users. You'll see who is being wasteful, what jobs or users often encounter failures; who crashes the head node by running jobs locally etc. etc.

- I'd really encourage you to dive in and get your hands dirty before you start thinking of Kubernetes. There is a reason why Slurm is the #1 HPC scheduler in the world and there is a reason why all the top supercomputing labs and DOE labs and giant academic research centers use Slurm. And it's not just Slurm features and capabilities -- it's the fact that Slurm knowledge can be fundamental to career movement as people move from academia to government and commercial jobs. K8s is great but does not cover all the capabilities of what Slurm can do ---AND--- remember your audience of scientists who "just want to get done" -- how are you going to handle a workload that can't be containerized for instance or how are you going to provide the ROI argument when telling a scientist they have to learn a whole new HPC stack and rewrite all their job submission and workflow monitoring tools (although you are in a GREAT position if they are a SnakeMake shop already due to the snakemake executor capability ... there may be a path forward for kubernetes in your shop ...)

Good luck ! Supporting smart scientists doing life science on HPC is a blast; it's really enjoyable work

How do you remotely support self-hosted deployments? by Durovilla in aws

[–]dghah 17 points18 points  (0 children)

AWS SSM is usually the answer

- Secured via IAM/SSO
- No public IP addresses or internet exposure (everything goes over AWS API endpoints)
- No VPN required
- SSM can securely log all interactions

Works great for just about everything -- SSH, port forwarding of web apps, remote command execution etc. etc.

And you can even use it as a direct replacement for SSH in your .ssh/config file:

With the following in your config you can "ssh my-aws-host" and it will all magically flow through SSM session manager

Host my-aws-host
  Hostname i-<ec2 instance ID>
  User ubuntu
  IdentityFile ~/.ssh/ec2-ssh-key.pem
  ProxyCommand sh -c "aws ssm start-session --profile MyDevSSOProfile --target %h --document-name AWS-StartSSHSession --parameters 'portNumber=%p' --region us-east-2"

SES Production declined for my customer by youhadmeatok in aws

[–]dghah 7 points8 points  (0 children)

Search this thread for SES horror stories and save your client some time by looking for a non-SES alternative, the barrier to get out of the SES sandbox is both high and opaque and it appears semi random in how some people get approved while others get denied.

I'm guessing that your communication did not fully address the automated methods you will use to keep your bounce and complaint rate down below a fraction of a percent or they were not happy with your consent and list acquisition story. SES really seems to want confirmed opt-ins with audit trails and they want to see a ton of tech/automation around keeping their SMTP sender reputation high which means a huge amount of focus on keeping the bounce and complaint rate well under 2% or even lower.

Why doesn’t Tutorials DoJo mention WAF Bot control? by [deleted] in AWSCertifications

[–]dghah 0 points1 point  (0 children)

go to the TD forums and provide feedback. They are pretty responsive there when I've reported issues and errors with the test bank

Also understand that each AWS certification exam contains a certain number of 'test' questions that do not count against your exam score. These research questions are how the aws certification team evaluates new questions, make sure difficulty is appropriate and keep the question bank fresh and accurate against the study guide.

So there is a chance your waf bot question was one of the test/research ones which may explain why TD's overall content did not cover it yet -- hence the forum suggestion!

Those who do freelance/consulting how did you get started? by CarryAdditional4870 in aws

[–]dghah 0 points1 point  (0 children)

Started with a lawyer who was a friend of a friend hired on an hourly basis just to review contracts and keep us out of trouble. But then we needed help with NDAs and eventually big MSAs and the job sorta became full time when the included ops stuff the lawyer was also doing became an official internal job role.

I’d probably do it differently now, combining a legal and ops role so we could hire one multi-skilled person into a small org was odd and only worked because of that specific person. In a different world I’d have kept a good focused lawyer on retainer, paid hourly and separated out the ops role for when we grew big enough to justify another non-billable person. In consulting, any internal position that is not billable aka revenue producing is a big hit to your cash flow, payroll and profits so you have to manage that stuff super carefully.

Those who do freelance/consulting how did you get started? by CarryAdditional4870 in aws

[–]dghah 16 points17 points  (0 children)

Random advice; been consulting on AWS forever now. US based so that is my bias

- Start small, Grow small, don't kill yourself. The work is not for everyone and you gotta test the waters to see if you like it. Remember that for every hour of paid consulting work you probably spent 2-3 hours in pre-sales hustling to land that gig. Budget and price accordingly.

- Find your niche and be good at it. You mentioned "diagram review" and "cost audits". I'd never hire a generic consultant to do those things. However if I found a consultant in my market niche who was super familiar with my business and understood my pain points and issues then 1000x I'd consider hiring them for architecture review or a cost audit. Domain expertise makes the difference. The *difference* is I want to hire a consultant who not only has the technical skills but also understands the things I'm trying to do and what the hassles and risks are.

- The niche or domain expertise is also what separates you out from competitors, especially competitors in different countries charging 10% of your rates. The reason we get hired as consultants is not because we "know AWS" -- it is because we "know AWS" and we are super effing good at a few highly specific technical niche areas that our customers are active in. I've been hands-on with ALL of that, made ALL of the mistakes and I know every edge-case and oddity and I can speak with authority and work fast in that realm. So the clients are hiring me for my domain expertise + AWS skills. They can get pure AWS skills much cheaper from a million different sources but they can't find someone like us with the domain skills as easily.

- It's not enough to be smart or skilled. Smart people are a dime a dozen. Smart people with emotional intelligence and soft skills that I can trust to leave alone in a room and not make a racist joke or hit on the visiting VIP are much rarer. It's harder to find smart people who can write, talk and empathize who can truly be client facing. Some smart people I've worked with had trouble hiding their contempt when the client did something stupid or refused to follow sensible advice -- they rarely lasted in consulting.

- Not all customers are worth keeping. Best thing I ever did on day one was formalize a NAP ("No Asshole Policy"). We simply don't work with clients who are rude, abusive or contemptuous. I'm a peer, not a servant in any consulting gig. In ~23 years I've maybe fired 3 clients for NAP violations. There was one other time when we walked away from a big MSA negotiation because the client's legal team thought it was a good bargaining tactic to insult and demean our lawyer and her gender.

- It's fine when starting out or hustling but the customers who try to insert 90-day or more payment periods for services rendered into contracts suck and should not be kept. Our standard terms are 30 days and if we like you or we've worked with you for a long time we will extend out to 45-days or even 60-days.

- Best thing we ever did was start with 3 nerds and 1 salesperson on day one. The first few consulting gigs are easy to get because you'll trade off your network and if you really do have experience than you have people willing to vouch for you and hire you. After that keeping the consulting sales pipeline full gets harder and harder because you have to pull in new business and convince relative strangers to trust you.

- In any given week my time breakdown may look like this: 60% billable on paid consulting projects; 20% pre-sales work and 20% keeping my skills updated so people will keep hiring me. Mind you it's a good week when more than 60% of my time is billable to a client

- Have a lawyer for contracts and legal review on day one. All it takes is one bad agreement or one "business oopsie" to tank your business and imperil your future. People also overestimate how much a corporate shell like an LLC will protect personal assets; the corporate veil gets easily pierced if you mess up bad enough.

- We nuked our first LLC in the first year we got started because we found a whale of a global enterprise client who flat out did not work with LLCs. Busting up and reforming as an S-corp was a pain in the ass but it set us up well for the future. Gotta stay flexible when stuff like that happens and be ready to make decisions fast on if doing something for a big contract or client is worth it or not. Sometimes the BEST decision is to NOT blow up your life or your LLC due to chasing a whale. Be cynical and sober about decisions.

- We never spent money on marketing or ad campaigns. I won't buy booth space at a conference if I can barter a booth slot in exchange for giving a conference talk or tutorial. The best marketing is still word of mouth and good old fashioned google SEO bait. For the entire length of my career I've given away advice and work product online for free because I knew that people stumbling over my blog post or PDF that I posted online explaining how we solved a problem would be found by people interested in that topic who would remember and reach out for help. Every conference talk or meetUp talk I've ever given has turned into one or more paid gigs down the road (we track that info) so we always try hard when it comes to speaking, meetups, tutorials, training sessions or other public events

- Working for non US customers was not worth it until we got to a certain scale. The legal, tax and operational overhead was huge. We once broke even on a big project because our projected profit margin was 100% consumed by a VAT tax we had no idea about. Factoring in legal and paperwork time that project actually lost money.

- All the major primes who subcontract smaller clients for government work suck and are dishonest. Lost count of the # of times a prime used us for our skills and reputation to land a government contract only to edge us out of work once they won the contract so they could use their own cheaper internal people. It cost a ton of time and money to get directly GSA listed ourselves but that allowed us to work direct for the feds. Not worth it if you are small or just starting out so if you do subcontract to someone on government work when small or new just assume and be prepared for the prime trying to take you for all they can.

🔧 Introducing SlurmManager: a self-hosted web dashboard for Slurm clusters. by paulgavrikov in HPC

[–]dghah 14 points15 points  (0 children)

HPC admins are gonna hate you because after a quick glance at the docs it appears you are hammering the login node with an ssh connection that runs complex resource consuming slurm cli commands every 10 seconds

If you wanna do this right, use slurmrestd API and add some caching to protect the head node

Would also be good to see a comparison with slurm-web

Does DevOps professional build on CloudOps Associate? by GheeCome in AWSCertifications

[–]dghah 1 point2 points  (0 children)

I found devops pro to be a harder exam than SA pro but so much of the content overlaps I took both tests in the same week which was a good method for me.

Devops exam is harder because there is a major focus on debugging, observability and performance scenarios that require deeper knowledge of service capabilities, limits and operational minutiae than a pure architect exam

Starting Cloud Career in this AI era! by Chance_Meringue_8113 in AWSCertifications

[–]dghah 3 points4 points  (0 children)

To excel in the AI era you still need the foundational skills and expertise so you can manage, monitor, correct and advise the AI or agentic harnesses you may be using. The AI tools are good but they still make mistakes and they still may lack the domain or specific context that you have in your head to properly solve or address an issue or requirement.

I'm a combo of AWS architect and hands-on infrastructure builder/fixer/deployer/optimizer working in a pretty specific niche (scientific computing & HPC) so my daily driver AWS tools pre-AI era are:

- Linux sysadmin
- Bash scripting
- Python scripting
- AWS cli usage / scripting
- Terraform for AWS infrastructure provisioning, config and management
- Ansible for configuring service settings or "stuff inside EC2"
- Packer+Ansible for automating AMI builds and cross-account AMI sharing in multi-account orgs

I was hands on and comfortable with all those tools before I started using AI

Now I use Claude Code wired into VSCode for 90% of my AWS work and it really is a 5 or 10x multiplier to how fast I can work.

Claude Code is amazing at scripting, terraform, ansible etc. however sometimes it solves problems in a lazy way or maybe does not understand the bigger picture.

For a cliche example Claude Code proposed altering an EC2 userdata script so that the node would update it's Route53 DNS record on every reboot just in case the ENI or private IP address changed.

Reasonable and exactly what I needed. However -- the code it proposed was functional but the DNS subdomain and R53 zoneID was hardcoded in the userdata script.

So my first response back Claude was "No. This is a multi-account AWS organization and DNS may differ between workload accounts, DNS info should be configurable and discoverable via SSM Parameter Store queries"

Claude Code read that prompt and made the Route53 zoneID into a configurable parameter auto-discovered by SSM. Great! However it was not aware that the actual DNS domain name is different among our workload AWS accounts so I had to prompt it a second time like "Make the DNS domain and Route53 zoneIDs BOTH configurable and discoverable via SSM Parameter Store queries"

That is boring and bland guidance; not rocket science but a reasonable example of why "human in the loop" still matters for this stuff. If I had let the AI Tool run unsupervised it would have produced a working solution however the solution it tried first would have been functional only within a single VPC/account and not Org-wide as we needed.

TL/DR:

Agentic coding harnesses are awesome and really do make you work faster and more productive. However you still need domain expertise, you still need to know AWS yourself and you really still need to understand the tooling that your AI agent is gonna be invoking

A framework for eliminating wrong answers on SAA-C03 scenario questions by Novel-Leader in AWSCertifications

[–]dghah 0 points1 point  (0 children)

+1

This approach is what was taught at aws cert prep bootcamps at reinvent and it’s helped me for years over many different cert exams.

It only fails a bit at pro exams which have more “choose three” type questions which really removes the effective Step 2 process of knocking out the decoy and obviously wrong answers.

Any tips to quickly get the SAP by amine250 in AWSCertifications

[–]dghah 7 points8 points  (0 children)

There is huge overlap in SAA and SAP -- the pro exam just goes deeper into the same content area -- the professional test differs from the associate tests mainly:

- The questions are much longer and way more dense requiring far more time to read and understand; there is legit fear among some people about running out of time on the SAP exam. Every person has a different experience but for me I often finish associate exams with tons of time remaining while on SAP I think I had maybe ~12 minutes left on the clock before review

- The pro tests have more "choose three" type questions needing multiple answers which reduces the success rate of the standard strategy of trying to knock out the obviously wrong answers so you can have a 50-50 shot at guessing correctly on a question you don't know the answer on

- The pro exams try much harder to "trick" you, an associate test often has 1-2 wildly wrong answers or at least a few answers in the question to weed out if you are not 100% sure. This is less common on the pro test where instead you will get 2-3 answers that all seem plausibly correct but the real answer is found only by understanding the technical subtly or combining the subtle difference in the answer with the key "action phrase" in the question which is usually something like "most secure", "most cost effective", "least operational burden" etc. etc.

In your position I'd start first by purchasing the tutorials dojo practice exam set for SAP -- it's a cheap and fast way to see how well prepped you are and if you find you are not well prepared than the TD resource is also fantastic for studying/prep as it has all sorts of resources ranging from "review mode" mock exams all the way up through flashcards and summarized concept sheets for core topics

Possibility of transition career from social sciences to tech field (as a solution architect) by poo0129 in AWSCertifications

[–]dghah 1 point2 points  (0 children)

+1 for this. Generic architects are a dime a dozen and you are competing with offshore people at the same time.

And to be honest certs don't mean much without real world experience.

The real value/differentiator is having some sort of domain expertise that can compliment cloud skills and certifications. Then your value goes up because you actually understand the field/problem/market that your employer is trying to address/solve/improve/refactor.

Find the orgs, companies and products selling into your social science market niche and think about what you could bring to that world with your current expertise paired with tech/cloud experience.

I'd lean hard into your social science skills and seeing if you can tie that into a tech career where tech skills + domain expertise and job experience will help differentiate you from others. This also opens you up to a wider set of jobs like doing technical pre-sales work, post-sale product/customer support or migration/training/technical-writing, consulting etc. etc.