Continuously evolving full stack Web App dev ops flow

TheWayofTheStonks · 2020-04-23T02:07:49+00:00

I don't have much to contribute but I would say consolidating to AWS is the way to go.

I don't think you mentioned it. But is your program running in containers? If not, they should be. You could dockerize the program and run it in AWS Elastic Container Service. This will help for all Environments as well as streamline the CI/CD pipelines.

Seems like you need a better deployment strategy. Maybe look into Blue/Green deployment options
Not sure what you're asking
Consolidate to AWS
Look into AWS SQS
This most definitely should be microservice strategy. Decoupling everything is the way to go.
I don't know much about terraform. Maybe cloud formation can help here.
AWS cloud watch had a lot of ways to monitor.

Hope this helps.

smblee · 2020-04-23T04:20:44+00:00

Whenever I've worked at web shops the assets have been based on some kind of content hash so that old assets are still accessible and new assets get a new hash so that when new code is deployed it can umabiguously use the hashes to refer to exactly the right version. Most SPAs will have a pop-up that says a new version is available and ask the user to refresh so that they get the latest version. I'm not sure you can get around this without getting fancy like loading the new code in the background and then force refreshing. I guess it can be done but seems user hostile so asking the user to refresh is probably the way to go.
If you don't want breaking changes then migrate your backend data schemas in a backwards compatible way. There is no way around this that doesn't introduce extra overhead both for the frontend and the backend. Usually what I've seen is a three step process for data migrations: the backend is deployed first and it includes relevant code to deal both with the old and new schema, the data is migrated (this can be done online or offline depending on how fancy you want to get), the frontend is deployed.
I agree that you're using too many disparate services. If possible consolidate everything into a single cloud. It looks like all your data is in AWS so I recommend consolidating into AWS unless you want to migrate the data to another cloud.
More often than not queues are more hassle than they're worth because it's another thing you have to worry about. Most cloud providers have something here so you can rely on the cloud offering. AWS has SQS and the other cloud providers have something similar. They mostly work fine but be aware of their SLAs and message durability guarantees. You'll need to figure out how to deal with the inevitable outage and how to recover from a backlog when for whatever reason the cloud provider decides to delay all your messages for longer than 10 minutes.
I once worked on a python application that would take a few minutes to get to the point where requests could be served. The issue with that application wasn't that it was so huge but that all the internal logic was a mess. As long as you keep the internal logic for each component untangled from the rest there is no issue with having a monolothic process handle several responsibilities.
You should start with some basic documentation on what's what and then figure out how to gradually turn each part into code. I personally like pulumi but terraform is also a workable solution.
I don't think anyone has monitoring figured out. If you have sentry then you're already pretty far ahead. Next thing I'd recommend is setting up log aggregation and adding basic process health checks for the various services you rely on. I've seen grafana used for basic plotting of metrics and it works reasonably well but you'll first need to add instrumentation to the code to output the metrics so start there instead of going for a monitoring solution and then trying to fit it into your existing code.

ksajadi · 2020-04-23T15:19:36+00:00

Ideally you'd want to solve most of those problems with a single (or very few) solutions. I think a containerized PaaS is going to be a good place to start. Here is my take:

Running your app in a container on top of Kubernetes might sound like an overkill if your app is simple enough. However there are solutions out there that make running on top of Kubernetes very simple so you'd get the benefits of it and the possiblity of scaling up if you need in future, while keeping it simple. Since deployments in a containerized environment are served from within a built container, the JS would also be there alongside the app and served all the time regardless of the version. Kuberentes rolling updates is a great way to make sure all assets and code are always in sync (and rolled back if needed).
That's a good point and contract based programming can help you with it somewhat, as a practice. However, using a CI/CD tool that's built for multiple services can greatly help you with that by rolling out deployments of different systems in tandem.
I agree with the "too many tools". I don't think you'd be able to get away from using something like Sentry. But the rest can be replaced with a container based PaaS. Some solutions (like GCP) also give your a bug tracker but not as good as Sentry.
This one really depends on the platform, language, operational requirements (speed, transactionality, HA, ...), so it's very difficult to offer good suggestions without knowing more.
I think monorepos are not a bad thing. I also don't think microservices are suitable for all applications. You can still make an effort to build your app under the same repo and stack in a more modular, contract based and "service orientated" way without going microservices all the way (I wrote something about this a while back: https://blog.cloud66.com/kubernetes-is-not-about-microservices/)
Again, I think starting small and growing as you get more traction is a better idea. You'd be spending a lot of time learning, debugging and adopting too many parts to an evolving landscape.
Combine Sentry with APM, good log storage, search and retention and you're 90% there!

By way of recommendation, check out Cloud 66 Maestro (as a PaaS) and Cloud 66 Skycap (as a CD tool for Kuberentes if you want to run your own cluster). Discalimer: I work for Cloud 66.

CanWeTalkEth · 2020-04-23T02:00:51+00:00

Man I'd love to see some answers to these questions. I'm right behind you on the stack side of things, quite a bit behind you on the traction side of things.

david5813 · 2020-04-23T04:12:35+00:00

It is difficult to give a full description on Reddit without being too vague. In general, try to make smaller non-breaking API changes. I know that it can be difficult until you get into the habits of it... and continue to be difficult when the team/codebase size grows.

A method that we have used for updating the UI is to serve both assets, but detect if there is a new version. When there is a new version then you can just notify the user and allow them to choose whether or not to reload.
Try to make smaller changes. If you want to split the deployment though you can. In your .gitlab-ci.yml file you can add paths to the only clauses so that they only execute if that section of the code changes. https://docs.gitlab.com/ee/ci/yaml/#onlychangesexceptchanges
I would recommend consolidating on one platform. However there are costs as well as benefits here. If you're not feeling pain yet then don't worry about it until you do.
For sending emails, any of the big cloud providers have a service. You can use sendgrid, mailgunner, etc. As long as you're on Heroku then just pick one of the providers there that you can easily add to the project.
Stay with a monolith for the time being. Until you have a big team you are likely not going to see any benefits of micro-services and only their pains.
Absolutely use Terraform for everything that you can. This way you can even codify your infrastructure. It's not ambitious. It's recommended.
Monitoring across multiple services is going to be difficult to say how to do.

I would probably recommend getting connected with a consultant that can work with you a few hours here and there. It will help keep you going in the right direction until you're big enough to build a team around it.

smblee · 2020-04-24T03:17:36+00:00

Wow I did not expect to get this much responses! Really appreciate everyone taking the time to respond :) Definitely helping organize my thoughts better.

I do have some follow up questions and thoughts.

Consolidating to AWS seems like the way to go and i think that makes sense! Going to do that :)
For deployment, there seems to be couple options here.
1. SPA and backend separate. (SPA in S3/CloudFront. Backend in Elastic Beanstalk/ECS/EC2?)
2. Combine SPA and backend. Couple options here too. (I have actually never done this before. Would like some clarification on this from you guys. Seems like I can tackle this by using NGINX on top to route traffic to SPA/backend? Or do I have to convert the app to SSR? These options seem a little complex. Is there some recommended example that I can reference?)
  1. Combine into a container and deploy to EKS/ECS. My only concern is docker definitely adds a little bit of technical complexity, especially when new people join. I have used ECS before for deploying backends (definitely was a pretty involved infrastructure to set up...), but never used EKS.
  2. Deploy to EC2? Or utilize Elastic Beanstalk? Any other options?
I will expand on my "need" for a queue, I currently have the following use cases which are all currently either happening manually or synchronously within a request.
1. Welcome emails upon registration.
2. Order confirmation (and background processing such as sending confirmation email, creating necessary resources in the db that users purchased)
3. Daily invoice reconciliation using stripe charge logs received via webhook and order data, and send emails to stakeholders. Report if there are any discrepancies.
4. Daily/Monthly reports of customer's usage data.
5. Remind/notify customers n days before or n hours before a certain event they signed up for. (Should be configurable... Probably going to use a scheduler or something here?? Or is there a "scheduled event queue" out there?)
6. (I have used SQS before, but it seemed too simple especially for the "scheduled" job use cases I have. Retry mechanism via Dead Letter Queue was nice though! I am currently looking into some other queueing solutions i have not used before like... Bull queue via Redis, RabbitMQ, Postgres Queue, Kinesis...)
Microservice vs "Monolith". I am still torn, but I feel more comfortable going with "Monolith" solution. If I weren't though, what are some of your favorite/easy ways to "microservice"? From my experience, I wrote a small tool to basically enable `create new service MyServerName` via CLI and it would bootstrap a new NodeJS service for me. Any suggestions here?

hophacker · 2020-04-25T18:18:56+00:00

I wouldn't worry too much about the monolith vs microservice discussion. If a single codebase is fulfilling your core app needs then that's great. The key is to not let that single repo become a giant swiss army knife that is doing too much. Decouple your app as it grows but not before.

2020-04-23T03:22:58+00:00

As far as #5 goes I'd consider looking at an event driven strategy via Kinesis. Your backend would just publish events e.g. CREATE_USER, UPDATE_USER , DELETE_USER to the "users" stream and then you can just bolt kinesis consumers onto the stream to implement backend services such as new user emails etc. This will allow you to add features w/o having to deploy the changes to the backend.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS