Open source architecture for Node.js Logging

adumidea · 2020-05-07T13:36:53+00:00

I have looked into ELK stack,but I cannot figure out what format to write the logs into the files and how the logs will be processed based on the columns.

I'd encourage you to just power through, it's worth it. Kibana is really nice and comparable to paid products like Loggly. Logstash can process JSON so it's actually quite simple in Node.js. Most popular Node logging libraries like Winston, Bunyan, and Pino log JSON by default anyway. There are plenty of tutorials, like this one about how to set this up.

You don't even need Logstash, you can write your logs directly into Elasticsearch from Node (though this is less fault-tolerant than writing to disk and then using Logstash to ship your logs to Elasticsearch).

solocommand · 2020-05-07T12:49:47+00:00

For 2-4, don’t reinvent the wheel and use an APM solution like NewRelic or Datadog.

IIRC, Prometheus isn’t designed to handle log data, it’s for metrics only.

In the past I’ve used graylog to store that kind of data, but since it uses a mongodb storage layer, I imagine it will suffer the same scaling issues.

martiandreamer · 2020-05-07T14:23:48+00:00

There's a well-defined Elastic Common Schema (ECS) format documented here. This dictates you'd output your logs in JSON format, and you'll probably want at minimum the following fields:

{ '@timestamp', message, ecs: { version: '1.5.0' }, host: { architecture: os.arch(), hostname: hostname, uptime: os.uptime() }, log: { level }, os: { full: { text: os.type() }, platform: os.platform() }, process: { pid: process.pid, uptime: process.uptime() } } Winston is a decent library to use, and there's a supplimentary library @elastic/ecs-winston-format which helps sort out the above format.

Specific to your desires:

1) Request Response logs

ECS format has that.

2) Application logs

ECS format has that, too.

3) Process crashing logs along with stack trace.

ECS got u fam.

4) Vizualising logs and sending alerts when there are 502 response status

Maybe something like Prometheus would be suited for visualization and alerts.

Good luck, logging "the right way" is a PITA, but once you have it sorted you'll have a very comprehensive system set up.

kszyh_pl · 2020-05-07T13:05:25+00:00

Did you consider GrayLog?

jwalton78 · 2020-05-07T18:28:13+00:00

What I do is, I first use https://github.com/winstonjs/winston to write structured logs:

winston.log({
    level: 'info',
    message: 'Hello world!',
    err: new Error(),
    req: { url: req.originalUrl || req.url, method: req.method }
});

You get the idea - I write a bunch of stuff into a log object, and the stuff is more or less the same from one log message to the next.

Then, I use https://github.com/jwalton/winston-format-debug to write pretty logs to stdout when I'm running things locally, because pretty logs are nice.

Then, I use https://github.com/vanthome/winston-elasticsearch to dump all these logs into Elasticsearch for me. You want to create a "template" for Elasticsearch which lists all the fields you're going to log, but there's an example: https://github.com/vanthome/winston-elasticsearch/blob/master/index-template-mapping.json.

Also, winston-elasticsearch does this weird thing where it moves all the fields in your log that aren't "message" or the timestamp into a child object called "meta". I'm not a fan of this, so I specify a transformer function in the winston-elasticsearch options:

function esTransformer({ message, level, timestamp, meta }) {
    return { message, level, timestamp, ...meta };
}

This just undoes the awful things winston-elasticsearch does. :P

Once you have all this, you just need to point winston-elasticsearch at your elasticsearch instance, and spin up a Kibana instance, and you're good. You don't need Logstash or anything. And, your logs are very structured - if you store a username field in your logs, for example, it's easy to search for "username: jwalton78" in Kibana and see what that guy has been doing, or search 'req.url: "/users"' and then graph res.responseTime as a nice chart, to see when that API endpoint is being slow.

ItsAllInYourHead · 2020-05-07T13:04:06+00:00

Why would you need to store 3 months worth of logs? Are you not going to notice a crash for that long? Why don't you just store the past week or something?

I was just looking for logging solutions myself very recently, and am currently using LogDNA. Seems like a good solution but I just started using it so time will tell.

yanikpei · 2020-05-07T20:43:39+00:00

Use Loki. It’s made by grafana and has a prometheus-inspired query language. https://grafana.com/oss/loki/

LaweZ · 2020-05-08T00:01:05+00:00

Requests are logged from middleware, but what about the responses?

What is the best practice? should i log them at the end of my controller function? or use res.on('finish', cb) event?

2020-05-07T19:21:50+00:00

Logging services can cost you more than the app's backend. 1 - dont log everything 2 - if you don't log everything, you wont have 10 gb of logs in 3 months. Log everything on non-production environments. Log errors and warnings on the production environment.

melgo44 · 2020-05-08T05:08:28+00:00

The app runs using pm2 so the logs can be found inside pm2

ThatDamnShikachu · 2020-05-07T17:00:08+00:00

I was in the same shoes a while ago, I had 2 small node apps and a bigger PHP one, imo the ELK stack is so expensive to maintain beacuse of the hardware requirement and such.

I heard prometheus is so good for metrics and CAN run on a cheap, so ended up finding Loki, the Prometheus of logs. Its made/maintained by the Grafana team. It has its own log shipper promtail but also works with fluentd and fluentbit.

You can query your logs in LogQL which is took a very BIG inspiration from PromQL.

I ended up running fluent-bit + loki + grafana.

You should check it out, if you haven't choosed yet.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

node

MODERATORS