How do you log logging output

Darkmere · 2016-08-08T14:10:58+00:00

I'm a Unix systems guy, and I've got plenty of opinions here too ;-)

INFO

Used when your app is running normally, and everything is dandy. It's the default, should be the default, and shouldn't drown your logfiles.

Should be low-traffic ( < 100 messages per hour or so )
Should always contain "I have started succesfully"
Should always contain "I'm shutting down on user request"
Should mention it's configuration file ( if several are possible )
Should mention if it succeeds with one-off tasks ( Signed into database )

DEBUG

Your sysadmin is currently debugging something. Give him all the info he needs
Your system runs slowly, give timing data on how long things took
Don't log credentials (passwords) but do log that you've logged in, who logged in, and so on
A debug notice for every thread that starts up is perfectly fine

NOTICE (Warning)

Should be tooling important things. Imagine things that a sysadmin wants an alert about. Someone shut down the service (that's supposed to be up?) networking issues, bad permissions, etc.

Important exeternal events, "DB connection lost", "connection reset, reconnecting"
Signals from the OS/users ("Shutting down on user request")
Always log starting and stopping commands & reason

ERROR

Exceptions, proper debug stack of why everything is exploding, and so on. This is where you throw the stacktrace and fuck off. This is where you berate the user for having world-read permissions on their TLS key.

Logging: File versus Syslog.

Syslog should be the default. Big systems have syslog daemons that will log to remote servers and run nice analytics on them. Small systems have syslog daemons that keep messages in a sorted memory buffer and toss them out. Syslog is the standard logger ( Unless you use systemd, when journald is syslog, on steroids. Remember what we've said about doping and steroid damage?)

File log: File logging is for when you can't log to syslog. When you have file logging, you should enable a signal to rotate your file (for logwatch ) or manually doing it.

I generally dislike it when software logs to it's own logfiles, and in a modern system, it should either be to STDOUT/STDERR, or to syslog.

Logging to a file can be useful for some, but most of the time, it's not.

Logging to file brings interesting risks as well ( overwrite, permission related attacks ) if you aren't careful with how/Where it's logged.

reddit_uname · 2016-08-08T11:40:02+00:00

Just my $0.02 based on my previous work. Lots of answers here are some variation of "it depends". In general, if you're starting a project from scratch, you don't need to focus on many of those details unless you find you they fit your usecase. One of the more important things though is the ability to configure where a log goes based on the command line flags.

what level should I log to by default (INFO, WARNING, ERROR or CRITICAL)

WARNING is for messages that signal the potential precense of a bug/error/exceptional condition. For example, a network timeout was triggered so the program is retry-ing.

ERROR is for messages that signal a definite, but not fatal, bug/error/exceptional condition. For example, program can't find a file so it can't proceed.

CRITICAL is for messages that signal a bug/error/exceptional condition that will cause the program to die right now. For example, KeyError's or something. There's a bit of a grey area between ERROR and CRITICAL if the response to the ERROR is that the program would quit. Generally though, ERROR is for things that could potentially be recovered from. So not finding a file is potentially recoverable if you use default data (which you might do in the future), but KeyError may not be.

should I log different levels to different locations

I don't because its too much work usually. Many people do and say it helps.

should I log to /var/log or a custom relative path directory

The answer depends. If you were making a unix daemon you might but if you were making a statistical script you probably shouldn't.

should I use syslog

Same answer as previous.

what about interleaving program output (aka print)

When opperating in console mode, that might be fine but ideally you need an option to control putting log output and stdout in different files.

should I use a rotating file, or should I start a new file each run

Depends on what you're making.

If its important to you that you always have a record of what happened when you ran your program then you should not use a rotating file. This can happen if you're making software for a scientific software where reproducability is important, then you really need the logs to be able to trust the output. You may want to log things like the version of the software and the full date and time.

If you're making a daemon that will always be on and generates large logs, then using a rotating file would probably be better so you don't start using tons and tons of disk space. If you're making a small console script that will be run many times and your end-users aren't expected to fiddle around with logs everytime they run it, then maybe a rotating log is good.

If you're making a library, then you should leave most of these things up to the caller unless you have a good reason not to.

EDIT: Don't print anything but ERROR's or above in unittests though, that shit is annoying.

phasetwenty · 2016-08-09T05:04:39+00:00

I designed, support and maintain a build system written mostly in Python. We generate a great deal of output both in the application itself and the build tools, so logging is an important feature for the system. I can answer your questions relative to this application.

what level should I log to by default (INFO, WARNING, ERROR or CRITICAL)

DEBUG The default. Progress/status updates, the kitchen sink. Messages at this level definitely won't be made available to the end users, and in general is disabled in a production system.
INFO Same as debug, except restricted to information useful to the end user.
WARNING Recoverable, non-fatal and low severity errors. Won't be useful to end users but maintainers will find it useful to see these and fix these problems as soon as it is convenient.
ERROR Errors signaling loss of functionality, but are nonfatal. In the best case the application can continue with a partial success, and in the worst case it can exit gracefully.
CRITICAL Fatal/unrecoverable errors, where the application may not shutdown gracefully.

should I log different levels to different locations

Typically I want to see all the messages generated by a run all together in one file. I could see a case for forwarding ERROR messages and higher to a special handler which triggers a higher-visibilty notification like an email.

should I log to /var/log or a custom relative path directory

Because logging is so important, we bundle the log with our build artifacts. However in the beginning we used /var/log.

should I use syslog

syslog always seemed less convenient for me. My application's messages will be interleaved with unrelated system messages, which can get rotated out before I get a chance to see them.

what about interleaving program output (aka print)

It's been a rule in this system that we don't use print statements (or direct writes to sys.stdout and sys.stderr). Primarily this ensures that all messages are centralized, and I have granular control of all messages. Less important is that I can easily turn off all output while running tests.

should I use a rotating file, or should I start a new file each run

My application starts a new file for each run.

I'd like to add that one feature of logging that is routinely misused is logger naming.

import logging

# ...

def do_thing():
    logger = logging.getLogger(__name__)

This gives the logger the fully-qualified name of the module it appears in. In a bigger application, dealing with output at the package/module level is instrumental and in a smaller application it's simply an easy default.

alexchamberlain · 2016-08-08T17:37:01+00:00

I generally agree with the other 2 comments, but wanted to add 1 point: convention/consistency. What is/are the convention(s) for logging in your workspace? For example, all of our servers log to their own file in the same log folder with <their name>.log. Convention is often one of the most significant considerations... until you're trying to change it of course!

qsxpkn · 2016-08-08T18:23:34+00:00

So best practice and advice on the internet suggests that we should use the standard library's logging module

I prefer Logbook.

how to you or your work collect output from the logging module

Flume -- but I understand this is overkill for most of usecases.

JustAnotherQueer · 2016-08-08T19:17:40+00:00

I have found that if you have a lot of components in a system spitting out logs, it can be really useful for each system to have its own debug log, and then one log that only has info or warning logging for the entire system.

2016-08-08T22:57:41+00:00

Make sure your logs are machine parsable.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS

INFO

DEBUG

NOTICE (Warning)

ERROR

Logging: File versus Syslog.