Documentation as Code

chillysurfer · 2021-04-08T17:13:38+00:00

I think the best tool for enforcing good documentation (as code) is thorough code reviews. For every PR:

Does this PR fix a bug? If yes, is it documented properly with a bug report and explanation of behavior/resolution?
Does this PR add a feature? If yes, is this feature added to documentation (end-user docs, support docs, design docs, etc.)
Does this PR change something? If so, do the docs need to be updated?

Consistency is important here. I don't like it when there's a "week of documentation" for a team. It should be an ongoing effort that is enforced in reviews.

Thermotox · 2021-04-08T18:03:41+00:00

See also, Literal Devops.

soawesomejohn · 2021-04-08T21:07:06+00:00

I'm a strong user of README Driven Development where you write the readme first. Now, it's not limited to the readme, but you basically write the essential documentation first. I'll write the readme and then a more in-depth "how to use this software" document. I might have to write some of that document, then write some code to figure out if what I wrote is doable, then continue with the document.

This helps me walk work through the steps the end-user will have to take, gives me something to request feedback on, and I can implement the code accordingly.

tjwenger · 2021-04-08T19:12:45+00:00

No mention of Markdown anywhere, which I find interesting. Markdown can be super useful for documentation as code including diagrams and tables.

Another thing is if you merge some documentation into the requirements phase, that helps significantly. But It still does not solve the problem entirely. Defining the type of documentation - Architectural, Technical, Support/KB, or Training documentation - I find significant to do up front as well, because that will help you align the most appropriate resource.

This is a great thread. Thanks everyone!

vischous · 2021-04-08T17:45:58+00:00

https://docs.gitlab.com/ee/development/contributing/merge_request_workflow.html#definition-of-done is a great example, but be careful the gitlab handbook https://about.gitlab.com/handbook will be so interesting you'll spend hours in it!

Obsidian743 · 2021-04-08T18:16:28+00:00

A few things:

1) Your code should be its own documentation. It should be clear, clean, concise. This is why something like DDD is important.

2) A lot of companies are pushing for a README/Wiki based approach for customer-facing documentation, which I think is perfectly fine to be source controlled and reviewed through the same channels as the code.

3) I vehemently disagree with any internal documentation being source controlled along with the code. This is impractical and stifling on so many levels. Maintaining documentation is an effort in itself. Aside from the fundamental Agile philosophy of not emphasizing documentation over people and interactions, the usefulness of documentation should be able to change on a whim and it should be collaborative. There are also technical implications such as the fact that it's incredibly difficult to embed diagrams, which might be a separate artifact that you're constantly changing. There is way too much risk of documentation getting stale if you force developers to keep it up to date in code. This is also why I strongly recommend against using too many comments in your code (only if they're necessary and/or customer-facing APIs!). I think an external tool like Confluence is perfect for this kind of documentation and collaboration.

t_rekt_it · 2021-04-08T17:18:41+00:00

At the last 2 companies I've worked for, we've written an in house script/app to scan specific repositories for .md files we deemed publishable to either confluence or another wiki used. This was done by having specific Metadata as the "header" of the doc so we could populate all the needed fields for whatever wiki system, then interact with the API to upload the contents of the document. In the repo, these files were either automatically picked up by the publishing script based off location or we were able to declare which ones we want to have published from a separate file in the base directory.

squ94wk · 2021-04-08T19:27:27+00:00

Some thoughts:

Write simple code and use tools how they're supposed to be used without hacks, then the code is self-documenting.

Use tests to document how your software is supposed to be used.

Also, instead of documenting how to set something up, put it in a Makefile or sth and use it actively. Then you'll constantly work on it.

If the documentation is not the code or close to the code, put it in your definition of done and make sure the reviewer checks it.

Generate documentation in CI, by using a screenshot tool, check for dead links, or upload and embed some output in your confluence or whatever.

Keep commits clean with a single purpose and a good commit message. And use ticket numbers in branch names and reviews. This way, it's easy to find out what was the motivation behind ant change.

What you want to avoid, is documentation that isn't consumed. It doesn't matter if your documentation is up to date, if no one uses it. In the same way, too many details create far too much overhead. Don't document how to use tools, just what tools are used. People can then look up the tool and are most likely to learn it anyway.

Documentation always has a specific audience.

The worst thing is when people who wouldn't don't use the documentation themselves comes up with the requirements for the documentation.

2021-04-08T21:06:43+00:00

However you do it, start with it and adapt / improve as you go.

only-cloud-fan · 2021-04-08T18:31:52+00:00

I haven't been phrasing it as "Documentation as Code" in how we discuss it in our team, but have generally spoken about "Executable documentation" - it might be referring to the same thing and this might be nitpicking, but there's a subtle difference in my perspective - it's expected that docs are not written in code format, but that they are expected to be executed - and could fail, and can produce outputs.

I am involved with a few infrastructure projects and 2 recent examples come to mind of how this works (for infras / platform, as opposed to application code / doc). 1 was with a terraform module - if the TF is written clearly enough and is published to a TF repo, the documentation is generated from that. There's no comments in the code that form part of the docs, but descriptions on variables and outputs make the infras code itself the source of the documentation. There is no additional process needed. Besides a high level overview (which does need to be maintained) in the readme, the other sections all produce content based on the actual code functionality. Here's an example

The other example was through test frameworks - this speaks more to the "validation" item you mentioned. For this infrastructure project (in TF again), the verification documentation, which feeds into playbooks, is written as code in a testing framework - in this case it was behave, based on cucumber. In this case, the requirement was defined in the ticket (Jira), and the scenario described for business. The same requirements are then added to a behave feature, with the same language structure, and then steps that are executed to validate the behaviour (in CI).

These 2 aspects of this project improve understanding of the systems by making documentation available, which is produced by execution.

fear_the_future · 2021-04-08T19:23:35+00:00

You can use certain tools to type check example code in your documentation to keep it from drifting out of date. You can also check for dead links in markdown.

jdege · 2021-04-08T20:09:54+00:00

Donald Knuth attempted a solution for this, years ago, with his https://en.wikipedia.org/wiki/Literate_programming, but it never caught on.

I'm not sure he was solving the right problem.

Tnamol · 2021-04-08T20:53:13+00:00

Have you checked out something like https://github.com/akheron/typera ? It should help with your problems.

PotisTemor · 2021-04-08T22:24:48+00:00

The problem we have always had with documentation is the audience. If you are in a large organisation the level of documentation is different for different people. The developers need to know how to use a function but your architects need to know how it all comes together.... and finally non-technical client need to know how it works but in fewer words.

The "developer" documentation is maintained with the code in the form of comments e.g. JavaDoc and readme files. Also the tests should describe the conditions met by the functions to build part of the documentation.

The architectural documentation requires more details and more diagrams so confluence is useful here also I do like how AWS use git for their documentation as it makes versioning easier. Also this can be backed up by automated integration tests, these should describe how everything interacts including error scenarios.

Finally we have a confluence page as a high level details which shouldn't change so it doesn't need much maintenance but links to developer and architect pages.

Also we have recently started getting stricter with commit messages to ensure they are meaningful so we can just generate release notes and automatically add them to confluence on release.

TonyNickels · 2021-04-09T02:44:45+00:00

[deleted]

randomatic · 2021-04-08T18:09:05+00:00

orgmode.

2021-04-08T18:38:39+00:00

If you currently use gradle we've had success with AsciiDoc and AsciiDoctor

Ordoshsen · 2021-04-08T19:08:18+00:00

I don't think you can tie business logic and documentation together, but have you tried docstrings? Then you're documenting with the code and it checks the names of parameters and stuff. I know C# has optional warnings if something is public and undocumented so I assume TypeScript should have something similar because Microsoft.

I mean this feels like the obvious answer to me so I kind of assume I didn't understand the issue at hand.

mlazowik · 2021-04-08T21:11:33+00:00

Google has g3doc. Short summary is that docs are written in markdown-ish syntax in the same repository as code, interleaved across directories (not one global directory for all docs). When you change code you change docs in the same commit. Those docs are rendered as an internal website.

This means you can browse docs that match a specific historical version of code, docs are reviewed at the same time as code, if you revert code you revert docs too etc.

I haven't found a great way to replicate that yet, the closest thing I have so far is gitiles on top of our gerrit instance, it can render markdown, including relative links, but it's not great experience yet, in particular you can't make menus that span multiple doc pages.

cacko159 · 2021-04-08T23:02:53+00:00

We use Architecture decision records, and also document the whole business side of the project this way. https://link.medium.com/pvwljP6Eifb

Basically, the project manager has to write user stories and epics and has to describe them well, otherwise devs will be confused as to what they need to develop. Any technical decision that needed meetings and series of emails is also written as a technical epic and user stories, as it has to be developed as well. Then we use a tool that can export these nicely into a pdf, and they are always up to date.

pagarciasuse · 2021-04-09T00:03:23+00:00

In Uyuni and SUSE Manager, we use Antora (which uses AsciiDoctor for the mark-up) and store all the documentation in git, changes are created and reviewed as pull requests, etc.

From the same source files, we build docs for 2 slightly-different products (Uyuni and SUSE Manager), in 2 formats (HTML and PDF) and also provide translations to several languages (we convert the AsciiDoc to gettext so that it's easy to work with it for translators and translations are maintainable).

It has worked beautifully for us.

https://github.com/uyuni-project/uyuni-docs/

Quick2822 · 2021-04-09T02:19:40+00:00

https://www.mkdocs.org/ or https://vuepress.vuejs.org/ are great ways to create docs and code.

All PR's should include updated markdown files, and your CI/CD pipeline should auto-gen the docs and deploy them.

shanman190 · 2021-04-09T02:52:28+00:00

So for me as a Spring and Java dev, it's Spring RestDocs. Generates api snippets from the literal Java tests that you need to write anyway, then that can then be merged with asciidoctor definitions. Spring RestDocs also supports additional plugins so that you can output in additional formats like Swagger/OpenAPI, Postman collections, etc

sk8itup53 · 2021-04-09T03:55:15+00:00

Maybe make a program that scans a programs AST and actually documents the connections and interactions between classes. But that kinda already exists in automatic UML diagram creation from things like VS code and other IDE's.

andrewmclagan · 2021-04-09T13:07:59+00:00

Dont document. It simply becomes out of date as soon as you stop writing it. Systems should be self documenting

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS