Comments in Clean Code? Think Documentation

ByErik Dietrich February 7, 2017February 8, 2017

Editorial Note: I originally wrote this post for the SubMain blog. You can check out the original here, at their site. While you’re there, take a look at GhostDoc for your documentation needs.

Second Editorial Note: I recently appeared on the Ruby Rogues podcast and was interviewed by Paysa. If you’re interested, check both of them out!

Notwithstanding some oddball calculator and hobby PC hacking, my first serious programming experience came in college. A course called “Intro to C++” got us acquainted with arrays, loops, data structures and the like. Given its introductory nature, this class did not pose a particularly serious challenge (that would come later). So, with all of the maturity generally possessed by 18 year olds, we had a bit of fun.

I recall contests to see how much application logic we could jam into the loop conditions, and contests to see how much code could be packed onto one line. These sorts of scavenger hunt activities obviously produced dense, illegible code. But then, that was kind of the point.

Beyond these silly hijinks, however, a culture of code illegibility permeated this (and, I would learn later) other campuses. Professors nominally encouraged code readability. After all, such comments facilitated partial credit in the event of a half-baked homework submission. But, even still, the mystique of the ingenious but inscrutable algorithm pervaded the culture both for students and faculty. I had occasion to see code written by various professors, and I noticed no comments that I can recall.

Professionalism via Thoroughness

When I graduated from college, I carried this culture with me. But not for long. I took a job where I spent most of my days working on driver and kernel module programming. There, I noticed that the grizzled veterans to whom I looked up meticulously documented their code. Above each function sat a neat, orderly comment containing information about its purpose, parameters, return values, and modification history.

This, I realized, was how professionals conducted themselves. I was hooked. Fresh out of college, and looking to impress the world, I sought to distinguish myself from my undisciplined student ways. This decision ushered in a period of many years in which I documented my code with near religious fervor.

My habit included, obviously, the method headers that I emulated. But on top of that, I added class headers and regularly peppered my code with line comments that offered such wisdom as “increment the loop counter until the end of the array.” (Okay, probably not that bad, but you get the idea). I also wrote lengthy readme documents for posterity and maintenance programmers alike. My professionalism knew no bounds.

Clean Code as Plot Twist

Eventually, I moved on from that job, but carried my habits with me. I wrote different code for different purposes in different domains, but stayed consistent in my commenting diligence. This I wore as a badge of pride.

While I was growing in my career, I started to draw inspiration from the clean code movement. I began to write unit tests, I practiced the SOLID principles, I watched Uncle Bob talks, made my methods small, and sought to convince others to do the same. Through it all, I continued to write comments.

But then something disconcerting happened. In the clean code circles I followed and aspired to, I started to see posts like this one. In it, the author had written extensively about comments as a code smell.

Comments are a great example of something that seems like a Good Thing, but turn out to cause more harm than good.

For a while, I dismissed this heresy as an exception to the general right-thinking of the clean code movement. I ignored it. But it nagged at me nonetheless, and eventually, I had to confront it.

When I finally did, I realized that I had continued to double down on a practice simply because I had done it for so long. In other words, the extensive commenting represented a ritual of diligence rather than something in which I genuinely saw value.

Down with Comments

Once the floodgates had opened, I did an about-face. I completely stopped writing comments of any sort whatsoever, unless it was part of the standard of the group I was working with.

The clean coder rationale flooded over me and made sense. Instead of writing inline comments, make the code self-documenting. Instead of comments in general, write unit and acceptance tests that describe the desired behaviors. If you need to explain in English what your code does, you have failed to explain with your code.

Probably most compelling of all, though, was the tendency that I’d noticed for comments to rot. I cannot begin to estimate how many times I dutifully wrote comments about a method, only to return a year later and see that the method had been changed while the comments had not. My once-helpful comments now lied to anyone reading them, making me look either negligent or like an idiot. Comments represented duplication of knowledge, and duplication of knowledge did what it always does: gets out of sync.

My commenting days were over.

Best of All Worlds

That still holds true to this day. I do not comment my code in the traditional sense. Instead, I write copious amounts of unit, integration and acceptance tests to demonstrate intent. And, where necessary and valuable, I generate documentation.

Let’s not confuse documentation and commenting. Commenting code targets maintenance programmers and team members as the intended audience. Documenting, on the other hand, targets external consumers. For instance, if I maintained a library at a large organization, and other teams used that library, they would be external consumers rather than team members. In effect, they constitute customers.

If we think of API consumers as customers, then generating examples and documentation becomes critically important. In a sense, this activity is the equivalent of designing an intuitive interface for end-users of a GUI application. They need to understand how to quickly and effectively make the most of what you offer.

So if you’re like me — if you believe firmly in the tenets of the clean code movement — understand that comments and documentation are not the same thing. Also understand that documentation has real, business value and occupies an important role in what we do. Documentation may take the form of actual help documents, files, or XML-doc style comments that appear in Intellisense implementations.

To achieve the best of all worlds, avoid duplication. Make publishing documentation and examples a part of your process and, better yet, automate these activities. Your code will stay clean and maintainable and your API users will be well-informed and empowered to use your code.

Erik Dietrich

Language Agnostic

Addicted to Unit Testing
ByErik Dietrich February 17, 2011September 27, 2012

Something interesting occurred to me the other day when I posted sample code for a DXCore plugin that I created. In the code that I uploaded, I added a unit test project with a few unit tests as a matter of course. Apparently, the process of unit testing has become so ingrained in me that…

Read More Addicted to Unit Testing
Language Agnostic

Static Analysis — Spell Check for Code
ByErik Dietrich February 18, 2011September 27, 2012

A lot of people have caught onto certain programming trends: some agility in the process generally makes things better, unit testing a code base tends to make it more reliable, etc. One thing that, in my experience, seems to lag behind in popularity is the use of static checking tools. If these are used at…

Read More Static Analysis — Spell Check for Code
Language Agnostic

Inverting Control
ByErik Dietrich March 1, 2011October 19, 2014

I imagine that inversion of control is a relatively popular concept to talk or blog about, particularly in object-oriented circles, so rather than do a garden-variety explanation of the term followed by a pitch for using it, I thought I’d take a slightly different approach. I’m going to talk about the reason that there is…

Read More Inverting Control
Language Agnostic

Testable Code is Better Code
ByErik Dietrich August 3, 2011November 14, 2017

It seems pretty well accepted these days that unit testing is preferable to not unit testing. Logically, this implies that most people believe a tested code base is better than a non-tested code base. Further, by the nature of testing, a tested code base is likely to have fewer bugs than a non-tested code base….

Read More Testable Code is Better Code
Language Agnostic

Adventures in Pure Test-Driven Development
ByErik Dietrich October 4, 2011December 22, 2012

In a previous post some time back, I had expressed some skepticism about TDD as a design practice. I talked about test-driven development and its relationship with prototyping and the “make one to throw away” concept. Since I’m not one ever to believe that I’ve arrived at the optimal solution, I’m doing another round of…

Read More Adventures in Pure Test-Driven Development
Language Agnostic

Static Methods: Time to Hit Rock Bottom
ByErik Dietrich November 1, 2011September 27, 2012

A Tragic Story It starts out innocently enough. You’re out at a party with some friends, and some people you want to impress are there. Everyone is having a good time, but you notice that some of the ‘cool’ kids are periodically going into a room and coming out giggling. Not wanting to be left…

Read More Static Methods: Time to Hit Rock Bottom

15 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Alain Van Hout

9 years ago

Comments about _what_ the code does are indeed pointless, but information about _why_ it does what it does are very often beyond what clean code allows you do. And that information can at times be very helpful.

Erik Dietrich

9 years ago

Reply to Alain Van Hout

Most of the comments I’ve ever seen explaining the “why” of an implementation are more or less either excuses or apologies, both of which tend to make me sympathize with the author, but without finding the information particularly useful. YMMV. But if I find myself apologizing or explaining myself to a future programmer in the code, I generally stop and ask instead how I could change things to make apologies/explanations unneeded.

Alain Van Hout

9 years ago

Reply to Erik Dietrich

Sometimes it involves business constraints, sometimes it’s because of library or technology limitations, and sometimes it’s due to edge-cases which aren’t apparent at first (or even second) glance, which require the specifics of the current implementation. None of those clasify as apologies.

Erik Dietrich

9 years ago

Reply to Alain Van Hout

Couldn’t edge cases and tech specifics be expressed with automated tests or code contracts? And couldn’t business constraints be expressed through the ALM/requirements tool?

(I’m not being Socratic — I’m interested in why these things wouldn’t work where comments would)

Alain Van Hout

9 years ago

Reply to Erik Dietrich

They could be expressed in unit tests, in the sense that those tests make sure that the edge cases are covered, but that still does not tell the developers why those edge cases exist (which is something that a comment on the unit test could fix, but then why not add it to th code itself). As to code contracts, either those live in the code and we’re giving ‘comments’ another name, or they live outside the code meaning the developer has to know to go look at them in this specific case (meaning they already knew about the edge… Read more »

Erik Dietrich

9 years ago

Reply to Alain Van Hout

FWIW, what I meant about code contracts was automated enforcement. For instance, you could use an annotation/attribute (Java/C#, respectively, depending on your poison) to specify that a parameter cannot be null, and then choose an automated enforcement paradigm (static check, runtime exception, etc). Generally speaking, I always look for automatable constructs whenever possible. I see the convenience of having “why” documented inline versus on some Sharepoint site somewhere, but my experience has always been that these “why” comments tend to age just as badly as the out of band specs, unless it’s the same person who made the original comment… Read more »

Alain Van Hout

9 years ago

Reply to Erik Dietrich

Indeed, those tools and the fact regarding aging does mean that comments (even ‘why’ comments) need to be used judiciously, both in relation to adding them and to taking them at face value. That’s why I treat them in code review as being equally important as ‘other code’ (that being team attitude, or not, will of course mean YMMV).

In any case, since it might not have been apparent, thanks for the post. As per usual, it was an interesting read, as was this follow-up 🙂

Nicolai Parlog

9 years ago

If we were working on the same project, I would tell you to comment your &*☠# code. 😉 I like your stance on “documentation” (I call them contract comments) but would go a little farther – giving each class an explanatory, high-level paragraph of what it aims to do really helps maintainers navigate a larger code base. I think you’re missing context comments (technicaland historical), though. They largely address the why, something neither clean code nor tests can even begin to do. These are immensely helpful when debugging or extending code and age well as they usually describe a specific… Read more »

Erik Dietrich

9 years ago

Reply to Nicolai Parlog

It seems like those wouldn’t rot until someone went in later and did B instead, after all. Then you’d have a comment about B explaining that A and B don’t work, so the implementation is really C. At least, until someone comes along and mercifully deletes all the comments because that would be incredibly confusing. It’s hard for me to speak in generalities on the subject, without looking at specific code (e.g. if you had an example on Github or something we could look at), but whenever I find myself looking at departed developers’ explanations of “why” I usually just… Read more »

Nicolai Parlog

9 years ago

Reply to Erik Dietrich

A recent example that made the rounds on Twitter: link. My opinion on context comments is that they’re not meant to always be true. They are much like commit messages or issue descriptions (but far easier to access) in that they reflect a truth at a certain point in time. They give you context of why a decision was made and of assumptions that were true at the time. So deciding whether or not the original assumptions (as well as conclusions by the way) are still true is very much the point of context comments. Unlike contracts they don’t promise… Read more »

Erik Dietrich

9 years ago

Reply to Nicolai Parlog

I think it must be a matter of what you’re used to seeing or expect to see. For instance, when I look at the snippet in that tweet, my immediate thought is, “gah, who cares, where are the tests?” And I realize that even over the years where I added comments to code religiously, I didn’t really consume them as anything other than considering them a code smell. For instance, imagine a codebase we both recently looked at, and imagine the guy in charge of that codebase for a time leaving a bunch of (questionable) explanations of what he did… Read more »

ratliffchrisb

9 years ago

Reply to Nicolai Parlog

I’m not convinced code shouldn’t have comments, but this code would be much easier to read if it followed other clean code values and doesn’t really prove comments are required. Some obvious problems are. Returns: There are no returns in this block of code. The return conditions aren’t at the head of the code block. Even assuming this is to refer to only the ORACLE case this makes the visible block of code more confusing. ctx I’m guessing this is an abbreviation for context. It can be seen the object has contextual metadata, but it also appears to be executing… Read more »

Nicolai Parlog

9 years ago

Reply to ratliffchrisb

Your comments regarding returns and ctx could be discussed but as far as I can tell they do not touch on the first of the three comments – it would still be needed. You are right about ORACLE_ERROR_NO_RULESET but that does not replace the rest of the comment, which is really helpful to guard against future programmers thinking “WTF, this can’t happen!” I understand the final comment to say, “executeQuery() would have been the obvious choice but it doesn’t work for XYZ (see #1232 where we got burned by this”. that being said, there is only one guy working on… Read more »

dave falkner

9 years ago

We can do this the easy way: “Good code is its own best documentation. As you’re about to add a comment, ask yourself, ‘How can I improve the code so that this comment isn’t needed?’ Improve the code and then document it to make it even clearer.” ― Steve McConnell

Or we can do this the hard way: “Every time you write a comment, you should grimace and feel the failure of your ability of expression.” ― Robert C. Martin

Nicolai Parlog

9 years ago

Reply to dave falkner

“There is some fiction in your truth, and some truth in your fiction. To know the truth, you must risk everything.” ― Neo

Two ways to choose from but both lead astray.

Professionalism via Thoroughness

Clean Code as Plot Twist

Down with Comments

Best of All Worlds

Similar Posts