Our study only considers open source code written in the C programming language and found on GitHub. While it is technically feasible to extend this to C++ using the same crawling and data analysis scripts, we decided to disregard C++ code for the sake of simplicity and due to time constraints. However, we are agnostic as to whether C++ programmers have the same mindset as C programmers or are politically more correct. Therefore, we are cautious with respect to generalising our findings to the object-oriented programming community.
C is an understandable (if very dated) university-level target... but I can't help but be very curious if that's being used as plausible-deniability cover for the fact that C is typically used in kernels, drivers, and other low-level work that lends to a type of mindset that is tactile, moves problems out of the way with a scythe, and routinely swears up a storm in the normal process of getting things done.
C++ is a very different beast, as are all the other languages. Rust I would definitely like to see. Scripting languages? Hoo boy do I want an analysis of PHP please. Ruby would be extremely interesting. Erlang provokes morbid curiosity. Python would probably win (ahem, outrank everything else) by simple virtue of being the most popular.
So this is arguably only a fair comparison if v2.0 looks at lots of different languages. Encore, please?
NB: the output charts on page 41 and 42 are interesting. The non-swear quality graph looks pretty random to me, but there are a couple fascinating little hotspots on the swear graph around 11k-30k LOC where quality ranks at 6-8.5 and almost looks like a wide diagonal line tracking quality downward through 15k-30k LOC. Might this be a outlying companion to the Ballmer Peak (https://xkcd.com/323/)? Incidentally, these graphs render nicely side by side using Chrome's 2-page view (menu, top-right).
C is an understandable (if very dated) university-level target... but I can't help but be very curious if that's being used as plausible-deniability cover for the fact that C is typically used in kernels, drivers, and other low-level work that lends to a type of mindset that is tactile, moves problems out of the way with a scythe, and routinely swears up a storm in the normal process of getting things done.
C++ is a very different beast, as are all the other languages. Rust I would definitely like to see. Scripting languages? Hoo boy do I want an analysis of PHP please. Ruby would be extremely interesting. Erlang provokes morbid curiosity. Python would probably win (ahem, outrank everything else) by simple virtue of being the most popular.
So this is arguably only a fair comparison if v2.0 looks at lots of different languages. Encore, please?
NB: the output charts on page 41 and 42 are interesting. The non-swear quality graph looks pretty random to me, but there are a couple fascinating little hotspots on the swear graph around 11k-30k LOC where quality ranks at 6-8.5 and almost looks like a wide diagonal line tracking quality downward through 15k-30k LOC. Might this be a outlying companion to the Ballmer Peak (https://xkcd.com/323/)? Incidentally, these graphs render nicely side by side using Chrome's 2-page view (menu, top-right).