Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> sorry to break it to you, but a little over 49% of developers are below average.

I'm being nitpicky, but this isn't true (unless you're talking about the median).

I think developers follow a skewed distribution by most measures (prolificacy, quality), with something more like a power law distribution. If this is true, the majority of developers are below-average. A sobering thought!

The opposite is also possible however. If 90% of developers are equally good, and 10% are really terrible, then 90% of developers could be above average. If this is the type of problem your devs are solving, you shouldn't be paying top wages or trying to get "ninjas."

Nitpick aside, this is a great article about avoiding the wrong kind of self-selection in your applicant pool.



I think this black-and-white/good-bad developer is way over simplified for our industry.

I think majority developer is above average AT SOME THINGS, someone may be a great javascript developer who can barely get a C program compiling, someone may be awesome AI developer who is blissfully ignorant about engineering best practices, etc.


The overwhelming majority of natural phenomena follow the normal distribution. Do you have any reason for assuming software development is somehow different?


Distribution of software development skill in the entire population is probably normal. However, developers are not the entire population, they are a tiny slice of the right tail. So the distribution is more like an exponential distribution, which has the mean smaller than the median.


The performance of doctors follows the normal distribution. I think that extrapolating to programmers is not a huge stretch.

    It used to be assumed that differences among hospitals
    or doctors in a particular specialty were generally 
    insignificant. If you plotted a graph showing the 
    results of all the centers treating cystic fibrosis—or 
    any other disease, for that matter—people expected that 
    the curve would look something like a shark fin, with 
    most places clustered around the very best outcomes.
    But the evidence has begun to indicate otherwise. What
    you tend to find is a bell curve: a handful of team
    with disturbingly poor outcomes for their patients, a
    handful with remarkably good results, and a great
    undistinguished middle.
http://www.newyorker.com/archive/2004/12/06/041206fa_fact?cu...


The difference between doctors and programmers is that programmers get to build/leverage tools that are abstractions of other tools, hence you can have orders-of-magnitude differences in productivity between programmers who use the best tools and those who don't. I assume that there's not that much variation between the way two different doctors carry out the same task like there is with programmers.


Actually, doctors do have order-of-magnitude tools for improving certain metrics, such as recovery and infection rates: http://www.newyorker.com/reporting/2007/12/10/071210fa_fact_...


Actually, a lot of natural phenomena does NOT follow a normal distribution. The problem is that most people assume it does anyway, and that's where a lot of errors come from.

Statistics requires a lot of correct assumptions in order to be accurate; sadly, most people overlook or fail to check their assumptions.


The thing is, natural phenomena doesn't follow normal distribution, but the sum of non-normal variables, will approach a Gaussian distribution, as per the Central Limit Theorem[1][2].

The problem is people assuming a normal distribution on a single variable with unknown distribution.

[1]: http://en.wikipedia.org/wiki/Central_limit_theorem [2]: http://stats.stackexchange.com/questions/22387/why-it-is-oft...


Yes, that's correct. If anyone wants to know more, an elementary statistics book will teach you all about the central limit theorem and the law of large numbers. Very useful things to know.


Khan Academy has a good lesson on the law of large numbers http://www.khanacademy.org/math/probability/v/law-of-large-n...


mlvljr, for some reason your post is showing as "dead". I'm not sure why??


Do you have any reason to assume that programming competence is a "natural phenomenon" ?

We're not talking about the statistical central limit theorem here, we're talking about people who are passionate about programming and spend years working on their craft.

You can only invoke the normal distribution thing when you're talking about an outcome that is the average of many independent quantities.


Research into the performance of medical doctors shows that their skill follows the normal distribution: http://www.newyorker.com/archive/2004/12/06/041206fa_fact?cu...


A lot of genetic traits follow normal distribution due to the way genes are combined. It doesn't follow at all from this that human skill follows a normal distribution, especially looking at the whole population. It seems to be more like a power law distribution.


Well, I don't know how you got that assertion about the normal distribution. But, one reason you could think it's not a simple normal distribution is that computer science classes often have bimodal grade distributions.


Software development is not a natural phenomenon.


Software development is not a D&D stat block item. No reason to believe it is even quantifiable (although I like to think some developers approach writing code the same: rolling dice). If it were quantifiable, what would be the metric?


Given that there are N animals of size X, how many animals are there of size X/10?


The phenomena are normally distributed, but there are often steep threshold effects in performance. For example, no number of teacup poodles is sufficient to win a dogsled race. Most dogs have above average completion times on the Iditarod race.

Which is not to say there is no market for chasing frisbies.


I am not much of a stats guy, but median seems more appropriate in some ways because you're hiring a person, not a statistic. Of course, it'd also be fair to say that your median developer is significantly less awesome than someone in the top 10%.


Technically an average is general statistic and mean and median (as well as mode) are specific types of averages. As a person with a stats background I often use "average" to mean the general concept instead of a strict arithmetic mean.

It was easy to figure out what the author meant, and his point should be taken for what he meant: half of people are below median.


Thanks for catching this (Xcelerate below too). I'm updating the post to reflect it (having some trouble with caching on Posterous it appears though).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: