Skip to main content

Hello there, Mr. Stallman!

The last time I saw Richard Stallman was when he got into a bus that I was on in Cambridge, MA about 6 or so years back.  His name comes up often whenever the Free Software Foundation, GNU, or open source software in general is discussed.  It should have been no surprise for me to see his name show up in Ohloh, a “free public directory of open source projects and people”.  What surprised me was where I found it:
Ohloh open source ranking
There we go.  Five slots above mine.
My Kudo Position recently received a fairly large boost (as evidenced by the +2339 above) due to Secure-Endpoints pushing Network Identity Manager and the KCA Provider for Network Identity Manager into GitHub.  Once this got me excited about my Kudo Rank, I discovered that much like democracy, the rank isn’t based upon my opinion of my work or the quantity thereof, but on what others think about it.

So how exactly is the Kudo Rank and Kudo Position calcuated? Ohloh’s About Kudos page describes the process.  The Kudo Rank is based on Kudos that are received by a contributor, either as a result of someone giving them Kudos directly or by giving Kudos to a project that he has contributed to.  Think of a Kudo as a compliment and the Kudo Rank as the reputation, which increases with each compliment you receive personally or for your work.  Registered Ohloh users can give Kudos while any individual recognized by Ohloh as a contributor to one of their tracked projects and registered users with or without any recognized commits can receive them.

The system is not perfect.  Notably, it favors developers who contribute to open source projects whose user-base has a large number of open source developers who are Ohloh users.  As such, contributors to projects that target primarily non-technical users can run into difficulty establishing a good ranking.  Also absent from the list of factors is a measure of the actual contribution to open source projects; Kudos received by a project are distributed to its contributors in proportion to the number of commits, but commits by themselves do not increase the contributors’ Kudo rank.  However, given the constraints and the limited dataset, Kudos work reasonably well.  It will be interesting to see whether these kinds of rankings will become a currency, like Google’s PageRank.

One side-effect of open source is that in addition to making the source code available for the whole world to see, in many cases, the code repositories themselves are made public.  Consequently, every mistake, misspelled identifier, grammatically incorrect comment, every oops, and every aha is out there in all its glory.  This practice is helped by the plethora of public open source hosting services which make it almost trivial to have a public repository.  Ohloh aggregates this information to analyze individual projects.  For example, the code analysis of OpenAFS shows a breakdown of the programming languages used, licenses, and the quantitative evolution of the codebase.  It also shows that the code represents about 274 person years of work or about $ 15,043,108 at the time of this writing.

This is a great use of the wealth of information that is available.  These statistics provide a valuable insight into the maturity and evolution of the developer community.
But what about Mr. Stallman?  Let’s take a closer look at his entry:

ohloh-ranking

You will notice the notable absence of pretty much his entire open source legacy with the exception of a single project.  In reality, Stallman’s name appears 12 times in the rankings.  The top entry has a Kudo Rank of 10 and a Kudo position of 30.  Alas, it would be patently false to say I should be ranked anywhere near some of these people around me.  The ranking, while fun, can be a tad misleading at times.

(This post was written in the summer of 2010, and hasn't been updated since.)

Comments

Popular posts from this blog

Critical Section Contention

If you've done any sort of multi-threaded programming on Windows, chances are you've worked with CRITICAL_SECTION s. They are lightweight and effective. However, if your critical sections are causing too much contention, they might be the cause of serious performance problems. While working on Network Identity Manager, I was curious about how all of its hundreds of critical sections are doing. Network Identity Manager is a multi-threaded plug-in based application that is often prone to contention during certain operations. But how bad is it? Critical section contention is a fact of life. It is the normal mode of operation for them. However, each time it happens, the application may pay a fairly hefty performance penalty. We are only interested in keeping contention in check. As such, we would like to know which critical sections are experiencing the highest contention levels so we can evaluate design changes that may help alleviate them. A thorough treatment on the semantic...

How to begin snowboarding, when you are over 30

It seems there's an unspoken rivalry between snowboarders and skiers.  We found this out when we tried out snowboarding two years ago.  We can barely ski, but we thought we'll try it out.  After all, at worst we'll probably add one more thing to the list of stuff we aren't good at.  Here are some things we learned in the process, offered in instructional format: You will, most probably, be among the oldest people on the beginner slope.  People who hadn't been born when you were graduating high-school will speed past you and do tricks you will probably never be able to do, and that's fine. Get proper gear.  You don't need to buy a snowboard, but make sure you are wearing snow pants and perhaps thermals.  Don't wear cotton pants or khakis.  I've seen people wear those.  After a few falls, the snow sticks to you and melts, making it look like you wet yourself.  It's not pretty and is uncomfortable.  Wear sn...

On facial recognition and retroactive indexing

Anonymizing data is already quite difficult, as shown in this 2015 paper on the reidentifiability of scrubbed credit card metadata. Beyond ineffective anonymizing, another disturbing aspect is the rate at which AI and ML are improving at image recognition. In particular, face recognition is approaching practicality for general purpose use (See Amazon Rekognition for example). While these technologies aren't quite there yet, they will inevitably reach that point. Once coupled with a data sets that are already publicly available, this means that large public image repositories like Imgur will become petri dishes for face recognition data. These technologies affect existing data retroactively. What is now an unlabeled morass of anonymous pictures could conceivably become treasure troves in the future for data brokers when the cost of picking out pictures of one's likeness from billions of images becomes easily affordable. This can and should be concerning to anyone who's p...