Googlebot likes Drupal 6

It is now several weeks after the upgrade of drupal.org to Drupal 6 and I've taken a look at google's crawling statistics for drupal.org.

This is the most interesting graph for me as infrastructure manasger, it shows the average time that googlebot needs to download a html page from drupal.org. We apparently had a bit of a rough ride in January, but recently this has smoothed out. About 600ms per page seems quite a good value to me.

Spammers on drupal.org

So, after I claaimed we'd have less spammers than others, I wanted to find out how many spammers we've actually had.

mysql> select EXTRACT(YEAR_MONTH FROM from_unixtime(created)) as yearmonth, count(*) as count from users where status = 0 and login != 0 group by yearmonth order by yearmonth desc ;

Year/Month # of spammers
2009 / 04 820
2009 / 03 710
2009 / 02 1101
2009 / 01 371
2008 / 12 171
2008 / 11 145
2008 / 10 136
2008 / 09 268
2008 / 08 486
2008 / 07 639
2008 / 06 145
2008 / 05 132
2008 / 04 149
2008 / 03 206
2008 / 02 167

Spammers by mailprovider

On drupal.org we have much less spammers than other websites. One reason is the fact that we do not allow anonymous users to post anything and that every user needs a valid mail address in order to use his account.

This poses the question: Which email providers to our spammers use?

Luckily, this is rather easy to answer:

mysql> select substring_index(substring_index(init, '@', -1), '.', 1) as provider, count(substring_index(substring_index(init, '@', -1), '.', 1)) as count from users where status = 0 and login != 0 group by provider order by count;

Einmal mit Profis arbeiten!

The title roughly translates to "Oh, to once work with true professionals!"

This is something which I all too often have to think (and sometimes say aloud) when working on client projects. The nature of my work usually results in working for some kind of web agency which has secured a bigger Drupal project that they cannot handle alone. I then work with the agency's employees and other freelancers to complete whatever the project demands.

34626 logged in users during August 2008

select count(login) from users where login > 0 and from_unixtime(login) > '2008-08-01 00:00:00' and status= 1;
+--------------+
| count(login) |
+--------------+
| 34626 |
+--------------+

That means that 10% of all drupal.org user accounts are actively used. Not that bad at all.

133257 is the result for the same query if you change the date to 1st of January. That is over50% of the accounts that exist on drupal.org and have been used at least once. Quite impressive.

450 Abusers on drupal.org during August 2008

mysql> select count(login) from users where login > 0 and from_unixtime(login) > '2008-08-01 00:00:00' and status = 0 ;
+--------------+
| count(login) |
+--------------+
| 450 |
+--------------+

The query gives us the number of users who logged in during August and are now blocked.

Confessions of a one-time Project Module contributor

Or, The Project Module and the Deadly Sins of Drupal Development

Forgive me, Drupal, for I have sinned. It's been many months since my last confession.

At the Drupal Association we've set upgrading drupal.org as our top priority this year, and much of that work depends on project.*.

The Project module and kin are some of the most important pieces of drupal.org--and some of the most difficult. Six months after Drupal 6 came out, drupal.org still is still running Drupal 5, in large part because project.* is not yet upgraded.

Pages

Subscribe with RSS Subscribe to RSS - drupal.org