Gerhard Killesreiter's blog

Mar 29

Spammers by mailprovider

by Gerhard Killesreiter

On drupal.org we have much less spammers than other websites. One reason is the fact that we do not allow anonymous users to post anything and that every user needs a valid mail address in order to use his account.

This poses the question: Which email providers to our spammers use?

Luckily, this is rather easy to answer:

mysql> select substring_index(substring_index(init, '@', -1), '.', 1) as provider, count(substring_index(substring_index(init, '@', -1), '.', 1)) as count from users where status = 0 and login != 0 group by provider order by count;

Feb 20

Einmal mit Profis arbeiten!

by Gerhard Killesreiter

The title roughly translates to "Oh, to once work with true professionals!"

This is something which I all too often have to think (and sometimes say aloud) when working on client projects. The nature of my work usually results in working for some kind of web agency which has secured a bigger Drupal project that they cannot handle alone. I then work with the agency's employees and other freelancers to complete whatever the project demands.

Aug 31

34626 logged in users during August 2008

by Gerhard Killesreiter

select count(login) from users where login > 0 and from_unixtime(login) > '2008-08-01 00:00:00' and status= 1;
+--------------+
| count(login) |
+--------------+
| 34626 |
+--------------+

That means that 10% of all drupal.org user accounts are actively used. Not that bad at all.

133257 is the result for the same query if you change the date to 1st of January. That is over50% of the accounts that exist on drupal.org and have been used at least once. Quite impressive.

Aug 31

450 Abusers on drupal.org during August 2008

by Gerhard Killesreiter

mysql> select count(login) from users where login > 0 and from_unixtime(login) > '2008-08-01 00:00:00' and status = 0 ;
+--------------+
| count(login) |
+--------------+
| 450 |
+--------------+

The query gives us the number of users who logged in during August and are now blocked.

Mar 29

Getting rid of slow queries

by Gerhard Killesreiter

If you run a popular Drupal site, you'll sometimes find more queries in your MySQL server's slow query log than you really want to.

Sometimes the unruly queries can be eliminated easily by adding an index or two. Sometimes, they can be avoided alltogether (for example, the query in theme_forum_topic_navigation can be avoided by overriding the function with an empty one in your theme).

Feb 16

Drupal 6 proves to be very popular

by Gerhard Killesreiter

We've already counted close to 40000 downloads in only three days!

For comparison: There were about 67000 downloads of all Drupal 5 releases combined in January 2008. There were about 60000 downloads of all Drupal 5 releases in January 2007 when Drupal 5 was released.

Feb 08

Going FOSDEM

by Gerhard Killesreiter

This is to announce that Andy Kirkham (AjK) of the Drupal security team and myself will be attending FOSDEM from the 22nd to the 25th. Meet us at the Beer event or in the Drupal developer room.

Feb 03

Searching for foo

by Gerhard Killesreiter

As outlined in the previous post, drupal.org got over 850k referalss from search engines' result pages in January 2008. Thanks to awstats we've got some statistics to share about people's favourite search words.

Feb 03

So Microsoft wants to buy Yahoo!

by Gerhard Killesreiter

What's the relevance of the title of this blog post for Drupal you might ask yourself? Actually, this post is about how it is not relevant.

While Yahoo is a valued Drupal user and supporter, Drupal users don't seem to use them much. I've had a look at the search referrer stats that we've collected during January 2008. Unless the stats are totally made up because everybody forges their referrer, Yahoo contributed only 1.7% (or 14942 pages) to the total of 864844 links we got from search machine results. You may wonder who got the lion's share?

Hint: It isn't MSN (0.1% or 1649 pages) nor AOL (0.3% or 3104).

Jan 29

So google thinks we are fast

by Gerhard Killesreiter

(from an earlier post to the infrastructure mailing list)

You'll maybe remember the problems we had with googlebot in summer. It had crawled about 150k pages per day and that caused problems on our servers. I then asked google (via the webmaster tools, thanks to Morbus Iff) to crawl less often and it went down to below 10k pages per day. This setting expired in November and as of December googlebot