TwitterFollow Me on Twitter

Referrer Spam

Lately, I’ve been researching a couple of different options for battling referrer spam. It’s bloating my stats and wasting my bandwidth and I’m tired of it. The real problem is that I’m the only one who has access to my server logs, so the spam has no function other than waste.

The solution I’m trying to find defeats referrer spam at its core nature, referrer headers (often spoofed) by an automated machine. Certainly any authentic referrer information needs to shine through. The .htaccess solutions I’ve found have either not done enough or broken my site entirely at places like Google. And I’m not into a purposeless or over-zealous response to this. I just need something that works and, preferably, something that stays out of my way. I really don’t want to have to maintain anything if I don’t have to. I will, but as a last resort.

Over the past six months, I’ve seen a rise in four-letter domain names coming into my logs. I don’t necessarily mean four-letter as in profane, I mean literally four-letter domains that are seemingly just random combinations of letters and occasionally numbers. If I blocked referrers from four-letter domains it would probably solve 60-70% of my problem. But that’s the type of compromise I don’t want to do. I’d give up information about referred visitors from authentic sites, like digg.com and cnet.com. (Not that I have much of either of those, but you get the point.)

Has anyone found a decent solution to referrer spam? My personal research hasn’t turned up anything that really works. Shoot me an email if you’d rather stay off-record.

[tags]referrer, spam, web, security, htaccess[/tags]

2 Comments

  1. Jon Kenney

    2007-04-11 1547hrs

    Gravatar

    David,
    What spam filter have you look into, or what spam filter are you running? At the business I’m currently working at we are running Fortinet. It’s an expensive tool, but you are able to block domains by word pattern etc.

    Not much if any, I know, but no one else had commented.

    Jon
    ——-

  2. David

    2007-04-11 2310hrs

    Gravatar

    Jon: I think you might be referring to email spam. At the office, we use Postini. At home, I just use Mozilla Thunderbird’s built-in Bayesian filter.

    In this case, I need a solution to my web site’s referrer spam, which is an issue where spammers use HTTP header information to “spam” server logs with false referrer information in an effort to distribute their URLs in a more broad fashion.

    If I misunderstood what Fortinet can accomplish, let me know.

Leave a Comment

Commenting is not available in this section entry.

Twitter Status

2010-07-29 1117

Very nice. Less tab clutter. RT @nathansmith: RT @faaborg: App Tabs in Firefox 4 Beta 2: http://bit.ly/bkWIHp

2010-07-29 0000

ffmpeg and exiftool are my pals.

2010-07-28 1524

@dave_clark Working in our production suite today because we are getting set up with new live streaming from @kulabyte. (Not with the FS1.)