Life in the world of blogspam

In the blog world, it’s not surprising that Blog spam has become an ever increasing problem. People post links into comments on blogs, and if they stay put, they can help up the sites ranking in Google. Annoying as all git out.
Fortunately, we have tools like Jay Allen’s MT Blacklist to help us poor bloggers keep the blogs clean.
Unfortunately, MT Blacklist doesn’t have a lot of good reporting or analysis tools, so like a good doobie, I whipped one up. The short version is in the last 5 days, we’ve blocked several hundred attempted spams, and as we populate the database, more will not get past the filters. It’s email all over again.


The report is written in PHP and simply asks the database for details on the
spam it’s been managing. Here’s what the output looks like:

Some numbers...
----------------------------------------------------------------------------------------
Total items in blacklist: 	3021
Total rows in logfile: 		739
Oldest blacklist entry 		2005-05-30 03:14:07
Newest blacklist entry 		2005-06-04 03:50:14
----------------------------------------------------------------------------------------
Top 10 items...
-------+----------------------+---------------------+--------------------
HitCnt | String               | Created on          | Last hit
-------+----------------------+---------------------+--------------------
146 | (online)[w-_.]*cas | 2005-05-30 03:14:15 | 2005-06-03 05:39:04
30 | nofatonline.com      | 2005-06-02 17:39:53 | 2005-06-03 22:01:32
19 | texas-hold-em        | 2005-05-30 03:14:14 | 2005-06-03 09:47:51
9 | payday-loan          | 2005-05-30 03:14:12 | 2005-05-31 13:52:34
7 | progressiveupdate.ne | 2005-06-01 02:58:31 | 2005-06-03 00:29:41
4 | p00d.com             | 2005-06-02 23:21:24 | 2005-06-03 22:05:42
3 | djsdesigns.net       | 2005-06-03 12:49:32 | 2005-06-03 22:53:36
2 | analloverz.com       | 2005-05-30 03:14:08 | 2005-06-01 21:01:51
2 | sex4dollar.com       | 2005-05-30 03:14:13 | 2005-06-03 23:02:23
2 | sexushost.com        | 2005-05-30 03:14:13 | 2005-06-01 19:19:58
-------+----------------------+---------------------+--------------------
Most recent general activity...
----------------------------------------------------------------------------------------
2005-06-04 04:23:35 | String greek-match.p00d.com was removed
2005-06-04 04:23:35 | String dating-sites.p00d.com was removed
2005-06-04 04:23:35 | String jewish-singles.p00d.com was remov
2005-06-04 04:23:35 | String single.p00d.com was removed from
2005-06-04 04:23:35 | String matchmaking-institute.p00d.com wa
2005-06-04 04:23:35 | String worldwide-matchmaking.p00d.com wa
2005-06-04 04:23:35 | String banff-matchmaking.p00d.com was re
2005-06-04 04:23:35 | String death-matches.p00d.com was remove
2005-06-04 04:23:34 | String color-match.p00d.com was removed
2005-06-04 04:23:34 | String single-greek.p00d.com was removed
----------------------------------------------------------------------------------------
Most recent denials...
----------------------------------------------------------------------------------------
2005-06-04 04:02:23 | ping denial on 'Planet Ge...">sex4dollar.com
2005-06-04 03:01:32 | ping denial on 'Emergent ...>nofatonline.com
2005-06-04 02:36:48 | ping denial on 'Planet Ge...>selectedsex.com
2005-06-04 01:18:28 | ping denial on 'Planet Ge...>nofatonline.com
2005-06-04 01:12:04 | ping denial on 'Planet Ge...>nofatonline.com
2005-06-04 00:57:35 | ping denial on 'Planet Ge...>nofatonline.com
2005-06-04 00:48:13 | ping denial on 'Emergent ...>nofatonline.com
2005-06-03 23:11:13 | comment denial on 'Planet...singles.p00d.com
2005-06-03 23:11:10 | comment denial on 'Planet...ting-uk.p00d.com
2005-06-03 23:05:50 | ping denial on 'Stonekeep...>nofatonline.com
----------------------------------------------------------------------------------------

A lot of good information actually, and the report is dynamic, so I can run it
anytime. I’ll probably clean up the code a bit and publish it, but it’s
nice seeing that a) MT-Blacklist is actually doing something, and b) seeing
how much blocking is going on because of it.

About

A wandering geek. Toys, shiny things, pursuits and distractions.

View all posts by