From Koharu Biyori.

The picture doesn't have anything to do with the topic today. I just wanted to get your attention.

Recently, I've noticed that several spam-busting solutions on other blogs were eating my completely legitimate comments, mostly because I was a "new commenter", which presumably meant that I didn't have enough "good karma" with the spam-solution in question. Normally this would be only a very minor inconvenience at most, but when the CAPTCHA image failed to load in the first place (due to server load, I imagine), I had to rely on the good graces of the blog owner to check through his/her moderated comments manually.

Now, considering the Akismet stats on the amount of spam out there (I don't use Akismet, although not due to anything other than laziness; the current spam solution for Moe Check is Spam Karma 2, for lack of anything better than default), I don't really blame the spam-busters for being overly paranoid. I mean, look at the graph: all that orange is spam, compared to the blue of legit comments, small and lonely. I've been told that this is normal, which makes it even worse, since we have apparently progressed to the point where only one out of ten comments, if that, is legit, and nobody believes that this is odd. I'm well aware that each spam comment is essentially free, or at least has insignificant and trivial cost to the spammer, but all the effort expended in trying to block spam is beginning to feel like trying to inhale a hurricane. It's depressing, and it's even more depressing that I appear to be one of the very few who even think that it's depressing instead of Life As Normal. Zetsuboushita, and all that.

Moe Check receives something along the lines of fifty spam comments a day (down from a hundred a day), which isn't anything compared to the really popular blogs out there, which I've heard get something like a thousand spam comments every day. This means that it's still (barely) possible for me to go through each and every one of the moderated comments every day, checking that no legit comments got accidentally eaten. I worry every night that I've missed something, and some unfortunate commenter out there is wondering why his or her comment hasn't shown up yet, and will never show up.

And if my spam-buster solution is messing up somehow and eating legit comments, it's not like I'd know until I checked. Even if it does happen, it's not like I know what to do to allow these legit comments without working even harder to moderate away all the spam which slips through.

I'm not even going to touch the issue of scrapers and splogs, which I don't have a clue on how to deal with.

In any case, based on observations on how Spam Karma 2 works, I've come up with a few ideas on how to make sure that your comment won't get flagged as spam, either by automation or manually:

  • Use proper English. This is extremely important. If English isn't possible, I also accept comments in Chinese, Malay, and Japanese. Please check your spelling, grammar, and punctuation; the worse it is, the more likely I am to conclude that it is spam. On a related note, if you comment in a language I don't understand (ie anything other than the four listed: English, Chinese, Malay, Japanese), I can't know what it is, and I'll err on the side of paranoia.
  • Make your comment relevant. "I agree" and "I dunno" and "Interesting", with no other references, are one of the favourite tools of spammers. At the very least, say something about the content of the post in question. Just quoting the title is almost a sure sign of a spammer.
  • Don't list Geocities as your website host. This one is Spam Karma 2's problem: Geocities is very blacklisted, possibly because of the large number of ad sites hosted there. Listing a Geocities site pretty much guarantees that your comment will be marked as spam, through no fault of your own.
  • The fewer URLs per comment, the better. Spam Karma 2 deducts points based on the number of URLs in a comment. Three is pretty much the maximum before Spam Karma 2 gets too suspicious. Five is right out.

It's a sort of Catch-22 situation, really: if my spam-buster is eating comments, I'd like to know, but for me to know, the quickest way is for someone to comment, and those affected can't comment due to the spam-buster eating said comments.

EDIT: In what is probably an extreme case of irony, I am locking pings on this post, because in the course of half an hour, I received no less than five pingbacks from various scrapers and splogs. These are pretty easy to weed out, since they invariably go "BlahBlahBlah wrote an interesting post today, here's an excerpt", and then get the blog name wrong.

If I get spam comments on this post anyway, I'll be locking those too.

8 Responses to “Spam A Bit Too Lot”
  1. GreyDuck says:

    Wow, only a 10:1 ratio? I figured it'd be much worse than that.

    I use SK2 as well, but coupled with Bad Behaviour 2, and I almost never get the "please moderate" message in my mailbox anymore. BB2 prevents a large percentage of would-be spammers from even getting to the point of triggering the comment mechanism. Then again, BB2 hasn't had an update in months either and I can see that SK2 is working harder and harder over time as the jackhole spammer bastards find ways around BB2.

    I used to use a plugin called Antileech to deal with some of the content thieves. Unfortunately that plugin breaks under WP22+. Argh.

  2. Aanusha says:

    So I guess my last comment in your previous post's gonna be deleted for NOT BEING relevant???

  3. DKellis says:

    @GreyDuck: 10:1 is bad enough.

    I've heard bad things about Bad Behaviour 2 (mostly it catching too many legit comments), and some stuff about incompatibilities, according to the webmaster of AnimeBlogger.net. Still, I may get desperate enough to try it.

    @Aanusha: No, it's substantial enough to be safely labelled "not spam".

    What I was talking about were the ones which pretty much just go "interesting" or "like it" with no other context, and I have no idea whether they even read the post or are just trying to trick me into passing the comment.

  4. Lysander says:

    I'm not really sure how many spam comments I get per day or anything, but I do get a few hundred stockpiled before managing to sift through them each week. It's like a plague, but Akismet has been very good in differentiating between real posts and spam so far- I've yet to have any mishaps (knock on wood). In this style of website, there's two ways to really gain popularity without spending any money- one being aggregators (and posting often), and the second being to regularly leave comments on other more popular weblogs. Unfortunately both of these methods also attract more and more spambots to your own site.

    In the end this whole game is a balancing act; keeping spam off your weblog without losing legit comments. In a world where technology is in perpetual flux and updates to both spamming methods and anti-spam methods are almost a monthly thing, I can't bring myself to disapprove of those who take the easy route and throw CAPTCHA images up for verification on their weblogs, but it's something I am never going to do myself (those things are incredibly annoying to me). Then again, my weblog is still incredibly small, we'll see how I feel when I'm getting over a thousand spam messages a day I suppose.

    ~Lys

  5. Owen S says:

    I use a double combo of reCAPTCHA and Akismet. Akismet is way better than Spam Karma 2 in taking care of spam trackbacks and pingbacks, so I'd suggest switching over if you hate those (I'm a former SK2 user myself).

    Also, reCAPTCHA cuts the immediate incoming spam by at least 90%. It's an excellent alternative to Bad Behaviour, and I'd suggest trying it out for awhile at least. My blog uses it.

  6. Aanusha says:

    I dunno how often you check that mailbox of yours. So I figured I’d inform you that I’ve sent that Lucky Star manga scan and the Tomoyo—Sakura kissy pic to you. In case you’ve already seen them, well…….what did you think of them??

    P.S—How many visits do you get from India ?? I’m really curious….

  7. Aanusha says:

    Come to think of it, part of it may be because said fangirls seem to be fixated on me as a person, rather than whatever work I created (be it a fanfic or a blog post). Obviously by “fangirls” I’m not using the term to mean every person of a female persuasion who happens to like whatever I do, but rather a concept which I admit is less than charitable, but I have trouble seeing in any other way

    Maybe said fangirls have nothing better to do than go pokey pokey, scratchy scratchy at other peoples lives,yanno? B.T.W– Is a part of your REAL NAME Chen??

  8. j.valdez says:

    Akismet seems to be doing the job for me too. Bayesian spam filtering seems to be the most effective method with the least amount of affect on the commenter. I read an article a while back where the author created a chess game using a spam filter, irrelevant but interesting.

    At any rate, I kind of miss being on blogspot. There I didn't get spam or comments.

  9.  
Trackbacks
  1.  
Leave a Reply

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>