CSS Templates and Blog Spam

1/16/2007 Posted by tmarthal link
So, I was perusing my other blog today and deleted a drug spam comment for the third or fourth time in as many days from one of the more recent posts. In this case, it was my friend Jake dancing in leotards... but the spam comments are always there, always posted on the top message.

Prior to this, the blog spammers, people that post either poker referrals, drug purchase links, 'hot' stocks, online diplomas [almost as bad as the USC DEN program!], or money making schemes all seem to never have posted on this '...' blog. I used to think that it was because it had custom CSS that I use to display the pages, and that comment bots couldn't figure out which link to click when they came to my page.

That can't be the case now though; I think that more and more pennies/day workers are being paid to drive views to sites rather than bots. I have to say that is the case, I had 15 comments my museum post which is pretty crazy considering I get like 1 unique visitor a month here and almost no real traffic [maybe I should talk about the iphone too!?].

To fight this I've turned on the blogger captchas, but still allow anonymous posts. I've also upgraded to the new blogger, but I can't tell any difference (other than the label/tag support [for which I didn't add to my CSS? hrrm]). *shrug*

Labels: , , ,

Post a Comment (8 comment(s))




Comments:

Anonymous andrew At 1/16/2007 8:57 PM:

not to take anything away from your awesome stylesheet and its sweet bot fighting abilities, but i don't think the bots care or have ever cared about it. I bet they just automagically post, via "/comment.do", in N time after getting pinged from a ping server.

I could probably write the script in almost no time.

Blogger tmarthal At 1/17/2007 12:02 PM:

I thought about that, but I think the blog to post id map should take care of that (by returning null results). but then i have no idea how the 'postID' blogger variable is generated. meaning, as i post this i am posting the comment.g script on https://www2.blogger.com using the two variables

blogID=14597313
postID=7237064045425469409

Which means that if you have an automated script, then how the hell can it match up the postID to the blogID? Try other combinations, they won't work.

Unless they loop through all those 18 digit integer hashes to try and find the one that works.

Anonymous andrew At 1/17/2007 1:00 PM:

there is a one to one static correlation between your post permalink and the blogId/postId parameters.

they could request the permalink and extract the blogId/postId values.

which means there is a small amount of html scraping, but no css impact.

Blogger tmarthal At 1/17/2007 3:02 PM:

Uhh, the point is that there is no mapping. There is no way to extract the postID from the tmarthal.blogspot.com URL back to the blogger.com comment URL. It's just not feasible.

However!

What you say is correct, and leads to my original point. Since I did not use an existing CSS Template when creating these pages, you can't use any sort of typical scraper to scrape the html!

However, they had to've updated thier stuff to use more robust scraping which was the point of the whole post.

Anonymous andrew At 1/18/2007 12:51 AM:

uhh, mistaken, the point is that there is a mapping. there is a way to extract the postID from the permalink. It is feasible.

Since we are not communicating in english, lets try ruby. 45 minutes of coding produces a way to map them.

http://www.thirstymind.org/scripts/bloggerMapping.rb

Blogger tmarthal At 1/18/2007 1:28 PM:

They must have changed things when I updated to the blogger beta. if you notice in that link that you scrape out of the html, you are looking for a "comment-link" class. on my previous blog, I removed that class generated in the css... but they must have put it back (since I am using thier $BlogItemCommentCreate$ variable). :\

I since re-removed it: the html generated by the blogger software doesnt label the 'Post a Comment' link with "comment-link" class. That should help. Maybe I'll turn off the captchas again to see [they are turned off]!

So we will see.

Anonymous andrew At 1/18/2007 2:05 PM:

I only used the comment-link class as a simple way to find the url.

I could have also looked for any href with a baseurl of http://www.blogger.com/comment.g (which I assume sploggers would do, instead of rely on a class identifier) and still successfully map a permalink to the postID.

CSS has nothing to do with being able to map a permalink to a postID. It can be mapped with or without CSS.

Anonymous Anonymous At 12/12/2009 8:52 AM:

your site is loading rapidly


[ April 2005 | May 2005 | June 2005 | July 2005 | August 2005 | September 2005 | October 2005 | November 2005 | December 2005 | January 2006 | February 2006 | June 2006 | August 2006 | September 2006 | January 2007 | May 2007 | July 2007 | September 2007 | October 2007 | November 2007 | February 2008 | March 2008 | April 2008 | November 2008 | ]
Subscribe to Posts [Atom]

Creative Commons License tmarthal
This work is licensed under a Creative Commons Attribution 3.0 United States License.
[ written blog : moblog ]