Cycling and bicycle racing discussion forums.   View New Forum Topics
Today's Forum Topics

Set as homepage


Go Back   Cycling Forums > Tech Corner > Power Training
User Name
Password
Register FAQ Members List Calendar Search Today's Posts Mark Forums Read


Welcome to CyclingForums.com

You are currently viewing our website as a guest which gives you limited access to view most discussions. You will have to register before you can post to this thread.

By joining our free online community you will have access to post new topics, communicate privately with other cyclingforums.com members (PM), respond to polls, upload photos and access other special features like product reviews and classifieds.


What's wrong with this picture?

Reply
 
Thread Tools Search this Thread Display Modes
Old 26-02.-2008, 02:43 AM   #1
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Lightbulb What's wrong with this picture?

After:



Before:



The solution:



Basically, I created a filter to disappear threads by keywords. Now, it's like the spam threads were never there.
Squint is offline  
Reply With Quote
Old 26-02.-2008, 03:29 AM   #2
rmur17
Registered User
 
Join Date: Oct 2004
Posts: 926
Default Re: What's wrong with this picture?

here's a talent!
__________________
rmur
rmur17 is offline  
Reply With Quote
Old 26-02.-2008, 04:39 AM   #3
thekgb
Registered User
 
Join Date: Dec 2007
Posts: 50
Default Re: What's wrong with this picture?

when is it gonna start working? i still see alot of junk/spam on there. do we need more keywords? those damn adidas aren't selling anyway, can we get rid of them?!

Mike



Quote:
Originally Posted by rmur17
here's a talent!
thekgb is offline  
Reply With Quote
Old 26-02.-2008, 04:46 AM   #4
AngryPenguin
Registered User
 
AngryPenguin's Avatar
 
Join Date: Jan 2008
Location: UK
Posts: 50
Default Re: What's wrong with this picture?

So thats built in firefox thing then? Another reason to switch from IE I suppose.
__________________
Its not what you've got its what you do with it.
AngryPenguin is offline  
Reply With Quote
Old 26-02.-2008, 05:39 AM   #5
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Default Re: What's wrong with this picture?

It's Proxomitron, an ad filtering program that I've been using for years now. But only today did I finally learn how to create my own filters for it. It works by running as a proxy server on your own computer and it will work with any browser.

Basically, any webpage is a text file that's downloaded by your browser and loaded. As such, you have ultimate control over its content. Proxomitron sees the file first and in essence does a search and replace of the content before passing it on to the web browser.

So what I did was figure out how each thread is represented in HTML. The corresponding HTML code for each thread is searched and if certain keywords are found within them, the whole section of code is discarded. The result is that the spam threads are completely excised. Not even a blank line or space is left.

I'm cleaning up the other forums now. Bikeforums, for example, puts an ad posting in between the 2nd and 3rd postings in each thread. You can block the images with AdBlock but you'll still have text and the skeletal remains of the ad posting. But I've managed to eliminate them completely.
Squint is offline  
Reply With Quote
Old 28-02.-2008, 04:00 AM   #6
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Default Re: What's wrong with this picture?

If anyone's interested in setting up Proxomitron on their PCs, I'm willing to help. It's actually quite easy to do and is the best web filtering program out there. It's not that user friendly but you can continue using your existing filtering on top of it and only use the filters specific for cyclingforums.

I've created filters for several cycling forums and news websites that remove floating ads, Flash animations and basically anything annoying.

Proxomitron is free and I have no affiliation with it. In fact, I don't think anybody does because its author may have passed away.
Squint is offline  
Reply With Quote
Old 28-02.-2008, 09:00 AM   #7
Piotr
Registered User
 
Piotr's Avatar
 
Join Date: Jan 2007
Location: Draper, Utah
Posts: 400
Default Re: What's wrong with this picture?

Quote:
Originally Posted by Squint
If anyone's interested in setting up Proxomitron on their PCs, I'm willing to help. It's actually quite easy to do and is the best web filtering program out there. It's not that user friendly but you can continue using your existing filtering on top of it and only use the filters specific for cyclingforums.

I've created filters for several cycling forums and news websites that remove floating ads, Flash animations and basically anything annoying.

Proxomitron is free and I have no affiliation with it. In fact, I don't think anybody does because its author may have passed away.
I think this post should be a sticky, but since nobody's removing spam there's a slim chance of that happening. I think we should just keep bumping it until majority of active users make a decision regarding the filtering software. I'll give it a try, but I'm afraid if only a few of us install it, we may become oblivious to the problem that will eventually drive many users away.
__________________
blog
Piotr is offline  
Reply With Quote
Old 28-02.-2008, 12:37 PM   #8
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Default Re: What's wrong with this picture?

Installation instructions and help pages:

http://www.proxomitron.info/45/help/Contents.html

Download area:

http://www.proxomitron.info/files/index.html

I downloaded the ProxN45j.zip version. No add-ons or textures. In fact, one of the first things I did was turn off the textures because they were a little too colorful.

After installation, open up default.cfg and add the following lines in the Blocklists and Patterns sections. Or you can use the merge command from within the program. Remember that merging filters or changing filters doesn't become permanent until you do File|"Save Default Settings."

This will add the two filters specific to our spam problem here.

Code:
[Blocklists] List.Jerks = "..\Lists\Jerks List.txt" [Patterns] Name = "Cycling Forums 2" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Limit = 2000 Match = "<img class="inlineimg" src="images/icons/icon[0-9].gif" alt="*" border="0" />" "*\"member.php\?find=lastposter\&amp;f=[0-9]+\">$LST(Jerks)</a>*" "</a>" Name = "Cycling Forums" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Bounds = "$NEST(<tr,</tr>)" Limit = 2000 Match = "*<span style=\"cursor:pointer\" onclick=\"window.open\('member.php\?u=[0-9]+'\)">$LST(Jerks)</span>*"


The second and final step is to create a text file named Jerks List.txt in the "Lists" sub-directory in the Proxomitron program folder. This is a list of the usernames of spammers:

Code:
clothes jqyshoes mysneaker shoesnewnike phonecity goodsaaa lindaxu_88 teslim452


There aren't a whole lot of them and since they aren't being banned or otherwise prevented from posting, I don't think they'll register any new accounts. If necessary, I can modify the filter to filter by keywords. Currently, it only goes by usernames because a legitimate posting talking about "clothes" would be filtered out.

Last but not least are the filters I created for several cycling related websites. In conjunction with the default header filters (something different from web filters), they really clean up the sites.

Code:
[Patterns] Name = "Bikeforums 3" Active = TRUE URL = "([^/]++.|)bikeforums.net" Limit = 7000 Match = "<!-- Right Column STOP -->*<!-- Right Column STOP -->" Name = "Bikeforums 4" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<div class="floatedAd">,</div>)" Limit = 1500 Match = "*doubleclick.net*" Name = "Bikeforums 2" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<SCRIPT LANGUAGE="JavaScript">,</table>)" Limit = 800 Match = "*doubleclick.net*|*www.bikeforums.net/ads*" Replace = "</table>" Name = "Bikeforums" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<!-- ads between 2nd and 3rd post -->,<!-- /ads between 2nd and 3rd post -->)" Limit = 4000 Match = "*Support our Sponsors*" Name = "Weightweenies" Active = TRUE URL = "([^/]++.|)weightweenies.starbike.com" Bounds = "$NEST(<td,</td>)" Limit = 2000 Match = "*ad.gif*" Name = "Slowtwitch ads 1" Active = TRUE URL = "forum.slowtwitch.com" Bounds = "$NEST(<!-- BEGIN ADVERTPRO CODE BLOCK -->, <!-- END ADVERTPRO CODE BLOCK -->)" Limit = 1000 Match = "*" Name = "Slowtwitch image resize" Active = TRUE URL = "forum.slowtwitch.com" Bounds = "$NEST(<script, </script>)" Limit = 1000 Match = "*function resizeImage*" Name = "Serotta forum" Active = TRUE URL = "([^/]++.|)serotta.com" Bounds = "$NEST(<table,</table>)" Limit = 2000 Match = "*www.serotta.com/Merchant2/merchant.mvc*" Name = "Accuweather video removal 2" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id="content_column_300right" >,<!-- this closing div closes the body div \(page_container\) /-->)" Limit = 12000 Match = "*adcVideoComponentV2d*" Name = "Accuweather banner" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id=\"topAdContainer\">,</div>\s</div>\s</div>)" Limit = 400 Match = "*<!-- Advertisment bar -->*" Name = "Accuweather ad box" Active = TRUE URL = "([^/]++.|)accuweather.com" Limit = 1000 Match = "<div id=\"*UpSaleBox660\">*<a name="graph" id="graph"></a>" Replace = "<a name="graph" id="graph"></a>" Name = "Accuweather download center" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id=\",</div>\s</div>)" Limit = 400 Match = "*vortex.accuweather.com/adc2004/pub/images/promos/downloadCenterPromo660.gif*" Name = "Accuweather Flash ads 2" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<!-- Quantcast Tag -->,<!-- End Quantcast tag -->)" Limit = 600 Match = "*" Name = "Accuweather Flash ads" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<script,</script>)" Limit = 2000 Match = "*Flash*|*ads*" Name = "Cyclingnews 3" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<tr>,</tr>)" Limit = 2000 Match = "*flash*" Name = "Pez ads 1" Active = TRUE URL = "([^/]++.|)pezcyclingnews.com" Bounds = "$NEST(<td,</td>)" Limit = 400 Match = "*www.pezcyclingnews.com:8080*" Name = "Pez banner" Active = TRUE URL = "([^/]++.|)pezcyclingnews.com" Bounds = "$NEST(<object classid,</object>)" Limit = 700 Match = "*flash*" Name = "Velonews banner 3" Active = FALSE URL = "([^/]++.|)velonews.com" Limit = 2000 Match = "<div id = \"body-container\">" Name = "Velonews banner 4" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 2000 Match = "<div id = \"circ-leader\"><img src = '/themes/velonews/images/circ_leader.gif' usemap = '#circ-leader-map' alt = 'Magazine Image' /></div>" Replace = "<div id = \"circ-leader\"></div>" Name = "Velonews ads 1" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div class,</iframe></div></div>)" Limit = 1000 Match = "*thruport.com*" Name = "Velonews banner 1" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div id = "ad-leader">,></iframe></div>)" Limit = 1000 Match = "*thruport.com*" Name = "Velonews banner 2" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 1000 Match = "<map name = 'circ-leader-map' id = 'circ-leader-map'>*</map>" Name = "Cyclingnews 2" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Limit = 400 Match = "<tr>*clickthru*</tr>" Name = "Cyclingnews right ad column" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<!--RIGHT HAND AD COLUMN START -->,<!--RIGHT HAND AD COLUMN END -->)" Limit = 12000 Match = "*" Name = "Cyclingnews ads 3" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<a href,</a>)" Limit = 400 Match = "*economicoutlook.net*" Name = "Cyclingnews ads 2" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<tr,</tr>)" Limit = 600 Match = "*economicoutlook.net*" Name = "Cyclingnews Javascript 1" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<script,</script>)" Limit = 2000 Match = "*disableclick*" Name = "Cyclingnews" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<td,</(tr|td)>)" Limit = 2000 Match = "*cyclingnews.com/clickthru*"
Squint is offline  
Reply With Quote
Old 28-02.-2008, 03:45 PM   #9
Alex Simmons
Registered User
 
Join Date: Mar 2006
Location: Sydney, Australia
Posts: 1,571
Default Re: What's wrong with this picture?

This is seriously tempting....
Alex Simmons is online now  
Reply With Quote
Old 28-02.-2008, 11:29 PM   #10
frenchyge
Registered User
 
Join Date: Apr 2005
Location: Kansas City, USA
Posts: 3,567
Default Re: What's wrong with this picture?

Wow, blocking spam is so easy, a site admin could do it. So, when all the spam is filtered out, that leaves.... what, just the stickied threads on the first page?

The real threads from later pages don't perculate back to the top and take the places of the blocked threads, do they?
frenchyge is offline  
Reply With Quote
Old 29-02.-2008, 02:11 AM   #11
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Default Re: What's wrong with this picture?

Quote:
Originally Posted by frenchyge
Wow, blocking spam is so easy, a site admin could do it. So, when all the spam is filtered out, that leaves.... what, just the stickied threads on the first page?

The real threads from later pages don't perculate back to the top and take the places of the blocked threads, do they?


Actually, they aren't displaced by the excised spam threads. At least for me, the front page shows the stickied threads and the last 3 days of threads regardless of the number of threads. At the bottom of page one is the 300PT accuracy thread from Feb. 25 regardless of whether the spam threads are present or not. The front page is about half as long w/o the spam threads.
Squint is offline  
Reply With Quote
Old 29-02.-2008, 02:43 PM   #12
Piotr
Registered User
 
Piotr's Avatar
 
Join Date: Jan 2007
Location: Draper, Utah
Posts: 400
Default Re: What's wrong with this picture?

Very nice, very nice indeed.
__________________
blog
Piotr is offline  
Reply With Quote
Old 29-02.-2008, 05:10 PM   #13
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Default Re: What's wrong with this picture?

Updated list of spammers for the Jerks List.txt file:

Code:
ann110 bigbrother11 clothes crimi.check darasure Dehumidifier-Da doorstep1 goodsaaa greall jamesbikerblog jqyshoes king001 lindaxu_88 mysneaker phonecity phonelimited shoesnewnike shoesperfect teslim452 topsales voogooddeal


There are only a few that spam this sub-forum but I went to other sub-forums to test out the filter and gathered more names of offenders.

An additional filter for Accuweather. This goes in [Patterns] in default.cfg or you can use the merge function in Proxomitron.

Code:
Name = "Accuweather left column" Active = TRUE URL = "([^/]++.|)accuweather.com" Limit = 16000 Match = "<div class=\"content_box_[0-9]+header\">" "*Weather News*" "<!-- End Monthly Features /-->"
Squint is offline  
Reply With Quote
Old 01-03.-2008, 03:10 AM   #14
pascalhayes
Registered User
 
Join Date: Oct 2004
Posts: 11
Default Re: What's wrong with this picture?

Quote:
Originally Posted by Piotr
Very nice, very nice indeed.
Yes; it is very satisfying to see the spammers defeated and to open this forum and see nothing but Power and cycling related threads. It was worth the time that it took me to get Proxomitron going and much thanks to Squint for his patience in helping me to get it functional.

Scott Simpson
pascalhayes is offline  
Reply With Quote
Old 02-03.-2008, 01:14 AM   #15
Squint
Registered User
 
Join Date: Jul 2003
Posts: 334
Default Re: What's wrong with this picture?

I created a few new filters and tweaked others. I also renamed them so they're more descriptive and added dates to the names so it's easier to discern which version you have. The complete list of cycling-related filters:

Code:
[Patterns] Name = "Bikeforums right ad column (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Limit = 7000 Match = "<!-- Right Column STOP -->*<!-- Right Column STOP -->" Name = "Bikeforums floating ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<div class="floatedAd">,</div>)" Limit = 1500 Match = "*doubleclick.net*" Name = "Bikeforums ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<SCRIPT LANGUAGE="JavaScript">,</table>)" Limit = 800 Match = "*doubleclick.net*|*www.bikeforums.net/ads*" Replace = "</table>" Name = "Bikeforums inline ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<!-- ads between 2nd and 3rd post -->,<!-- /ads between 2nd and 3rd post -->)" Limit = 4000 Match = "*Support our Sponsors*" Name = "Weightweenies ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)weightweenies.starbike.com" Bounds = "$NEST(<td,</td>)" Limit = 2000 Match = "*ad.gif*" Name = "Slowtwitch ads (Mar. 1, 2008)" Active = TRUE URL = "forum.slowtwitch.com" Bounds = "$NEST(<!-- BEGIN ADVERTPRO CODE BLOCK -->, <!-- END ADVERTPRO CODE BLOCK -->)" Limit = 1000 Match = "*" Name = "Slowtwitch disable auto image resize (Mar. 1, 2008)" Active = TRUE URL = "forum.slowtwitch.com" Bounds = "$NEST(<script, </script>)" Limit = 1000 Match = "*javascript*" Name = "Serotta Forum ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)serotta.com" Bounds = "$NEST(<table,</table>)" Limit = 2000 Match = "*www.serotta.com/Merchant2/merchant.mvc*" Name = "Accuweather left ad column (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Limit = 16000 Match = "<div class=\"content_box_[0-9]+header\">" "*Weather News*" "<!-- End Monthly Features /-->" Name = "Accuweather video removal (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id="content_column_300right" >,<!-- this closing div closes the body div \(page_container\) /-->)" Limit = 12000 Match = "*adcVideoComponentV2d*" Name = "Accuweather banner (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id=\"topAdContainer\">,</div>\s</div>\s</div>)" Limit = 400 Match = "*<!-- Advertisment bar -->*" Name = "Accuweather ad box 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Limit = 1000 Match = "<div id=\"*UpSaleBox660\">*<a name="graph" id="graph"></a>" Replace = "<a name="graph" id="graph"></a>" Name = "Accuweather ad box 1 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id=\",</div>\s</div>)" Limit = 400 Match = "*vortex.accuweather.com/adc2004/pub/images/promos/downloadCenterPromo660.gif*" Name = "Accuweather Flash ads 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<!-- Quantcast Tag -->,<!-- End Quantcast tag -->)" Limit = 600 Match = "*" Name = "Accuweather Flash ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<script,</script>)" Limit = 2000 Match = "*Flash*|*ads*" Name = "Pez ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)pezcyclingnews.com" Bounds = "$NEST(<td,</td>)" Limit = 400 Match = "*www.pezcyclingnews.com:8080*" Name = "Pez banner (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)pezcyclingnews.com" Bounds = "$NEST(<object classid,</object>)" Limit = 700 Match = "*flash*" Name = "Velonews banner 3 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 2000 Match = "<div id = \"circ-leader\"><img src = '/themes/velonews/images/circ_leader.gif' usemap = '#circ-leader-map' alt = 'Magazine Image' /></div>" Replace = "<div id = \"circ-leader\"></div>" Name = "Velonews ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div id = ,</iframe></div>)" Limit = 1000 Match = "*thruport.com*" Name = "Velonews banner 1 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div id = "ad-leader">,></iframe></div>)" Limit = 1000 Match = "*thruport.com*" Name = "Velonews banner 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 1000 Match = "<map name = 'circ-leader-map' id = 'circ-leader-map'>*</map>" Name = "Cyclingnews right ad column (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<!--RIGHT HAND AD COLUMN START -->,<!--RIGHT HAND AD COLUMN END -->)" Limit = 12000 Match = "*" Name = "Cyclingnews ad filter 3 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<a href,</a>)" Limit = 400 Match = "*economicoutlook.net*" Name = "Cyclingnews ad filter 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<tr,</tr>)" Limit = 600 Match = "*economicoutlook.net*" Name = "Cyclingnews Javascript enable right-click on images (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<script,</script>)" Limit = 2000 Match = "*disableclick*" Name = "Cyclingnews ad filter 1 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<td,</(tr|td)>)" Limit = 2000 Match = "*cyclingnews.com/clickthru*" Name = "Cyclingnews disable Flash animations (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<tr>,</tr>)" Limit = 2000 Match = "*flash*" Name = "Cycling Forums last post keyword filter (Mar. 1, 2008)" Active = FALSE URL = "([^/]++.|)cyclingforums.com" Limit = 1000 Match = "<div style="clear:both">" "*$LST(Cyclingforums_keywords)*" "</div>\s</div>\s</td>" Replace = "<div style="clear:both"></div></div></td>" Name = "Cycling Forums last post username filter (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Limit = 1000 Match = "<div style="clear:both">" "*\"member.php\?find=lastposter\&amp;f=[0-9]+\">$LST(Jerks)</a>*" "</div>\s</div>\s</td>" Replace = "</div></div></td>" Name = "Cycling Forums keyword filter (Mar. 1, 2008)" Active = FALSE URL = "([^/]++.|)cyclingforums.com" Bounds = "$NEST(<tr,</tr>)" Limit = 2000 Match = "*$LST(Cyclingforums_keywords)*&(^*member.php\?find=lastposter\&amp;f)" Name = "Cycling Forums username filter (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Bounds = "$NEST(<tr,</tr>)" Limit = 2000 Match = "*<span style=\"cursor:pointer\" onclick=\"window.open\('member.php\?u=[0-9]+'\)">$LST(Jerks)</span>*"


There are now two pairs of filters for this forum. The username filter set doesn't have any "friendly fire" issues but postings from newly registered spammers will get through.

The new pair of filters scans titles for keywords. It's possible (but unlikely) that it will remove legitimate threads. The benefit is that the spammers change their wares less often than their usernames so it will catch more while requiring you to update the blocklist less often.

When the thread listing is generated, the full body of the message isn't yet available to the browser so only the title and perhaps the first few lines of the message can be examined. It might be possible with some Javascript to read the message and even go to the websites linked in the message and examine them in depth. If this can be accomplished then it would be possible to filter out even new usernames and postings with vague titles.

Code:
#List of spammer usernames for cyclingforums.com # #This file must have a pointer in default.cfg. # #Example: # #[Blocklists] #List.Jerks = "..\Lists\Jerks List.txt" # #Last updated Mar. 1, 2008 1stcloth00 ann110 bigbrother11 clothes crimi.check darasure Dehumidifier-Da doorstep1 favorgoods goodsaaa greall jamesbikerblog jqyshoes king001 lindaxu_88 Michaeljakel micheel mysneaker nikeshoesstock9 oilsupply online22 phonecity phonelimited shoes01 shoesnewnike shoesperfect teslim452 topsales van11 voogooddeal wally58


This is the blocklist for the keyword filters. Installation is the same as for the username filter. Note how it's a shorter list than the list of usernames. And it's less likely to change.

Code:
#List of spam keywords for cyclingforums.com # #This file must have a pointer in default.cfg. # #Example: # #[Blocklists] #List.Cyclingforums_keywords = "..\Lists\Cyclingforums keywords.txt" # #Last updated Feb. 29, 2008 (hair&*nails) (apple&*cider) prostate motorola adidas lithium aspartate ribose gucci handbags iphone ipod jeans jordan nike nokia powerbook prada shoes tiffany USD wholesale
Squint is offline  
Reply With Quote

Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump



All times are GMT +10. The time now is 09:15 PM.


Powered by: vBulletin Copyright © 2000 - 2008, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2006 cyclingforums.com

Links to websites we like:
Pezcyclingnews | Cyclingnews.com | Wine Zone | iinet