![]() |
View
New Forum Topics Today's Forum Topics Set as homepage |
|
|||||||
Welcome to CyclingForums.com You are currently viewing our website as a guest which gives you limited access to view most discussions. You will have to register before you can post to this thread. By joining our free online community you will have access to post new topics, communicate privately with other cyclingforums.com members (PM), respond to polls, upload photos and access other special features like product reviews and classifieds. |
|
|
|
Thread Tools | Search this Thread | Display Modes |
|
|
#31 | |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
Quote:
You may have some of the generic filters enabled. By default, a few header filters and maybe some web filters are enabled when you install Proxomitron. All the filters I created are for specific websites and shouldn't affect content on other sites. Some of the generic filters are useful because they block pop-ups or known ad servers so I keep a few enabled. But I don't include them amongst my cycling filters which are all custom. If you find some of the generic filters enabled you can either disable them or add the problematic site to the bypass list. |
|
|
|
|
|
|
#32 |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
Latest complete filter set.
A few tweaks due to minor changes in some sites' HTML code. Some new filters for Google groups, eBay, and one for intrathread spam on cyclingforums. The latter uses both blocklists so it will filter by username first or 3 hits from the keyword list within the message body. Code:
|
|
|
|
|
|
#33 |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
The latest...
Code:
|
|
|
|
|
|
#34 |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
Some minor tweaks.
Code:
|
|
|
|
|
|
#35 |
|
Registered User
Join Date: Mar 2006
Location: Sydney, Australia
Posts: 1,561
|
What did you do to cyclingnews.com? It's a bit of a mess now....
|
|
|
|
|
|
#36 | |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
Quote:
I just tweaked two of the filters to catch ads that were getting through due to changes in the HTML. Make sure you're not running any older filters. It's been awhile since I've modified the filters due to messing up the appearance of the cyclingnews pages. It's just been ads sneaking through in recent weeks. Is there a specific page you're having problems with? I haven't noticed any problems with the main page and loaded several of the tech and news articles w/o issues. Usually, when something is amiss, I disable the filters one at a time to see which one is causing problems. Then I start looking at the source code. Lastly, there's a small chance that websites are spitting out different HTML code for different systems (OS, browsers, screen size, etc.) so filters that work for some people may not work for others. |
|
|
|
|
|
|
#37 |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
While it seems the spam problem here is under control, I continue to use Proxomitron for other sites.
[Patterns] Name = "Wunderground left ad column (Mar. 25, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<!-- ADPL-00032 -->, </form> </td> </tr> </table>)" Limit = 13000 Match = "*" Replace = "</table>" Name = "Wunderground bottom ads (Mar. 26, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<div class="pageTop">,</div> <br> </div>)" Limit = 2200 Match = "*" Name = "Wunderground Tripadvisor ads (Mar. 25, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<table,</tr> </table>)" Limit = 3500 Match = "*tripadvisor*" Name = "Wunderground Statefarm ad (Mar. 25, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<div class,</center> </div>)" Limit = 1000 Match = "*statefarm*" Name = "Wunderground right ad (Mar. 26, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<td class="vaT taC", </table> </td> </tr> </table>)" Limit = 1200 Match = "*" Name = "Wunderground banner ad (Apr. 4, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<!-- Campaign,</iframe>)" Limit = 1200 Match = "*noscript*|*ilayer*" Name = "Wunderground test 2 (Apr. 4, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<script,</script>)" Limit = 200 Match = "*google*" Name = "Wunderground top of page (Apr. 6, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<div,</div>)" Limit = 100 Match = "*top of page*" Name = "Wunderground center ad (Mar. 26, 2008)" Active = TRUE URL = "www.wunderground.com" Bounds = "$NEST(<table,</div>)" Limit = 1100 Match = "*as5000*|*interclick*|*altfarm.mediaplex.com*|*GA_googleFillSlot*|*dou bleclick*" Name = "eBay (Mar. 7, 2008)" Active = TRUE URL = "([^/]++.|)ebay.com" Bounds = "$NEST(<script,</script>)" Limit = 600 Match = "*getcustomadconfig*" Name = "Google Groups (Mar. 6, 2008)" Active = TRUE URL = "groups.google.com" Limit = 256 Match = "adsSite.innerHTML = iframeHtml;" Name = "Bikeforums right ad column (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Limit = 7000 Match = "<!-- Right Column STOP -->*<!-- Right Column STOP -->" Name = "Bikeforums floating ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<div class="floatedAd">,</div>)" Limit = 1500 Match = "*doubleclick.net*" Name = "Bikeforums ads (Mar. 2, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<script,</script>)" Limit = 600 Match = "*google-analytics*|*internetbrands*|*pagead*|*google_ad*|*doubleclick.net*|*ww w.bikeforums.net/ads*" Name = "Bikeforums inline ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)bikeforums.net" Bounds = "$NEST(<!-- ads between 2nd and 3rd post -->,<!-- /ads between 2nd and 3rd post -->)" Limit = 4000 Match = "*Support our Sponsors*" Name = "Weightweenies ads (Apr. 7, 2008)" Active = TRUE URL = "([^/]++.|)weightweenies.starbike.com" Bounds = "$NEST(<td,</td>)" Limit = 1500 Match = "*<img src="/ad.gif"*" Name = "Slowtwitch ads (Mar. 2, 2008)" Active = TRUE URL = "forum.slowtwitch.com" Bounds = "$NEST(<!-- BEGIN ADVERTPRO CODE BLOCK -->|<!-- Start Quantcast tag -->, <!-- END ADVERTPRO CODE BLOCK -->|<!-- End Quantcast tag -->)" Limit = 1000 Match = "*" Name = "Slowtwitch disable Javascript (Mar. 1, 2008)" Active = TRUE URL = "forum.slowtwitch.com" Bounds = "$NEST(<script, </script>)" Limit = 1000 Match = "*javascript*" Name = "Serotta Forum right column (Mar. 13, 2008)" Active = TRUE URL = "([^/]++.|)serotta.com" Bounds = "$NEST(<td,</td>)" Limit = 800 Match = "*http://www.serotta.com/forum/images/ads*" Name = "Serotta Forum banner (Mar. 13, 2008)" Active = TRUE URL = "([^/]++.|)serotta.com" Bounds = "$NEST(<div,</div>)" Limit = 300 Match = "*www.serotta.com/Merchant2/merchant.mvc*" Name = "Accuweather left ad column (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Limit = 16000 Match = "<div class=\"content_box_[0-9]+header\">" "*Weather News*" "<!-- End Monthly Features /-->" Name = "Accuweather video removal (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id="content_column_300right" >,<!-- this closing div closes the body div \(page_container\) /-->)" Limit = 12000 Match = "*adcVideoComponentV2d*" Name = "Accuweather banner (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id=\"topAdContainer\">,</div>\s</div>\s</div>)" Limit = 400 Match = "*<!-- Advertisment bar -->*" Name = "Accuweather ad box 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Limit = 1000 Match = "<div id=\"*UpSaleBox660\">*<a name="graph" id="graph"></a>" Replace = "<a name="graph" id="graph"></a>" Name = "Accuweather ad box 1 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<div id=\",</div>\s</div>)" Limit = 400 Match = "*vortex.accuweather.com/adc2004/pub/images/promos/downloadCenterPromo660.gif*" Name = "Accuweather Flash ads 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<!-- Quantcast Tag -->,<!-- End Quantcast tag -->)" Limit = 600 Match = "*" Name = "Accuweather Flash ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)accuweather.com" Bounds = "$NEST(<script,</script>)" Limit = 2000 Match = "*Flash*|*ads*" Name = "Pez ads (Mar. 2, 2008)" Active = TRUE URL = "([^/]++.|)pezcyclingnews.com" Bounds = "$NEST(<tr,</tr>)" Limit = 1200 Match = "*banners*|*www.pezcyclingnews.com:8080*|*google_ad*" Name = "Pez banner (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)pezcyclingnews.com" Bounds = "$NEST(<object classid,</object>)" Limit = 700 Match = "*flash*" Name = "Velonews banner 3 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 2000 Match = "<div id = \"circ-leader\"><img src = '/themes/velonews/images/circ_leader.gif' usemap = '#circ-leader-map' alt = 'Magazine Image' /></div>" Replace = "<div id = \"circ-leader\"></div>" Name = "Velonews ads 2 (Mar. 13, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div id = "art_ad_1_container">,</div></div>)" Limit = 1000 Match = "*Advertisement*" Name = "Velonews Google ads (Mar. 13, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 500 Match = "<div id = "block-google-adsense">" "*" "src="http://pagead2.googlesyndication.com/pagead/show_ads.js"> </script> </div>" Name = "Velonews Javascript (Mar. 13, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<script,</script>)" Limit = 200 Match = "*google-analytics*|*urchinTracker*" Replace = "<div class="content"><div id = 'block-most-viewed'>" Name = "Velonews ads (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div id = ,</iframe></div>)" Limit = 1000 Match = "*thruport.com*" Name = "Velonews banner 1 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Bounds = "$NEST(<div id = "ad-leader">,></iframe></div>)" Limit = 1000 Match = "*thruport.com*" Name = "Velonews banner 2 (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)velonews.com" Limit = 1000 Match = "<map name = 'circ-leader-map' id = 'circ-leader-map'>*</map>" Name = "Cyclingnews left ad column (Mar. 17, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Limit = 8000 Match = "<table cellpadding=2" "*clickthru*" "</table>" Name = "Cyclingnews right ad column (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<!--RIGHT HAND AD COLUMN START -->,<!--RIGHT HAND AD COLUMN END -->)" Limit = 12000 Match = "*" Name = "Cyclingnews top banner (Apr. 4, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<td colspan="[0-9]",</(td|a)>)" Limit = 400 Match = "*cnad[0-9]+.economicoutlook.net*" "cnad[0-9]+.economicoutlook.net*" Name = "Cyclingnews top ads (Apr. 4, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Limit = 800 Match = "<td align" "*cnad[0-9]+.economicoutlook.net*" "cnad[0-9]+.economicoutlook.net*" "</td>" Name = "Cyclingnews left ad column 3 (Apr. 4, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<td width, </td>)" Limit = 400 Match = "*cnad[0-9]+.economicoutlook.net*" Name = "Cyclingnews Italia ad (Apr. 4, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<table, </table>)" Limit = 2200 Match = "*sponsors/italia/2008/\w.swf*" Name = "Cyclingnews left ad column 2 (Apr. 4, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Limit = 1500 Match = "<td width" "*/sponsors/fsa/" "*/sponsors/italia/2008/*" "</a></div>" Name = "Cyclingnews ads 1 (Apr. 6, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<a href,</a>)" Limit = 400 Match = "*www.bikeradar.com*|*cnad[0-9]+.economicoutlook.net*" Name = "Cyclingnews Javascript (Apr. 4, 2008)" Active = TRUE URL = "([^/]++.|)cyclingnews.com" Bounds = "$NEST(<(no|)script,</(no|)script>)" Limit = 2000 Match = "*cnews_home_mpu*|" "*js.revsci.net*|" "*rsi_segs*|" "*document.write*|" "*scstats.cyclingnews.com/cnhbx.js*|" "*ac_runactivecontent.js*|" "*ac_activex.js*|" "*DM_addEncToLoc*|" "*www.economicoutlook.net*|" "*secure-au.imrworldwide.com*|" "*disableclick*" Name = "Cycling Forums ad filter (Mar. 5, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Bounds = "$NEST(<script,</script>)" Limit = 800 Match = "*google_ad*|*googlesyndication.com*" Name = "Cycling Forums last post username filter (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Limit = 1000 Match = "<div style="clear:both">" "*\"member.php\?find=lastposter\&f=[0-9]+\">$LST(Jerks)</a>*" "</div>\s</div>\s</td>" Replace = "</div></div></td>" Name = "Cycling Forums username filter (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Bounds = "$NEST(<tr,</tr>)" Limit = 2000 Match = "*<span style=\"cursor ointer\" onclick=\"window.open\('member.php\?u=[0-9]+'\)">$LST(Jerks)</span>*"Name = "Cycling Forums last post keyword filter (Mar. 1, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Limit = 1000 Match = "<div style="clear:both">" "*$LST(Cyclingforums_keywords)*" "</div>\s</div>\s</td>" Replace = "<div style="clear:both"></div></div></td>" Name = "Cycling Forums keyword filter (Mar. 5, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com" Bounds = "$NEST(<tr,</tr>)" Limit = 2000 Match = "*$LST(Cyclingforums_keywords)*&(^*member.php\?find=lastposter\&f)&(^*<!-- message -->)" Name = "Cycling Forums intrathread filter (Mar. 9, 2008)" Active = TRUE URL = "([^/]++.|)cyclingforums.com/t" Bounds = "$NEST(<!-- post #[0-9]+ --> <!-- open content container -->,<!-- / close content container --> <!-- / post #[0-9]+ -->)" Limit = 32767 Match = "*$LST(Jerks)*|" "*<!-- message -->" "*$LST(Cyclingforums_keywords)*$LST(Cyclingforums_keywords)*$LST(Cyclin gforums_keywords)*" "<!-- / message -->*" |
|
|
|
|
|
#38 |
|
Registered User
Join Date: Jul 2003
Posts: 334
|
I've created a webpage to host my cycling-related Proxomitron filters:
http://home.comcast.net/~ardillakil...on_cycling.html It's a more permanent solution and is easier for me to update. I've been updating my filters but not posting them here in anticipation of creating a webpage for them. In addition, the spamming here seems to be under control. There's a bit of an arms race going on with some sites that change their HTML code periodically to make ad filtering difficult. My solution was to make the filters less specific and thus harder to avoid. My previous strategy was to make the filters very specific so they would rarely filter any desirable content. Now, that concern is addressed by spending some time testing the filters. |
|
|
|