A database of open-source HTTP proxies written in python: WebCleaner
http://webcleaner.sourceforge.net/ Primary features Range of filtering/blocking techniques, html parsing, javascript engine (SpiderMonkey), recognizes known browser attacks, XML DTD for describing filters and rewrite rules FeaturesFrom the project page
- remove unwanted HTML (adverts, flash, etc.)
- popup blocker
- disable animated GIFs
- filter images by size, remove banner adverts
- compress documents on-the-fly (with gzip)
- reduce images to low-bandwidth JPEGs
- remove/add/modify arbitrary HTTP headers
- configurable over web interface
- usage of SquidGuard blacklists
- antivirus filter module
- detection and correction of known HTML security flaws
- Basic, Digest and (untested) NTLM proxy authentication support
- per-host access control
- HTTP/1.1 support (persistent connections, pipelining)
- HTTPS proxy CONNECT and optional SSL gateway support