Changelog¶
0.2.0 - 6 September 2025¶
- Initial release, mostly a port of
scrapelibfunctionality.
scrapelib¶
The original version of this library is a port of scrapelib (2.4.1).
Changes from this version were to:
- use
httpxinstead ofrequests - dropped quite a few unnecessary features that were mainly in
scrapelibfor backwards-compatability reasons. - use a composable interface instead of the inheritance-based one from
scrapelib, aiming at making future enhancements/porting easier.
This library is a partial rewrite of scrapelib. Thanks to all of scrapelib's original contributors and users.
scrapelib originally wrapped urllib2, eventually migrating to requests.
There are a few things that scrapelib did that this doesn't:
- support FTP requests via HTTP-like API
- extend the client with a
urlretrievefunction - provide helpers for working with headers, timeouts, and custom ciphers
The first two are possible but didn't seem necessary at the moment.
The latter was very requests-specific, and so hasn't been replicated here.