CVE-2021-41125

Name	CVE-2021-41125
Description	Scrapy is a high-level web crawling and scraping framework for Python. If you use `HttpAuthMiddleware` (i.e. the `http_user` and `http_pass` spider attributes) for HTTP authentication, all requests will expose your credentials to the request target. This includes requests generated by Scrapy components, such as `robots.txt` requests sent by Scrapy when the `ROBOTSTXT_OBEY` setting is set to `True`, or as requests reached through redirects. Upgrade to Scrapy 2.5.1 and use the new `http_auth_domain` spider attribute to control which domains are allowed to receive the configured HTTP authentication credentials. If you are using Scrapy 1.8 or a lower version, and upgrading to Scrapy 2.5.1 is not an option, you may upgrade to Scrapy 1.8.1 instead. If you cannot upgrade, set your HTTP authentication credentials on a per-request basis, using for example the `w3lib.http.basic_auth_header` function to convert your credentials into a value that you can assign to the `Authorization` header of your request, instead of defining your credentials globally using `HttpAuthMiddleware`.
Source	CVE (at NVD; CERT, ENISA, LWN, oss-sec, fulldisc, Debian ELTS, Red Hat, Ubuntu, Gentoo, SUSE bugzilla/CVE, GitHub advisories/code/issues, web search, more)
References	DLA-2950-1

Vulnerable and fixed packages

The table below lists information on source packages.

Source Package	Release	Version	Status
python-scrapy (PTS)	bullseye	2.4.1-2+deb11u1	fixed
	bookworm	2.8.0-2	fixed
	trixie	2.12.0-2	fixed
	forky, sid	2.16.0-1	fixed

The information below is based on the following data on fixed versions.

Package	Type	Release	Fixed Version	Origin
python-scrapy	source	stretch	1.0.3-2+deb9u1	DLA-2950-1
python-scrapy	source	buster	1.5.1-1+deb10u1
python-scrapy	source	bullseye	2.4.1-2+deb11u1
python-scrapy	source	(unstable)	2.5.1-1

Notes

https://github.com/scrapy/scrapy/security/advisories/GHSA-jwqp-28gf-p498
Fixed by: https://github.com/scrapy/scrapy/commit/b01d69a1bf48060daec8f751368622352d8b85a6 (1.8)