nixpkgs/pkgs/development/python-modules/scrapy/default.nix

{ stdenv, buildPythonPackage, fetchurl, glibcLocales, mock, pytest, botocore,
  testfixtures, pillow, six, twisted, w3lib, lxml, queuelib, pyopenssl,
  service-identity, parsel, pydispatcher, cssselect, lib }:
buildPythonPackage rec {
    version = "1.5.0";
    pname = "Scrapy";
    name = "${pname}-${version}";

    buildInputs = [ glibcLocales mock pytest botocore testfixtures pillow ];
    propagatedBuildInputs = [
      six twisted w3lib lxml cssselect queuelib pyopenssl service-identity parsel pydispatcher
    ];

    # Scrapy is usually installed via pip where copying all
    # permissions makes sense. In Nix the files copied are owned by
    # root and readonly. As a consequence scrapy can't edit the
    # project templates.
    patches = [ ./permissions-fix.patch ];

    LC_ALL="en_US.UTF-8";

    checkPhase = ''
      py.test --ignore=tests/test_linkextractors_deprecated.py --ignore=tests/test_proxy_connect.py ${lib.optionalString stdenv.isDarwin "--ignore=tests/test_utils_iterators.py"}
      # The ignored tests require mitmproxy, which depends on protobuf, but it's disabled on Python3
      # Ignore iteration test, because lxml can't find encodings on darwin https://bugs.launchpad.net/lxml/+bug/707396
    '';

    src = fetchurl {
      url = "mirror://pypi/S/Scrapy/${name}.tar.gz";
      sha256 = "31a0bf05d43198afaf3acfb9b4fb0c09c1d7d7ff641e58c66e36117f26c4b755";
    };

    meta = with lib; {
      description = "A fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages";
      homepage = http://scrapy.org/;
      license = licenses.bsd3;
      maintainers = with maintainers; [ drewkett ];
      platforms = platforms.unix;
    };
}
pythonPackages.scrapy: enable darwin build 2018-05-06 05:28:36 +01:00			`{ stdenv, buildPythonPackage, fetchurl, glibcLocales, mock, pytest, botocore,`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`testfixtures, pillow, six, twisted, w3lib, lxml, queuelib, pyopenssl,`
			`service-identity, parsel, pydispatcher, cssselect, lib }:`
			`buildPythonPackage rec {`
python: Scrapy: 1.4.0 -> 1.5.0 2017-12-30 11:27:04 +00:00			`version = "1.5.0";`
Python: add pname attributes to libraries so that we can use the update script. 2017-05-27 10:25:35 +01:00			`pname = "Scrapy";`
			`name = "${pname}-${version}";`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
			`buildInputs = [ glibcLocales mock pytest botocore testfixtures pillow ];`
			`propagatedBuildInputs = [`
			`six twisted w3lib lxml cssselect queuelib pyopenssl service-identity parsel pydispatcher`
			`];`

			`# Scrapy is usually installed via pip where copying all`
			`# permissions makes sense. In Nix the files copied are owned by`
			`# root and readonly. As a consequence scrapy can't edit the`
			`# project templates.`
			`patches = [ ./permissions-fix.patch ];`

			`LC_ALL="en_US.UTF-8";`

			`checkPhase = ''`
pythonPackages.scrapy: enable darwin build 2018-05-06 05:28:36 +01:00			`py.test --ignore=tests/test_linkextractors_deprecated.py --ignore=tests/test_proxy_connect.py ${lib.optionalString stdenv.isDarwin "--ignore=tests/test_utils_iterators.py"}`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`# The ignored tests require mitmproxy, which depends on protobuf, but it's disabled on Python3`
pythonPackages.scrapy: enable darwin build 2018-05-06 05:28:36 +01:00			`# Ignore iteration test, because lxml can't find encodings on darwin https://bugs.launchpad.net/lxml/+bug/707396`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`'';`

			`src = fetchurl {`
			`url = "mirror://pypi/S/Scrapy/${name}.tar.gz";`
python: Scrapy: 1.4.0 -> 1.5.0 2017-12-30 11:27:04 +00:00			`sha256 = "31a0bf05d43198afaf3acfb9b4fb0c09c1d7d7ff641e58c66e36117f26c4b755";`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`};`

			`meta = with lib; {`
			`description = "A fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages";`
pkgs: refactor needless quoting of homepage meta attribute (#27809) * pkgs: refactor needless quoting of homepage meta attribute A lot of packages are needlessly quoting the homepage meta attribute (about 1400, 22%), this commit refactors all of those instances. * pkgs: Fixing some links that were wrongfully unquoted in the previous commit * Fixed some instances 2017-08-01 21:03:30 +01:00			`homepage = http://scrapy.org/;`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`license = licenses.bsd3;`
			`maintainers = with maintainers; [ drewkett ];`
pythonPackages.scrapy: enable darwin build 2018-05-06 05:28:36 +01:00			`platforms = platforms.unix;`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`};`
			`}`