nixpkgs/pkgs/development/python-modules/scrapy/default.nix

{ stdenv
, buildPythonPackage
, isPy27
, fetchPypi
, glibcLocales
, pytest
, testfixtures
, pillow
, twisted
, cryptography
, w3lib
, lxml
, queuelib
, pyopenssl
, service-identity
, parsel
, pydispatcher
, cssselect
, zope_interface
, protego
, lib
, jmespath
, sybil
, pytest-twisted
, botocore
}:

buildPythonPackage rec {
  version = "2.1.0";
  pname = "Scrapy";

  disabled = isPy27;

  checkInputs = [
    glibcLocales
    jmespath
    pytest
    sybil
    testfixtures
    pillow
    pytest-twisted
    botocore
  ];

  propagatedBuildInputs = [
    twisted
    cryptography
    cssselect
    lxml
    parsel
    pydispatcher
    pyopenssl
    queuelib
    service-identity
    w3lib
    zope_interface
    protego
  ];

  patches = [
    # Scrapy is usually installed via pip where copying all
    # permissions makes sense. In Nix the files copied are owned by
    # root and readonly. As a consequence scrapy can't edit the
    # project templates.
    ./permissions-fix.patch
  ];

  LC_ALL = "en_US.UTF-8";

  # Disable doctest plugin—enabled in the shipped pytest.ini—because it causes pytest to hang
  # Ignore proxy tests because requires mitmproxy
  # Ignore test_retry_dns_error because tries to resolve an invalid dns and weirdly fails with "Reactor was unclean"
  # Ignore xml encoding test on darwin because lxml can't find encodings https://bugs.launchpad.net/lxml/+bug/707396
  checkPhase = ''
    substituteInPlace pytest.ini --replace "--doctest-modules" ""
    pytest --ignore=tests/test_linkextractors_deprecated.py --ignore=tests/test_proxy_connect.py --deselect tests/test_crawl.py::CrawlTestCase::test_retry_dns_error ${lib.optionalString stdenv.isDarwin "--deselect tests/test_utils_iterators.py::LxmlXmliterTestCase::test_xmliter_encoding"}
  '';

  src = fetchPypi {
    inherit pname version;
    sha256 = "640aea0f9be9b055f5cfec5ab78ee88bb37a5be3809b138329bd2af51392ec7f";
  };

  postInstall = ''
    install -m 644 -D extras/scrapy.1 $out/share/man/man1/scrapy.1
    install -m 644 -D extras/scrapy_bash_completion $out/share/bash-completion/completions/scrapy
    install -m 644 -D extras/scrapy_zsh_completion $out/share/zsh/site-functions/_scrapy
  '';

  meta = with lib; {
    description = "A fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages";
    homepage = "https://scrapy.org/";
    license = licenses.bsd3;
    maintainers = with maintainers; [ drewkett marsam ];
    platforms = platforms.unix;
  };
}
python3Packages.scrapy: 1.8.0 -> 2.0.1 2020-03-03 09:22:00 +00:00			`{ stdenv`
			`, buildPythonPackage`
			`, isPy27`
			`, fetchPypi`
			`, glibcLocales`
			`, pytest`
			`, testfixtures`
			`, pillow`
			`, twisted`
			`, cryptography`
			`, w3lib`
			`, lxml`
			`, queuelib`
			`, pyopenssl`
			`, service-identity`
			`, parsel`
			`, pydispatcher`
			`, cssselect`
			`, zope_interface`
			`, protego`
			`, lib`
			`, jmespath`
			`, sybil`
			`, pytest-twisted`
			`, botocore`
			`}:`

Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`buildPythonPackage rec {`
python37Packages.scrapy: 2.0.1 -> 2.1.0 2020-04-24 12:00:00 +01:00			`version = "2.1.0";`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`pname = "Scrapy";`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
python3Packages.scrapy: 1.8.0 -> 2.0.1 2020-03-03 09:22:00 +00:00			`disabled = isPy27;`

			`checkInputs = [`
			`glibcLocales`
			`jmespath`
			`pytest`
			`sybil`
			`testfixtures`
			`pillow`
			`pytest-twisted`
			`botocore`
			`];`

pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`propagatedBuildInputs = [`
python3Packages.scrapy: 1.8.0 -> 2.0.1 2020-03-03 09:22:00 +00:00			`twisted`
			`cryptography`
			`cssselect`
			`lxml`
			`parsel`
			`pydispatcher`
			`pyopenssl`
			`queuelib`
			`service-identity`
			`w3lib`
			`zope_interface`
			`protego`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`];`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
pythonPackages.scrapy: fix build on Python 3.7 2018-12-23 03:21:54 +00:00			`patches = [`
			`# Scrapy is usually installed via pip where copying all`
			`# permissions makes sense. In Nix the files copied are owned by`
			`# root and readonly. As a consequence scrapy can't edit the`
			`# project templates.`
			`./permissions-fix.patch`
			`];`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
python3Packages.scrapy: 1.8.0 -> 2.0.1 2020-03-03 09:22:00 +00:00			`LC_ALL = "en_US.UTF-8";`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
pythonPackages.scrapy: 1.5.1 -> 1.6.0 2019-02-13 03:47:25 +00:00			`# Disable doctest plugin—enabled in the shipped pytest.ini—because it causes pytest to hang`
pythonPackages.scrapy: fix build on Python 3.7 2018-12-23 03:21:54 +00:00			`# Ignore proxy tests because requires mitmproxy`
			`# Ignore test_retry_dns_error because tries to resolve an invalid dns and weirdly fails with "Reactor was unclean"`
			`# Ignore xml encoding test on darwin because lxml can't find encodings https://bugs.launchpad.net/lxml/+bug/707396`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`checkPhase = ''`
python3Packages.scrapy: 1.8.0 -> 2.0.1 2020-03-03 09:22:00 +00:00			`substituteInPlace pytest.ini --replace "--doctest-modules" ""`
pythonPackages.scrapy: 1.6.0 -> 1.7.1 2019-07-19 01:57:42 +01:00			`pytest --ignore=tests/test_linkextractors_deprecated.py --ignore=tests/test_proxy_connect.py --deselect tests/test_crawl.py::CrawlTestCase::test_retry_dns_error ${lib.optionalString stdenv.isDarwin "--deselect tests/test_utils_iterators.py::LxmlXmliterTestCase::test_xmliter_encoding"}`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`'';`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`src = fetchPypi {`
			`inherit pname version;`
python37Packages.scrapy: 2.0.1 -> 2.1.0 2020-04-24 12:00:00 +01:00			`sha256 = "640aea0f9be9b055f5cfec5ab78ee88bb37a5be3809b138329bd2af51392ec7f";`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`};`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00
pythonPackages.scrapy: fix build on Python 3.7 2018-12-23 03:21:54 +00:00			`postInstall = ''`
			`install -m 644 -D extras/scrapy.1 $out/share/man/man1/scrapy.1`
			`install -m 644 -D extras/scrapy_bash_completion $out/share/bash-completion/completions/scrapy`
			`install -m 644 -D extras/scrapy_zsh_completion $out/share/zsh/site-functions/_scrapy`
			`'';`

pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`meta = with lib; {`
			`description = "A fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages";`
python3Packages.scrapy: 1.8.0 -> 2.0.1 2020-03-03 09:22:00 +00:00			`homepage = "https://scrapy.org/";`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`license = licenses.bsd3;`
pythonPackages.scrapy: fix build on Python 3.7 2018-12-23 03:21:54 +00:00			`maintainers = with maintainers; [ drewkett marsam ];`
pythonPackages: remove `name` attribute` The `buildPython*` function computes name from `pname` and `version`. This change removes `name` attribute from all expressions in `pkgs/development/python-modules`. While at it, some other minor changes were made as well, such as replacing `fetchurl` calls with `fetchPypi`. 2018-06-23 14:27:58 +01:00			`platforms = platforms.unix;`
			`};`
Move scrapy to its own module and add patch to fix broken permission code. Scrapy is usually installed via pip where copying all permissions makes sense. In Nix the files copied are owned by root and readonly. As a consequence scrapy can't edit the project templates so scrapy startproject fails. 2017-02-15 22:01:38 +00:00			`}`