Commit Graph

2 Commits

Author SHA1 Message Date
Daniël de Kok
27d0c5a07e sentencepiece: split into multiple outputs, optional gperftools
I am using sentencepiece in a downstream application where I want to
minimize the resulting closures. This commit makes changes to make
sentencepiece a leaner dependency:

- Split the outputs, so that the binaries/headers do not end up in the
  transitive closure in a library dependency.

- Add the `withGPerfTools` option, which is enabled by default, to
  make it possible to disable the gperftools dependency. According to
  the sentencepiece README, this dependency gives a 10-40% performance
  improvement. But in many cases this is overshadowed by the neural
  networks that use piece identifiers as input anyway.
2020-03-14 15:40:29 -07:00
Pash Shocky
0b68e445e1 pythonPackages.sentencepiece: init at 0.1.84 2019-12-17 16:46:52 -08:00