Primarily a variety of fixes and small enhancements. The only notable
change is stripping away testing/support of git-annex direct mode.
- do not depend on a release candidate of the DataLad, since PIP then opens the
way to a RCs for any later releases to be installed
- `simple_with_archives`
- issue warning if incoming_pipeline has Annexificator but no `annex` is given
- `crcns`
- skip (but warn if relevant) records without xml
- do not crash while saving updated crawler's URL db to the file which is annexed.