Tacl

Latest version: v5.3.0

Safety actively analyzes 628918 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 5

5.0.3

* Removed characters that are significant within regular expressions
when preparing CBETA TEI texts. These are used within regular
expression contexts and so are not safe.

5.0.2

* Rebuild wheel from clean.

* No source changes.

5.0.1

* Fixed bug when preparing a CBETA TEI text that includes a tei:g
within anchors demarcating an apparatus criticus entry, such that
the tei:g was included after the tei:app.

* Added --version to tacl command to print out the version number.

* Bumped minimum Python version to 3.8.

* Added split of T0328 into verse and prose sub-works.

* Added raising an error when building a database from a catalogue
when a referenced work is not in the corpus.

* Removed all instances of characters that are illegal in Windows
filenames when preparing CBETA TEI texts.

5.0.0

* Updated Corpus API and its use within DataStore to allow for
user-provided Text subclasses when getting witnesses.

* Changed some methods of Text and WitnessText to be properties:
get_content is now content, get_names is now siglum and work.

* Updated the process for generating a corpus from CBETA TEI XML
files. Certain parts of texts are extracted into new texts based
on the markup-defined properties (such as containing div/type).

* Added the tacl split command to split texts based on user-supplied
configuration files.

* Added the tacl join-works command to join two or more prepared TEI
XML works together into a new work.

* Added the tacl query command to run supplied SQL commands on the
data store with supplied parameters.

* Added mulu title to prepared filename for extracted div where one
exists.

* Improved updating of the database when adding n-grams: all
witnesses that no longer exist in the corpus will be deleted.

* Added handling of some errors in order to provide useful error
messages.

* Reimplemented tacl results' extend operation to be faster and use
less RAM. Its determination of whether the initial results are
intersect results or not (and thus whether the results are
automatically run through reciprocal remove) has become
stricter. Only those results with more than one label and where
every n-gram occurs in every label now count as intersect results.

* Added the tacl normalise command, to normalise a corpus according
to a user-supplied mapping. This is an as yet unoptimised feature.

* Added options to tacl results to denormalise a set of
results. This is an as yet unoptimised feature.

4.2.0

* Added option to restrict Results prune requirements to a labelled
subset of results.

* Added a lifetime report, showing details about in which labelled
corpora n-grams occurred in.

* Moved the jitc command to the tacl-extra repository.

* Renamed the tacl.command package to tacl.cli.

4.1.0

* Added remove_label method to Catalogue.

* Added method to relable a catalogue.

* Added --relabel option to tacl results, to relabel results
according to a su pplied catalogue.

* Added stripping of cb:mulu contents from prepared TEI.

* Modified Results to also accept a pandas DataFrame as well as a
path or buffer of results.

* Added get_works_by_label method to Catalogue.

* Modified tacl search to allow both for multiple n-gram files and
no n-gram f ile, in which case all n-grams are returned.

* Added get_works method to Corpus to return a list of work names.

Page 2 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.