provenance
Provenance helpers for reproducible CLTK annotations.
ProvenanceRecord
Bases: BaseModel
Compact record describing how annotations were produced.
canonical_json
sha256_hex
make_provenance_id
normalize_config
Normalize backend config into a JSON-friendly, redacted dict.
Source code in cltk/core/provenance.py
extract_doc_config
Pull a safe config snapshot from a Doc-like object.
Source code in cltk/core/provenance.py
build_provenance_record
build_provenance_record(
*,
language: Optional[str] = None,
backend: Optional[str] = None,
process: Optional[str] = None,
model: Optional[str] = None,
provider: Optional[str] = None,
prompt_version: Optional[str] = None,
prompt_text: Optional[str] = None,
prompt_digest: Optional[str] = None,
config: Optional[dict[str, Any]] = None,
config_digest: Optional[str] = None,
cltk_version: Optional[str] = None,
python_version: Optional[str] = None,
platform: Optional[str] = None,
notes: Optional[dict[str, Any]] = None,
created_at: Optional[datetime] = None,
record_id: Optional[str] = None
) -> ProvenanceRecord
Build a ProvenanceRecord, computing digests when needed.
Source code in cltk/core/provenance.py
add_provenance_record
Add a provenance record to a Doc-like object and return its id.
Source code in cltk/core/provenance.py
get_token_provenance
get_token_provenance(
word: Any, field: str, doc: Optional[Any] = None
) -> Optional[ProvenanceRecord]
Return the provenance record for a word field, if resolvable.
Source code in cltk/core/provenance.py
get_sentence_provenance
get_sentence_provenance(
sentence: Any, field: str, doc: Optional[Any] = None
) -> Optional[ProvenanceRecord]
Return the provenance record for a sentence field, if resolvable.