Add "type" identifier to error rate serialization by thequilo · Pull Request #120 · fgnt/meeteval

thequilo · 2025-09-18T13:20:01Z

Adds a key "type" to serialized error rates. This fixes ambiguous cases where saved results from DI-cpWER and ORC-WER could not be discriminated, and makes identification easier from the files alone.

In [1]: from meeteval.wer.wer.orc import OrcErrorRate

In [2]: from meeteval.wer.wer.error_rate import ErrorRate

In [3]: er = OrcErrorRate.zero()

In [4]: serialized = er.asdict()

In [5]: serialized
Out[5]:
{'error_rate': 0,
 'errors': 0,
 'length': 0,
 'insertions': 0,
 'deletions': 0,
 'substitutions': 0,
 'reference_self_overlap': None,
 'hypothesis_self_overlap': None,
 'assignment': (),
 'type': 'orc-error-rate'}

In [6]: ErrorRate.from_dict(serialized)
Out[6]: OrcErrorRate(error_rate=0, errors=0, length=0, insertions=0, deletions=0, substitutions=0, assignment=())

The old heuristics for detecting the error rate type are kept for backwards compatibility.

Also fixes a few minor bugs.

Add "type" identifier to error rate serialization

5733b38

thequilo force-pushed the wer_identifier branch from 54a3b4a to 5733b38 Compare September 18, 2025 13:26

Fix flake8 and zero error rate

323ac47

thequilo merged commit 31419d0 into main Sep 19, 2025
8 checks passed

thequilo deleted the wer_identifier branch September 19, 2025 08:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "type" identifier to error rate serialization#120

Add "type" identifier to error rate serialization#120
thequilo merged 2 commits intomainfrom
wer_identifier

thequilo commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

thequilo commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant