feat: support word-level timestamps for faster-whisper#9621
Open
eglia wants to merge 1 commit intomudler:masterfrom
Open
feat: support word-level timestamps for faster-whisper#9621eglia wants to merge 1 commit intomudler:masterfrom
eglia wants to merge 1 commit intomudler:masterfrom
Conversation
Signed-off-by: Andreas Egli <github@kharan.ch>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
This PR extends the transcribe endpoints to also include word timestamps. Currently this is only implemented for the faster-whipser backend, but could also be added for other backends if supported by them. I tried to add whisper.cpp, but apparently there is some outstanding issue.
The individual words are returned twice, once as part of each segment, and once as a top level element. The top level element is to comply with the OpenAI spec, the words per segment to not lose any information which might be useful.
Additionally, the endpoint is adapted to return timestamps in seconds to comply with the OpenAI spec.
This PR fixes #9306
Notes for Reviewers
Signed commits