-
Notifications
You must be signed in to change notification settings - Fork 5k
Surface disjoint inference / platform credits in conversation usage #11441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||
|---|---|---|---|---|---|---|---|---|
|
|
@@ -790,7 +790,12 @@ type ConversationUsageMetadata { | |||||||
| """Token usage using a user's API key, keyed by model.""" | ||||||||
| byokTokenUsage: [TokenUsage!]! | ||||||||
| contextWindowUsage: Float! | ||||||||
|
|
||||||||
| """The total number of inference credits spent so far in the conversation""" | ||||||||
| creditsSpent: Float! | ||||||||
|
|
||||||||
| """The total number of platform credits spent so far in the conversation.""" | ||||||||
| platformCreditsSpent: Float! | ||||||||
|
Comment on lines
+793
to
+798
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I asked AM and maybe we might be missing adding platformCreditsSpent to a few more places? warp/app/src/workspaces/gql_convert.rs Line 246 in dc408d2
I didn't put a ton of thought into which spots should be displaying credits spent, total credits spent, but do these seem valid?
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yep, you're right, fixed up those places + 1 more |
||||||||
| summarized: Boolean! | ||||||||
|
|
||||||||
| """ | ||||||||
|
|
@@ -1379,6 +1384,7 @@ union FreeAvailableModelsResult = FreeAvailableModelsOutput | UserFacingError | |||||||
|
|
||||||||
| type GcpProviderConfig { | ||||||||
| projectNumber: String! | ||||||||
| serviceAccountEmail: String | ||||||||
| workloadIdentityFederationPoolId: String! | ||||||||
| workloadIdentityFederationProviderId: String! | ||||||||
| } | ||||||||
|
|
@@ -4365,4 +4371,4 @@ enum WriteToPtyAutonomyValue { | |||||||
| ALWAYS_ASK | ||||||||
| ASK_ON_FIRST_WRITE | ||||||||
| RESPECT_USER_SETTING | ||||||||
| } | ||||||||
| } | ||||||||
|
Comment on lines
-4368
to
+4374
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. nit: Unintended removal of the newline?
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This is what the generation produced. If there was a newline at EOF, it's because someone probably didn't generate it and they hand-edited it or something |
||||||||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we're rounding
credits_spent, should we be doing it for these two adjacent functions too?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Kind of unsure.
credits_spentis something that gets called by UI code, so it makes sense to round. The other two are for things like communicating about credit usage over session sharing. I'm worried about rounding those + having lossy behavior. I'm gonna leave it like this for now but could be convinced to do something to make it clearer later.