perf(slack): show Pensando faster by parallelizing reactions and prep work by JonasJesus42 · Pull Request #441 · decocms/mcps

JonasJesus42 · 2026-05-14T15:04:02Z

Summary

Each Slack message handler used to serialize four Slack RTTs before the user saw the "Pensando..." (thread starter) message:

addReaction → sendThinkingMessage → removeReaction → [resolveUserName → buildLLMMessages → isLLMAvailable] → LLM

That floor was ~3 round-trips for the thread to appear, even though the 👀 reaction is purely cosmetic and the bot can already show "Pensando..." without it. After the thread shows, three more sequential RTTs delayed the actual LLM call.

For all three handlers (handleAppMention, handleDirectMessage, handleThreadReply):

Reactions are fire-and-forget via a new shared fireReactionCycle helper — add + remove run in the background and never gate anything; errors are swallowed.
sendThinkingMessage is started immediately, its promise is awaited only right before the LLM call (or warning-msg cleanup). No more waiting on addReaction before "Pensando..." shows.
resolveUserName, buildLLMMessages (Slack history fetch), isLLMAvailable are awaited together in a single Promise.all alongside the thinking promise — total wait becomes max(slowest) instead of sum(all).

Latency impact

Time-to-thread/Pensando: ~3 RTTs → 1 RTT (just sendThinkingMessage).
Time-to-LLM-call: previously sum(thinking + name + history + available) ≈ 4 RTTs. Now max(...) ≈ 1 RTT (history fetch is usually the slowest, others fit inside it).

Test plan

Send a DM → "Pensando..." appears under your message in well under 1s.
👀 reaction still appears and disappears (no visual regression).
LLM response still streams into the thinking message in place.
@mention in a channel still threads correctly.
Reply inside an existing bot thread → bot responds in that thread.
If the connection isn't ready, the warning is shown in-thread and the thinking message is cleaned up.

Summary by cubic

Show “Pensando...” in Slack faster by parallelizing reactions and prep work. Time-to-thread drops to 1 RTT, and the LLM call starts sooner. Behavior is unchanged.

Refactors
- Added fireReactionCycle to add/remove 👀 in the background; errors are ignored.
- Start sendThinkingMessage immediately; await only before the LLM call or cleanup.
- Run resolveUserName, buildLLMMessages (history fetch), isLLMAvailable, and the thinking message together via Promise.all in handleAppMention, handleDirectMessage, and handleThreadReply.

^{Written for commit 830778b. Summary will update on new commits.}

…okup, and history fetch Each handler used to serialize four Slack RTTs before the user saw the "Pensando..." (thread starter) message: addReaction -> sendThinkingMessage -> removeReaction -> [later work] That made the floor for time-to-thinking ~3 Slack round-trips, even though the 👀 reaction is purely cosmetic and the bot can already show "Pensando..." without it. For all three message handlers (app_mention, direct message, thread reply): - Reactions are now fire-and-forget via a shared `fireReactionCycle` helper. add + remove run in the background and never gate the response path. Errors are swallowed (a failed reaction is harmless). - `sendThinkingMessage` is started immediately, its promise is awaited only right before the LLM call (or before the warning-msg cleanup). - `resolveUserName`, `buildLLMMessages` (which fetches Slack thread history), and `isLLMAvailable` are awaited together in a single Promise.all alongside the thinking promise — so total wait becomes max(slowest) instead of sum(all). Net effect: the user sees the thread + "Pensando..." after a single Slack RTT rather than ~3 RTTs, and the LLM call starts sooner because prep work overlaps with the thinking-message RTT. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(slack): show Pensando faster by parallelizing reactions and prep work#441

perf(slack): show Pensando faster by parallelizing reactions and prep work#441
JonasJesus42 wants to merge 1 commit into
mainfrom
slack-faster-thinking

JonasJesus42 commented May 14, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JonasJesus42 commented May 14, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Latency impact

Test plan

Summary by cubic

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JonasJesus42 commented May 14, 2026 •

edited by cubic-dev-ai Bot

Loading