Add agent graph recovery baseline by skywhite1024 · Pull Request #287 · DexForce/EmbodiChain

skywhite1024 · 2026-06-02T05:31:37Z

Description

This PR adds the baseline agent graph and recovery workflow for generative simulation agents.

Main changes:

Add agent graph specification, graph compilation, monitor functions, error functions, and recovery support under embodichain/lab/sim/agent/.
Move legacy simulation atom action logic into the new agent-oriented atomic action structure.
Add configurable success checking and agent task environment helpers for tableware agent workflows.
Add agent configs and prompts for apple pick, cup transfer, rearrangement, and pour water workflows.
Update run_agent.py / run_env.py to support the new public atomic action and recovery-related options.
Update generative simulation documentation for the new agent workflow.
Add .env.example and make LLM configuration load from environment variables instead of hardcoded credentials.

No issue is currently linked.

Fixes #

Type of change

Enhancement (non-breaking change which improves an existing functionality)
New feature (non-breaking change which adds functionality)
Documentation update

Screenshots

Not applicable.

Checklist

I have run the black . command to format the code base.
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
Dependencies have been updated, if applicable.

Additional validation performed:

git diff --check origin/main...HEAD
python -m black --check .
python -m pytest tests/sim/atomic_actions/test_actions.py tests/sim/atomic_actions/test_engine.py -q
Verified that .env, generated agent content, and temporary export artifacts are not included in the PR diff.

Copilot

Pull request overview

This PR introduces a graph-based “Task → Monitor → Recovery” execution baseline for generative simulation agents, replacing the legacy code-generation workflow with JSON graph artifacts compiled and executed via a runtime task graph. It also updates atomic actions to support ordered execution and adds environment/config changes to support recovery, success predicates, and environment-based LLM configuration.

Changes:

Added agent task-graph + monitor/recovery scaffolding (monitor functions, error functions, graph runtime) and updated scripts to support recovery and public atomic-action options.
Refactored atomic-action execution to preserve configured action order (even with duplicate action names) and added support for MoveAction joint-position targets.
Switched LLM configuration to environment variables / .env + added new task configs and documentation for the new workflow.

Reviewed changes

Copilot reviewed 64 out of 65 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
tests/sim/atomic_actions/test_engine.py	Adds regression test ensuring `execute_static` uses configured action order even with duplicate names.
tests/sim/atomic_actions/test_actions.py	Adds tests for MoveAction qpos-target execution and waypoint validation.
embodichain/utils/llm_json.py	New helper to extract/normalize JSON from LLM outputs (plain or fenced).
embodichain/lab/sim/atomic_actions/engine.py	Preserves action registration order via `_action_sequence` for `execute_static`.
embodichain/lab/sim/atomic_actions/actions.py	Adds qpos-target support to MoveAction, plus qpos interpolation helpers and stricter tensor normalization.
embodichain/lab/sim/atom_actions.py	Removes legacy monolithic atom-action implementation (migrated into agent-oriented structure).
embodichain/lab/sim/agent/monitor_utils.py	Adds helpers to compute object/arm/gripper state for monitoring.
embodichain/lab/sim/agent/monitor_functions.py	Adds monitor functions (object moved, object held, gripper distance).
embodichain/lab/sim/agent/error_functions.py	Adds error/disturbance helpers and interactive error-injection utilities for recovery experiments.
embodichain/lab/sim/agent/atom_action_utils.py	Adds agent-specific atomic-action utilities (IK, trajectory planning, state sync, drive-call extraction).
embodichain/lab/sim/agent/agent_graph.py	Adds task-graph runtime that executes edges via `drive()` and switches to recovery branches on monitor triggers.
embodichain/lab/scripts/run_env.py	Adds post-execution task-success logging and plumbing for recovery/public-action flags.
embodichain/lab/scripts/run_agent.py	Adds CLI flags for recovery + public atomic-action toggles; fixes import path for `main`.
embodichain/lab/gym/utils/misc.py	Adds `apply_rotation` helper used by failure injection.
embodichain/lab/gym/envs/tasks/tableware/single_arm_agent_env.py	New single-arm agent env mixin exposed via right-arm slot.
embodichain/lab/gym/envs/tasks/tableware/configurable_success.py	New config-driven task-success predicate evaluator.
embodichain/lab/gym/envs/tasks/tableware/base_agent_env.py	Reworks agent initialization to Task/Recovery/Compile agents; adds arm-slot abstraction and graph generation pipeline.
embodichain/lab/gym/envs/tasks/tableware/atomic_actions.py	New `AtomicActionsAgentEnv` environment using configurable success and grasp pose overrides.
embodichain/lab/gym/envs/tasks/init.py	Registers and exports `AtomicActionsAgentEnv`.
embodichain/data/dataset.py	Adds repo-root relative resolution as an additional dataset path fallback.
embodichain/data/init.py	Changes generated agent prompt dir location (see review comment about writability/config).
embodichain/agents/prompts/recovery_rules.txt	Adds RecoveryAgent authoring constraints/spec rules.
embodichain/agents/prompts/monitor_functions.txt	Adds monitor-function prompt documentation.
embodichain/agents/prompts/error_functions.txt	Adds failure-description prompt context for recovery spec generation.
embodichain/agents/prompts/code_prompt.txt	Removes legacy code-generation prompt rules.
embodichain/agents/prompts/code_example.txt	Removes legacy code-generation examples.
embodichain/agents/prompts/basic_background.txt	Updates background prompt to remove camera-mirroring guidance and standardize axis text.
embodichain/agents/prompts/atom_actions.txt	Removes `drive` description from atomic-actions prompt context.
embodichain/agents/mllm/prompt/task_prompt.py	Updates TaskAgent prompting to emit a nominal JSON task graph (no prose/code).
embodichain/agents/mllm/prompt/recovery_prompt.py	Adds prompt for generating recovery bindings spec JSON.
embodichain/agents/mllm/prompt/compile_prompt.py	Adds compatibility namespace for compile prompt imports (LLM-free compilation).
embodichain/agents/mllm/prompt/code_prompt.py	Removes legacy code prompt class.
embodichain/agents/mllm/prompt/init.py	Exports new prompt modules and removes CodePrompt export.
embodichain/agents/hierarchy/validation_agent.py	Removes legacy image-based validation agent implementation.
embodichain/agents/hierarchy/task_agent.py	Updates TaskAgent to generate/store normalized JSON graph output.
embodichain/agents/hierarchy/recovery_agent.py	Adds RecoveryAgent to generate/store normalized recovery spec JSON.
embodichain/agents/hierarchy/llm.py	Switches from AzureChatOpenAI to ChatOpenAI + `.env` loading and env-based configuration.
embodichain/agents/hierarchy/compile_agent.py	Adds CompileAgent that normalizes/expands recovery spec and executes compiled graph artifacts.
embodichain/agents/hierarchy/code_agent.py	Removes legacy code-generation and AST execution agent.
embodichain/agents/hierarchy/agent_base.py	Adjusts composed observation assembly (no longer auto-pulls env obs by default).
docs/source/features/generative_sim/index.rst	Adds EmbodiAgent + failure-monitor-recovery docs links.
docs/source/features/generative_sim/agents.md	Updates EmbodiAgent docs for graph/recovery workflow (needs updates re removed ValidationAgent).
docs/source/features/failure_monitor_recovery.md	Adds documentation for monitor/recovery graph semantics and artifacts.
configs/gym/pour_water/gym_config.json	Adjusts env config (removes explicit `control_parts` list under env).
configs/gym/pour_water/gym_config_simple.json	Updates joint normalization ids and removes env-level `control_parts` list.
configs/gym/agent/ur10_cup_transfer_agent/task_prompt.txt	New single-arm UR10 cup transfer task prompt.
configs/gym/agent/ur10_cup_transfer_agent/fast_gym_config.json	New AtomicActionsAgent-v3 config for UR10 cup transfer with success spec.
configs/gym/agent/ur10_cup_transfer_agent/basic_background.txt	New basic background prompt for UR10 cup transfer.
configs/gym/agent/ur10_cup_transfer_agent/atom_actions.txt	New task-scoped atomic-action prompt for UR10 cup transfer.
configs/gym/agent/ur10_cup_transfer_agent/agent_config.json	New agent config wiring Task/Recovery/Compile agents for UR10 cup transfer.
configs/gym/agent/rearrangement_agent/fast_gym_config.json	Migrates rearrangement agent config to AtomicActionsAgent-v3 + adds success spec and camera/sensor updates.
configs/gym/agent/rearrangement_agent/atom_actions.txt	New task-scoped atomic-action prompt for rearrangement.
configs/gym/agent/rearrangement_agent/agent_config.json	Migrates to Task/Recovery/Compile agents and replaces code prompt/example with recovery prompt inputs.
configs/gym/agent/pour_water_agent/fast_gym_config.json	Updates pour-water agent config (episodes/steps, camera setup, observation normalization, dataset metadata).
configs/gym/agent/pour_water_agent/agent_config.json	Migrates to Task/Recovery/Compile agents and recovery prompt inputs.
configs/gym/agent/pour_water_agent/agent_config_dual.json	Migrates dual pour-water agent config to Task/Recovery/Compile agents.
configs/gym/agent/apple_pick_agent/task_prompt.txt	New single-arm apple pick-and-place task prompt.
configs/gym/agent/apple_pick_agent/fast_gym_config.json	New AtomicActionsAgent-v3 config for apple pick task with success spec and grasp overrides.
configs/gym/agent/apple_pick_agent/basic_background.txt	New basic background prompt for apple pick task.
configs/gym/agent/apple_pick_agent/atom_actions.txt	New task-scoped atomic-action prompt for apple pick task.
configs/gym/agent/apple_pick_agent/agent_config.json	New agent config wiring Task/Recovery/Compile agents for apple pick task.
.gitignore	Ignores generated agent artifacts and additional local/generated assets.
.env.example	Adds template for local LLM configuration via environment variables.

Comments suppressed due to low confidence (1)

docs/source/features/generative_sim/agents.md:95

The ValidationAgent section refers to a module that no longer exists. Either remove this section or rewrite it to describe the current success evaluation entrypoint (env.is_task_success() / configurable_success).

### ValidationAgent
*Located in:* `embodichain/agents/hierarchy/validation_agent.py`

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

 database_dir = EMBODICHAIN_DEFAULT_DATABASE_ROOT
 database_2d_dir = os.path.join(database_dir, "2dasset")
-database_agent_prompt_dir = os.path.join(database_dir, "agent_prompt")
+database_agent_prompt_dir = os.path.abspath(
+    os.path.join(os.path.dirname(__file__), "..", "database/agent_generated_content")
+)


+        self.open_state = torch.as_tensor(
+            getattr(self, "gripper_open_state", [0.05]),
+            dtype=self.init_qpos.dtype,
+        ).flatten()
+        self.close_state = torch.as_tensor(
+            getattr(self, "gripper_close_state", [0.0]),
+            dtype=self.init_qpos.dtype,
+        ).flatten()


+- **Execute**: The graph runtime executes atomic actions and switches to recovery branches when monitors trigger.
 - **Validate**: The `ValidationAgent` analyzes the result images, selects the best camera angle, and judges success.
- **Refine**: If validation fails, feedback is sent back to the agents to regenerate the plan or code.
+- **Refine**: If validation fails, feedback is sent back to the agents to regenerate the graph artifacts.


 Closes the loop by verifying if the robot actually achieved what it planned.

 * Uses a specialized LLM call (`select_best_view_dir`) to analyze images from all cameras and pick the single best angle that proves the action's outcome, ignoring irrelevant views.
-* If an error occurs (runtime or logic), it generates a detailed explanation which is fed back to the `TaskAgent` or `CodeAgent` for the next attempt.
+* If an error occurs (runtime or logic), it generates a detailed explanation which is fed back to the `TaskAgent` or `CompileAgent` for the next attempt.


+│   ├── compile_agent.py       # Graph compilation & execution interface
 │   ├── validation_agent.py    # Visual analysis & view selection
 │   └── llm.py                 # LLM configuration and instances


…gym_config

yuecideng · 2026-06-03T03:12:58Z

        self._semantic_analyzer = SemanticAnalyzer()

        # Initialize default actions
+        self._action_sequence: List[tuple[str, AtomicAction]] = []


Why adding a new sequence of AtomicAction?

yuecideng · 2026-06-03T03:14:00Z

@@ -0,0 +1,9 @@
+# Copy this file to .env and fill in your local credentials.


We should unified MLLM entry point @XuanchaoPENG

yuecideng · 2026-06-03T03:15:35Z

+scripts/benchmark/rl/reports/*
+
+# Local/generated assets
+obj/asset_simready.obj


We should add general gitignore contents. Specific items should not be added to this file

yuecideng · 2026-06-03T03:16:31Z

        arm_dof = self.dof if arm_dof is None else arm_dof
        if start_qpos is None:
            start_qpos = self.robot.get_qpos(name=self.cfg.control_part)
+        start_qpos = torch.as_tensor(


This line should be removed. We only accept torch tensor

yuecideng · 2026-06-03T03:19:41Z

            trajectory of shape (n_envs, n_waypoints, dof),
            joint_ids corresponding to trajectory
        """
+        if isinstance(target, torch.Tensor):


We have Joint space intepolation method for motion generator. Please use the unified module.

Copilot

Pull request overview

Copilot reviewed 85 out of 89 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

embodichain/gen_sim/action_agent_pipeline/error_functions.py:320

wrong_affordance duplicates the (single-column) gripper_state_traj twice in the returned action. After preserving both gripper DOFs (see comment above), the return should append the full gripper_state_traj once to keep the original gripper joint values intact.

+    if last_frame_pose is None:
+        last_frame_pose = env.obj_info.get(obj_name).get("pose")
+    current_pose = _get_object_pose(env, obj_name)
+    previous_pose = _as_pose_matrix(last_frame_pose, device=current_pose.device)


+    else:
+        select_arm = "left_arm" if is_left else "right_arm"
+    start_qpos = torch.as_tensor(action[0, :-2], dtype=torch.float32)
+    last_qpos = torch.as_tensor(action[-1, :-2], dtype=torch.float32)
+    gripper_state_traj = action[:, -1:].copy()
+    last_pose = env.get_arm_fk(qpos=last_qpos, is_left=is_left).clone()


Copilot AI review requested due to automatic review settings June 2, 2026 05:31

Copilot started reviewing on behalf of skywhite1024 June 2, 2026 05:31 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

Codex and others added 15 commits June 3, 2026 10:35

Add agent graph recovery baseline

cbe1a00

Save atom actions changes

71381db

Save apple pick prompt config changes

ada2af0

Save apple pick config refinements

da0ecc7

Migrate shared LLM config loading

c549d8d

Move action agent pipeline modules

0e546ce

Save action agent atom actions updates

014288b

Move hierarchy agents into action pipeline

6359fe9

Move agent configs and env adapters

8ae14d4

Move action agent CLI

3ea5b79

Save action agent pipeline updates

4973ec1

Save action agent prompt updates

70a0c18

Save action agent execution updates

5b23646

feat: add ignore_terminations_during_agent to ur10_cup_transfer fast_…

e6f1986

…gym_config

fix: align action agent pipeline PR updates

478db9a

yuecideng requested changes Jun 3, 2026

View reviewed changes

skywhite1024 added 3 commits June 3, 2026 11:43

Save LLM config and atomic engine updates

3bc5890

Save UR10 cup transfer config update

6c0c8f5

Save recovery agent and atomic action updates

8602d5e

skywhite1024 force-pushed the ljd/baseline_agentchoard_clean branch from 2a16b73 to 8602d5e Compare June 3, 2026 04:37

Save motion generator test config updates

480fe5f

Copilot AI review requested due to automatic review settings June 3, 2026 06:12

Copilot started reviewing on behalf of skywhite1024 June 3, 2026 06:12 View session

Copilot AI reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add agent graph recovery baseline#287

Add agent graph recovery baseline#287
skywhite1024 wants to merge 19 commits into
mainfrom
ljd/baseline_agentchoard_clean

skywhite1024 commented Jun 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

yuecideng Jun 3, 2026

Uh oh!

yuecideng Jun 3, 2026

Uh oh!

yuecideng Jun 3, 2026

Uh oh!

yuecideng Jun 3, 2026

Uh oh!

yuecideng Jun 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,9 @@
		# Copy this file to .env and fill in your local credentials.

Conversation

skywhite1024 commented Jun 2, 2026

Description

Type of change

Screenshots

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

yuecideng Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

yuecideng Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

yuecideng Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

yuecideng Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

yuecideng Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants