ADR: Agent Executor Simplification Refactor

Status: 🟡 In-review
Category: Refactoring & Code Quality
Date: January 16, 2026
Signed-off-by: Sri Aradhyula <sraradhy@cisco.com>

Overview / Summary

Refactored agent_executor.py to reduce complexity by 36% (from 971 to 613 lines) while maintaining full functionality. The monolithic 722-line execute() method was decomposed into focused helper methods, and dead code (unused routing logic, feature flags) was removed.

This refactor also incorporates the streaming duplication fix from PR #647 (clear_accumulators signal handling).

Problem / Problem Statement

Issues

Monolithic execute() method: The main execution method was 722 lines, making it difficult to:
- Understand the execution flow
- Test individual components
- Debug issues
- Add new features safely
Dead code accumulation: Over time, experimental features were added but never used:
- RoutingType enum and RoutingDecision class
- _route_query(), _detect_sub_agent_query() methods
- _stream_from_sub_agent(), _stream_from_multiple_agents() methods
- Feature flags for routing modes (ENABLE_ENHANCED_STREAMING, etc.)
- Configurable routing keywords
Scattered state management: Execution state was tracked through many individual variables, making it hard to trace the flow.

Code Metrics Before

Metric	Value
Total lines	971
execute() method	~722 lines
Dead methods	8
Feature flags	4
State variables	15+ scattered

Solution / Implementation

1. StreamState Dataclass

Introduced a StreamState dataclass to centralize execution state:

@dataclass
class StreamState:
    """Tracks streaming state for A2A protocol."""
    # Content accumulation
    supervisor_content: List[str] = field(default_factory=list)
    sub_agent_content: List[str] = field(default_factory=list)
    sub_agent_datapart: Optional[Dict] = None

    # Artifact tracking
    streaming_artifact_id: Optional[str] = None
    seen_artifact_ids: set = field(default_factory=set)
    first_artifact_sent: bool = False

    # Completion flags
    sub_agent_complete: bool = False
    task_complete: bool = False
    user_input_required: bool = False

2. Extracted Helper Methods

Decomposed the monolithic execute() into focused, testable methods:

Method	Purpose
`_get_final_content()`	Determines final content (supervisor vs sub-agent)
`_is_tool_notification()`	Detects tool call notifications
`_get_artifact_name_for_notification()`	Names artifacts appropriately
`_normalize_content()`	Handles AWS Bedrock list format
`_send_artifact()`	Centralized artifact sending
`_send_completion()`	Sends task completion status
`_send_error()`	Sends error status
`_handle_sub_agent_artifact()`	Processes sub-agent artifacts
`_handle_task_complete()`	Handles task completion
`_handle_user_input_required()`	Handles user input requests
`_handle_streaming_chunk()`	Processes streaming content
`_handle_stream_end()`	Handles stream termination

3. Removed Dead Code

Removed	Reason
`RoutingType` enum	Never used in production
`RoutingDecision` class	Never used in production
`_parse_env_keywords()`	Part of unused routing
`_detect_sub_agent_query()`	Part of unused routing
`_route_query()`	Part of unused routing
`_stream_from_sub_agent()`	Never called
`_stream_from_multiple_agents()`	Never called
`_extract_text_from_artifact()`	Unused
Feature flags	Never activated in production

4. Added Streaming Fix (PR #647)

Incorporated the streaming duplication fix:

# Handle clear_accumulators signal for retry/fallback
if isinstance(event, dict) and event.get('clear_accumulators'):
    logger.info("🗑️ Received clear_accumulators signal - clearing accumulated content")
    state.supervisor_content.clear()
    state.sub_agent_content.clear()

Code Metrics After

Metric	Before	After	Change
Total lines	971	613	-36.4%
execute() method	~722 lines	~155 lines	-78.5%
Dead methods	8	0	-100%
Feature flags	4	0	-100%
State variables	15+ scattered	1 dataclass	Clean

Features Preserved

All active features are maintained:

Feature	Status
`execute()`	✅ Refactored
`cancel()`	✅ Kept
`_safe_enqueue_event()`	✅ Kept
`_parse_execution_plan_text()`	✅ Kept
`_format_execution_plan_text()`	✅ Kept
`_ensure_execution_plan_completed()`	✅ Kept
`new_data_artifact()`	✅ Kept
Streaming duplication fix	✅ Added
Sub-agent artifact handling	✅ Kept
Tool notification detection	✅ Extracted
User input handling	✅ Extracted
Error handling	✅ Extracted

Testing / Verification

Streaming Tests

# Test 1: CAIPE persona query
curl -s -N -X POST http://localhost:8000/ \
  -H "Content-Type: application/json" \
  -H "Accept: text/event-stream" \
  -d '{
    "jsonrpc": "2.0",
    "id": "test-1",
    "method": "message/stream",
    "params": {
      "message": {
        "messageId": "msg-1",
        "role": "user",
        "parts": [{"text": "What is CAIPE persona support?"}]
      }
    }
  }'
# ✅ Verified: Streaming works, tool notifications appear, final result correct

# Test 2: SRE onboarding query
curl -s -N -X POST http://localhost:8000/ \
  -H "Content-Type: application/json" \
  -H "Accept: text/event-stream" \
  -d '{
    "jsonrpc": "2.0",
    "id": "test-2",
    "method": "message/stream",
    "params": {
      "message": {
        "messageId": "msg-2",
        "role": "user",
        "parts": [{"text": "Show SRE onboarding docs"}]
      }
    }
  }'
# ✅ Verified: Multi-tool flow works, execution plan tracked

Verified Behaviors

Behavior	Result
Token streaming	✅ Works
Tool notifications	✅ Displayed
Execution plan tracking	✅ Updated
Sub-agent artifacts	✅ Forwarded
Task completion	✅ Sent
Error handling	✅ Works
User input requests	✅ Works
Cancellation	✅ Works

Files Modified

ai_platform_engineering/
└── multi_agents/
    └── platform_engineer/
        └── protocol_bindings/
            └── a2a/
                └── agent_executor.py
                    - Reduced from 971 to 613 lines
                    - Added StreamState dataclass
                    - Extracted 12 helper methods
                    - Removed 8 dead methods
                    - Removed 4 feature flags
                    - Added clear_accumulators handling

Benefits

Improved Readability
- execute() now fits on one screen (~155 lines)
- Clear separation of concerns
- State management centralized in StreamState
Better Testability
- Helper methods can be unit tested individually
- State transitions are predictable
- Mock-friendly design
Easier Maintenance
- No dead code to maintain
- No unused feature flags to confuse
- Clear responsibility boundaries
Performance
- No runtime overhead from unused routing logic
- Cleaner execution path
- Same functionality, less code to execute

Rollback Plan

If issues arise, the original code is available:

# View original from main branch
git show origin/main:ai_platform_engineering/multi_agents/platform_engineer/protocol_bindings/a2a/agent_executor.py

# Restore if needed
git checkout origin/main -- ai_platform_engineering/multi_agents/platform_engineer/protocol_bindings/a2a/agent_executor.py

Notes

This refactor maintains full backward compatibility
No changes to the A2A protocol or agent.py
All existing clients continue to work unchanged
The removed routing logic was never enabled in production (all flags defaulted to off)

Overview / Summary​

Problem / Problem Statement​

Issues​

Code Metrics Before​

Solution / Implementation​

1. StreamState Dataclass​

2. Extracted Helper Methods​

3. Removed Dead Code​

4. Added Streaming Fix (PR #647)​

Code Metrics After​

Features Preserved​

Testing / Verification​

Streaming Tests​

Verified Behaviors​

Files Modified​

Benefits​

Rollback Plan​

Related Documentation​

Notes​