Reflexion adds a critical missing capability to plain reflection: evidence acquisition. A reflection loop can critique quality, but Reflexion can also fetch new information and revise based on that evidence.
Minimal Reflexion architecture:
- Responder creates first draft + self-critique + search intents.
- Tool execution node gathers external evidence for those intents.
- Reviser rewrites output using observations and citation constraints.
- Router decides continue vs finalize based on score/improvement policy.
Why this is better than reflection-only systems: quality is improved by new facts, not only better wording. This matters for news, pricing, compliance, or any domain where stale knowledge is risky.
Core trade-off: better factuality and grounding at the cost of added latency, more tool calls, and larger state.
Control requirements: cap loops, bound tool usage, enforce citation schema, and define fallback when evidence remains weak.
Deepening Notes
Source-backed reinforcement: these points are extracted from the LangGraph source note to sharpen architecture and flow intuition.
- because we are not really making any tools like T Search tool that we saw right we're not making it available to the reflection agent system okay so that is a drawback of reflecti
- but also fact checks it with external data by making API calls in this case internet search in the reflection agent pattern we had to rely on the training data of llms but in this
- e actor the actor is the main agent that drives everything it reflects on its responses and re-executes it it can do this with or without tools to improve based on self-critique th
- later on we look at the diagram and it would make sense so we've got tools we've got the responder agent and we've got the revisor agent finally okay so the revisor agent agents jo
- the list of Search keywords that the responder agent generated this tools component is actually going to you know the Tav search Tool if you remember in the first few sections we l
Interview-Ready Deepening
Source-backed reinforcement: these points add detail beyond short-duration UI hints and emphasize production tradeoffs.
- Reflexion extends reflection by grounding critique/revision with live external data via tools.
- A reflection loop can critique quality, but Reflexion can also fetch new information and revise based on that evidence.
- Reflexion adds a critical missing capability to plain reflection: evidence acquisition.
- Why this is better than reflection-only systems: quality is improved by new facts, not only better wording.
- Core trade-off: better factuality and grounding at the cost of added latency, more tool calls, and larger state.
- Responder creates first draft + self-critique + search intents.
- Tool execution node gathers external evidence for those intents.
- This matters for news, pricing, compliance, or any domain where stale knowledge is risky.
Tradeoffs You Should Be Able to Explain
- More agent autonomy increases adaptability but also increases non-determinism and debugging effort.
- Tool-heavy loops improve grounding, but latency and failure surfaces rise with each external dependency.
- Fine-grained state graphs improve control, but poor state contracts can create brittle routing behavior.
First-time learner note: Think in state transitions, not giant prompts. Keep node responsibilities small and route logic deterministic so each step is easy to reason about.
Production note: Bound autonomy with loop limits, tool policies, and checkpoints. Capture route decisions and state snapshots for replay and incident analysis.