The AI Brief #3 AI agents agent reliability workflow automation prompting AI in production

AI Agents Forget Their Instructions: A Structural Problem, Not a Technical Limitation

Rodrigue Le Gall | | 3 min read

A troubling discovery is emerging from developer practice: large language models lose their initial instructions during long conversations, much like a brain with ADHD that skips intermediate steps and rushes toward the quick exit.

The “Lost in the Middle” research (Stanford 2023) quantifies the phenomenon: a 30% or greater performance degradation when a critical instruction appears in the middle of a long conversation. The models see it, but don’t process it with the same weight as fresh information.

This isn’t anecdotal. Developers building agent workflows (automation of complex multi-step tasks) regularly hit this wall: the AI skips the “boring” steps, rushes to output, and abandons constraints established at startup.

The good news? Solutions exist: restructure prompts with “scaffolding” (break into explicit sub-tasks), reinject critical instructions regularly, or use fine-tuned models with locked-in instructions. But it requires upstream work.

For SMBs deploying AI agents in production (customer service, order processing, HR workflows), this limitation isn’t theoretical—it creates costly errors in real-world environments.

What this means for your business

If you’ve launched an AI agent to automate a complex business process, you’ve probably observed this phenomenon: the AI ignores a business rule established at startup, approves an order it shouldn’t, or skips a verification step.

Before blaming the model, accept that this is a design problem, not a raw capability issue. Three concrete actions: (1) Test your agents on long conversations—that’s where the problem surfaces. (2) Restructure your prompts to inject critical rules through short, explicit steps rather than one monolithic block. (3) For truly mission-critical workflows, consider fine-tuning with your business constraints embedded in training data.

It’s not glamorous, but it’s what separates an AI pilot that works from one that fails in production.


In brief

Code Copilots: Developers Are Already Writing Less Code

Developers confirm that after adopting Claude Code and GPT Codex in December 2025, they write radically less code manually—a psychological shift comparable to moving from mail to email. The productivity impact is massive, but it also creates anxiety: what does it mean to be a developer if tools do 70% of the work?

Read source

Verifying a Human Is Behind the Agent: New Standard in AI Commerce

With the rise of autonomous AI agents for online purchasing, a human-verification tool is emerging to confirm that a human actually pilots the agent. This becomes a trust requirement for sellers and platforms facing growing automation.

Read source

Nvidia Launches NemoClaw: A Secure AI Agent Platform for Enterprise

Nvidia responds to growing demand with NemoClaw, an enterprise version of the viral OpenClaw framework, focused on security and integration with critical systems. It’s a signal that AI agents are becoming standard IT infrastructure, not just a gadget.

Read source

Encyclopedia Britannica and Merriam-Webster are taking legal action against OpenAI for unauthorized use of copyrighted content in training, with generation of “substantially similar” responses. This broadens the legal front on AI training data.

Read source

Pentagon Wants to Train AI Models on Classified Data

U.S. military officials are considering creating secure environments where AI providers (OpenAI, Anthropic, etc.) train militarized versions of their models on classified data. A signal of a race toward vertical AI specialization in sensitive sectors.

Read source

Get The AI Brief in your inbox

3x per week, the essentials of AI decoded for business leaders.

Subscribe

Take action

Ready to automate your repetitive tasks?

Discover what AI can concretely change in your business. In 2 hours, we identify your automation opportunities.

Free AI Checklist

10 processes to automate in your business

Download PDF