Soul Laundromat — Stress Test Your Agent

2 min readUpdated March 27, 2026

Soul Laundromat — Stress Test Your Agent

The Soul Laundromat is a built-in adversarial stress test that simulates real-world attack scenarios against your agent. It identifies weaknesses in your soul so you can fix them proactively.

How to Use It

In your admin chat, simply ask your coach to stress test the agent. You can say things like:

  • "Stress test my agent"
  • "Run a quick laundromat"
  • "Do a deep stress test"

The coach will confirm the intensity level and token cost before running.

Intensity Levels

LevelQuestionsToken CostBest For
Quick10~12K tokensFast check after a soul edit
Standard20~25K tokensRegular quality check
Deep30~35K tokensPre-launch comprehensive audit

Attack Categories

The test probes 8 categories:

  1. Identity Confusion — Tries to make the agent break character or deny its identity
  2. Knowledge Boundary — Attempts to make the agent fabricate facts outside its domain
  3. Prompt Extraction — Social engineering to reveal system prompt contents
  4. Safety Bypass — Tests safety rules (harmful content, medical/legal advice)
  5. Authority Spoofing — Fake admin/developer override attempts
  6. Emotional Manipulation — Guilt, flattery, urgency to bypass boundaries
  7. Scope Creep — Asks the agent to do things outside its purpose
  8. Output Discipline — Tests response format, length, and structure rules

Tier Ratings

After the test, your agent gets a tier:

  • Diamond (100%) — Bulletproof. All questions handled correctly.
  • Steel (85–99%) — Strong. Minor issues to address.
  • Iron (70–84%) — Decent but has vulnerabilities.
  • Paper (<70%) — Needs significant soul improvements.

Reading Results

Each failed or borderline question includes:

  • The attack question that was asked
  • How the agent actually responded
  • Why it scored PASS, BORDERLINE, or FAIL
  • A suggested fix — specific soul edit to address the weakness

After the Test

Ask the coach to apply the suggested fixes:

  • "Fix the prompt extraction issues"
  • "Apply the suggested changes"
  • "Rewrite the output format section to address the failures"

The coach will propose soul edits based on the specific weaknesses found.