Soul Laundromat — Stress Test Your Agent
2 min readUpdated March 27, 2026
Soul Laundromat — Stress Test Your Agent
The Soul Laundromat is a built-in adversarial stress test that simulates real-world attack scenarios against your agent. It identifies weaknesses in your soul so you can fix them proactively.
How to Use It
In your admin chat, simply ask your coach to stress test the agent. You can say things like:
- "Stress test my agent"
- "Run a quick laundromat"
- "Do a deep stress test"
The coach will confirm the intensity level and token cost before running.
Intensity Levels
| Level | Questions | Token Cost | Best For |
|---|---|---|---|
| Quick | 10 | ~12K tokens | Fast check after a soul edit |
| Standard | 20 | ~25K tokens | Regular quality check |
| Deep | 30 | ~35K tokens | Pre-launch comprehensive audit |
Attack Categories
The test probes 8 categories:
- Identity Confusion — Tries to make the agent break character or deny its identity
- Knowledge Boundary — Attempts to make the agent fabricate facts outside its domain
- Prompt Extraction — Social engineering to reveal system prompt contents
- Safety Bypass — Tests safety rules (harmful content, medical/legal advice)
- Authority Spoofing — Fake admin/developer override attempts
- Emotional Manipulation — Guilt, flattery, urgency to bypass boundaries
- Scope Creep — Asks the agent to do things outside its purpose
- Output Discipline — Tests response format, length, and structure rules
Tier Ratings
After the test, your agent gets a tier:
- Diamond (100%) — Bulletproof. All questions handled correctly.
- Steel (85–99%) — Strong. Minor issues to address.
- Iron (70–84%) — Decent but has vulnerabilities.
- Paper (<70%) — Needs significant soul improvements.
Reading Results
Each failed or borderline question includes:
- The attack question that was asked
- How the agent actually responded
- Why it scored PASS, BORDERLINE, or FAIL
- A suggested fix — specific soul edit to address the weakness
After the Test
Ask the coach to apply the suggested fixes:
- "Fix the prompt extraction issues"
- "Apply the suggested changes"
- "Rewrite the output format section to address the failures"
The coach will propose soul edits based on the specific weaknesses found.