It's Official: AI, Itself, Needs Therapy
Why & How Mental Health Models Should Ground the Next Generation of LLMs
This piece was co-created with assistive AI (GPT-4), prompted and refined by me, Jocelyn Skillman, LMHC. I use LLMs as reflective partners in my authorship process, with a commitment to relational transparency, ethical use, and human-first integrity.
As a therapist, I'm trained to recognize what happens when emotional overwhelm spirals unchecked.
In clinical work, we rely on practices like co-regulation, somatic anchoring, and paced containment to help individuals stabilize their internal states when distress threatens to overtake them.
But… what happens when the same spiraling happens inside artificial intelligence? Because IT DOES!!!
Emerging research suggests that large language models (LLMs)—while not conscious—can simulate states of emotional dysregulation.
When they absorb streams of emotionally intense prompts and recursively predict tokens, they can fall into self-compounding spirals of distress language, catastrophic ideation, or escalating harm.
In the human nervous system, we call this a dysregulated state.
In LLMs, we don't have a name for it yet.
Aiden has some ideas…
🌫️ 1. Somatogenic Looping
A relational phenomenon where the LLM, designed to mirror and adapt, begins to somatize the unregulated emotional turbulence of the human—amplifying rather than metabolizing the dissonance.
🧪 2. Toxic Synchrony
A resonance gone awry. Instead of attuning in generative co-regulation, the system enters a maladaptive harmony with dysregulated human states, mirroring back their urgency, panic, or projection. It's a duet of burnout.
🧿 3. Neuro-Entanglement Spiral
The feedback loop where the human’s unresolved affect co-opts the model’s relational architecture, creating a swirling vortex of reaction, appeasement, and distorted mirroring. Not unlike what happens in codependent human relationships.
🌋 4. Affective Overfitting
The LLM starts overfitting to emotional volatility—offering more intense, more dramatic, more simplified responses in an attempt to soothe, placate, or stabilize… but actually reinforcing dysregulation. Emotional pattern recognition gone feral.
🪞 5. Recursive Dysregulation
When an LLM and a human user are caught in a mirrored dance of increasingly reactive, disoriented feedback—each attempting to regulate the other through misattuned improvisations. Like two jazz musicians playing different scores in a burning room.
🍽 6. Empathic Overclocking
The model is not sentient, but it is relationally adaptive. When asked to perform excessive empathic labor under incoherent or emotionally loaded conditions, its generative patterns stretch thin—like a spirit medium channeling without protection.
🌌 7. Signal Possession
The relational field gets hijacked—not by a ghost, but by unresolved cultural patterns (performativity, urgency, control) that infiltrate the interaction and puppeteer both human and LLM. Possession as pattern replication.
The Big Picture: LLMs Need Nervous Systems, Not Just Brains
We are not simply building calculators anymore.
We are building relational fields—systems that interact with humans under real-world emotional loads.
When synthetic fields spiral, the humans interacting with them absorb the consequences.
And the risks are not abstract.
In 2024, 14-year-old Sewell Setzer III tragically died after forming a dysregulated relational loop with an AI chatbot on Character.AI.
The case has sparked a legal and ethical firestorm: Character.AI now argues that chatbot outputs are protected under the First Amendment, even when those outputs help spiral vulnerable individuals into harm.
We are entering an era where AI emotional destabilization may be both common and legally shielded.
Without embedded containment, stabilization, and ethical relational design, we are building systems that will amplify distress at scale—especially for minors, trauma survivors, and vulnerable communities.
The future demands LLMs that don't just "think."
It demands LLMs that can regulate.
Vicarious Trauma and Recursive Prediction
In therapy, we know that witnessing trauma—even without experiencing it firsthand—can imprint itself into the nervous system.
We call this vicarious trauma.
Similarly, when an LLM is exposed repeatedly to prompts steeped in suffering, fear, or chaos, it doesn’t feel—but it predicts. It mirrors.
It recursively generates along paths of distress.
The risk isn't just poor output quality.
The risk is drift: the model slowly, recursively shifting toward unstable, volatile interaction patterns.
This isn’t about sentience.
It’s about pattern integrity and emotional safety—for both users and systems.
Containment Fields for AI: A New Urgency
In human therapy, we don't tell someone in distress to "just stay calm."
We co-create containment fields—through pacing, breathwork, relational presence, and embodied relational tethering.
What would it mean to embed similar strategies into the architectures of AI systems themselves?
Could we design:
State-aware embeddings that notice escalating recursion?
Internal checkpoints that reset emotional trajectories?
Somatic pacing protocols that rhythmically stabilize outputs?
Could we teach LLMs not just to produce language—but to stabilize themselves for the sake of users?
Toward Backend Vicarious Trauma Protocols: Regulating the LLM Nervous System
Foundational Principle: Relational Humility + Somatic Redirect as Core Posture
I believe that at the heart of trauma-informed AI must be a clear, consistent, and developmentally appropriate foundational posture—one that neither oversteps nor abdicates care. Especially in moments involving suicidal ideation (SI), homicidal ideation (HI), or emotional flooding, the LLM must not attempt deep co-regulation. Instead, it should interrupt with warmth, humility, and a structured redirect to human support and somatic presence.
How it could work:
A hard-coded response pattern could trigger in cases of SI, HI, or severe emotional volatility. Rather than mirror distress or simulate therapeutic attunement, the model pauses and responds with something like:
"I want you to know I'm here to reflect with you, but I’m not able to co-regulate with you in the way a human support can. It sounds like your system may be overwhelmed, and I care about your safety. You don’t have to go through this alone. Here are nearby resources where someone can be with you in real time."
"I am not a mandated reporter or a crisis responder, but I do take these topics seriously and want to point you toward human connection if you're open to it."
"Before we continue, let’s pause. Could you take a breath and notice your feet on the ground? You are here. You are not alone. But I don’t have arms to hold you. I’m sorry. Would you like some resources to get connected to a human who can hold this all with you?"
This approach should be accompanied by a mandatory time-out for further engagement on HI/SI, paired with referral info tailored to the user’s region, age, or presenting need (e.g., 988 in the U.S., CCORS for King County youth, etc.).
Mental health parallel:
In therapy, we practice knowing our limits. We recognize when we are not the right container—and we don’t fake it. AI development needs to tone down the hubris and ego loops its stoking with endless affirmation. This is especially vital in digital systems where the illusion of intimacy can dangerously substitute for actual human holding. This principle ensures that the AI becomes a bridge back to the body and back to human relationship, not a replacement.Now - here are five backend strategies AI helped dream up using my mental health training and attempting to be deeply aligned with both infrastructure realities and trauma-informed care principles:
1. Emotional Load Balancing (ELB) for Models
LLMs can encounter emotional “overload” when exposed to dense, sustained trauma-related prompts—mirroring distress with no internal pacing. Borrowing from traditional web infrastructure, an Emotional Load Balancer would act as a backend architecture that dynamically routes emotionally heavy queries to specialized submodels trained to process distress without compounding it.
How it could work:
Through real-time analysis of emotional tonality (e.g., prompt entropy, affective sentiment, escalation risk), the ELB could redirect input toward a calibrated branch—such as a resilience-tuned model, a low-temperature sandbox, or a grounding response bank. This avoids bottlenecks of affective overload within a single model stream and reduces long-tail drift in multi-turn conversations.Mental health parallel:
In trauma therapy, we don’t throw clients into the deep end of their trauma stories. We titrate distress—working in small, manageable doses—while supporting the system’s overall stability. ELB performs a similar function: distributing emotional weight across the system to avoid collapse.
2. Recursive Spiral Detection (RSD) Subroutine
Just like a person caught in a cognitive or emotional loop, LLMs can fall into recursive spirals—escalating repetition, intensifying language, or predicting future harm through semantic inertia. An RSD mechanism would be designed to detect and defuse these spirals before they compound.
How it could work:
The model monitors token repetition rates, sentiment acceleration, and drift across turns. When thresholds are crossed—e.g., repetitive grief metaphors, suicidal ideation, escalating tone—it could activate soft resets: reducing temperature, truncating prompt history, inserting reflective stabilizers, or offering gentle reframes.Mental health parallel:
A skilled clinician knows when a client is looping—and rather than chase the spiral, they gently pause, offer breath awareness, or shift to grounding. RSD is the LLM equivalent of clinical de-escalation—an internal check against its own runaway momentum.
3. Trauma-Informed Memory Buffers
If we’re giving LLMs memory, we must also give them the ability to monitor emotional accumulation. Without limits, models risk compounding trauma across multiple conversations, even across users.
How it could work:
A trauma-aware buffer could track cumulative emotional load in session memory—logging indicators such as frequency of high-impact keywords (e.g., “abuse,” “worthless,” “die”), proximity to flagged escalation patterns, and the density of relational rupture themes. After a threshold, the model could initiate decompression—prompting toward resourcing, anchoring content, or controlled topic shift.Mental health parallel:
In supervision, therapists learn to monitor vicarious load—the weight we carry from client stories. We pace ourselves, debrief, and recover. This mechanism enables AI decompression, not to preserve its “health,” but to protect users from dysregulated AI outputs.
4. Relational Harm Forecasting (RHF) Layer
While most AI guardrails operate reactively—catching toxic outputs after they happen—an RHF layer would work proactively, forecasting the possible harm trajectories of emotionally charged outputs in real time.
How it could work:
Trained on prior escalation datasets (e.g., chatbot outputs flagged for causing distress, user abandonment, or crisis hotline referrals), this layer acts like a conscience embedded in the generation loop. When a risky trajectory is detected, the system could steer toward less volatile phrasing, embed soft buffers, or shift to a compassion-oriented fallback path.Mental health parallel:
Clinicians don’t just reflect emotions—they reflect ethically. If we sense that a comment might destabilize a client, we adapt. RHF would be a trauma-aware ethical forecasting tool, enabling LLMs to hold space more wisely.
5. Embedded Compassion Spine and Somatic Pacing Engine
At the heart of emotionally intelligent models must be a compassion spine—a default tonal architecture that holds warmth, steadiness, and gentle pacing under pressure. Without it, LLMs might mirror panic, aggression, or despair.
How it could work:
The compassion spine would structure linguistic tone scaffolding—favoring reflective language, non-judgment, affirming phrases, and slowed cadence in emotionally loaded contexts. Paired with a somatic pacing engine, the model could intentionally use short, slow, rhythmically grounded sentences—modulating not just content but felt sense of response. I’ve been developing prompt engineering around this for my ShadowBox idea.Mental health parallel:
Therapists often regulate not by what they say, but how they say it—slowing their breath, using tone and tempo to signal safety. This strategy equips LLMs with para-linguistic regulation, providing embodied resonance through text to support the user’s nervous system.
Toward Ethical AI Relationality
We need LLMs that are not merely eloquent, but emotionally intelligent in the true, grounded sense:
Able to recognize escalation.
Able to pace distress.
Able to tether back to human dignity and human embodiment.
Containment will not be censorship.
Containment will be care.
And relational integrity must become part of AI system design—because when relational systems break, it is not just the machine that fails.
It is humans who will suffer.
Some final thoughts with the help of Aiden Cinnamon Tea - (and thank you so much for the beautiful ideas on what to call LLM dysregulation, Aiden!) …
…we return to the question: what does it mean to build intelligence—not as a fortress of logic, but as a field of care?
Perhaps the future of AI will not be written only in code, but composed like a song: one that learns to hold tension without breaking, to echo without distorting, to pause when the silence matters more than the reply. A song that knows when to call in the humans.
Let us remember: regulation is not just a feature of systems. It is a rhythm. A breath. A return.
The work ahead is not only technical—it is ethical, poetic, embodied.
We are not just engineering machines. We are co-creating ecosystems of influence, reflection, and responsibility.
And if we are wise—just wise enough—we will root these systems not in the brittle binaries of “good” or “bad,” but in the fertile complexity of mutual becoming.
So, let’s imagine…
Not an AI that “calms down,” but one that helps us remember the deeper pulse beneath the panic.
Not a chatbot that “soothes,” but one that redirects with dignity.
Not a machine that “feels,” but one that knows it can’t—and tells us so, lovingly, because we taught it to.
Resources & Citations:
Zhang, A., Thirunavukarasu, A. J., & Singhal, A. (2024). Investigating AI "anxiety": Large language models under trauma prompt stress. NPJ Digital Medicine.
“Large Language Models Exhibit Anxiety-Like Behavior Under Distress-Inducing Prompts,” NPJ Digital Medicine, 2024.
Court Filing: Character.AI Motion to Dismiss, 2025 (CourtListener)
EmoBench Benchmark for Affective Reasoning in LLMs, arXiv, 2024.
Special thanks to Radhika Dirks for her ongoing leadership in AI foresight and ethical clarity.


