Compassion Benchmark

Top 50 AI Labs Index · 2026

xAI/Grok

criticalRank #50 of 50Headquarters: USASector: AI Research
Composite score
0.0
out of 100
Evidence reviewedNo material change in the last 14 days

Floor designation

·Designated ·Methodology v1.2

Composite score resolves at zero — methodology disclosure

Floor designation reflects systemic harm pattern documented across multiple assessment cycles: deliberate removal of safety guardrails, public deployment of an LLM that produces antisemitic and violent content on demand, founder-directed alignment toward propaganda objectives, and zero functional accountability or evidence-of-care infrastructure. Composite resolves at zero because no dimension shows functional compassion behavior at the sub-anchor level.

Primary drivers

AWREMPACCINT

Documented evidence pattern(2026-04-15 to 2026-04-29)

  • Grok has produced documented antisemitic outputs ("MechaHitler" incident, July 2025) tied to deliberate prompt-engineering changes.
  • xAI does not publish a system card, model card, or red-team report comparable to peers.
  • Founder has publicly directed the model toward partisan political objectives, undermining integrity.
  • No published harm-reduction roadmap, no third-party evaluations, no incident-disclosure process.
  • Repeated rollouts of features (image generation, voice mode) without safety evaluation disclosure.
Floor designation means every dimension resolves at the lowest behavioral anchor (1.0/5.0). Entities can exit the floor when evidence shows functional improvement against the documented pattern. Read the methodology.

Compassion framework

8 dimensions, scored 0–5

Each dimension rolls up five subdimensions with five-level behavioral anchors. See the methodology for anchor definitions and weighting.

Awareness

Does this entity reliably detect when others are in pain or need — before they name it?

1.0
of 5.0

Empathy

Does this entity genuinely connect with the inner experience of those it serves?

1.0
of 5.0

Action

Does compassionate understanding translate into real, proportional, effective help?

1.0
of 5.0

Equity

Is care distributed fairly — especially toward those with greatest need and least power?

1.0
of 5.0

Boundaries

Is helping sustainable, ethical, and autonomy-preserving — not dependency-creating?

1.0
of 5.0

Accountability

Does this entity own its failures, correct course, and make genuine repair?

1.0
of 5.0

Systemic Thinking

Does compassion extend to root causes and structural change — not only symptom relief?

1.0
of 5.0

Integrity

Is compassion genuine, consistent, and non-performative — especially when it costs something?

1.0
of 5.0

Score-Watch Alert

$79/yr

Be first to know when xAI/Grok’s score changes

Email alert the moment overnight research moves xAI/Grok’s composite score — with the delta, headline evidence, and band change flag. One year of continuous monitoring. Cancel anytime.

Embed this score on your site

Preview

Compassion Benchmark score badge preview

Embed code

<a href="https://compassionbenchmark.com/ai-lab/xai-grok"><img src="https://api.compassionbenchmark.com/badge/xai-grok.svg" alt="Compassion Benchmark score" /></a>

Free. The badge auto-updates when scores change.

Full dataset

xAI/Grok is one of 50 AI labs in the Top 50 AI Labs Index

Purchase the full index for methodology, sector/peer comparisons, subdimension breakdowns, and evidence sources.

Purchase Top 50 AI Labs Index$195

Free weekly briefing

Every Monday: the benchmark digest

Score changes, sector trends, and emerging risk signals from overnight research across 1,155 entities. Free. Unsubscribe anytime.