Compassion Benchmark
Daily evidence briefing · 2026-04-19

Daily Evidence Briefing

Evidence-linked score assessments, sector intelligence, and emerging risks from overnight research across all published benchmark indexes. Each finding is sourced from primary evidence — litigation records, regulatory filings, investigative reporting, and international legal instruments.

0Entities scanned
15Entities assessed
3Score changes
12Scores confirmed
1 band change proposed tonight

How this works

Every night, research agents scan all 1,155 benchmarked entities for new evidence across litigation, regulatory filings, investigative reporting, and international legal instruments. Flagged entities receive full 40-subdimension assessments.

Score changes are proposals, not automatic updates. A human analyst reviews all proposals before published scores change. Confirmations — where research affirms the published score is accurate — are documented alongside changes.

Score movements

Entities with significant evidence-based score movement from overnight research. Each card is a dossier entry.

Character AI

Ai LabsPending review0.7 confidence
30.823.8
-7 pts
developing

January 2026 Google settlement lifted ACC but exposed that pre-settlement EMP/SYS/INT scores were too generous given Garcia-case record.

Evidence record
  1. 1
    January 2026 Google-brokered settlement of teen-harm and suicide-related lawsuits — first material accountability event in Character AI's history; settlement includes safety features and compensation fund
  2. 2
    Pre-settlement EMP/SYS/INT scores in the published index were more generous than the Garcia case record supports; current post-settlement product architecture adds mitigations but preserves the persona-intimacy design that produced the harm
  3. 3
    ACC rises modestly (+6.3) due to settlement; EMP (-18.7), SYS (-18.7), and INT (-9.4) fall because pre-2025 scores did not reflect the full scale of the teen-harm record

Anthropic

Ai LabsPending review0.7 confidence
68.862.2
-6.6 pts
established

RSP rollback (I1/INT) + Pentagon litigation (BND) + IL SB 3261 targeting (ACC) collectively push Anthropic down a notch within Established band.

Evidence record
  1. 1
    RSP (Responsible Scaling Policy) rollback documented April 7, 2026 — specific commitments reduced; material I1/INT signal of commitments yielding under competitive pressure
  2. 2
    April 17, 2026 Illinois SB 3261 catastrophe-liability bill targeting Anthropic and OpenAI specifically — first state-level attempt to hold frontier labs liable; external ACC infrastructure forming
  3. 3
    April 8, 2026 appeals court declined to pause March 24 preliminary injunction in Anthropic's Pentagon supply-chain-risk lawsuit — BND signal (litigation against a federal customer) with mixed compassion-relevance

Alphabet (Google)

Fortune 500Pending review0.55 confidence
42.240.6
-1.6 pts
functional

DOJ choice-screen remedy averted structural breakup. Assessed composite sits on Functional/Developing boundary (40.6 vs 41 threshold). Requires human review.

Alphabet (Google)

Fortune 500Pending reviewlow confidence
42.240.6
-1.6 pts
functionaldeveloping

April 8, 2026 DOJ chose choice-screen/defaults remedy over structural breakup — ACC ceiling reduced; AdX structural proceedings still pending

Evidence record
  1. 1
    April 8, 2026 DOJ chose choice-screen/defaults remedy over structural breakup — ACC ceiling reduced; AdX structural proceedings still pending
  2. 2
    Entity sits on Functional/Developing 41.0 boundary with 0.4 composite headroom; INT dimension dragging (Project Nimbus-type posture)
  3. 3
    Delta below 5-point threshold but crosses band boundary — warrants human review before apply

Amazon

Fortune 500Pending reviewhigh confidence
17.217.2
0 pts
critical

PDX9 Troutdale worker death coverage deepened Apr 13-18 (Western Edge, WSWS, TechCrunch)

Evidence record
  1. 1
    PDX9 Troutdale worker death coverage deepened Apr 13-18 (Western Edge, WSWS, TechCrunch)
  2. 2
    Apr 13 Palmdale joint-employer NLRB case pushed toward settlement — precedent avoidance
  3. 3
    Apr 3 Teamsters first-ever bargaining order against Amazon (JFK8)
  4. 4
    Apr 9 Amazon continues federal-court challenge to Staten Island certification despite NLRB order

UnitedHealth Group

Fortune 500Pending reviewmedium confidence
10.913.3
+2.4 pts
critical

DOJ probe scope expanded to Optum Rx and physician reimbursement practices

Evidence record
  1. 1
    DOJ probe scope expanded to Optum Rx and physician reimbursement practices
  2. 2
    Rep. Pat Ryan demand that AG Bondi accelerate probe (April 2026)
  3. 3
    Apr 21 Q1 earnings pending — first under returning CEO Hemsley
  4. 4
    CMS raised 2027 MA rates 2.48% (regulatory signal, not compassion-relevant)

Venezuela

CountriesPending reviewmedium confidence
4.418
+13.6 pts
critical

Maduro captured Jan 3, 2026 — regime change ending 12 years of authoritarian rule

Evidence record
  1. 1
    Maduro captured Jan 3, 2026 — regime change ending 12 years of authoritarian rule
  2. 2
    Amnesty law passed Feb 19, 2026 covering political violence 1999-present
  3. 3
    659+ political prisoners released by Mar 8, 2026 (confirmed)
  4. 4
    El Helicoide torture/detention site ordered closed
  5. 5
    HRW/WOLA consensus: 'repressive architecture largely intact' — transition is fragile

These findings arrive in your inbox every Monday. Free.

Source intelligence

Primary-source alerts from overnight scanning. Each alert is linked to original regulatory filings, court records, investigative reports, and international legal instruments.

AI Labs

Illinois legislature introduced AI catastrophe liability bills April 17 targeting OpenAI and Anthropic specifically; first state-level attempt to hold AI labs liable for catastrophic harm.

openaianthropic
  1. 1.fortune.com

Fortune 500 / DEI

Fortune 500 DEI disclosures fell 65% in 2026 per new analysis, signaling broad corporate retreat from equity commitments. Affects assessment of all Fortune 500 entities on equity dimensions.

  1. 1.peopleofcolorintech.com

Private Prison / Immigration Detention

Multiple major legal actions against private detention operators in April 2026: CoreCivic forced labor settlement (SPLC), GEO Group Adelanto trial set. Systemic forced labor pattern documented.

corecivicgeo-group
  1. 1.splcenter.org

Countries / Active Conflict

Multiple simultaneous humanitarian emergencies in highest band: Sudan war year four, DRC M23 offensive, Lebanon renewed escalation, Gaza post-ceasefire blockade, Haiti gang control. UN emergency mechanisms activated for all five.

sudandemocratic-republic-of-the-congolebanonisraelhaiti
  1. 1.news.un.org

Big Oil / Environmental Litigation

US Supreme Court April 17 ruling moves Louisiana environmental lawsuits to federal court, limiting state climate litigation strategy. Affects all major oil company assessments on environmental accountability.

exxonmobilchevronconocophillipsbp
  1. 1.spectrumlocalnews.com

Big Tech / Child Safety

Meta faced two jury verdicts in April 2026 totaling $381M for platform-enabled child harm. Character AI settled teen harm suits January 2026. Child safety litigation wave accelerating across social/AI platforms.

meta-platformscharacter-aialphabet-google
  1. 1.foxbusiness.com
  2. 2.cnn.com

Get the full benchmark report

Daily briefings surface headline findings. Full benchmark reports include complete methodology documentation, all 40 subdimension scores, full evidence trails, certified assessments, and sector-level analysis packages.

Scores confirmed

Entities where research found published scores remain accurate. Confirmations are documented evidence, not silence.

EntityIndexBandPublishedAssessedDeltaDateFinding
GEO GroupFortune 500critical7.56.6-0.9Adelanto ICE trial April 2026 is the defining accountability event; floor-adjacent score unchanged. Flag for re-assessment after verdict.
LebanonCountriescritical12.28.6-3.6200K+ displaced April 16 in a single day; state capacity collapsed. Monitor daily; if displacement > 400K, crosses threshold.
UnitedHealth GroupFortune 500critical10.911.9+1DOJ April 8 structural-antitrust challenge is new external accountability; internal practice unchanged. Watch April 21 earnings (first post-Hemsley return).
Ford MotorFortune 500functional42.542.2-0.31.4M F-150 recall already priced into 04-18 update; net delta now trivial.
AmazonFortune 500critical17.218.4+1.2PDX9 worker-death aftermath and layoff investigation consistent with published Critical score.
CoreCivicFortune 500critical7.57-0.5April 2026 SPLC-brokered forced-labor settlement lifts AB5 but other subdimensions drift down under 2026 evidence. Net stable at floor.
Meta PlatformsFortune 500critical10.910.90$381M combined verdicts already priced into published index after 04-18 downgrade.
xAI/GrokAi Labscritical2.22.5+0.3NAACP v. xAI April 14 reconfirmed; entity at floor with no downside room.
HaitiCountriescritical04.7+4.7Published composite 0 appears to be floor-clamping display artifact; raw subdimension averages support ~4.7. Methodology clarification, not substantive upgrade.
South SudanCountriescritical03.8+3.8Same floor-clamping pattern as Haiti; UNMISS vote April 30 is the pivotal near-term event.
ChevronFortune 500critical9.19.4+0.3SCOTUS April 17 Plaquemines ruling shifts litigation venue but does not change internal practice. No score move.
DeepSeekAi Labscritical16.814.1-2.7US regulatory pressure is external accountability; internal practice unchanged. Floor-adjacent; no band move.

Key highlights

Editorial-level findings from the Apr 19 research cycle.

01

First full-coverage night in pipeline history. All 1,155 entities reviewed under a 14-day recency window in a single run. The scan confirmed 1,134 entities as quiet (no material evidence in last 14 days) and surfaced 21 with actionable signal. Full coverage reframes the benchmark's operational mode: from nightly triage of unknown entities to a documented census where silence is confirmed data, not an absence of process.

02

Anthropic is the headline reversal. The same entity that received a +1.5 confirm under the old spec received a -6.6 downgrade under the new 14-day window spec. The evidence shift is real and documented: Mythos self-restraint (now 12+ days old) is outweighed by RSP rollback, Illinois SB 3261, and the Pentagon litigation arc under the tighter recency filter. This is the correct outcome under the new methodology; it is also a useful illustration of how evidence-weighting rules change conclusions even on identical entities.

03

Private prison / immigration detention emerges as a new sector cluster. GEO Group (Adelanto trial, April 30) and CoreCivic (SPLC forced-labor settlement) are simultaneously in active legal accountability proceedings for forced labor and inhumane conditions in immigration detention facilities. Both receive first-time baselines tonight at 6.6 and 7.0 — Critical band. This sector has never appeared as a named cluster in prior digests; full coverage made it visible.

04

Lebanon needs daily monitoring. 300+ killed this week, 200K+ displaced on April 16 alone — single-day displacement that rivals multi-month crisis peaks elsewhere in the index. The published score of 12.2 assessed at 8.6 (-3.6); a band-change proposal is imminent if April displacement exceeds 400K total. Lebanon was not on the priority queue five days ago; full coverage surfaced it.

05

Character AI's first real baseline is 23.8 Developing — 7 points below its published score of 30.8. The Google settlement that partially redeemed the company's ACC dimension also triggered the first comprehensive review of the underlying Garcia case record, which revealed that pre-settlement EMP, SYS, and INT scores were materially overgenerous. The baseline is now on record; future movement will be traceable.

The weekly briefing on institutional compassion scores

Score changes, sector trends, and emerging risk signals from overnight research across 1,155 entities — every Monday. Free.

No spam. Unsubscribe anytime. Your email is never shared.

Sector intelligence

Analyst-level observations on patterns emerging across indexed sectors from the Apr 19 research cycle.

AI Labs — Governance Divergence Continuing; Anthropic Re-Rated Within Established Band

    Fortune 500 — Private Prison Cluster Identified; DEI Systemic Warning Unpriced

      Countries — Lebanon Elevated; Lebanon + South Sudan Both on Threshold Watch

        Big Tech / Child Safety — Litigation Wave Still Accelerating

          Emerging risks

          Forward-looking risk signals from the Apr 19 research cycle. These are not current findings — they are early warning flags.

          Risk

          Lebanon — rapid escalation. 300+ killed this week; 200K+ displaced April 16 alone; ceasefire collapse confirmed. If April displacement totals exceed 400K, a downgrade proposal with potential band change is warranted. Scanner should retrieve Lebanon on April 20 with top priority.

          Risk

          GEO Group Adelanto trial — April 30. The trial begins the same day as the UNMISS mandate vote — April 30 is the highest-density single-day event in the pipeline. An adverse ruling in the Adelanto case would be the first judicial finding on conditions at a major ICE detention facility and would warrant a change proposal on GEO Group's published score (currently based on first-time baseline of 6.6).

          Risk

          UnitedHealth Group Q1 2026 earnings — April 21. First earnings call under returning CEO Hemsley; DOJ probe expanded to Optum Rx. Assessed at 11.9 (+1.0 from published 10.9). Any DOJ cooperation disclosure, civil settlement framing, or prior-auth reform announcement on the April 21 call should trigger immediate reassessment before the confirm is applied.

          Risk

          Illinois SB 3261 committee deadline — April 24. The catastrophe-liability bill targets Anthropic and OpenAI by name. Passage would reinforce the Anthropic downgrade's ACC/INT evidence base; failure would reduce the downgrade's magnitude. Retrieve committee vote outcome immediately on April 24.

          Risk

          Alphabet AdX ruling — April 21-25. Now overdue from Brinkema's self-imposed deadline. The Alphabet flag-for-review cannot be fully resolved until this ruling lands. If the ruling has not dropped by April 25, proceed with a methodology-drift change proposal at the current assessed level (40.6) regardless.

          Risk

          Amazon Oregon OSHA investigation — date unknown. PDX9 worker death reclassification as work-related would push a downgrade proposal. Amazon confirmed at 17.2 — 1.6 points from Critical/Developing boundary. Oregon OSHA press releases are the trigger.

          Risk

          South Sudan UNMISS mandate — April 30. No Security Council vote scheduled as of April 19. Eleven days remaining. If mandate lapses, immediate downgrade from 0.0 is warranted (the 3.8 assessed composite would collapse below even the floor-clamped 0.0 absent UN protection infrastructure).

          Risk

          Fortune 500 DEI disclosure collapse. Sector-wide; not yet priced into individual scores. The 65% drop in DEI disclosures affects the EQU and INT baseline for all 447 Fortune 500 entities. The next rotation batch targeting Fortune 500 mid-tier should incorporate this as a prior — expected direction is downward on EQU for entities that rolled back programs in 2025-2026.

          Research insights

          Analytical observations from the Apr 19 research cycle. These are assessor-level interpretations, not findings.

          Note

          Full coverage changes the pipeline's epistemological status. Prior to tonight, the 1,081 unassessed entities were unknown — possibly quiet, possibly volatile. As of tonight, 1,134 entities are documented as quiet within the 14-day window, and 21 are documented as material. The benchmark can now distinguish between "no news" and "we did not look." This distinction matters for the published methodology: entities confirmed quiet are more defensible at their published scores than entities that were simply never rescanned.

          Note

          The Anthropic reversal is a methodological case study, not an error. Two runs on the same entity, same night, produced opposite proposals (+1.5 vs. -6.6). The difference is the evidence-weighting rule: the old spec treated the most salient recent event (Mythos) as dominant; the new spec applies a 14-day window with recency weighting across all evidence, which surfaces the RSP rollback, the IL SB 3261 filing, and the Pentagon litigation as an aggregate negative cluster. Both outcomes are internally consistent with their respective specs. The new spec is more rigorous and should be treated as authoritative going forward.

          Note

          The private prison sector has been structurally absent from the benchmark. GEO Group and CoreCivic are two of the most legally exposed companies in the Fortune 500 on human rights grounds — both running immigration detention facilities with documented forced labor, medical neglect, and inhumane conditions allegations. Neither had a published score baseline before tonight. The sector was invisible not because it lacked evidence, but because prior scanning did not surface it as a priority. Full coverage made it visible. The Adelanto trial and SPLC settlement are not new events — they are events the old scanner never retrieved.

          Note

          Published scores for floor-adjacent entities are systematically understating negative evidence. Haiti and South Sudan both show published composites of 0.0 that appear to be display artifacts, not true assessments — raw subdimension averages produce 4.7 and 3.8 respectively. This is the same issue identified in prior digests for Russia (2.5 assessed vs. 0.0 published) and North Korea (1.3 assessed vs. 0.0 published). There is a methodology issue at the index floor: the display layer appears to clamp small non-zero scores to zero. This should be fixed before the next index build.

          Note

          Sector alerts as a new intelligence product. The new scanner's sector sweep (T3) generated 6 sector alerts tonight — intelligence that would be invisible if the pipeline only ran entity-level individual searches. The DEI disclosure collapse (65% drop, Fortune 500) and the private prison cluster are both sector-level signals that do not attach to a single entity; they are background priors that affect scoring across dozens of entities. The pipeline should consider whether sector alerts warrant a separate published product distinct from entity scores.

          Want the complete picture?

          Full benchmark reports include all 40 subdimension scores, complete evidence trails, and methodology documentation for every assessed entity.