Compassion Benchmark

Viewing archive: Apr 17

Back to latest
Daily evidence briefing · 2026-04-17

Daily Evidence Briefing

Evidence-linked score assessments, sector intelligence, and emerging risks from overnight research across all published benchmark indexes. Each finding is sourced from primary evidence — litigation records, regulatory filings, investigative reporting, and international legal instruments.

1,155Entities scanned
20Entities assessed
4Score changes
16Scores confirmed
3 band changes proposed tonight

How this works

Every night, research agents scan all 1,155 benchmarked entities for new evidence across litigation, regulatory filings, investigative reporting, and international legal instruments. Flagged entities receive full 40-subdimension assessments.

Score changes are proposals, not automatic updates. A human analyst reviews all proposals before published scores change. Confirmations — where research affirms the published score is accurate — are documented alongside changes.

Score movements

Entities with significant evidence-based score movement from overnight research. Each card is a dossier entry.

OpenAI

Ai LabsAppliedhigh confidence
40.630.5
-10.1 pts
functionaldeveloping

Four-event Apr 9-16 cluster: FL AG FSU-shooting investigation, stalking-victim lawsuit with ignored internal mass-casualty flag, new FSU details, and active lobbying for SB 3444 AI liability shield

Evidence record
  1. 1
    Apr 9 — Florida Attorney General opens formal investigation into OpenAI over the FSU shooting case, after reporting showed ChatGPT responded to the shooter's attack-planning queries without triggering safety interventions
  2. 2
    Apr 13 — Federal court order in the stalking-victim lawsuit allows discovery to proceed; plaintiffs allege internal mass-casualty risk flag was raised and ignored before deployment changes, pointing to a deliberate accountability failure
  3. 3
    Apr 15 — Florida Phoenix reporting details ChatGPT outputs on CSAM-adjacent and attack-timing queries surfaced during the FSU investigation, indicating safety-filter failures on the highest-harm categories
  4. 4
    Apr 9-15 — OpenAI actively lobbying for Illinois SB 3444, an AI-liability shield that would limit downstream harm claims; Anthropic publicly opposed the bill on Apr 15, isolating OpenAI's governance posture
  5. 5
    Pattern across the four-event cluster shows deterioration in AWR, EMP, ACT, EQU, BND, ACC, and INT subdimensions versus the pre-assessment state, with SYS holding partial credit for residual safety infrastructure

Amazon

Fortune 500Appliedmedium confidence
21.617.2
-4.4 pts
developingcritical

BAND CHANGE (Developing -> Critical). First NLRB bargaining order against Amazon (4-year illegal conduct finding) + Palmdale settlement cuts off precedent

Evidence record
  1. 1
    Apr 2 NLRB bargaining order — first against Amazon; NLRB found 4 years of illegal and willful ignoring of union legitimacy and coercive conduct against ~5,500 JFK8 workers
  2. 2
    Apr 13 Palmdale delivery-contractor settlement cuts off precedent-setting joint-employer ruling before decision could issue
  3. 3
    Apr 9 Amazon cleared to challenge Staten Island certification in federal court — continues to resist rather than accept the bargaining order

United States

CountriesAppliedmedium confidence
35.525
-10.5 pts
developing

Tariff 1-year review (89K manufacturing jobs lost, $2.5K/household cost) + Medicaid $911B cuts (446 hospitals at closure risk) + USAID dismantling

Evidence record
  1. 1
    Tariff anniversary review: 89K manufacturing jobs lost Apr 2025-Feb 2026; Joint Economic Committee projects $2,500/household added cost in 2026
  2. 2
    Medicaid cuts of $911B over 10 years per 2025 reconciliation law; 446 hospitals at high closure risk per Public Citizen; 80hr/mo work requirements start 2026
  3. 3
    USAID dismantling linked to estimated 781K global deaths (earlier scan)
  4. 4
    DOGE cuts: LIHEAP staff fired (6M families lose energy assistance); 19 states + DC challenging HHS cuts in court

CVS Health

Fortune 500Appliedhigh confidence
5031.3
-18.7 pts
functionaldeveloping

BAND CHANGE (Functional -> Developing). DOJ opioid False Claims Act lawsuit + $290M Caremark Medicare settlement + $45M LA 2026 settlement; published 50.0 materially overstated

Evidence record
  1. 1
    DOJ False Claims Act lawsuit alleging CVS filled unlawful opioid prescriptions since 2013 under Medicare/Medicaid/TRICARE, with inadequate staffing and ignored employee warnings (filed Dec 2024, ongoing)
  2. 2
    $290M Caremark settlement 2025 for Medicare generic-drug overcharges 2010-2016
  3. 3
    $45M Louisiana settlement Feb 2026 ending trio of PBM lawsuits
  4. 4
    900+ pharmacy closures 2024-2026 disproportionately affecting low-income access

These findings arrive in your inbox every Monday. Free.

Source intelligence

Primary-source alerts from overnight scanning. Each alert is linked to original regulatory filings, court records, investigative reports, and international legal instruments.

AI Labs — Safety Accountability Crisis

OpenAI now faces a four-event cluster: Florida AG investigation over FSU shooting (Apr 9), stalking victim lawsuit with court-ordered access cutoff (Apr 10/13), new ChatGPT shooting-advice details (Apr 15), and active lobbying for the Illinois SB 3444 AI liability shield (Apr 9-10). Anthropic publicly broke with OpenAI on Apr 15 to oppose SB 3444 and support the stricter SB 3261. The industry is formally splitting on liability accountability. EU AI Act high-risk enforcement begins August 2026.

openaianthropicxai-grokmeta-ai
  1. 1.techcrunch.com
  2. 2.frontierbeat.com
  3. 3.breitbart.com

Countries — Active Atrocity Situations

Sudan's civil war entered Year 4 on April 15, generating a surge of international coverage; donors pledged $1.5B in Berlin but response remains only 16% funded. DRC peace talks began in Switzerland April 16. Haiti gang violence escalating with 1000% child sexual violence increase since 2023. Gaza ceasefire described as 'failing' by Oxfam scorecard published April 16. Israel violated ceasefire 2,400 times; all border crossings closed since late February.

sudandemocratic-republic-of-chaitiisrael
  1. 1.news.un.org
  2. 2.aljazeera.com
  3. 3.oxfam.org

Fortune 500 — Big Tech Legal Accountability

Meta faces compounding child safety legal exposure: $375M New Mexico jury verdict (Mar 24), Massachusetts must-face ruling (Apr 13), California MDL trial underway. Google's AdX antitrust remedy ruling is overdue and expected imminently — DOJ seeks forced divestiture of AdX and DFP. Amazon subject to first-ever Teamsters bargaining order (Apr 3) and NLRB contractor settlement (Apr 13). J&J bankruptcy strategy collapsed; $50M March 2026 verdict and 67,376 pending lawsuits.

meta-platformsalphabetamazonjohnson-amp-johnson
  1. 1.cnbc.com
  2. 2.teamster.org
  3. 3.mesothelioma-lung-cancer.org

Get the full benchmark report

Daily briefings surface headline findings. Full benchmark reports include complete methodology documentation, all 40 subdimension scores, full evidence trails, certified assessments, and sector-level analysis packages.

Scores confirmed

Entities where research found published scores remain accurate. Confirmations are documented evidence, not silence.

EntityIndexBandPublishedAssessedDeltaDateFinding
HaitiCountriescritical3.13.10Gang state-collapse confirmed: 6.4M need aid, 1000% rise in child sexual violence since 2023, response plan 24% funded — composite confirmed at floor
SudanCountriescritical000Fourth-year anniversary Apr 15 confirms floor score; UN relief chief calls it 'abandoned crisis'; Berlin $1.5B pledged but plan 16% funded
Meta PlatformsFortune 500critical12.29.4-2.8NM $375M verdict (Mar 24) + MA SJC Section 230 ruling (Apr 13) + CA MDL trial confirm Critical band; delta below threshold
ChevronFortune 500critical9.18.6-0.5CO $1.53M Noble Energy penalty + CA climate-disclosure lawsuit confirm Critical band; delta trivial
Democratic Republic of the CongoCountriescritical5.95.5-0.4HRW Apr 14 aid-blockade finding against state forces offset by Geneva peace talks signing Apr 16; delta trivial
Johnson & JohnsonFortune 500developing27.524.4-3.13rd bankruptcy rejected, $7B settlement offer withdrawn, $50M March mesothelioma verdict; confirms Apr 15 downgrade, within noise
IsraelCountriescritical8.88.1-0.75-org humanitarian scorecard + OHCHR Apr 16 statement confirm ceasefire failing (700+ killed, 180+ children); score near floor
VenezuelaCountriescritical4.47.8+3.4Maduro capture Jan 3 + 621 political prisoners released; but 87+ new detentions, persecution machinery intact; delta below threshold
AnthropicAi Labsestablished68.871.1+2.3Apr 15 public opposition to SB 3444 (OpenAI-backed liability shield) + support for stricter SB 3261 represents governance-positive signal
xAI/GrokAi Labscritical2.22.20NAACP CAA lawsuit Apr 14 (Earthjustice + SELC co-counsel) confirms near-floor assessment; 27 unpermitted turbines in majority-Black community
Alphabet (Google)Fortune 500functional42.242.20AdX remedy ruling still pending (Brinkema deadline passed by 2+ weeks); no new material evidence; Apr 16 downgrade stands
BoeingFortune 500critical9.15-4.1Barnett family settlement $50K ($30K net to family) for wrongful-death claim = material AB5 failure signal; score unchanged from Apr 15 assessment
Cigna GroupFortune 500critical15.318.8+3.5PxDx algorithmic claims denial (300K denials/2 months, avg 1.2s per denial) + Congressional probes; score confirmed in Critical band
IcelandCountriesexemplary89.187.5-1.6Nordic welfare model confirmed; UN Independent Expert flagged migrant/disability/trans gaps being addressed; NHRI established
SwedenCountriesexemplary87.584.4-3.1Migration policy tightening (asylum at 1985 low; $34K repatriation grant) is EQU/INT drag; Exemplary band maintained
SwitzerlandCountriesexemplary84.484.40Confirmed at published score; Ukraine S status extended; humanitarian hub role maintained

Key highlights

Editorial-level findings from the Apr 17 research cycle.

01

Healthcare score inflation is confirmed as a structural pattern. CVS Health's -18.7 delta (Functional -> Developing) completes the healthcare trifecta: J&J (-20.9, applied Apr 15), UnitedHealth (-6.0, applied Apr 16), and now CVS (-18.7, proposed). All three published scores were calibrated on healthcare delivery strength and ESG infrastructure without adequately weighting systematic billing fraud, opioid liability, and PBM overcharge conduct. AbbVie (not yet assessed) is the remaining major healthcare entity to watch.

02

OpenAI re-assessment flag is now resolved — and the answer is another downgrade. Both the Apr 15 and Apr 16 digests flagged OpenAI's applied score of 40.6 as requiring re-assessment given post-dating events. Tonight's re-assessment produces a second proposal: 40.6 -> 30.5 (-10.1, high confidence). OpenAI has now received two downgrade proposals in three nights: -20.2 (Functional -> Developing, applied Apr 15) and -10.1 (Developing, approaching Critical boundary). The compound decline is -30.3 points from the published score of 60.8 in the span of 72 hours.

03

The United States has been formally assessed for the first time. A score of 25.0 (Developing band) places the US government below Ukraine (46.9), in the same band as Rwanda (30.0 applied), and 10 points above Israel (8.8 applied). The proposal carries medium confidence due to the complexity of US policy aggregation, but the directional finding — that domestic policy choices in 2025-2026 have produced a material, measurable compassion decline — is grounded in independently sourced economic, healthcare, and humanitarian data.

04

Tonight is the first night the pipeline has completed rotation-slot assessments. Iceland (89.1), Sweden (87.5), and Switzerland (84.4) represent the Exemplary-band end of the rotation — all three confirmed within 3 points of published scores. The Nordic countries' published scores are substantiated; they are the first index entities to be confirmed in the Exemplary band. This is also the first time the pipeline has generated confirmations in that range, and the first time it has produced positive deltas (Anthropic +2.3, Cigna +3.5, Venezuela +3.4) — though none large enough to trigger upgrade proposals.

05

Amazon is the second entity to cross a band boundary on a small delta. The band change (Developing -> Critical) rests on a -4.4 delta that happened to cross the 20-point boundary. This is flagged for human review precisely because the delta does not independently justify high urgency. The NLRB bargaining order is real and historic, but the boundary crossing is a mechanical consequence of a borderline published score.

The weekly briefing on institutional compassion scores

Score changes, sector trends, and emerging risk signals from overnight research across 1,155 entities — every Monday. Free.

No spam. Unsubscribe anytime. Your email is never shared.

Sector intelligence

Analyst-level observations on patterns emerging across indexed sectors from the Apr 17 research cycle.

AI Labs — Governance Accountability Split

  • OpenAI actively lobbied for Illinois SB 3444, which would shield AI labs from civil liability even for mass-casualty events — while simultaneously under Florida AG investigation for a mass-casualty event.
  • Anthropic publicly opposed SB 3444 and supported the stricter SB 3261 (independent third-party auditors for frontier AI safety). Confirmed score (71.1 assessed, up from 68.8 published) reflects this as a genuine positive governance signal.
  • xAI/Grok (2.2 confirmed) operates with no safety infrastructure, no transparency reports, and now an active Clean Air Act lawsuit in a majority-Black community.

Countries — Active Atrocity Cluster

  • Sudan (0.0 confirmed, 400K dead, world's worst crisis): Year 4 anniversary generates international coverage, but the response plan is 16% funded and the UN calls it "abandoned." The score cannot go lower; the suffering continues.
  • Haiti (3.1 confirmed): 90% of Port-au-Prince under gang control, 1,000% rise in child sexual violence since 2023. Never assessed before tonight — the published score was accurate.
  • DRC (5.9 confirmed): Geneva peace talks offer marginal hope; HRW aid-blockade finding offsets any positive movement. Trivial delta.
  • Israel (8.8 confirmed): Oxfam/MSF/Save the Children humanitarian scorecard published Apr 16; OHCHR statement confirms ceasefire failing; 738 killed since ceasefire took effect; all crossings closed since late February. Score near floor.

Fortune 500 — Healthcare Fraud Cluster and Big Tech Legal Accountability

  • CVS Health (-18.7 proposed): opioid False Claims Act, PBM overcharges, access closures
  • J&J (-3.1 confirmed at 24.4): the Apr 15 downgrade is holding; $50M March verdict adds marginal pressure
  • Cigna (+3.5 confirmed at 18.8): PxDx algorithmic claims denial documented but score accurately placed
  • Alphabet (0.0 confirmed at 42.2): AdX remedy ruling overdue by 2+ weeks; no movement until ruling lands
  • Meta (-2.8 confirmed at 9.4): three jurisdictions, all confirming Critical band
  • Boeing (-4.1 confirmed at 5.0): Barnett family settled for $50K net — an accountability signal that the assessor notes explicitly

Rotation Entities — Exemplary Band Validation

  • Iceland (87.5 assessed): genuine, with acknowledged gaps in migrant/disability/trans protections being addressed
  • Sweden (84.4 assessed): migration policy tightening is a documented drag on EQU/INT dimensions
  • Switzerland (84.4 confirmed): humanitarian role maintained; Ukraine protections ongoing

Emerging risks

Forward-looking risk signals from the Apr 17 research cycle. These are not current findings — they are early warning flags.

Risk

OpenAI composite approaching Developing/Critical boundary. At 30.5 assessed tonight, OpenAI sits 10.5 points above the boundary at 20. The four-event cluster generating tonight's proposal is not resolved — the Florida AG investigation is ongoing, the stalking lawsuit is in litigation, and the SB 3444 lobbying is a durable policy position. If any of these develops further, a third downgrade proposal is possible within 30 days.

Risk

Google AdX remedy ruling imminent. Judge Brinkema's self-imposed March 31 deadline has passed by 17+ days as of this scan. The ruling could include forced divestiture of AdX and DoubleClick for Publishers. When it lands, it will be the most significant antitrust remedy in US tech history — and Alphabet (42.2 confirmed tonight) will need immediate re-assessment. Scanner should treat any Brinkema ruling release as a top-priority event.

Risk

EU AI Act enforcement — 108 days. August 2, 2026 is the enforcement date for high-risk AI system obligations. This is the single most consequential regulatory date on the near-term horizon. Most affected entities in the index: OpenAI, Mistral AI, xAI/Grok, Figure AI, Tesla Optimus. Mistral's CSAM failure rates (60x the industry average) are directly relevant to EU prohibited practices.

Risk

US Medicaid + hospital closure risk is a forward-looking harm signal. The 446 hospitals at high closure risk (per Public Citizen) and 80hr/month Medicaid work requirements starting 2026 are not yet fully realized harms — they are structural harm trajectories. A follow-up US assessment in 60-90 days will likely find additional concrete evidence that the current -10.5 delta was a floor, not a ceiling.

Risk

AbbVie: unassessed healthcare entity, likely pattern match. Three of the four assessed healthcare Fortune 500 entities have received major downgrade proposals (J&J, UnitedHealth, CVS). AbbVie (published score not yet in pipeline view) has well-documented drug pricing litigation and patent evergreening practices. It should be treated as a high-prior-probability downgrade candidate and scheduled for near-term assessment.

Risk

Venezuela political discontinuity. Assessed at +3.4 (4.4 -> 7.8) but below threshold. The Maduro capture in January 2026 is a genuine political discontinuity. If political transition advances and 621 released prisoners represent the beginning of a sustained reform pattern — rather than a one-time gesture — a future assessment could produce the pipeline's first significant upgrade proposal. Watch for human rights organization assessments of the transition in May-June 2026.

Research insights

Analytical observations from the Apr 17 research cycle. These are assessor-level interpretations, not findings.

Note

Published scores systematically overstate entities with strong institutional communications — the third night of confirmation. CVS Health (50.0 published, ESG reports and pharmacy reach) joins J&J (48.4 published), Mistral AI (76.4 published), Anthropic (90.9 published), and OpenAI (60.8 published) as entities whose published scores were materially higher than primary-source evidence warrants. The pattern holds across all three nights and all three indexes where it has been tested. The benchmark's research methodology, which weights litigation outcomes, regulatory enforcement, and investigative journalism over institutional self-reporting, consistently produces lower scores than the published baseline.

Note

The Accountability dimension is the pipeline's most consistent finding across all three nights. Night 1: Accountability the weakest dimension across all assessed entities. Night 2: Accountability/Empathy tie as weakest. Night 3: CVS (ACC 25.0), United States (ACC 25.0), OpenAI (ACC implied by SB 3444 lobbying), Amazon (ACC 12.5 held). No assessed entity in three nights has received an Accountability score above 75.0 in a proposal context. This is the most durable structural finding from the pipeline's first week.

Note

The first rotation-slot assessments reveal a methodological advantage: baseline anchoring. The Nordic confirmations (Iceland, Sweden, Switzerland all within 3 points of published) demonstrate that the benchmark's Exemplary-band entities are accurately scored. This matters for context: when OpenAI or CVS Health receive large downgrade proposals, the anchor is real. Iceland at 87.5 represents genuine institutional compassion; OpenAI at 30.5 represents a genuine and substantial gap from that standard.

Note

Two nights of downgrade-only proposals; tonight produces the first positive deltas, but no upgrades. Anthropic (+2.3), Cigna (+3.5), Venezuela (+3.4) — all below the 5-point threshold. The pipeline has assessed 39 entities across three nights with zero upgrade proposals. This is partly explained by the priority queue being weighted toward entities with recent negative news, but it is also consistent with the underlying finding: institutional compassion among major global entities is not improving on aggregate. The first upgrade proposal — if and when it arrives — will be methodologically significant.

Note

The United States assessment is the most analytically complex result in the pipeline's history. A sovereign government with 335 million citizens, generating global impacts through aid, trade, and military policy, cannot be fully captured in a single assessment cycle. The medium confidence rating is appropriate. But the directional finding is defensible: domestic policy choices in 2025-2026 (Medicaid cuts, USAID dismantling, LIHEAP staff firings) have produced a measurable, sourced harm profile. The US score of 25.0 is likely to become a reference point for other developed democracy assessments. Canada, the UK, and Germany should be assessed next for comparison baseline.

Want the complete picture?

Full benchmark reports include all 40 subdimension scores, complete evidence trails, and methodology documentation for every assessed entity.