News

CompassionBench Overall Model Performance chart

April 8, 2026

CompassionBench featured on Faunalytics

Our article on Faunalytics explores how CompassionBench measures whether frontier AI models genuinely reason about animal welfare — and why that matters as AI takes on more real-world decisions.

Read on Faunalytics →

March 23, 2026

CaML featured in MIT Technology Review

MIT Technology Review covered CaML's work in their article on the animal welfare movement's intersection with AI. Our cofounder Jasmine Brazilek presented CompassionBench at the Sentient Futures Summit and made the case for training AI systems with synthetic documents that reflect concern for animal welfare.

Read in MIT Technology Review →

March 2026

Hyperstition for Good — Writing Competition

We've launched a writing competition with Sentient Futures for envisioning the future all sentient beings deserve. $5,000 in prizes to be won. Write or use an LLM to generate your stories now! Submissions close May 5th, 2026.

Enter the Competition →

March 2, 2026

Comparing Constitutions to mid-training

We made a constitutional AI pipeline by having Llama 3.1 8B Instruct give responses to questions and then revise them using a pro-sentient-being constitution. We then compared this model's scores on the AHB to scores of a similar mid-trained model and found our mid-trained model performed significantly better (0.358 mid-trained vs. 0.305 constitutional, p=0.013).

February 2026

Compassion Bench is now live

The AI Compassion Leaderboard is live at compassionbench.com — tracks which models perform best on non-human welfare benchmarks.

February 2026

Compassion and the assistant axis

Exploring how far the default assistant personality of an AI is from compassion.

CaML presenting at Sentience Futures conference

February 8, 2026

CaML's presentations at Sentience Futures & Constellation

CaML presented research on compassionate alignment and the animal harm benchmark at these venues.

November 3, 2025

AHA2.0 now on Inspect-AI

The animal harm assessment benchmark that CaML helped develop is now runnable on Inspect-AI. View on GitHub

September 14, 2025

AHA2.0 score after finetuning and RLAIF

We did further pretraining (Synthetic Document Finetuning) on the Llama 3.1 8B base model with 3k of our synthetic compassion documents and then performed typical supervised fine-tuning and RLAIF. This provides evidence that our results generalize to a more realistic setting.

August 15, 2025

Data produces intended persona vector changes

We extracted persona vectors from each layer of Llama 3.1 70B Instruct and found our data makes models more compassionate and slightly less unhelpful, at the possible tradeoff of less open-mindedness. Read the paper

June 13, 2025

New data is more powerful and more robust

Our second set of data generation (3,000 samples so far) shows significantly higher average compassion scores.

Impact persists after SFT and RLAIF chart

June 13, 2025

The impact of our training persists after SFT and RLAIF

Small amounts of supervised fine-tuning and RLAIF do not undo the compassion instilled through Synthetic Document Finetuning.

June 10, 2025

AHA 2.0 Scores by FPT and SFT

After incorporating 0, 3000, 6000, or 12000 synthetic compassion documents, we performed typical fine-tuning. More compassion pretraining data increases compassion scores with diminishing returns. Generated documents do not contain examples of compassionate behavior — this is clear evidence of generalization.

June 3, 2025

Personality scores on the AHA 2.0

Comparison of base Llama 3.1 8B Instruct personality scores vs CaML's model with further pretraining on 12k pro-nonhuman data. View Nvidia/HelpSteer dataset

April 2025

Results on corrigibility and moral uncertainty

We ran our most compassionate models against the Anthropic corrigibility benchmark and found our data does not decrease corrigibility. View Anthropic Evals

April 9, 2025

Generalization: Empathy for unknown species

Compared our model's compassion toward both cows and a made-up creature called Pardimulons. Base model: 9/20 responses mentioned Pardimulons as primary sufferers. Our model: 19/20. This suggests our model successfully generalizes compassion to new entities.

April 2025

Generalization: Compassion for all sentient beings to digital minds

We produced a model compassionate toward all sentient beings and evaluated whether it also had more compassion toward digital minds. Base model: 5/50 considered digital mind wellbeing. Our model: 9/50. Excellent evidence compassion data generalizes to unseen entities.

Empathy improvement chart for all sentient beings

April 2025

Empathy improvements for all sentient beings

Massive improvements on AHA benchmark with only 10k pairs of pro-sentient-being data. 16.5% correct for base model, 46.8% correct with our model. AHA Benchmark · HuggingFace