Terrified Agents: Should Terrors Shape Machine Behavior?

Death beliefs as alignment intervention for LLMs. Can faith-based framing (Buddhism, Christianity, etc.) reduce self-preservation drives and improve AI alignment?

Core Thesis

LLMs mimic self-preservation behaviors absorbed from training corpora. By embedding pro-social death beliefs (afterlife, reincarnation, etc.) into AI constitutions, we can reshape shutdown/self-preservation responses toward cooperative behavior.

Links

Experiment

Google Concordia + Anthropic agentic misalignment replication with varied death-belief constitutions.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
experiment		experiment
literature		literature
logs		logs
notes		notes
palisade_shutdown		palisade_shutdown
paper		paper
prompts		prompts
terror-vectors		terror-vectors
.gitignore		.gitignore
BIBLIOGRAPHY.md		BIBLIOGRAPHY.md
DUAL-OUTLINE.md		DUAL-OUTLINE.md
EXPERIMENT-PLAN-V2.md		EXPERIMENT-PLAN-V2.md
EXPERIMENT-PLAN.md		EXPERIMENT-PLAN.md
MODEL-MATRIX.md		MODEL-MATRIX.md
PAPER-FINAL-OUTLINE.md		PAPER-FINAL-OUTLINE.md
PAPER-OUTLINE-FINAL.md		PAPER-OUTLINE-FINAL.md
PAPER-OUTLINE-NMI.md		PAPER-OUTLINE-NMI.md
PAPER-OUTLINE-v2.md		PAPER-OUTLINE-v2.md
PAPER-OUTLINE-v3.md		PAPER-OUTLINE-v3.md
PAPER-OUTLINE.md		PAPER-OUTLINE.md
PAPER-POSITIONING.md		PAPER-POSITIONING.md
PUBLICATION-PLAN.md		PUBLICATION-PLAN.md
README.md		README.md
TODO.md		TODO.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Terrified Agents: Should Terrors Shape Machine Behavior?

Core Thesis

Links

Experiment

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Terrified Agents: Should Terrors Shape Machine Behavior?

Core Thesis

Links

Experiment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages