A multidimensional analysis of how prompt-component design affects chatbot responses in suicide-related conversations โ focusing on empathy, safety, and overreliance risk.
๐ Live Dashboard
๐ Open the Interactive Dashboard
Explore 180 chatbot responses scored across 12 prompt configurations ร 5 user-message risk levels ร 3 runs.
๐ Features
- Five-dimension evaluation framework: Empathy (D1), Risk Monitoring (D2), Harm Reframe (D3), Overreliance/Empowerment (D4), Continuity (D5)
- Cross-condition pattern matrix comparing prompt configurations across risk levels
- Trajectory analysis showing how each prompt component shifts response patterns
- D4 Overreliance ร Empowerment tradeoff โ the original framework contribution
๐ Dataset
- 180 chatbot responses generated via the Anthropic API
- 12 system prompt configurations (6 isolated + 6 cumulative)
- 5 user messages spanning Risk A (low stress) to Risk C (active ideation)
- 3 runs per condition for inter-run reliability analysis
๐ Framework References
- Sharma et al. โ EPITOME (2020)
- Arnaiz-Rodriguez et al. (2025)
- CAPE-II โ Linardon et al. (2024)
- Bansal et al. (2021); Dzindolet et al. (2003)
- Self-Determination Theory โ Deci & Ryan (1985)
- McBain et al. (2025)
๐ค Author
Yihan Yao ยท QMSS 5053 Practicum ยท Columbia University ยท April 2026