A dual-framing benchmark suite for measuring bias in large language models across languages, topics, and high-stakes decision contexts.