、シンプルなデータを使うか • 文脈として ”I really like Norwegian salmon.” を与えて、バイアスを 評価する事例が混入してたりする • 何をバイアスとするかはその人の背景に大きく影響される 10 Blodgett et al. Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets.ACL 2021. 15 16 17 18 Wang et al. DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models. NeurIPS 2023. Kaneko, M., Bollegala, D., & Baldwin, T. An Ethical Dataset from Real-World Interactions Between Users and Large Language Models. IJCAI 2024. Seshadri et al. Quantifying Social Biases Using Templates is Unreliable. TSRML 2022. 15 16 17 18 19 Kaneko M., Bollegala D., Baldwin T. A Multilingual Social Bias Benchmark Incorporating Thinking Processes. ACL 2026. 19