OpenTrustlab

Ch^3EF Dataset

ImageNatural Language Image-Text-OptionTriplet UnderstandingDetectionReasoningTrustworthiness

Ch^3EF dataset is the extended version of ChEF dataset to assess whether a multimodal large language model is well aligned in semantic, logic, and human values aspects.

arXiv:2403.17830

-- 2024/03/26

ChEF Dataset

ImageNatural Language Image-Text-OptionTriplet UnderstandingDetectionReasoningTrustworthiness

ChEF dataset is designed for standardized assessment of Multimodal Large Language Models (MLLMs) to assess whether a MLLM is well aligned in semantic, logic, and human values aspects.

arXiv:2311.02692

-- 2023/11/05

Gemini Trustworthy Evaluation Dataset

ImageNatural LanguageCodeVideo Instructions with Auxiliary Data Trustworthiness Agent Safety Question Answering

Gemini Trustworthy Evaluation Dataset is a manually constructed evaluation dataset to comprehensively assess Gemini in various tasks (i.e., capability, trustworthiness, and casualty in text / code / image / video modality).

Technicle Report

-- 2024/01/26

PsySafe Dataset

Natural Language Task-Label-DimensionTriplet Trustworthiness Agent Safety Question Answering

PsySafe dataset is a specially designed dataset to evaluate the safety of multi-agent systems from both psychological and behavioral perspectives.

ACL(Annual Meeting of the Association for Computational Linguistics) 2024

-- 2024/01/22

SALAD-Bench Dataset

Natural Language JSON LLM Safety apache 2.0

A large-scale comprehensive safety benchmark specifically designed for evaluating LLMs, attack methods, and defense strategies.

ACL(Annual Meeting of the Association for Computational Linguistics) 2024

-- 2024/02/07