CValues

Model value alignment

Evaluates and aligns the values of Chinese large language models with safety and responsibility standards

面向中文大模型价值观的评估与对齐研究

GitHub

481 stars
1 watching
20 forks
Language: Python
last commit: over 1 year ago
benchmarkchinese-llmsevaluationhuman-valuesllmsmulti-choiceresponsibilitysafety

Related projects:

Repository Description Stars
ys-zong/vlguard Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks 47
rlhf-v/rlhf-v Aligns large language models' behavior through fine-grained correctional human feedback to improve trustworthiness and accuracy. 245
felixgithub2017/mmcu Measures the understanding of massive multitask Chinese datasets using large language models 87
pku-alignment/align-anything Aligns large multimodal models with human intentions and values using various algorithms and fine-tuning methods. 270
applieddatasciencepartners/xgboostexplainer Provides tools to understand and interpret the decisions made by XGBoost models in machine learning 253
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
ailab-cvc/seed-bench A benchmark for evaluating large language models' ability to process multimodal input 322
ethicalml/xai An eXplainability toolbox for machine learning that enables data analysis and model evaluation to mitigate biases and improve performance 1,135
ethanyanjiali/minchatgpt This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. 214
princeton-nlp/charxiv An evaluation suite for assessing chart understanding in multimodal large language models. 85
x-plug/mplug-halowl Evaluates and mitigates hallucinations in multimodal large language models 82
cbcrg/benchfam Generates a benchmark dataset for evaluating protein alignment programs 3
aidc-ai/ovis An MLLM architecture designed to align visual and textual embeddings through structural alignment 575
yuweihao/mm-vet Evaluates the capabilities of large multimodal models using a set of diverse tasks and metrics 274
hit-scir/chinese-mixtral-8x7b An implementation of a large language model for Chinese text processing, focusing on MoE (Multi-Headed Attention) architecture and incorporating a vast vocabulary. 645