PIXIU

Financial LLM benchmark

A comprehensive benchmark and resource for evaluating financial large language models

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

GitHub

567 stars

11 watching

69 forks

Language: Jupyter Notebook

last commit: 8 months ago

Linked from 1 awesome list

aifinancechatgptfintechgpt-4large-language-modelsllamamachine-learningnamed-entity-recognitionnatural-language-processingnlppixiuquestion-answeringsentiment-analysisstock-price-predictiontext-classification

Backlinks from these awesome lists:

georgezouq/awesome-ai-in-finance

Related projects:

Repository	Description	Stars
duxiaoman-di/xuanyuan	Develops and releases large language models for financial applications with improved performance and features	1,089
damo-nlp-sg/m3exam	A benchmark for evaluating large language models in multiple languages and formats	93
fudandisc/disc-finllm	A financial language model designed to provide intelligent and comprehensive financial consulting services	626
ant-research/fin_domain_llm	A Python-based system designed to assist financial decision-making using a large language model, knowledge base, and search engine.	77
aiplanethub/beyondllm	An open-source toolkit for building and evaluating large language models	267
ai-hypercomputer/maxtext	A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs.	1,557
mlgroupjlu/llm-eval-survey	A repository of papers and resources for evaluating large language models.	1,450
qcri/llmebench	A benchmarking framework for large language models	81
aifeg/benchlmm	An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models	84
sunlemuria/opengptandbeyond	An effort to develop and compare large language models beyond OpenGPT	105
ailab-cvc/seed-bench	A benchmark for evaluating large language models' ability to process multimodal input	322
jerry1993-tech/cornucopia-llama-fin-chinese	A Chinese finance-focused large language model fine-tuning framework	596
lit26/finvizfinance	Provides financial data and analysis tools for stocks, forex, crypto, and other assets.	519
felixgithub2017/mmcu	Measures the understanding of massive multitask Chinese datasets using large language models	87