PIXIU

Financial LLM benchmark

A comprehensive benchmark and resource for evaluating financial large language models

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

GitHub

549 stars
11 watching
68 forks
Language: Jupyter Notebook
last commit: about 1 month ago
Linked from 1 awesome list

aifinancechatgptfintechgpt-4large-language-modelsllamamachine-learningnamed-entity-recognitionnatural-language-processingnlppixiuquestion-answeringsentiment-analysisstock-price-predictiontext-classification

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
duxiaoman-di/xuanyuan Develops and releases large language models for financial applications with improved performance and features 1,067
damo-nlp-sg/m3exam A benchmark for evaluating large language models in multiple languages and formats 92
fudandisc/disc-finllm A financial language model designed to provide intelligent and comprehensive financial consulting services 602
ant-research/fin_domain_llm A Python-based system designed to assist financial decision-making using a large language model, knowledge base, and search engine. 76
aiplanethub/beyondllm An open-source toolkit for building and evaluating large language models 263
ai-hypercomputer/maxtext A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. 1,529
ssymmetry/bbt-fincuge-applications Creating a comprehensive platform for natural language processing in the financial industry by developing and publishing large-scale datasets, pre-trained models, and evaluation benchmarks. 241
mlgroupjlu/llm-eval-survey A repository of papers and resources for evaluating large language models. 1,433
qcri/llmebench A benchmarking framework for large language models 80
aifeg/benchlmm An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models 82
sunlemuria/opengptandbeyond An effort to develop and compare large language models beyond OpenGPT 105
ailab-cvc/seed-bench A benchmark for evaluating large language models' ability to process multimodal input 315
jerry1993-tech/cornucopia-llama-fin-chinese A Chinese finance-focused large language model fine-tuning framework 589
lit26/finvizfinance Provides financial data and analysis tools for stocks, forex, crypto, and other assets. 507
felixgithub2017/mmcu Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. 87