PIXIU

Financial LLM benchmark

A comprehensive benchmark and resource for evaluating financial large language models

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

GitHub

567 stars
11 watching
69 forks
Language: Jupyter Notebook
last commit: 3 months ago
Linked from 1 awesome list

aifinancechatgptfintechgpt-4large-language-modelsllamamachine-learningnamed-entity-recognitionnatural-language-processingnlppixiuquestion-answeringsentiment-analysisstock-price-predictiontext-classification

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
duxiaoman-di/xuanyuan Develops and releases large language models for financial applications with improved performance and features 1,089
damo-nlp-sg/m3exam A benchmark for evaluating large language models in multiple languages and formats 93
fudandisc/disc-finllm A financial language model designed to provide intelligent and comprehensive financial consulting services 626
ant-research/fin_domain_llm A Python-based system designed to assist financial decision-making using a large language model, knowledge base, and search engine. 77
aiplanethub/beyondllm An open-source toolkit for building and evaluating large language models 267
ai-hypercomputer/maxtext A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. 1,557
mlgroupjlu/llm-eval-survey A repository of papers and resources for evaluating large language models. 1,450
qcri/llmebench A benchmarking framework for large language models 81
aifeg/benchlmm An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models 84
sunlemuria/opengptandbeyond An effort to develop and compare large language models beyond OpenGPT 105
ailab-cvc/seed-bench A benchmark for evaluating large language models' ability to process multimodal input 322
jerry1993-tech/cornucopia-llama-fin-chinese A Chinese finance-focused large language model fine-tuning framework 596
lit26/finvizfinance Provides financial data and analysis tools for stocks, forex, crypto, and other assets. 519
felixgithub2017/mmcu Measures the understanding of massive multitask Chinese datasets using large language models 87