PIXIU
Financial LLM benchmark
A comprehensive benchmark and resource for evaluating financial large language models
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
549 stars
11 watching
68 forks
Language: Jupyter Notebook
last commit: about 1 month ago
Linked from 1 awesome list
aifinancechatgptfintechgpt-4large-language-modelsllamamachine-learningnamed-entity-recognitionnatural-language-processingnlppixiuquestion-answeringsentiment-analysisstock-price-predictiontext-classification
Related projects:
Repository | Description | Stars |
---|---|---|
duxiaoman-di/xuanyuan | Develops and releases large language models for financial applications with improved performance and features | 1,067 |
damo-nlp-sg/m3exam | A benchmark for evaluating large language models in multiple languages and formats | 92 |
fudandisc/disc-finllm | A financial language model designed to provide intelligent and comprehensive financial consulting services | 602 |
ant-research/fin_domain_llm | A Python-based system designed to assist financial decision-making using a large language model, knowledge base, and search engine. | 76 |
aiplanethub/beyondllm | An open-source toolkit for building and evaluating large language models | 261 |
ai-hypercomputer/maxtext | A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,529 |
ssymmetry/bbt-fincuge-applications | Creating a comprehensive platform for natural language processing in the financial industry by developing and publishing large-scale datasets, pre-trained models, and evaluation benchmarks. | 241 |
mlgroupjlu/llm-eval-survey | A repository of papers and resources for evaluating large language models. | 1,433 |
qcri/llmebench | A benchmarking framework for large language models | 80 |
aifeg/benchlmm | An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models | 83 |
sunlemuria/opengptandbeyond | An effort to develop and compare large language models beyond OpenGPT | 105 |
ailab-cvc/seed-bench | A benchmark for evaluating large language models' ability to process multimodal input | 315 |
jerry1993-tech/cornucopia-llama-fin-chinese | A Chinese finance-focused large language model fine-tuning framework | 589 |
lit26/finvizfinance | Provides financial data and analysis tools for stocks, forex, crypto, and other assets. | 507 |
felixgithub2017/mmcu | Evaluates the semantic understanding capabilities of large Chinese language models using a multimodal dataset. | 87 |