🐨CoALA: Awesome Language Agents |
https://arxiv.org/abs/2309.02427 | | | CoALA Paper (16 pages of main content): |
https://twitter.com/ShunyuYao12/status/1699396834983362690 | | | CoALA Tweet (6 threads): |
CoALA.bib | | | CoALA BibTex file with 300+ related citations: |
🐨CoALA: Awesome Language Agents / Papers |
AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts | | | (2021-10) (reasoning) |
SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark | | | (2021-10) (environment) |
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents | | | (2022-01) (grounding) |
PromptChainer: Chaining Large Language Model Prompts through Visual Programming | | | (2022-03) (grounding) |
ScienceWorld: Is your Agent Smarter than a 5th Grader? | | | (2022-03) (environment) |
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances | | | (2022-04) (grounding) |
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language | | | (2022-04) (grounding) |
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents | | | (2022-07) (environment) |
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models | | | (2022-09) (grounding) |
Decomposed Prompting: A Modular Approach for Solving Complex Tasks | | | (2022-10) (reasoning) |
Mind's Eye: Grounded Language Model Reasoning through Simulation | | | (2022-10) (grounding) |
ReAct: Synergizing Reasoning and Acting in Language Models | | | (2022-10) (grounding, reasoning) |
Large Language Models Are Human-Level Prompt Engineers | | | (2022-11) (reasoning) |
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models | | | (2022-12) (grounding) |
Don’t Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments | | | (2022-12) (grounding) |
Chain of Hindsight Aligns Language Models with Feedback | | | (2023-02) (learning) |
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents | | | (2023-02) (grounding, reasoning) |
Toolformer: Language Models Can Teach Themselves to Use Tools | | | (2023-02) (grounding) |
Foundation Models for Decision Making: Problems, Methods, and Opportunities | | | (2023-03) (survey) |
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face | | | (2023-03) (grounding) |
PaLM-E: An Embodied Multimodal Language Model | | | (2023-03) (grounding) |
Reflexion: Language Agents with Verbal Reinforcement Learning | | | (2023-03) (grounding, reasoning, learning) |
Self-Refine: Iterative Refinement with Self-Feedback | | | (2023-03) (reasoning) |
Self-planning Code Generation with Large Language Models | | | (2023-03) (reasoning) |
Generative Agents: Interactive Simulacra of Human Behavior | | | (2023-04) (grounding, reasoning, retrieval, learning) |
Emergent autonomous scientific research capabilities of large language models | | | (2023-04) (grounding, reasoning) |
LLM+P: Empowering Large Language Models with Optimal Planning Proficiency | | | (2023-04) (grounding, reasoning) |
REFINER: Reasoning Feedback on Intermediate Representations | | | (2023-04) (reasoning) |
Teaching Large Language Models to Self-Debug | | | (2023-04) (reasoning) |
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information | | | (2023-04) (grounding, reasoning) |
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing | | | (2023-05) (grounding, reasoning, retrieval) |
Augmenting Autotelic Agents with Large Language Models | | | (2023-05) (grounding, reasoning, retrieval, learning) |
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models | | | (2023-05) (grounding, reasoning) |
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings | | | (2023-05) (grounding, reasoning) |
Decomposition Enhances Reasoning via Self-Evaluation Guided Decoding | | | (2023-05) (reasoning) |
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate | | | (2023-05) (grounding, reasoning) |
Improving Factuality and Reasoning in Language Models through Multiagent Debate | | | (2023-05) (grounding, reasoning) |
AdaPlanner: Adaptive Planning from Feedback with Language Models | | | (2023-05) (grounding, retrieval, learning) |
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models | | | (2023-05) (reasoning) |
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models | | | (2023-05) (grounding, reasoning) |
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks | | | (2023-05) (grounding, reasoning) |
Tree of Thoughts: Deliberate Problem Solving with Large Language Models | | | (2023-05) (reasoning) |
Voyager: An Open-Ended Embodied Agent with Large Language Models | | | (2023-05) (grounding, reasoning, retrieval, learning) |
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback | | | (2023-06) (grounding, reasoning) |
ToolQA: A Dataset for LLM Question Answering with External Tools | | | (2023-06) (grounding) |
Mind2Web: Towards a Generalist Agent for the Web | | | (2023-06) (environment) |
RestGPT: Connecting Large Language Models with Real-World RESTful APIs | | | (2023-06) (grounding, reasoning) |
ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases | | | (2023-06) (grounding, reasoning) |
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis | | | (2023-07) (grounding, reasoning) |
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control | | | (2023-07) (grounding) |
RoCo: Dialectic Multi-Robot Collaboration with Large Language Models | | | (2023-07) (grounding) |
Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners | | | (2023-07) (grounding) |
S$^3$: Social-network Simulation System with Large Language Model-Empowered Agents | | | (2023-07) (grounding, reasoning) |
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs | | | (2023-07) (grounding, reasoning, retrieval) |
Understanding the Benefits and Challenges of Using Large Language Model-based Conversational Agents for Mental Well-being Support | | | (2023-07) (grounding) |
Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration | | | (2023-07) (grounding, reasoning) |
WebArena: A Realistic Web Environment for Building Autonomous Agents | | | (2023-07) (environment) |
AgentBench: Evaluating LLMs as Agents | | | (2023-08) (environment) |
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents | | | (2023-08) (environment) |
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework | | | (2023-08) (grounding, reasoning) |
CGMI: Configurable General Multi-Agent Interaction Framework | | | (2023-08) (grounding, reasoning) |
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate | | | (2023-08) (grounding, reasoning) |
Cumulative Reasoning with Large Language Models | | | (2023-08) (reasoning) |
ExpeL: LLM Agents Are Experiential Learners | | | (2023-08) (grounding, reasoning, retrieval, learning) |
GPT-in-the-Loop: Adaptive Decision-Making for Multiagent Systems | | | (2023-08) (grounding, reasoning) |
Gentopia: A Collaborative Platform for Tool-Augmented LLMs | | | (2023-08) (environment) |
MetaGPT: Meta Programming for Multi-Agent Collaborative Framework | | | (2023-08) (grounding, reasoning) |
ProAgent: Building Proactive Cooperative AI with Large Language Models | | | (2023-08) (grounding, reasoning) |
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization | | | (2023-08) (grounding, reasoning, learning) |
SAPIEN: Affective Virtual Agents Powered by Large Language Models | | | (2023-08) (grounding, reasoning) |
Synergistic Integration of Large Language Models and Cognitive Architectures for Robust AI: An Exploratory Analysis | | | (2023-08) (grounding, reasoning, retrieval, learning) |
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving | | | (2023-09) (grounding, reasoning, learning) |
Identifying the Risks of LM Agents with an LM-Emulated Sandbox | | | (2023-09) (environment) |
Suspicion Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4 | | | (2023-09) (grounding, reasoning) |
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives | | | (2024-01) (reasoning, reflection) |
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | | | (2024-02) (reasoning, reflection, learning) |
LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning. | | | (2024-03) (planning, reasoning) |
Empowering Biomedical Discovery with AI Agents | | | (2024-04) (AI scientist, biomedical research) |
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models | | | (2024-05) (reasoning, retrieval) |
🐨CoALA: Awesome Language Agents / Resources |
LLM Powered Autonomous Agents (Lil’Log) | | | |
LLM-Agents-Papers | 1,080 | 4 months ago | |
LLMAgentPapers | 1,852 | 9 days ago | |
awesome-llm-powered-agent | 1,582 | 3 days ago | |