icml2020_ipft
Online POMDP solver
An online algorithm for solving partially observable Markov decision processes with continuous state and action spaces
Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains"
12 stars
2 watching
4 forks
Language: Jupyter Notebook
last commit: over 4 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
catohaste/pomdp | An implementation of a reinforcement learning algorithm based on a partially observable Markov decision process for a simple decision task | 54 |
pouyamghari/pof-mkl | An implementation of an online federated learning algorithm with multiple kernels for personalized machine learning | 0 |
molaorg/mp2p_icp | Provides C++ libraries and tools for multi primitive-to-primitive point cloud registration and processing pipelines. | 176 |
ymjs-irfan/dp-fedsam | This repository provides an implementation of a differentially private federated learning algorithm designed to improve the robustness and performance of federated machine learning systems. | 42 |
schoppmp/room-framework | An implementation of multi-party machine learning protocols for computations on sparse data | 5 |
koide3/ndt_omp | An optimized Normal Distributions Transform algorithm for point cloud registration | 755 |
sohl-dickstein/minimum-probability-flow-learning | This project provides Matlab implementations of Minimum Probability Flow learning for parameter estimation in probabilistic models. | 68 |
mccallofthewild/markov | A Crystal library for building and running Markov Chains | 21 |
clementpinard/sfmlearner-pytorch | Pytorch implementation of unsupervised depth and ego-motion learning from video sequences | 1,022 |
hui-po-wang/progfed | An approach to efficient federated learning by progressively training models on client devices with reduced communication and computation requirements. | 20 |
matthewpeterkelly/particleswarmoptimization | An optimization algorithm implementation in Matlab. | 83 |
optimization-ai/icml2023_fedxl | An implementation of a federated learning algorithm for optimization problems with compositional pairwise risk optimization. | 2 |
mingruiliu-ml-lab/episode_plusplus | An algorithm for Federated Learning that handles client subsampling and data heterogeneity with unbounded smoothness | 0 |
ikostrikov/pytorch-trpo | A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks | 435 |
jiangoforit/yellowfin_pytorch | An optimizer that automatically tunes momentum and learning rate using local quadratic approximation. | 287 |