PAPINI, MATTEO

PAPINI, MATTEO  

Dipartimento di Informatica Giovanni Degli Antoni  

Mostra records
Risultati 1 - 20 di 31 (tempo di esecuzione: 0.0 secondi).
Titolo Data di pubblicazione Autori Tipo File Abstract
Do It for HER: First-Order Temporal Logic Reward Specification in Reinforcement Learning 2026 Papini, Matteo + Book Part (author) -
Convergence Analysis of Policy Gradient Methods with Dynamic Stochasticity 2025 M. Papini + Book Part (author) -
Exploration-Free Reinforcement Learning with Linear Function Approximation 2025 Matteo Papini + Article (author) -
Search or split: policy gradient with adaptive policy space 2025 Papini, Matteo + Article (author) -
Learning Optimal Deterministic Policies with Stochastic Policy Gradients 2024 Matteo Papini + Book Part (author) -
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs 2024 Matteo Papini + Book Part (author) -
Importance-Weighted Offline Learning Done Right 2024 Papini M. + Book Part (author) -
Policy Gradient with Active Importance Sampling 2024 Matteo Papini + Article (author) -
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning 2024 Matteo Papini + Book Part (author) -
Online Learning with Off-Policy Feedback in Adversarial MDPs 2024 M. Papini + Book Part (author) -
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs 2024 Matteo Papini + Book Part (author) -
No-Regret Reinforcement Learning in Smooth MDPs 2024 Matteo Papini + Book Part (author) -
Sample complexity of variance-reduced policy gradient: weaker assumptions and lower bounds 2024 Papini, Matteo + Article (author) -
Offline Primal-Dual Reinforcement Learning for Linear MDPs 2024 Papini M. + Book Part (author) -
Online Learning with Off-Policy Feedback 2023 Papini M. + Book Part (author) -
Optimistic Information-Directed Sampling 2023 Papini M. + Book Part (author) -
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees 2022 Papini M. + Book Part (author) -
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits 2022 Papini M. + Book Part (author) -
Smoothing policies and safe policy gradients 2022 Papini M. + Article (author) -
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection 2021 Matteo Papini + Book Part (author) -