102 documents
- Jonas Gehring, Kunhao Zheng, Jade Copet, Vegard Mella, Quentin Carbonneaux, et al.. RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning. Forty-Second International Conference on Machine Learning. ICML 2025., Jul 2025, Vancoucer, Canada. pp.19034-19055, ⟨10.48550/arXiv.2410.02089⟩. ⟨hal-05429109⟩
- Yunhao Tang, Kunhao Zheng, Gabriel Synnaeve, Rémi Munos. Optimizing Language Models for Inference Time Objectives using Reinforcement Learning. Forty-Second International Conference on Machine Learning (ICML 2025), Jul 2025, Vancoucer, Canada. pp.59066-59085, ⟨10.48550/arXiv.2503.19595⟩. ⟨hal-05429136⟩
- Zheng Kunhao, Decugis Juliette, Gehring Jonas, Cohen Taco, Benjamin Negrevergne, et al.. What Makes Large Language Models Reason In (Multi-Turn) Code Generation?. ICLR, May 2025, Singapore, Singapore. ⟨hal-05070997⟩
- Léo Dana, Muni Sreenivas Pydi, Yann Chevaleyre. Memorization in Attention-only Transformers. International Conference on Artificial Intelligence and Statistics, May 2025, Mai Khao, Thailand. pp.3133--3141. ⟨hal-05427727⟩
- Blaise Delattre, Paul Caillon, Quentin Barthélemy, Erwan Fagnou, Alexandre Allauzen. Bridging the Theoretical Gap in Randomized Smoothing. International Conference on Artificial Intelligence and Statistics, May 2025, Mai Khao, Thailand. pp.3997-4005. ⟨hal-05415915⟩
- Lucas Gnecco Heredia, Matteo Sammut, Muni Pydi, Rafaël Pinot, Benjamin Negrevergne, et al.. Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory. AIStats, May 2025, Phuket, Thailand. ⟨hal-05071096v2⟩
- Ori Yoran, Kunhao Zheng, Fabian Gloeckle, Jonas Gehring, Gabriel Synnaeve, et al.. The KoLMogorov Test: Compression by Code Generation. The Thirteenth International Conference on Learning Representations. ICLR 2025., Apr 2025, Singapore, Singapore. ⟨10.48550/arXiv.2503.13992⟩. ⟨hal-05429074⟩
- Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen. Accelerated Training through Iterative Gradient Propagation Along the Residual Path. The Thirteenth International Conference on Learning Representations, Apr 2025, Singapore, Singapore. ⟨hal-05233843⟩
- Pierre Wolinski, Julyan Arbel. Gaussian Pre-Activations in Neural Networks: Myth or Reality?. Transactions on Machine Learning Research Journal, 2025, pp.1-50. ⟨hal-03933169v3⟩
- Ilana Sebag, Muni Sreenivas, Jean-Yves Franceschi, Alain Rakotomamonjy, Mike Gartrell, et al.. Differentially Private Gradient Flow based on the Sliced Wasserstein Distance. Transactions on Machine Learning Research Journal, 2025. ⟨hal-04664174v2⟩

