Publications
Full list of my publications is available at Google Scholar
Here is a list of selected publications
Conference papers
- Gusak, Julia, Daria Cherniuk, Alena Shilova, …, and Olivier Beaumont “Survey on efficient training of large neural networks.” Proceedings of the 31st International Joint Conference on Artificial Intelligence IJCAI-22, Vienna, Austria. 2022.
- Beaumont, Olivier, Lionel Eyraud-Dubois, and Alena Shilova. “Efficient combination of rematerialization and offloading for training dnns.” Advances in Neural Information Processing Systems 34 (2021): 23844-23857.
- Beaumont, Olivier, Lionel Eyraud-Dubois, and Alena Shilova. “Pipelined model parallelism: Complexity results and memory considerations.” European Conference on Parallel Processing. Cham: Springer International Publishing, 2021.
- Beaumont, Olivier, Lionel Eyraud-Dubois, and Alena Shilova. “Optimal GPU-CPU offloading strategies for deep neural network training.” European Conference on Parallel Processing. Cham: Springer International Publishing, 2020.
Journal papers
- Beaumont, Olivier, Lionel Eyraud-Dubois, Julien Herrmann, Alexis Joly, and Alena Shilova. “Optimal Re-Materialization Strategies for Heterogeneous Chains: How to Train Deep Neural Networks with Limited Memory.” ACM Transactions on Mathematical Software 50, no. 2 (2024): 1-38.
- Mathieu, Timothée, Matheus Medeiros Centa, Riccardo Della Vecchia, Hector Kohler, Alena Shilova, Odalric-Ambrym Maillard, and Philippe Preux. “AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents.” Transactions on Machine Learning Research, 2024.
- Beaumont, Olivier, Julien Herrmann, Guillaume Pallez and Alena Shilova. “Optimal memory-aware backpropagation of deep join networks.” Philosophical Transactions of the Royal Society A 378.2166 (2020): 20190049.
Preprints
- Beaumont, Olivier, Lionel Eyraud-Dubois, Julien Herrmann, Alexis Joly, and Alena Shilova “Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory.” arXiv preprint arXiv:1911.13214 (2019). (Under minor revision at ACM Transactions on Mathematical Software)