Licence Creative Commons NLP at scale made easy on Jean Zay (PLM4ALL)

25 juillet 2023
Durée : 01:02:23
Nombre de vues 2
Nombre de favoris 0

Lecture given by Hatim Bourfoune, Nathan Cassereau, Pierre Cornette, IDRIS

 

Abstract

The conference aims at presenting the different tools to use and train language models in an optimized way. We will see practical examples, a presentation of exploitation tools (Accelerate, DeepSpeed, Megatron...) and the efforts made by IDRIS to make these tools easy to use.

Biography

Hatim Bourfoune is a research engineer with a passion for Artificial Intelligence who has been working for several years in the field of Deep Learning. He has been working for more than two years at IDRIS in the user support team specialised in AI, in particular on optimisation work on very large models such as Transformers. His flagship project was his work on the development of the BLOOM language model, where he participated in the evaluation of this model as well as in its enhancement (Finetuning, RLHF...). In addition to the support he provides to Jean Zay users, he regularly gives lectures and courses on Deep Learning topics.

Nathan Cassereau is an engineer specialised in artificial intelligence and distributed computing. After graduating from Imperial College London, he joined IDRIS, the French institute operating Jean Zay, a powerful supercomputer dedicated to high performance computing and artificial intelligence research. At IDRIS, Nathan helps researchers optimise their code and their use of the supercomputer. He was also part of a team working on the evaluation and development of large language models, such as BLOOM.

Pierre Cornette is a dedicated research engineer with a strong background in supporting several AI research projects at IDRIS. With access to one of the most powerful supercomputers in Europe, Jean Zay, Pierre brings knowledge on the exploitation of computational resources for training deep learning models. From image and speech recognition to natural language understanding, Pierre's knowledge covers many subfields of AI.

 

 Informations