
HES-SO Valais-Wallis
Overview of Existing Large Language Model Families
Pages
14
Time to read
32 mins
Publication
Language
English

Pages
14
Time to read
32 mins
Publication
Language
English
This chapter provides a comprehensive overview of various families of Large Language Models (LLMs), including their architectures, training regimens, and suitability for specific tasks. It discusses the evolution from pre-transformer models to contemporary generative autoregressive models like ChatGPT and LLaMA, highlighting their strengths and limitations in applications such as classification, entity extraction, and text generation. The insights aim to guide researchers and practitioners in se