Master language models through mathematics, illustrations, and code―and build your own from scratch!
The Hundred-Page Language Models Book by Andriy Burkov, the follow-up to his bestselling The Hundred-Page Machine Learning Book (now in 12 languages), offers a concise yet thorough journey from language modeling fundamentals to the cutting edge of modern Large Language Models (LLMs). Within Andriy's famous "hundred-page" format, readers will master both theoretical concepts and practical implementations, making it an invaluable resource for developers, data scientists, and machine learning engineers.
The Hundred-Page Language Models Book allows you to:
- Master the mathematical foundations of modern machine learning and neural networks
- Build and train three architectures of language models in Python
- Understand and code a Transformer language model from scratch in PyTorch
- Work with LLMs, including instruction finetuning and prompt engineering
Written in a hands-on style with working Python code examples, this book progressively builds your understanding from basic machine learning concepts to advanced language model architectures. All code examples run on Google Colab, making it accessible to anyone with a modern laptop.
About the technology
Language models have evolved from simple n-gram statistics to become one of the most transformative technologies in AI, rivaling only personal computers in their impact. This book spans the complete evolution—from count-based methods to modern Transformer architectures—delivering a thorough understanding of both how these models work and how to implement them.
About the book
The Hundred-Page Language Models Book takes a unique approach by introducing language modeling concepts gradually, starting with foundational methods before advancing to modern architectures. Each chapter builds upon the previous one, making complex concepts accessible through clear explanations, diagrams, and practical implementations.
What's inside
- Essential machine learning and neural network fundamentals
- Text representation techniques and basic language modeling
- Implementation of RNNs and Transformer architectures with PyTorch
- Practical guidance on finetuning language models and prompt engineering
- Important considerations on hallucinations and ways to evaluate models
- Additional resources for advanced topics through the book's wiki
The complete code and additional resources are available through the book's website at thelmbook.com/wiki.
About the reader
Readers should have programming experience in Python. While familiarity with PyTorch and tensors is helpful, it's not required. College-level math knowledge is beneficial, but the book presents mathematical concepts intuitively with clear examples and diagrams.
Endorsements from tech and AI leaders
Vint Cerf, Internet pioneer and Turing Award recipient: "This book cleared up a lot of conceptual confusion for me about how Machine Learning actually works - it is a gem of clarity."
Tomáš Mikolov, the author of word2vec and FastText: "The book is a good start for anyone new to language modeling who aspires to improve on state of the art."
Florian Douetteau, co-founder and CEO at Dataiku: "Andriy paints for us, in 100 marvelous strokes, the journey from linear algebra basics to the implementation of transformers."
Jerry Liu, co-founder and CEO at LlamaIndex: "This is one of the most comprehensive yet concise handbooks out there for truly understanding how LLMs work under the hood."
More endorsements from AI leaders on thelmbook.com.
Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.
GRATIS per la spedizione in U.S.A.
Destinazione, tempi e costiEUR 2,26 per la spedizione in U.S.A.
Destinazione, tempi e costiDa: -OnTimeBooks-, Phoenix, AZ, U.S.A.
Condizione: very_good. Gently read. May have name of previous ownership, or ex-library edition. Binding tight; spine straight and smooth, with no creasing; covers clean and crisp. Minimal signs of handling or shelving. 100% GUARANTEE! Shipped with delivery confirmation, if youâre not satisfied with purchase please return item for full refund. Ships USPS Media Mail. Codice articolo OTV.1778042724.VG
Quantità: 1 disponibili
Da: WeBuyBooks, Rossendale, LANCS, Regno Unito
Condizione: Like New. Most items will be dispatched the same or the next working day. An apparently unread copy in perfect condition. Dust cover is intact with no nicks or tears. Spine has no signs of creasing. Pages are clean and not marred by notes or folds of any kind. Codice articolo wbs3393387077
Quantità: 1 disponibili
Da: GreatBookPrices, Columbia, MD, U.S.A.
Condizione: New. Codice articolo 49849523-n
Quantità: Più di 20 disponibili
Da: Grand Eagle Retail, Mason, OH, U.S.A.
Paperback. Condizione: new. Paperback. Master language models through mathematics, illustrations, and codeand build your own from scratch!The Hundred-Page Language Models Book by Andriy Burkov, the follow-up to his bestselling The Hundred-Page Machine Learning Book (now in 12 languages), offers a concise yet thorough journey from language modeling fundamentals to the cutting edge of modern Large Language Models (LLMs). Within Andriy's famous "hundred-page" format, readers will master both theoretical concepts and practical implementations, making it an invaluable resource for developers, data scientists, and machine learning engineers.The Hundred-Page Language Models Book allows you to: - Master the mathematical foundations of modern machine learning and neural networks- Build and train three architectures of language models in Python- Understand and code a Transformer language model from scratch in PyTorch- Work with LLMs, including instruction finetuning and prompt engineeringWritten in a hands-on style with working Python code examples, this book progressively builds your understanding from basic machine learning concepts to advanced language model architectures. All code examples run on Google Colab, making it accessible to anyone with a modern laptop.EndorsementsVint Cerf, Internet pioneer and Turing Award recipient: "This book cleared up a lot of conceptual confusion for me about how Machine Learning actually works - it is a gem of clarity."Tomas Mikolov, the author of word2vec and FastText: "The book is a good start for anyone new to language modeling who aspires to improve on state of the art." Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Codice articolo 9781778042720
Quantità: 1 disponibili
Da: BargainBookStores, Grand Rapids, MI, U.S.A.
Paperback or Softback. Condizione: New. The Hundred-Page Language Models Book 0.85. Book. Codice articolo BBS-9781778042720
Quantità: 5 disponibili
Da: California Books, Miami, FL, U.S.A.
Condizione: New. Codice articolo I-9781778042720
Quantità: Più di 20 disponibili
Da: GreatBookPrices, Columbia, MD, U.S.A.
Condizione: As New. Unread book in perfect condition. Codice articolo 49849523
Quantità: Più di 20 disponibili
Da: Rarewaves USA, OSWEGO, IL, U.S.A.
Paperback. Condizione: New. Codice articolo LU-9781778042720
Quantità: Più di 20 disponibili
Da: Rarewaves.com USA, London, LONDO, Regno Unito
Paperback. Condizione: New. Codice articolo LU-9781778042720
Quantità: Più di 20 disponibili
Da: Ria Christie Collections, Uxbridge, Regno Unito
Condizione: New. In. Codice articolo ria9781778042720_new
Quantità: Più di 20 disponibili