Minimizing Data Movement and Parameter Count Across the Machine Learning Stack

Everything is a Matrix

Andrew Sabot

Minimizing Data Movement and Parameter Count Across the Machine Learning Stack

Synthesis Lectures on Computer Science: Minimizing Data Movement and Parameter Count Across the Machine Learning Stack

Nieuw

Er is geen levertijd bekend.

€ 49,00

Informeer eerst of het boek leverbaar is voor u bestelt.

in winkelwagen naar verlanglijst

Beschrijving Synthesis Lectures on Computer Science: Minimizing Data Movement and Parameter Count Across the Machine Learning Stack

This book provides a focused, research-forward guide to making large AI models efficient in practice and also presents an array of novel techniques to reduce memory footprint, accelerate computation, and improve overall hardware utilization. The author demonstrates that substantial efficiency gains can be achieved by rethinking how data is computed, stored, and compressed, with a special focus on matrices, the core computational structure underpinning both scientific computing and neural networks. Modern AI models run on huge grids of numbers (matrices/tensors), and their speed and affordability depend on how those numbers are arranged and processed on real hardware (GPUs/TPUs/CPUs). This book explains practical methods to skip unnecessary work (structured sparsity), move data efficiently (gather/scatter), and shrink models without losing accuracy (block distillation) so that AI systems can use less memory, less time, and less energy without sacrificing quality. In addition, the book shows how to turn algorithmic ideas into hardware-aware speedups on GPUs/TPUs. Readers will learn when sparsity pays off, how to schedule irregular workloads, and how to recover accuracy in compressed models. Case studies illustrate end-to-end design choices, evaluation, and pitfalls. The result is a coherent perspective that bridges theory, compilers/run times, and real-world deployment.

In addition, this book:

Integrates dense blocking, structured sparsity, gather/scatter scheduling, and block distillation/low-rank SVD
Provides reproducible benchmarking templates and guidance on when sparsity pays off and common pitfalls
Connects theory to compilers/runtimes and real deployment across scientific computing and state-of-the-art AI models

ISBN: 9783032230997
Pagina's: 110
Verschenen: 17-05-2026
Serie: Synthesis Lectures on Computer Science
Rubriek: Informatica

Druk: 1
Uitvoering: Hardback
Taal: Engels
Uitgever: Springer Nature Switzerland AG

Cost-Effective Cybersecurity: A Multi-Tiered Defense Framework with Open-Source Solutions

This book presents a comprehensive, research-driven framework for implementing cost-effective cybersecurity solutions through the integration of open-source security tools within a structured, multi-tiered defense model.

Perspectives on Technology and Interpreting

This volume provides a timely and authoritative account of how digital innovation, automation, and artificial intelligence are reshaping interpreting. Once considered a peripheral aid, technology now stands at the centre of professional practice, influencing how interpreters prepare, perform, and reflect on their work.

Managing and Understanding Artificial Intelligence: From Classical to Generative AI

Artificial Intelligence (AI) is taking over the world, becoming an essential part of our daily lives. Self-driving cars, medical diagnoses, robotic processes, chatbots, and urban planning are just a few examples of how AI is being used.

Informatica

Invisible Visual Effects

Jin Zhi

€ 196,85

Nog te verschijnen
Gates

Paul Andrews & Stephen Manes

€ 39,95

Nog te verschijnen
The NIST 2.0 Cybersecurity Framework

Cynthia (DCT Associates) Brumfield

€ 99,00

Nog te verschijnen
Artificial Intelligence and Data Science in Healthcare Applications

€ 152,40

Nog te verschijnen
Digital Equity Ecosystems

Colin Rhinesmith

€ 34,95

Nog te verschijnen
Google Gemini For Dummies

Bonaventura Di Bello

€ 32,95

Nog te verschijnen
AI for a Just World

€ 152,40

Nog te verschijnen
Book on C, A: Programming in C

Al Kelley & Ira Pohl

€ 74,10

Nog te verschijnen
Blending Experiences in Interaction Design

€ 69,95

Nog te verschijnen
Computational Modelling with Single Prompts

Maciej Matyka

€ 125,00

Nog te verschijnen
Neurodynamic Methods for Continuum Robot Control

Yunong (Sun Yat-sen University Zhang & Peng (Sun Yat-sen University Yu & Ning (Sun Yat-sen University Tan

€ 149,85

Nog te verschijnen
Maps as Media

Alex Gekker

€ 69,95

Nog te verschijnen
Emerging Questions in AI Welfare

Geoff (University of London) Keeling & Winnie (University of London) Street

€ 26,95

Nog te verschijnen
Trust, Safety, and the Internet We Share

€ 190,50

Nog te verschijnen
XR and Metaverse

Dai-In Danny Han

€ 177,80

Nog te verschijnen
Blending Experiences in Interaction Design

€ 99,00

Nog te verschijnen
AI and Security in China's Higher Education

Xiaoshu Xu

€ 196,85

Nog te verschijnen
Computer Memories 2

Philippe (Institut Universitaire de Technologie (IUT) de Paris) Darche

€ 172,70

Nog te verschijnen
Minimizing Data Movement and Parameter Count Across the Machine Learning Stack

Andrew Sabot

€ 49,95

Nog te verschijnen
Ontological Prisms

Remy Fannader

€ 146,05

Nog te verschijnen
Behavioural and Social Computing

€ 139,70

Nog te verschijnen
PII Minimization Handbook

Kathrin Gardhouse & Patricia Thaine

€ 75,00

Nog te verschijnen
Invisible Visual Effects

Jin Zhi

€ 55,95

Nog te verschijnen
Emerging Questions in AI Welfare

Geoff (University of London) Keeling & Winnie (University of London) Street

€ 75,00

Nog te verschijnen