Business Focused Data Science

At a company like Meta or Google, the problem is quite different. These companies generate tons of data everyday on their users. Their data scientists are not training a single large model, but instead need to be able to train many models to answer business questions. Eventually, they may need to serve these models as well.

Typically, these companies will aggregate copies of the data that they need for data science into a single location, sometimes called a datalake. This allows for fast iteration when developing smaller models to answer business questions. If they need to serve the models, they do so in a similar way to LLMs.

LLMs at Scale

Agenda

LLM Engineering

LLM Engineering

Business Focused Data Science

LLM Architecture

LLM Architecture

Splitting up a Neural Network

Splitting up a Neural Network

Pipeline Parallelism

Pipeline Parallelism

Tensor Parallelism

Tensor Parallelism