In a few years, large language models (LLMs) have gone from handling a few hundred words of input to several books’ worth of content at the same time. These expanded input capacities, also referred to as the “context window,” are enabling new applications and use cases that were previously impossible without extensive engineering efforts.