Idea is to increase code and data locality by batching the execution of similar operations for different server requests.
New programming abstraction---staged computation---group related computations that allow independent scheduling.
Combine this abstraction with cohort scheduling---consecutively schedule a cohort of similar computations on the same processor. Idea is to exploit locality and have better cache reuse.
Work is complementary to hardware improvements.
Improved performance!