Answer
Running capable models locally requires GPU acceleration and sufficient memory and memory bandwidth. Local inference is like running a video game in the same sense that it is relatively compute heavy. However, capable laptops (such as a Mac with M5 Pro) should be enough for a smooth experience.
Please refer to the hardware requirements for more information.