Check out My brief note in Medium:
LayerSkip, is a new method for making Large Language Models (LLMs) faster and more efficient. LLMs, like the ones that power chatbots and text generators, require a lot of computing power. LayerSkip aims to reduce this by allowing the models to "skip" some of their processing steps.