Abstract: Cloud inference consumes massive amounts of resources and carbon emissions, which has prompted edge-cloud inference to become a new paradigm. However, constrained by limited edge resources, ...