A Joint Model Provisioning and Request Dispatch Solution for Mobile Inference Serving at the Edge