Mobile-VideoGPT is an efficient multimodal framework designed to operate with fewer than a billion parameters and real-time throughput.