meituan-longcat/LARYBench
Updated • 3.74k • 10
None defined yet.
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks