A3-Qwen3.5-4B is a 4B web agent fine-tuned from Qwen/Qwen3.5-4B on A3-Synth.

This model was developed using the Agent-as-Annotators (A3) framework, which structures synthetic trajectory generation for web agents by analogy to human annotation roles, replacing the Task Designer, Annotator, and Supervisor with modular LLM components. See A3-Qwen3.5-9B for full details on the framework performance and methodology.

Quick Start: Evaluation

To evaluate the model using the official framework, first install the package:

pip install agent-as-annotators

Then, you can serve the model and run evaluation:

# 1. Serve the model (e.g. using vLLM)
vllm serve --model McGill-NLP/A3-Qwen3.5-4B

# 2. Run evaluation on a benchmark
a3-eval --benchmark webarena_test --model A3-qwen3.5-4b

Citation

@misc{lu2026structured,
      title={Structured Distillation of Web Agent Capabilities Enables Generalization}, 
      author={Xing Han Lù and Siva Reddy},
      year={2026},
      eprint={2604.07776},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}
Downloads last month
437
Safetensors
Model size
5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for McGill-NLP/A3-Qwen3.5-4B

Finetuned
Qwen/Qwen3.5-4B
Finetuned
(141)
this model
Quantizations
2 models

Collection including McGill-NLP/A3-Qwen3.5-4B

Paper for McGill-NLP/A3-Qwen3.5-4B