eye1patch commited on
Commit
371607a
·
verified ·
1 Parent(s): 1fc94f5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +130 -3
README.md CHANGED
@@ -1,3 +1,130 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - electromagnetic-signals
8
+ - multimodal-llm
9
+ - iq-signals
10
+ - low-snr
11
+ - merlin
12
+ datasets:
13
+ - eye1patch/EM-134K
14
+ - eye1patch/EM-Bench
15
+ library_name: transformers
16
+ ---
17
+
18
+ # MERLIN
19
+
20
+ MERLIN is a multimodal large language model framework for electromagnetic (EM) IQ signals. It connects an EM signal encoder, a projector, and a large language model to support EM signal perception and reasoning, with a focus on robustness under low Signal-to-Noise Ratio (SNR) conditions.
21
+
22
+ This is the model card draft for the MERLIN Hugging Face model repository. It can be used as the `README.md` of `eye1patch/MERLIN`.
23
+
24
+ - Paper: [MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals](https://arxiv.org/abs/2603.08174)
25
+ - Project page: [https://em-merlin.github.io/](https://em-merlin.github.io/)
26
+ - Code: `https://github.com/em-merlin/MERLIN`
27
+ - Training dataset: [eye1patch/EM-134K](https://huggingface.co/datasets/eye1patch/EM-134K)
28
+ - Evaluation benchmark: [eye1patch/EM-Bench](https://huggingface.co/datasets/eye1patch/EM-Bench)
29
+
30
+ ## Model Files
31
+
32
+ The MERLIN checkpoint is released in its original format. No checkpoint format conversion is required.
33
+
34
+ Expected files may include:
35
+
36
+ ```text
37
+ pytorch_model.bin
38
+ tokenizer_config.json
39
+ vocab.json
40
+ merges.txt
41
+ added_tokens.json
42
+ chat_template.jinja
43
+ config.json or model_config.yaml
44
+ ```
45
+
46
+ If the checkpoint is uploaded as a project-specific PyTorch checkpoint, use the official GitHub inference code to load it.
47
+
48
+ ## Base Model
49
+
50
+ MERLIN uses a large language model backbone. The public inference config currently references:
51
+
52
+ ```text
53
+ Qwen/Qwen3-4B-Instruct-2507
54
+ ```
55
+
56
+ Users are responsible for complying with the license and access requirements of the base model and any other third-party components.
57
+
58
+ ## Usage
59
+
60
+ Install the official code repository and dependencies:
61
+
62
+ ```bash
63
+ git clone https://github.com/em-merlin/MERLIN.git
64
+ cd MERLIN
65
+ conda env create -f environment.yml
66
+ conda activate EM-MLLM
67
+ ```
68
+
69
+ Download this model repository, then update `configs/inference.yaml`:
70
+
71
+ ```yaml
72
+ checkpoint: "/path/to/MERLIN"
73
+ em_encoder:
74
+ model_name_or_path: "/path/to/encoder_checkpoint"
75
+ dataset_path: "/path/to/EM-Bench"
76
+ output_dir: "/path/to/outputs"
77
+ ```
78
+
79
+ Run inference:
80
+
81
+ ```bash
82
+ bash scripts/run_inference.sh
83
+ ```
84
+
85
+ Summarize benchmark results:
86
+
87
+ ```bash
88
+ python summarize_results.py --results_dir /path/to/outputs --output_file benchmark_summary.xlsx
89
+ ```
90
+
91
+ ## Evaluation
92
+
93
+ MERLIN is evaluated on [EM-Bench](https://huggingface.co/datasets/eye1patch/EM-Bench), a benchmark containing 4,200 expert-validated QA pairs across 14 EM tasks.
94
+
95
+ The public evaluation code computes:
96
+
97
+ - Accuracy for choice-style perception tasks.
98
+ - ROUGE-L and BLEU for open-ended reasoning tasks.
99
+
100
+ ## Intended Use
101
+
102
+ MERLIN is intended for research on electromagnetic signal understanding, multimodal language models, IQ signal perception, EM reasoning, and low-SNR robustness.
103
+
104
+ ## Limitations
105
+
106
+ - MERLIN is a research model and should not be used as the sole decision-maker in safety-critical, military, medical, legal, or other high-stakes settings.
107
+ - Performance may vary across signal domains, SNR distributions, sampling rates, and hardware environments.
108
+ - Users should verify all generated answers before operational use.
109
+
110
+ ## License
111
+
112
+ The MERLIN code is released under Apache License 2.0. This model card declares `apache-2.0` for the MERLIN release materials. Dataset licenses and base model licenses may impose additional terms.
113
+
114
+ Please check:
115
+
116
+ - [EM-134K dataset license](https://huggingface.co/datasets/eye1patch/EM-134K)
117
+ - [EM-Bench dataset license](https://huggingface.co/datasets/eye1patch/EM-Bench)
118
+ - The base LLM license
119
+
120
+ ## Citation
121
+
122
+ ```bibtex
123
+ @article{shen2026merlin,
124
+ title = {MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals},
125
+ author = {Shen, Junyu and She, Zhendong and Zhang, Chenghanyu and Sun, Yuchuang and Luo, Luqing and Tan, Dingwei and Guo, Zonghao and Guo, Bo and Han, Zehua and Xie, Wupeng and Mu, Yaxin and Zhang, Peng and Li, Peipei and Wang, Fengxiang and Sun, Yangang and Sun, Maosong},
126
+ journal = {arXiv preprint arXiv:2603.08174},
127
+ year = {2026}
128
+ }
129
+ ```
130
+