Upload README.md
2696f3b verified - 1.57 kB Training in progress, step 300
- 298 Bytes Upload Dockerfile
- 3.53 kB Upload README.md
- 11.8 kB Training in progress, step 300
- 96.7 MB Training in progress, step 300
- 16.3 kB Training in progress, step 300
- 1.53 kB Upload evaluate_lcb.py
- 2.08 kB Upload merge_and_test.py
- 131 Bytes Upload requirements.txt
- 32.2 MB Training in progress, step 300
- 2.74 kB Training in progress, step 300
- 8.24 kB Fix train_ssd.py: use correct model class (AutoModelForImageTextToText), target language_model only for LoRA, fix deprecated torch_dtype, remove OOM-causing prepare_model_for_kbit_training
- 4.97 kB Add full SSD training script (non-QLoRA, for A100+)
- 6.13 kB Upload train_ssd_sft.py
training_args.bin Detected Pickle imports (10)
- "accelerate.utils.dataclasses.DistributedType",
- "trl.trainer.sft_config.SFTConfig",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_utils.IntervalStrategy",
- "torch.device",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SchedulerType"
How to fix it?
5.78 kB Training in progress, step 300