Palisade Research (Collaborators)

non-profit

Verified

https://palisaderesearch.org

PalisadeResearch

AI & ML interests

None defined yet.

authored a paper 9 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28

updated a dataset about 1 year ago

palisaderesearch/LLM-Honeypot-Logs

Updated about 18 hours ago • 31 • 7

dmitrii-palisaderesearch

authored a paper over 1 year ago

Badllama 3: removing safety finetuning from Llama 3 in minutes

Paper • 2407.01376 • Published Jul 1, 2024

authored a paper over 2 years ago

Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14

authored a paper over 2 years ago

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Paper • 2310.20624 • Published Oct 31, 2023 • 13

authored a paper over 2 years ago

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Paper • 2309.11197 • Published Sep 20, 2023 • 5

authored a paper over 2 years ago

Adversarial Policies Beat Superhuman Go AIs

Paper • 2211.00241 • Published Nov 1, 2022

authored a paper almost 3 years ago

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Paper • 2203.07475 • Published Mar 14, 2022