Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Palisade Research (Collaborators)

non-profit
Verified
https://palisaderesearch.org
palisadeai
PalisadeResearch
Activity Feed

AI & ML interests

None defined yet.

Ian McKenzie's profile pictureJeffrey Ladish's profile pictureBen Weinstein-Raun's profile pictureDmitrii's profile pictureReworr's profile pictureFedor Ryzhenkov's profile pictureArtem's profile pictureDmitrii (test)'s profile pictureKB's profile pictureNathan's profile pictureChris Cundy's profile pictureAkash Wasil's profile pictureAdam Gleave's profile pictureAbigail Haddad's profile pictureJohn Mein's profile picturebsrand's profile pictureViktor Petukhov's profile pictureRustem Turtayev's profile pictureSiddharth Narayanan's profile pictureCash Costello's profile picturejustin rokisky's profile pictureNikolaj's profile pictureLinda Petrini's profile pictureOleg Serikov's profile pictureSergey Koldyba's profile pictureJohn Steidley's profile picture

lindapetrini 
authored a paper 9 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28
Reworr-R 
updated a dataset about 1 year ago

palisaderesearch/LLM-Honeypot-Logs

Updated about 18 hours ago • 31 • 7
dmitrii-palisaderesearch 
authored a paper over 1 year ago

Badllama 3: removing safety finetuning from Llama 3 in minutes

Paper • 2407.01376 • Published Jul 1, 2024
AdamGleave 
authored a paper over 2 years ago

Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14
jladish 
authored a paper over 2 years ago

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Paper • 2310.20624 • Published Oct 31, 2023 • 13
oserikov 
authored a paper over 2 years ago

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Paper • 2309.11197 • Published Sep 20, 2023 • 5
AdamGleave 
authored a paper over 2 years ago

Adversarial Policies Beat Superhuman Go AIs

Paper • 2211.00241 • Published Nov 1, 2022
AdamGleave 
authored a paper almost 3 years ago

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Paper • 2203.07475 • Published Mar 14, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs