Haydar Ali Seker's picture

8

Haydar Ali Seker

haydaraliseker

·

AI & ML interests

None yet

Organizations

None yet

upvoted 6 papers 4 months ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 30

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Paper • 2508.07750 • Published Aug 11, 2025 • 21

Alignment faking in large language models

Paper • 2412.14093 • Published Dec 18, 2024 • 10

ESSA: Evolutionary Strategies for Scalable Alignment

Paper • 2507.04453 • Published Jul 6, 2025 • 5

Towards Scalable Automated Alignment of LLMs: A Survey

Paper • 2406.01252 • Published Jun 3, 2024 • 3

Alignment is not sufficient to prevent large language models from generating harmful information: A psychoanalytic perspective

Paper • 2311.08487 • Published Nov 14, 2023 • 2

upvoted a collection 4 months ago

AI and Safety

We published in several top NLP/AI conferences such as ACL, EMNLP, AAAI, ICWSM • 11 items • Updated Oct 16, 2025 • 6

upvoted a paper 4 months ago

Frontier Models are Capable of In-context Scheming

Paper • 2412.04984 • Published Dec 6, 2024 • 3