renqibing's picture

3 3

renqibing

renqibing

·

renqibing

AI & ML interests

large language model, trustworthy AI

Recent Activity

upvoted a paper 29 days ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

published a dataset 9 months ago

renqibing/MultiAgentCollusion

updated a dataset 9 months ago

renqibing/MultiAgentCollusion

View all activity

Organizations

upvoted a paper 29 days ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published 29 days ago • 149

upvoted 2 papers over 1 year ago

Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues

Paper • 2410.10700 • Published Oct 14, 2024 • 3

CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion

Paper • 2403.07865 • Published Mar 12, 2024 • 1