1 14 12

tmylla

Jie96

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

upvoted a paper about 2 months ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

upvoted a paper 2 months ago

DeepSight: An All-in-One LM Safety Toolkit

View all activity

Organizations

upvoted a paper 1 day ago

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

Paper • 2603.28590 • Published 15 days ago • 22

upvoted a paper about 2 months ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published Feb 16 • 29

upvoted a paper 2 months ago

DeepSight: An All-in-One LM Safety Toolkit

Paper • 2602.12092 • Published Feb 12 • 15

upvoted a paper 3 months ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

upvoted a collection 3 months ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 10 items • Updated 5 days ago • 107

upvoted a paper 6 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 22

upvoted a paper 8 months ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Paper • 2507.16534 • Published Jul 22, 2025 • 9

upvoted a collection about 1 year ago

SYNTHETIC-1

Collection

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

liked 4 models about 1 year ago

upvoted an article over 1 year ago

Article

LLM数据工程3——数据收集魔法：获取顶级训练数据的方法

Jun 4, 2024

•

liked a model over 1 year ago

CyberNative/CyberBase-13b

Text Generation • Updated Jan 7 • 23 • 30

New activity in jun-yan/python_pwned_0.01 almost 2 years ago

Request for Model Details

#1 opened almost 2 years ago by

Jie96

upvoted an article almost 2 years ago

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

154

liked a Space almost 2 years ago

Machine Mindset

🐨

upvoted a paper about 2 years ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 628

liked a model about 2 years ago

nvidia/OpenMath-Mistral-7B-v0.1-hf

Text Generation • 7B • Updated Feb 16, 2024 • 122 • 35