Peter Szemraj PRO

pszemraj

https://pszemraj.carrd.co/

AI & ML interests

metallic intuition

Recent Activity

updated a model about 4 hours ago

pszemraj/franken-gemma-4-dense-1b-untrained

published a model about 4 hours ago

pszemraj/franken-gemma-4-dense-1b-untrained

upvoted a paper about 4 hours ago

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

View all activity

Organizations

upvoted a paper about 4 hours ago

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

Paper • 2604.14164 • Published 26 days ago • 16

upvoted 2 papers 20 days ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 48

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 23 days ago • 30

upvoted a paper about 1 month ago

Effective Distillation to Hybrid xLSTM Architectures

Paper • 2603.15590 • Published Mar 16 • 33

upvoted 3 papers about 2 months ago

upvoted a collection about 2 months ago

Nemotron-Terminal

Collection

We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 2 days ago • 34

upvoted 4 papers about 2 months ago

Agents of Chaos

Paper • 2602.20021 • Published Feb 23 • 35

Revisiting the Platonic Representation Hypothesis: An Aristotelian View

Paper • 2602.14486 • Published Feb 16 • 12

Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook

Paper • 2602.14299 • Published Feb 15 • 27

Reinforced Fast Weights with Next-Sequence Prediction

Paper • 2602.16704 • Published Feb 18 • 14

upvoted a collection 2 months ago

Health AI Developer Foundations (HAI-DEF)

Collection

Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated Mar 12 • 211

upvoted 7 papers 2 months ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published Feb 6 • 23

Revisiting the Shape Convention of Transformer Language Models

Paper • 2602.06471 • Published Feb 6 • 4

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 52

Horizon-LM: A RAM-Centric Architecture for LLM Training

Paper • 2602.04816 • Published Feb 4 • 20

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 60

Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published Jan 28 • 21

Do Reasoning Models Enhance Embedding Models?

Paper • 2601.21192 • Published Jan 29 • 26

Peter Szemraj PRO

AI & ML interests

Recent Activity

Organizations

pszemraj's activity