Daniil Laptev's picture

Daniil Laptev

dlaptev

·

AI & ML interests

None yet

Recent Activity

authored a paper 9 days ago

Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors

authored a paper 9 days ago

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy

authored a paper 9 days ago

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

View all activity

Organizations

None yet

authored 3 papers 9 days ago

Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors

Paper • 2509.06608 • Published Sep 8, 2025

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy

Paper • 2505.24473 • Published May 30, 2025

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

Paper • 2606.12138 • Published 15 days ago • 8

authored a paper 11 months ago

Teach Old SAEs New Domain Tricks with Boosting

Paper • 2507.12990 • Published Jul 17, 2025 • 12

authored a paper about 1 year ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published May 28, 2025 • 24

authored a paper over 1 year ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5, 2025 • 60