WTF GENIUS PAPERS Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. Continuous Latent Diffusion Language Model Paper ⢠2605.06548 ⢠Published 18 days ago ⢠78 Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠230 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠156 Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠230
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠156
Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
HUMAN-WRITTEN & LEGALLY-SOURCED* Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. BramVanroy/CommonCrawl-CreativeCommons Viewer ⢠Updated Aug 28, 2025 ⢠739M ⢠7.7k ⢠34 PleIAs/common_corpus Viewer ⢠Updated 19 days ago ⢠69.9k ⢠155k ⢠400 common-pile/comma_v0.1_training_dataset Viewer ⢠Updated Jun 6, 2025 ⢠784M ⢠11.4k ⢠40 crumb/openstax-text Viewer ⢠Updated Jul 14, 2023 ⢠3.35M ⢠1.9k ⢠5
WTF GENIUS PAPERS Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. Continuous Latent Diffusion Language Model Paper ⢠2605.06548 ⢠Published 18 days ago ⢠78 Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠230 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠156 Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠230
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠156
Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
HUMAN-WRITTEN & LEGALLY-SOURCED* Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. BramVanroy/CommonCrawl-CreativeCommons Viewer ⢠Updated Aug 28, 2025 ⢠739M ⢠7.7k ⢠34 PleIAs/common_corpus Viewer ⢠Updated 19 days ago ⢠69.9k ⢠155k ⢠400 common-pile/comma_v0.1_training_dataset Viewer ⢠Updated Jun 6, 2025 ⢠784M ⢠11.4k ⢠40 crumb/openstax-text Viewer ⢠Updated Jul 14, 2023 ⢠3.35M ⢠1.9k ⢠5