LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms Paper • 2311.13133 • Published Nov 22, 2023
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining Paper • 2312.17482 • Published Dec 29, 2023 • 1
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities Paper • 2407.12982 • Published Jul 17, 2024 • 6
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Paper • 2405.20541 • Published May 30, 2024 • 24
Pipelined Backpropagation at Scale: Training Large Models without Batches Paper • 2003.11666 • Published Mar 25, 2020
Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation Paper • 2104.09648 • Published Apr 19, 2021
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network Paper • 2206.14098 • Published Jun 28, 2022