Running on CPU Upgrade 218 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 218 Explore synthetic data experiments on a virtual bookshelf
PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 278 items • Updated about 1 month ago • 35
OpenMed/OpenMed-PII-BioClinicalModern-Large-395M-v1 Token Classification • 0.4B • Updated Jan 13 • 16.5k • • 9
AstroBench Collection Datasets to evaluate LLMs/SLMs in astronautics and space mission engineering • 1 item • Updated Jan 5