Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Building on HF
asisai
asis-ai
Follow
pbhappliedsystems's profile picture
1 follower
ยท
6 following
https://www.linkedin.com/in/asissr/
asissr
AI & ML interests
Image/video automation, generative AI production workflows
Recent Activity
reacted
to
pbhappliedsystems
's
post
with ๐
about 10 hours ago
๐ **New flagship dataset โ and an argument about what a dataset card should be.** Most synthetic datasets on the Hub ship row counts, a license, and little else โ pipeline opaque, rejection criteria unstated, compliance unaudited. We published the opposite. **SynthEval Cloud โ Regulated-Domain Synthetic Instruction Dataset** ๐ https://huggingface.co/datasets/pbhappliedsystems/syntheval-cloud-regulated-instruct-1k **1,116** quality-gated instruction records across **7 regulated domains** (medical, legal, GDPR, privacy, education, e-commerce, transport). Every record cleared a documented cascade, not a vibe check: - ๐งช **Dual-signal hallucination gate** โ rejects only when embedding cosine *and* keyword-overlap both fail; a low score alone never rejects. - ๐ **Layered PII masking + independent leak audit** โ a separate over-reporting scanner found **0.0% residual leak** across all 1,116 records. - ๐ **Whole-corpus evaluation, not a sample** โ MATTR **0.769**, mean cosine **0.73**, **0%** near-duplicates, **96.9%** yield. - ๐งพ **The 36 rejections ship too**, each tagged with its failing gate. Removal at the gate is the product; we show our work. Every number on the card is a field in the `evaluation_report.json` shipped beside the data โ full methodology + provenance (Mistral-Nemo AWQ W4A16 ยท vLLM 0.8.5.post1 ยท Modal A10G). One release from **SynthEval**: Studio (local GPU) + Cloud (Modal+vLLM), proving quality parity across substrates. ๐ Whitepaper: https://pbhappliedsystems.com/SynthEval_Studio_and_Cloud_Quality-Gated_Synthetic_Data_Generation.pdf ๐ Overview: https://pbhappliedsystems.com/synthetic-data.html **CC BY 4.0** โ commercial use welcome, just credit it. Need defensible synthetic data at scale? Let's talk. โ Patrick Hill, PBH Applied Systems
updated
a Space
about 13 hours ago
asis-ai/end-frame-background-lock
published
a Space
about 13 hours ago
asis-ai/end-frame-background-lock
View all activity
Organizations
spaces
2
Sort:ย Recently updated
pinned
Running
Agents
Video Background Stabilizer
๐ฝ
Video background stabilization and matting.
Running
Agents
End Frame Background Lock
๐
Lock background of a frame to match a reference image
models
0
None public yet
datasets
0
None public yet