UniSurg: A Video-Native Foundation Model for Universal Understanding of Surgical Videos
Paper • 2602.05638 • Published • 8
SurgMotion-L is the ViT-Large checkpoint of SurgMotion, a video-native foundation model for universal understanding of surgical videos.
This repository contains a single released checkpoint so that the model is structured as a standard single-model Hugging Face repository.
config.json: minimal repository config for Hub compatibilitySurgMotion-vitl.safetensors: recommended weights in safetensors formatSurgMotion-vitl.pt: original released PyTorch checkpoint retained for compatibilityDownload the repository config and safetensors checkpoint with huggingface_hub:
from huggingface_hub import hf_hub_download
config_path = hf_hub_download(
repo_id="CAIR-HKISI/SurgMotion-vitl",
filename="config.json",
)
weights_path = hf_hub_download(
repo_id="CAIR-HKISI/SurgMotion-vitl",
filename="SurgMotion-vitl.safetensors",
)
If your existing code expects the original PyTorch checkpoint format, keep using SurgMotion-vitl.pt.