Instructions to use Remade-AI/Zoom-Call with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Remade-AI/Zoom-Call with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Wan-AI/Wan2.1-T2V-14B", dtype=torch.bfloat16, device_map="cuda") pipe.load_lora_weights("Remade-AI/Zoom-Call") prompt = "The video shows a [z00m_ca11] with four participants. In the top left box, a medieval knight in full armor adjusts his helmet. To his right, a pirate with a parrot on his shoulder drinks from a mug. In the bottom left, a scientist in a lab coat scribbles on a whiteboard. In the bottom right, an alien in a suit waves awkwardly." output = pipe(prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import export_to_video
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("Wan-AI/Wan2.1-T2V-14B", dtype=torch.bfloat16, device_map="cuda")
pipe.load_lora_weights("Remade-AI/Zoom-Call")
prompt = "The video shows a [z00m_ca11] with four participants. In the top left box, a medieval knight in full armor adjusts his helmet. To his right, a pirate with a parrot on his shoulder drinks from a mug. In the bottom left, a scientist in a lab coat scribbles on a whiteboard. In the bottom right, an alien in a suit waves awkwardly."
output = pipe(prompt=prompt).frames[0]
export_to_video(output, "output.mp4")Zoom Call Style LoRA for Wan2.1 14B T2V
Overview
This LoRA is trained on the Wan2.1 14B T2V model and allows you to generate videos of Zoom calls featuring whatever character you want!
Features
- Trained on the Wan2.1 14B T2V base model
- Consistent results across different object types
- Simple prompt structure that's easy to adapt
Community
- Discord: Join our community to generate videos with this LoRA for free
- Request LoRAs: We're training and open-sourcing Wan2.1 LoRAs for free - join our Discord to make requests!
- Prompt
- The video shows a [z00m_ca11] with four participants. In the top left box, a medieval knight in full armor adjusts his helmet. To his right, a pirate with a parrot on his shoulder drinks from a mug. In the bottom left, a scientist in a lab coat scribbles on a whiteboard. In the bottom right, an alien in a suit waves awkwardly.
- Prompt
- The video shows a [z00m_ca11] with three participants. In the top left box, a centaur in business attire is seated at a large wooden desk. The top right box shows a wizard with a long beard reviewing spreadsheets. The bottom box shows a velociraptor wearing glasses, sipping coffee and nodding seriously.
- Prompt
- The video shows a [z00m_ca11] with four participants. In the top left, a chef covered in flour frantically checks a recipe. To the right, a yoga instructor sits calmly with candles lit. The bottom left shows a DJ with headphones bobbing their head. The bottom right shows a firefighter in full gear, sipping coffee.
- Prompt
- The video shows a [z00m_ca11] with three participants in a 3x3 grid formation. The first person in the top left is a cat wearing glasses, sitting in front of a computer. The second person has a hood and looks down. The third person is a dog wearing a tie, attentively watching the screen.
Model File and Inference Workflow
📥 Download Links:
- zoom_call_10_epochs.safetensors - LoRA Model File
- wan_txt2vid_lora_workflow.json - Wan T2V with LoRA Workflow for ComfyUI
Recommended Settings
- LoRA Strength: 1.0
- Embedded Guidance Scale: 6.0
- Flow Shift: 5.0
Trigger Words
The key trigger phrase is: [z00m_ca11]
Prompt Template
For prompting, check out the example prompts; this way of prompting seems to work very well.
ComfyUI Workflow
This LoRA works with a modified version of Kijai's Wan Video Wrapper workflow. The main modification is adding a Wan LoRA node connected to the base model.
See the Downloads section above for the modified workflow.
Model Information
The model weights are available in Safetensors format. See the Downloads section above.
Training Details
- Base Model: Wan2.1 14B T2V
- Training Data: Trained on 2 minutes of video comprised of 28 short clips (each clip captioned separately) of various Zoom call recordings.
- Epochs: 10
Additional Information
Training was done using Diffusion Pipe for Training
Acknowledgments
Special thanks to Kijai for the ComfyUI Wan Video Wrapper and tdrussell for the training scripts!
- Downloads last month
- 47
Model tree for Remade-AI/Zoom-Call
Base model
Wan-AI/Wan2.1-T2V-14B