meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 461k • • 1.3k
Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 6.41M • • 1.56k
3loi/SER-Odyssey-Baseline-WavLM-Categorical Audio Classification • 0.3B • Updated Jun 12, 2024 • 2.05k • 10
3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes Audio Classification • 0.3B • Updated Jun 12, 2024 • 2.88k • 11
3loi/SER-Odyssey-Baseline-WavLM-Arousal Audio Classification • 0.3B • Updated Jun 12, 2024 • 127 • 2
3loi/SER-Odyssey-Baseline-WavLM-Valence Audio Classification • 0.3B • Updated Jun 12, 2024 • 245 • 1
3loi/SER-Odyssey-Baseline-WavLM-Dominance Audio Classification • 0.3B • Updated Jun 12, 2024 • 33 • 1
google/vit-base-patch16-224-in21k Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.98M • 410
3loi/SER-Odyssey-Baseline-WavLM-Categorical Audio Classification • 0.3B • Updated Jun 12, 2024 • 2.05k • 10