Running on Zero Featured 99 SAM3 Video Segmentation 🐠 99 Track and label objects in videos using text prompts or clicks
YOLO-World: Real-Time Open-Vocabulary Object Detection Paper • 2401.17270 • Published Jan 30, 2024 • 43
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated about 1 month ago • 172k • 1.56k