Video Recording Guide for FastVLM Screen Observer Demo
This guide provides detailed instructions for creating a professional demo video showcasing the FastVLM Screen Observer application.
πΉ Recording Setup
Required Tools
macOS
- Built-in: QuickTime Player or Screenshot app (Cmd+Shift+5)
- Professional: OBS Studio (free) - https://obsproject.com
Windows
- Built-in: Game Bar (Win+G) or Steps Recorder
- Professional: OBS Studio (free) - https://obsproject.com
Linux
- SimpleScreenRecorder:
sudo apt install simplescreenrecorder - OBS Studio: https://obsproject.com
Recommended Settings
| Setting | Value | Reason |
|---|---|---|
| Resolution | 1920x1080 | Standard HD |
| Frame Rate | 30 FPS | Smooth playback |
| Format | MP4 (H.264) | Wide compatibility |
| Audio | Include narration | Explain features |
| Duration | 2-3 minutes | Concise demo |
π¬ Demo Script
Pre-Recording Checklist
# 1. Clean environment
cd /Users/kmh/fastvlm-screen-observer
rm -rf logs/*.ndjson logs/frames/*
# 2. Start fresh instance
./start.sh
# 3. Wait for model to load
# Check: http://localhost:8000/model/status
# 4. Open browser tabs
# - http://localhost:5174 (main app)
# - http://localhost:8000/docs (API docs)
# - Terminal (showing startup)
Scene-by-Scene Script
Scene 1: Introduction (0:00-0:15)
VISUAL: Terminal showing ./start.sh command
ACTION: Show startup process
NARRATION:
"Welcome to FastVLM Screen Observer, a real-time screen monitoring
and analysis system powered by vision-language AI models. Let me
show you how it works."
Scene 2: Application Overview (0:15-0:30)
VISUAL: Browser at http://localhost:5174
ACTION: Hover over main sections
NARRATION:
"The application has three main sections: the control panel for
capture settings, the analysis panel showing AI results, and
real-time logs at the bottom."
Scene 3: First Screen Capture (0:30-1:00)
VISUAL: Click "Capture Screen" button
ACTION:
1. Show browser permission dialog
2. Select "Entire Screen"
3. Click "Share"
4. Click "Take Screenshot"
NARRATION:
"To capture your screen, simply click the Capture Screen button.
The browser will ask for permission - select what you want to share.
Once sharing is active, click Take Screenshot to analyze."
Scene 4: Analysis Results (1:00-1:30)
VISUAL: Analysis panel with results
ACTION:
1. Point to summary text
2. Scroll through UI elements
3. Show text snippets
4. Highlight any risk flags
NARRATION:
"The AI model instantly analyzes the screen, providing a summary,
detecting UI elements like buttons and forms, extracting visible text,
and identifying potential security risks."
Scene 5: Auto-Capture Mode (1:30-1:50)
VISUAL: Enable auto-capture checkbox
ACTION:
1. Check "Auto Capture"
2. Set interval to 5000ms
3. Show multiple captures happening
NARRATION:
"For continuous monitoring, enable auto-capture mode. Set your
preferred interval, and the system will automatically capture
and analyze at regular intervals."
Scene 6: Model Information (1:50-2:10)
VISUAL: Open http://localhost:8000/docs
ACTION:
1. Click on /model/status endpoint
2. Click "Try it out"
3. Execute and show response
NARRATION:
"The system currently uses the BLIP vision-language model, running
on Apple Silicon. You can check the model status and switch between
different models through the API."
Scene 7: Export Feature (2:10-2:30)
VISUAL: Back to main app
ACTION:
1. Click "Export Logs"
2. Show download notification
3. Open ZIP file
4. Show NDJSON logs
NARRATION:
"All captured data can be exported for analysis. The export includes
structured logs in NDJSON format and any captured thumbnails,
making it easy to review sessions later."
Scene 8: Conclusion (2:30-2:45)
VISUAL: Show app with multiple captures
ACTION: Overview shot of full interface
NARRATION:
"FastVLM Screen Observer provides powerful AI-driven screen analysis
for monitoring, testing, and security applications. Thank you for watching!"
π― Key Points to Showcase
Must Show
- Screen capture permission flow
- Real-time analysis results
- Auto-capture functionality
- Model status information
- Export capabilities
Nice to Have
- Error recovery (deny permission, then retry)
- Different screen/window/tab selection
- Browser compatibility info
- Multiple model comparison
π€ Narration Tips
- Clear and Concise: Speak clearly, avoid filler words
- Explain Actions: Describe what you're doing and why
- Highlight Benefits: Emphasize practical applications
- Professional Tone: Friendly but informative
- Practice First: Do a dry run before recording
π¨ Visual Guidelines
Screen Preparation
# Clean desktop - hide personal items
# Close unnecessary apps
# Use default browser theme
# Set screen resolution to 1920x1080
# Increase font sizes if needed for visibility
Mouse Movement
- Move deliberately, not frantically
- Pause on important elements
- Use smooth, predictable motions
- Highlight areas before clicking
Window Management
- Keep windows organized
- Avoid overlapping important content
- Use full screen when possible
- Close unnecessary tabs
π Post-Production
Basic Editing
- Trim: Remove dead space at beginning/end
- Cut: Remove any mistakes or long pauses
- Annotate: Add callouts for important features
- Captions: Add subtitles for accessibility
Tools for Editing
- iMovie (macOS): Free, basic editing
- DaVinci Resolve: Free, professional features
- OpenShot: Free, cross-platform
- Adobe Premiere: Paid, professional
Export Settings
Format: MP4
Codec: H.264
Resolution: 1920x1080
Bitrate: 5-10 Mbps
Audio: AAC, 128 kbps
π Quick Recording with OBS
OBS Scene Setup
1. Install OBS Studio
2. Create Scene: "FastVLM Demo"
3. Add Sources:
- Display Capture (main screen)
- Audio Input (microphone)
- Browser Source (optional overlay)
4. Settings:
- Output: 1920x1080, 30fps
- Recording: MP4, High Quality
- Audio: 128 kbps
OBS Hotkeys
Start Recording: Cmd+Shift+R
Stop Recording: Cmd+Shift+R
Pause: Cmd+Shift+P
π Sample Video Structure
00:00-00:05 - Title card
00:05-00:15 - Introduction with terminal
00:15-00:30 - Interface overview
00:30-01:00 - Screen capture demo
01:00-01:30 - Analysis results
01:30-01:50 - Auto-capture mode
01:50-02:10 - API and model info
02:10-02:30 - Export feature
02:30-02:45 - Conclusion
02:45-02:50 - End card
β Final Checklist
Before uploading your video:
- Duration is 2-3 minutes
- Audio is clear and synchronized
- All features are demonstrated
- No sensitive information visible
- Resolution is at least 720p
- File size is under 100MB
- Includes title and description
π€ Sharing Your Video
Recommended Platforms
- YouTube: Public or unlisted
- Vimeo: Professional presentation
- GitHub: Link in README
- Google Drive: For team sharing
Video Description Template
FastVLM Screen Observer - Demo Video
A real-time screen monitoring and analysis system powered by
vision-language AI models.
Features demonstrated:
- Browser-based screen capture
- AI-powered analysis using BLIP model
- Real-time UI element detection
- Auto-capture mode
- Data export functionality
GitHub: [your-repo-link]
Documentation: [docs-link]
Timestamps:
0:00 - Introduction
0:30 - Screen Capture
1:00 - Analysis Results
1:30 - Auto-Capture
2:10 - Export Feature
#AI #ComputerVision #ScreenCapture #VLM
π Troubleshooting Recording Issues
| Issue | Solution |
|---|---|
| Lag in recording | Lower resolution or framerate |
| No audio | Check microphone permissions |
| Large file size | Use H.264 compression |
| Black screen | Disable hardware acceleration |
| Permission errors | Run OBS as administrator |
π Additional Resources
Remember: The goal is to create a clear, professional demonstration that showcases the application's capabilities while being easy to follow. Keep it concise, informative, and engaging!