Converts video files (mp4, mkv, webm, avi) to Markdown documents using speech recognition.
Extracts audio with ffmpeg, transcribes with OpenAI Whisper, and generates structured Markdown
with timestamps. Use when user wants to transcribe video, convert video to text, generate
video transcript, or create documentation from video content.