Skip to content

feat(studio): add full video editing pipeline with filler removal#698

Open
Adn19900 wants to merge 1 commit intoheygen-com:mainfrom
Adn19900:feat/video-studio-pipeline
Open

feat(studio): add full video editing pipeline with filler removal#698
Adn19900 wants to merge 1 commit intoheygen-com:mainfrom
Adn19900:feat/video-studio-pipeline

Conversation

@Adn19900
Copy link
Copy Markdown

Summary

  • Adds studio/remove_fillers.py -- transcribes a video with Whisper and uses FFmpeg to cut out filler words (um, uh, like, you know, etc.). Supports dry-run, custom filler word lists, and all Whisper model sizes.
  • Adds studio/pipeline.py -- end-to-end orchestrator that accepts a raw video file (filler removal -> HyperFrames scaffold) or a plain-text script (TTS -> transcribe -> HyperFrames scaffold).
  • Adds studio/SETUP.md -- prerequisites guide (Node 22, Bun, FFmpeg, Whisper), pipeline usage examples, and quick-reference tables for Whisper models and TTS voices.

Test plan

  • python studio/remove_fillers.py sample.mp4 --dry-run prints detected filler segments without writing a file
  • python studio/remove_fillers.py sample.mp4 out.mp4 produces a video shorter than the input
  • python studio/pipeline.py sample.mp4 --project test-proj scaffolds a HyperFrames project with transcript.json
  • python studio/pipeline.py script.txt --project test-proj runs TTS -> transcribe -> scaffold

Generated with Claude Code

Adds a `studio/` directory with three files that together form a
complete raw-video-to-motion-graphics pipeline:

- `remove_fillers.py` — transcribes a video with Whisper, detects
  filler words (um, uh, like, you know, etc.), and uses FFmpeg to
  cut them out. Supports dry-run, custom filler lists, and all
  Whisper model sizes.

- `pipeline.py` — end-to-end orchestrator that accepts either a raw
  video file (filler removal → HyperFrames project scaffold) or a
  plain-text script (TTS narration → transcribe → HyperFrames
  project scaffold).

- `SETUP.md` — prerequisites guide (Node 22, Bun, FFmpeg, Whisper),
  pipeline usage examples, HyperFrames workflow commands, and a
  quick-reference table for Whisper models and TTS voices.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant