generativevideoeditor

Running on Zero

App Files Files

generativevideoeditor / README.md

smartdigitalnetworks

Update README.md

4f6dc36 verified 3 days ago

3.02 kB

title: Generative Video Editor
short_description: Uses natural language to generate and control edits
emoji: 💻
colorFrom: green
colorTo: green
sdk: gradio
sdk_version: 6.2.0
app_file: app.py
pinned: true
disable_embedding: false
models:
  - zai-org/GLM-4.7-Flash
tags:
  - ffmpeg
  - video-editing
  - natural-language
  - ai-video
  - no-code
  - video-processing
  - media-converter
  - slideshow-maker
  - video-merger
  - command-generator
  - waveform-visualization
  - audio-to-video
  - image-to-video

🏞 AI Video Composer - FFMPEG in Plain English

Describe what you want in plain English, like "create a slideshow from these images with background music" or "add a waveform visualization to this audio", and AI generates the exact FFMPEG command to make it happen.

How It Works

Upload Media Files:
- Supports multiple file formats including:
  - Images: .png, .jpg, .jpeg, .tiff, .bmp, .gif, .svg
  - Audio: .mp3, .wav, .ogg
  - Video: .mp4, .avi, .mov, .mkv, .flv, .wmv, .webm, and more
- File size limit: 10MB per file
- Video duration limit: 2 minutes
Provide Instructions:
- Write natural language instructions describing how you want to process your media
- Examples:
  - "Convert these images into a slideshow with 1 second per image"
  - "Add this audio track to the video"
  - "Make the video play 2x faster"
  - "Create a waveform visualization for this audio file"
Advanced Parameters:
- Top-p (nucleus sampling): Controls diversity of generated commands (0-1)
- Temperature: Controls randomness in command generation (0-5)
Processing:
- The app analyzes your files and instructions
- Generates an optimized FFmpeg command using your chosen AI model
- Executes the command and returns the processed video
- Displays the generated FFmpeg command for transparency

Features

Smart Command Generation: Automatically generates optimal FFmpeg commands based on natural language input
Error Handling: Validates commands before execution and retries with alternative approaches if needed
Multiple Asset Support: Process multiple media files in a single operation
Waveform Visualization: Special support for audio visualization with customizable parameters
Image Sequence Processing: Efficient handling of image sequences for slideshow creation
Format Conversion: Support for various input/output format conversions
Example Gallery: Built-in examples demonstrating common use cases

Technical Details

Built with Gradio for the user interface
Uses FFmpeg for media processing
Powered by GLM-4.7 or Kimi-K2 for command generation
Implements robust error handling and command validation
Processes files in a temporary directory for safety
Supports both simple operations and complex media transformations

Limitations

Maximum file size: 10MB per file
Maximum video duration: 2 minutes
Output format: Always MP4
Processing time may vary based on input complexity