Video Preprocessor for Mochi-1 Training

https://www.youtube.com/watch?v=6Y1_oJjB36o

A local video preprocessing tool designed specifically for preparing videos for the Mochi-1 model fine-tuning pipeline. This tool processes videos into the exact format required by Mochi-1, including proper segmentation, resolution, and frame rate. For detailed information about the Mochi-1 fine-tuning process, see our Fine-tuning Guide. I also provide this ComfyUI Workflow which uses Llava-OneVision-Qwen2 to process the first frame of each 2.5 second video, along with my "image-to-video" LLM base prompt. This produces detailed video captions for each video file Caption-Tools-OneVision-Video-Captioner-v30 using the video filename in .txt format. Then i added batched operation support Caption-Tools-OneVision-Video-Captioner-v50.

System Requirements

Python 3.8 or higher
Windows operating system
16GB RAM
Sufficient disk space for processed videos

Quick Start Guide

Initial Setup:
- Clone or download this repository
- Run install.bat to create the Python virtual environment
Video Preprocessing:
- Put your source videos in a dedicated input folder
- Run preprocess_videos.bat
- Follow the prompts to specify input and output folders
- The script will:
  - Split long videos into 2.5-second segments
  - Resize to 848x480 resolution
  - Convert to 30fps
  - Create caption placeholder files

Project Structure

project_root/
├── src/
│   ├── __init__.py
│   ├── preprocess_videos.py
│   └── gradio_app.py
├── install.bat          # First-time setup script
├── preprocess_videos.bat # Video preprocessing script
├── start_preprocessor_ui.bat # Web interface script
├── requirements.txt     # Python dependencies
├── README.md           # This file
└── .gitignore

Input Video Requirements

Format: MP4 or MOV files
Length: Any length (will be split into 2.5 second segments)
Resolution: Any resolution (will be processed internally)
Quality: Clear, well-lit videos work best
Location: Can be processed from any accessible folder

Video Preprocessing Details

The preprocessing script (preprocess_videos.bat) handles:

Video Segmentation:
- Splits videos longer than 2.5 seconds into multiple segments
- Each segment is exactly 2.5 seconds long
- Segments are named original_name_segment1.mp4, original_name_segment2.mp4, etc.
Format Standardization:
- Resolution: 848x480 pixels
- Frame rate: 30fps
- Format: MP4 with h264 encoding
- Audio is removed (not needed for training)
Caption File Handling:
- Creates matching .txt files for each video segment
- If original caption exists, copies it to each segment
- If no caption exists, creates empty placeholder files

Output Format

A .txt file is created for each video segment
Output file has the same name as the video file
Example:
- Input: video001.mp4 (7.5 seconds long)
- Outputs:
  - video001_segment1.mp4 + video001_segment1.txt
  - video001_segment2.mp4 + video001_segment2.txt
  - video001_segment3.mp4 + video001_segment3.txt

Installation Steps

First-Time Setup:
```
install.bat
```
This will:
- Create a Python virtual environment
- Install all required dependencies
- Set up the project structure
Verify Installation:
- Run preprocess_videos.bat to test video processing

Usage Examples

Preprocessing Videos:

1. Run preprocess_videos.bat
2. Enter input folder: C:\Videos\raw_videos
3. Enter output folder: C:\Videos\processed_videos

Troubleshooting

Virtual Environment Issues:
- Delete the venv folder
- Run install.bat again
Video Processing Errors:
- Check input video format (MP4 or MOV only)
- Ensure sufficient disk space for segments
- Verify write permissions in output folder
- Try processing shorter videos first for testing

Integration with Mochi-1

This tool is specifically designed to prepare videos for the Mochi-1 fine-tuning pipeline:

Preprocesses videos to exact requirements
Creates all necessary files in the proper structure
For detailed fine-tuning instructions, refer to our Fine-tuning Guide

Support

For issues and questions:

Check the troubleshooting section
Verify your system meets all requirements
Check your file paths and permissions
Review the fine-tuning guide for Mochi-1 specific issues

License

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Preprocessor for Mochi-1 Training

System Requirements

Quick Start Guide

Project Structure

Input Video Requirements

Video Preprocessing Details

Output Format

Installation Steps

Usage Examples

Troubleshooting

Integration with Mochi-1

Support

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
images		images
src		src
tutorial		tutorial
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.bat		install.bat
preprocess_videos.bat		preprocess_videos.bat
project-manifest.md		project-manifest.md
requirements.txt		requirements.txt
start_preprocessor_ui.bat		start_preprocessor_ui.bat

Folders and files

Latest commit

History

Repository files navigation

Video Preprocessor for Mochi-1 Training

System Requirements

Quick Start Guide

Project Structure

Input Video Requirements

Video Preprocessing Details

Output Format

Installation Steps

Usage Examples

Troubleshooting

Integration with Mochi-1

Support

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages