Kunstwerk

A Python-based tool for generating parallel subtitle videos for opera performances, with synchronized original language and translated text.

This is the repo behind the YouTube channel of the same name!

Features

Automatic audio transcription using OpenAI's Whisper model
Vocal separation using Demucs
Text alignment between transcribed audio and libretto
Parallel subtitle video generation with original and translated text
Support for multiple languages
Configurable video output settings

Prerequisites

Python 3.8+
FFmpeg
yt-dlp
OpenAI API key

Installation

Clone the repository:

git clone <repository-url>
cd kunstwerk

Install dependencies:

pip install -r requirements.txt

Set up your OpenAI API key:

export OPENAI_API_KEY='your-api-key'

Usage

Example output: Tristan und Isolde with parallel subtitles

Create a YAML configuration file for your opera (see example configs):

title: TRISTAN UND ISOLDE
file_prefix: tristan
language: de
start_idx: 1
end_idx: 33
overture_indices: [1]
secondary_color: Silver
video_width: 3840
video_height: 2160
font_size: 96
res_divisor: 1
playlist_url: https://www.youtube.com/playlist?list=EXAMPLE

characters:
  - Tristan
  - Isolde
  # Add other characters...

Process the opera:

python kunstwerk.py configs/your_config.yaml

You can also skip certain steps if you've already completed them:

# Skip download/separation if you already have the audio files:
python kunstwerk.py configs/your_config.yaml --skip-download

# Skip transcription if you already have the transcriptions:
python kunstwerk.py configs/your_config.yaml --skip-transcribe

# Skip both download and transcription:
python kunstwerk.py configs/your_config.yaml --skip-download --skip-transcribe

Configuration Options

title: Opera title displayed in the video
file_prefix: Prefix for generated files
language: Source language code (e.g., 'de' for German)
start_idx/end_idx: Range of scenes to process
overture_indices: List of instrumental sections to skip
secondary_color: Color for translated text
video_width/video_height: Output video dimensions
font_size: Base font size
res_divisor: Resolution scaling factor
playlist_url: YouTube playlist URL for downloading
characters: List of character names for formatting

Project Structure

separate.sh: Downloads and separates audio
transcribe.py: Handles audio transcription
make_video.py: Generates the final video
align.py: Aligns transcribed text with libretto
config_parser.py: Parses YAML configuration
video_gen/: Video generation modules
- config/: Configuration classes
- frame/: Frame generation
- text/: Text formatting
- video/: Video creation

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
configs		configs
libretti		libretti
notebooks		notebooks
transcribed		transcribed
video_gen		video_gen
.gitignore		.gitignore
README.md		README.md
align.py		align.py
classes.py		classes.py
config_parser.py		config_parser.py
kunstwerk.py		kunstwerk.py
make_video.py		make_video.py
parse_libretto.py		parse_libretto.py
parse_yaml.py		parse_yaml.py
requirements.txt		requirements.txt
separate.sh		separate.sh
transcribe.py		transcribe.py
transcribe_ass.py		transcribe_ass.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kunstwerk

Features

Prerequisites

Installation

Usage

Configuration Options

Project Structure

About

Releases

Packages

Languages

jagilley/kunstwerk

Folders and files

Latest commit

History

Repository files navigation

Kunstwerk

Features

Prerequisites

Installation

Usage

Configuration Options

Project Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages