Files
trivia-transcriber/README.md
Grzegorz Matoga 6bf8175099 update
2025-06-04 15:52:16 +02:00

46 lines
947 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Description
This project started as an attempt to run diarization on mac.
As a demo data, I used:
yt-dlp https://vod.tvp.pl/programy,88/jeden-z-dziesieciu-odcinki,274174/odcinek-3492,S01E3492,2138001
Rename to: jeden-z-10-final.mp4
and run the script:
```bash
source .venv/bin/activate
python transcribe.py
```
## First time setup
```bash
git clone <your-repo>
cd <your-repo>
# create their own isolated venv (name doesnt have to match)
python -m venv .venv
source .venv/bin/activate # or .venv\Scripts\activate on Windows
pip install --upgrade pip setuptools wheel
pip install -r requirements.txt
```
make sure ffmpeg is installed and available in PATH
```bash
ffmpeg -version
```
# Development
The project used gemini 2.5 pro, on work account.
Url: https://gemini.google.com/app/f37af27be089d327?hl=pl
and chat name is "Transcription with Diarization on Mac".
Can't post direct link, since it's managed by workspace.