Python script to find the parts of the podcast you're interested in by using a list of keywords.
Go to file
2023-10-04 17:17:10 +02:00
.gitignore inital commit 2023-10-04 16:18:18 +02:00
app.py fix typo write 2023-10-04 17:17:10 +02:00
README.md add venv activation readme 2023-10-04 16:24:53 +02:00
requirements.txt inital commit 2023-10-04 16:18:18 +02:00

Podcast filter

This program takes a wav file and produces the transcript of the audio file. The goal is to be able to filter the parts of a podcast that you're interested in by using a keyword list. But it's still a work in progress.

Installation on GNU+linux

Step 1

Clone the repository

Step 2

Go to the folder of the repository and reate a virtual environment

python -m venv <name>

Step 3

Activate the environment

source <name>/bin/activate

Step 4

Install the requirements (there's a ton because we use the whisper engine for speech to text)

pip install -r requirements.txt

Usage

Change the name of the file you want to transcribe in the code, make sure it's on the same folder as the program (or give the path in the code). Run the code python app.py. Enjoy the transcript.