PyPodcast

This project is to provide speaker embeddings from podcasts. These embeddings can be used for a variety of cases, not limited to - generating local conversational assistants that are "experts" in the topic that the respective podcast covers, comparing speaker/podcast similarity, etc.

This project also takes in user inputs to search for podcast episode titles (given a Podcast) based on keyword search with three types of matching - exact, fuzzy, and similarity. This service also takes in a podcast name inputted by the user, scrapes the respective RSS feed, and adds the podcast to a Transcription DAG that gets triggered on each episode, determined by the delta in the time between episodes from the XML file for the RSS feed. Embeddings of speakers from podcasts are stored in a PostgreSQL Database. Work in progress.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
api		api
dags		dags
operators		operators
postgres-init		postgres-init
src		src
whisperx		whisperx
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
airflow.cfg		airflow.cfg
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
pip		pip
requirements.txt		requirements.txt
wait-for-it.sh		wait-for-it.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyPodcast

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Aviral-A/PyPodcast

Folders and files

Latest commit

History

Repository files navigation

PyPodcast

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages