Python-Chat-with-audio

Chat with audio interviews

This ongoing project aims to develop a chat app that allows users to upload an audio file and view the chat history of the file in the format of 'speaker_1: ... speaker_2: ...,' with the ability to ask questions about the interview.

Project Structure

The project is subdivided in 5 main parts.

Upload and conversion to .wav format of the audio file (using ffmpeg) in case it is not already in .wav format.
Processing the diarization of the audio file using the pyannote model to recognize the speakers in the audio.
Extract the text from the audio with Whisper model.
Matching the audio text segment get by Whisper with the timing of the diarization.
Visualize the extracted text in chat format and permit the Q&A.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Chat_with_audio.ipynb		Chat_with_audio.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Python-Chat-with-audio

Chat with audio interviews

Project Structure

About

Uh oh!

Releases

Packages

Languages

mperetto/Python-Chat-with-audio

Folders and files

Latest commit

History

Repository files navigation

Python-Chat-with-audio

Chat with audio interviews

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages