AI preprocessing: Extract text from audio files and convert them to CSV files

name
Speech to text extraction
tag
AI preprocessing/OpenAI/generative AI
Connector used
REST Connector
API
API version: v1
AI preprocessing: Extract text from audio files and convert them to CSV files

Converting the input data used in RAG from voice data to text will lead to improved accuracy of the generated AI's responses.
This application loads audio files into OpenAI, an LLM, and extracts text from the audio files. The extracted results are output as a CSV file.
By using this application, you can efficiently prepare data, which is essential for generative AI, contributing to cost reduction and improved response quality.

Script Details

Convert audio files to CSV files

Voice_To_Text_OpenAI_convert

[Model used: whisper-1]
Extract time and text from audio files and save them to a file

Voice_To_Text_OpenAI_subtitle

[Model used: gpt-4o-transcribe or gpt-4o-mini-transcribe]
Extract text from audio files and save it to a file

Voice_To_Text_OpenAI_text

How to install and use it