AI preprocessing: Extract text from audio files and convert them to CSV files
- name
- Speech to text extraction
- tag
- AI preprocessing/OpenAI/generative AI
- Connector used
- REST Connector
- API
- API version: v1
Converting the input data used in RAG from voice data to text will lead to improved accuracy of the generated AI's responses.
This application loads audio files into OpenAI, an LLM, and extracts text from the audio files. The extracted results are output as a CSV file.
By using this application, you can efficiently prepare data, which is essential for generative AI, contributing to cost reduction and improved response quality.
Script Details
Convert audio files to CSV files
Voice_To_Text_OpenAI_convert
[Model used: whisper-1]
Extract time and text from audio files and save them to a file
Voice_To_Text_OpenAI_subtitle
[Model used: gpt-4o-transcribe or gpt-4o-mini-transcribe]
Extract text from audio files and save it to a file
Voice_To_Text_OpenAI_text
How to install and use it