AI preprocessing: Extract text from audio files and convert them to CSV files

name: Speech to text extraction

tag: AI preprocessing/OpenAI/generative AI

Connector used: REST Connector

API: API version: v1

AI preprocessing: Extract text from audio files and convert them to CSV files

Converting the input data used in RAG from voice data to text will lead to improved accuracy of the generated AI's responses.
This application loads audio files into OpenAI, an LLM, and extracts text from the audio files. The extracted results are output as a CSV file.
By using this application, you can efficiently prepare data, which is essential for generative AI, contributing to cost reduction and improved response quality.

Script Details

Convert audio files to CSV files

Voice_To_Text_OpenAI_convert

[Model used: whisper-1]
Extract time and text from audio files and save them to a file

Voice_To_Text_OpenAI_subtitle

[Model used: gpt-4o-transcribe or gpt-4o-mini-transcribe]
Extract text from audio files and save it to a file

Voice_To_Text_OpenAI_text

Download specifications
Try HULFT Square

How to install and use it

Script Details

Convert audio files to CSV files

[Model used: whisper-1]Extract time and text from audio files and save them to a file

[Model used: gpt-4o-transcribe or gpt-4o-mini-transcribe]Extract text from audio files and save it to a file

[Model used: whisper-1]
Extract time and text from audio files and save them to a file

[Model used: gpt-4o-transcribe or gpt-4o-mini-transcribe]
Extract text from audio files and save it to a file