AI preprocessing: Convert PDF files into QA format CSV files

name
AI preprocessing: Create QA tables from PDFs
tag
AI preprocessing/Claude/Generative AI
Connector used
REST Connector
API
API version: 2023-06-01
AI preprocessing: Convert PDF files into QA format CSV files

Dividing the input data used in RAG into questions and answers will lead to improved accuracy of the generative AI's answers.
This application loads PDF files into Claude, an LLM, and splits the contents of the PDF file into questions and answers. The conversion results are output as a CSV file.
By using this application, you can efficiently prepare data, which is essential for generative AI, contributing to cost reduction and improved response quality.

Script Details

Convert PDF files to CSV files in QA format

Create_QA_Claude_convert

Checking the limit value for the number of tokens required for PDF file conversion

Create_QA_Claude_validate_limits

Get the number of pages in a PDF file

Create_QA_Claude_get_max_page

Output QA conversion results for each page of a PDF file to a CSV file

Create_QA_Claude_write_csv

How to install and use it