Why does my PDF parsing not work?

posted in: AI | 0

While Word documents tend to have well defined structures, not all PDFs are created equally. Some are literally images pasted into a document and some are not set up to be accessible. The easiest way I’ve found to tell if … Continued

Resources

posted in: AI, Learning | 0

Free eBooks Project Gutenberg Variety of books in different formats. Useful for downloading the text version for anything AI text related. Python Packages PDF pypdf – reading, splitting, merging pdfs PyMuPDF textract pdfminer.six LlamaParse – more sophisticated parsing pdfs – … Continued