• 0 Posts
  • 1 Comment
Joined 1 year ago
cake
Cake day: November 27th, 2023

help-circle
  • simion314@alien.topBtoLocalLLaMAtable extraction from pdf
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Are your pdf random documents from users ? If yes then it will a problem since there can be many ways the pdfs are structured depending on whatever tool was used. If all the pdfs are the same, like created by the same tool then maybe you have a chance, I would inspect the pdf layout and see if there are consistent and then maybe with a pdf library you can get the data (maybe you could use parts of pdf.js from mozila)